Preprocessing and Analysis of LC-MS-Based Proteomic Data
Tsai, Tsung-Heng; Wang, Minkun; Ressom, Habtom W.
2016-01-01
Liquid chromatography coupled with mass spectrometry (LC-MS) has been widely used for profiling protein expression levels. This chapter is focused on LC-MS data preprocessing, which is a crucial step in the analysis of LC-MS based proteomics. We provide a high-level overview, highlight associated challenges, and present a step-by-step example for analysis of data from LC-MS based untargeted proteomic study. Furthermore, key procedures and relevant issues with the subsequent analysis by multiple reaction monitoring (MRM) are discussed. PMID:26519169
Shotgun proteomics of plant plasma membrane and microdomain proteins using nano-LC-MS/MS.
Takahashi, Daisuke; Li, Bin; Nakayama, Takato; Kawamura, Yukio; Uemura, Matsuo
2014-01-01
Shotgun proteomics allows the comprehensive analysis of proteins extracted from plant cells, subcellular organelles, and membranes. Previously, two-dimensional gel electrophoresis-based proteomics was used for mass spectrometric analysis of plasma membrane proteins. In order to get comprehensive proteome profiles of the plasma membrane including highly hydrophobic proteins with a number of transmembrane domains, a mass spectrometry-based shotgun proteomics method using nano-LC-MS/MS for proteins from the plasma membrane proteins and plasma membrane microdomain fraction is described. The results obtained are easily applicable to label-free protein semiquantification.
Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus
2014-01-01
The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
Derivative component analysis for mass spectral serum proteomic profiles.
Han, Henry
2014-01-01
As a promising way to transform medicine, mass spectrometry based proteomics technologies have seen a great progress in identifying disease biomarkers for clinical diagnosis and prognosis. However, there is a lack of effective feature selection methods that are able to capture essential data behaviors to achieve clinical level disease diagnosis. Moreover, it faces a challenge from data reproducibility, which means that no two independent studies have been found to produce same proteomic patterns. Such reproducibility issue causes the identified biomarker patterns to lose repeatability and prevents it from real clinical usage. In this work, we propose a novel machine-learning algorithm: derivative component analysis (DCA) for high-dimensional mass spectral proteomic profiles. As an implicit feature selection algorithm, derivative component analysis examines input proteomics data in a multi-resolution approach by seeking its derivatives to capture latent data characteristics and conduct de-noising. We further demonstrate DCA's advantages in disease diagnosis by viewing input proteomics data as a profile biomarker via integrating it with support vector machines to tackle the reproducibility issue, besides comparing it with state-of-the-art peers. Our results show that high-dimensional proteomics data are actually linearly separable under proposed derivative component analysis (DCA). As a novel multi-resolution feature selection algorithm, DCA not only overcomes the weakness of the traditional methods in subtle data behavior discovery, but also suggests an effective resolution to overcoming proteomics data's reproducibility problem and provides new techniques and insights in translational bioinformatics and machine learning. The DCA-based profile biomarker diagnosis makes clinical level diagnostic performances reproducible across different proteomic data, which is more robust and systematic than the existing biomarker discovery based diagnosis. Our findings demonstrate the feasibility and power of the proposed DCA-based profile biomarker diagnosis in achieving high sensitivity and conquering the data reproducibility issue in serum proteomics. Furthermore, our proposed derivative component analysis suggests the subtle data characteristics gleaning and de-noising are essential in separating true signals from red herrings for high-dimensional proteomic profiles, which can be more important than the conventional feature selection or dimension reduction. In particular, our profile biomarker diagnosis can be generalized to other omics data for derivative component analysis (DCA)'s nature of generic data analysis.
Placental Proteomics: A Shortcut to Biological Insight
Robinson, John M.; Vandré, Dale D.; Ackerman, William E.
2012-01-01
Proteomics analysis of biological samples has the potential to identify novel protein expression patterns and/or changes in protein expression patterns in different developmental or disease states. An important component of successful proteomics research, at least in its present form, is to reduce the complexity of the sample if it is derived from cells or tissues. One method to simplify complex tissues is to focus on a specific, highly purified sub-proteome. Using this approach we have developed methods to prepare highly enriched fractions of the apical plasma membrane of the syncytiotrophoblast. Through proteomics analysis of this fraction we have identified over five hundred proteins several of which were previously not known to reside in the syncytiotrophoblast. Herein, we focus on two of these, dysferlin and myoferlin. These proteins, largely known from studies of skeletal muscle, may not have been found in the human placenta were it not for discovery-based proteomics analysis. This new knowledge, acquired through a discovery-driven approach, can now be applied for the generation of hypothesis-based experimentation. Thus discovery-based and hypothesis-based research are complimentary approaches that when coupled together can hasten scientific discoveries. PMID:19070895
Top-down proteomics for the analysis of proteolytic events - Methods, applications and perspectives.
Tholey, Andreas; Becker, Alexander
2017-11-01
Mass spectrometry based proteomics is an indispensable tool for almost all research areas relevant for the understanding of proteolytic processing, ranging from the identification of substrates, products and cleavage sites up to the analysis of structural features influencing protease activity. The majority of methods for these studies are based on bottom-up proteomics performing analysis at peptide level. As this approach is characterized by a number of pitfalls, e.g. loss of molecular information, there is an ongoing effort to establish top-down proteomics, performing separation and MS analysis both at intact protein level. We briefly introduce major approaches of bottom-up proteomics used in the field of protease research and highlight the shortcomings of these methods. We then discuss the present state-of-the-art of top-down proteomics. Together with the discussion of known challenges we show the potential of this approach and present a number of successful applications of top-down proteomics in protease research. This article is part of a Special Issue entitled: Proteolysis as a Regulatory Event in Pathophysiology edited by Stefan Rose-John. Copyright © 2017 Elsevier B.V. All rights reserved.
Findeisen, Peter; Neumaier, Michael
2009-01-01
Proteomics analysis has been heralded as a novel tool for identifying new and specific biomarkers that may improve diagnosis and monitoring of various disease states. Recent years have brought a number of proteomics profiling technologies. Although proteomics profiling has resulted in the detection of disease-associated differences and modification of proteins, current proteomics technologies display certain limitations that are hampering the introduction of these new technologies into clinical laboratory diagnostics and routine applications. In this review, we summarize current advances in mass spectrometry based biomarker discovery. The promises and challenges of this new technology are discussed with particular emphasis on diagnostic perspectives of mass-spectrometry based proteomics profiling for malignant diseases.
A proteomics performance standard to support measurement quality in proteomics.
Beasley-Green, Ashley; Bunk, David; Rudnick, Paul; Kilpatrick, Lisa; Phinney, Karen
2012-04-01
The emergence of MS-based proteomic platforms as a prominent technology utilized in biochemical and biomedical research has increased the need for high-quality MS measurements. To address this need, National Institute of Standards and Technology (NIST) reference material (RM) 8323 yeast protein extract is introduced as a proteomics quality control material for benchmarking the preanalytical and analytical performance of proteomics-based experimental workflows. RM 8323 yeast protein extract is based upon the well-characterized eukaryote Saccharomyces cerevisiae and can be utilized in the design and optimization of proteomics-based methodologies from sample preparation to data analysis. To demonstrate its utility as a proteomics quality control material, we coupled LC-MS/MS measurements of RM 8323 with the NIST MS Quality Control (MSQC) performance metrics to quantitatively assess the LC-MS/MS instrumentation parameters that influence measurement accuracy, repeatability, and reproducibility. Due to the complexity of the yeast proteome, we also demonstrate how NIST RM 8323, along with the NIST MSQC performance metrics, can be used in the evaluation and optimization of proteomics-based sample preparation methods. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Comparative shotgun proteomics using spectral count data and quasi-likelihood modeling.
Li, Ming; Gray, William; Zhang, Haixia; Chung, Christine H; Billheimer, Dean; Yarbrough, Wendell G; Liebler, Daniel C; Shyr, Yu; Slebos, Robbert J C
2010-08-06
Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography-tandem mass spectrometry (LC-MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (QuasiTel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher's Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography-multiple reaction monitoring mass spectrometry (LC-MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples.
Comparative Shotgun Proteomics Using Spectral Count Data and Quasi-Likelihood Modeling
2010-01-01
Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography−tandem mass spectrometry (LC−MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (QuasiTel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher’s Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography−multiple reaction monitoring mass spectrometry (LC−MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples. PMID:20586475
Integrated Analysis of Transcriptomic and Proteomic Data
Haider, Saad; Pal, Ranadip
2013-01-01
Until recently, understanding the regulatory behavior of cells has been pursued through independent analysis of the transcriptome or the proteome. Based on the central dogma, it was generally assumed that there exist a direct correspondence between mRNA transcripts and generated protein expressions. However, recent studies have shown that the correlation between mRNA and Protein expressions can be low due to various factors such as different half lives and post transcription machinery. Thus, a joint analysis of the transcriptomic and proteomic data can provide useful insights that may not be deciphered from individual analysis of mRNA or protein expressions. This article reviews the existing major approaches for joint analysis of transcriptomic and proteomic data. We categorize the different approaches into eight main categories based on the initial algorithm and final analysis goal. We further present analogies with other domains and discuss the existing research problems in this area. PMID:24082820
Colangelo, Christopher M.; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L.; Carriero, Nicholas J.; Gulcicek, Erol E.; Lam, TuKiet T.; Wu, Terence; Bjornson, Robert D.; Bruce, Can; Nairn, Angus C.; Rinehart, Jesse; Miller, Perry L.; Williams, Kenneth R.
2015-01-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry (LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED’s database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. PMID:25712262
Choi, Hyungwon; Kim, Sinae; Fermin, Damian; Tsou, Chih-Chiang; Nesvizhskii, Alexey I
2015-11-03
We introduce QPROT, a statistical framework and computational tool for differential protein expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite, originally developed for spectral count data, adapted for the analysis using continuously measured protein-level intensity data. QPROT offers a new intensity normalization procedure and model-based differential expression analysis, both of which account for missing data. Determination of differential expression of each protein is based on the standardized Z-statistic based on the posterior distribution of the log fold change parameter, guided by the false discovery rate estimated by a well-known Empirical Bayes method. We evaluated the classification performance of QPROT using the quantification calibration data from the clinical proteomic technology assessment for cancer (CPTAC) study and a recently published Escherichia coli benchmark dataset, with evaluation of FDR accuracy in the latter. QPROT is a statistical framework with computational software tool for comparative quantitative proteomics analysis. It features various extensions of QSPEC method originally built for spectral count data analysis, including probabilistic treatment of missing values in protein intensity data. With the increasing popularity of label-free quantitative proteomics data, the proposed method and accompanying software suite will be immediately useful for many proteomics laboratories. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.
Colangelo, Christopher M; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L; Carriero, Nicholas J; Gulcicek, Erol E; Lam, TuKiet T; Wu, Terence; Bjornson, Robert D; Bruce, Can; Nairn, Angus C; Rinehart, Jesse; Miller, Perry L; Williams, Kenneth R
2015-02-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Building ProteomeTools based on a complete synthetic human proteome
Zolg, Daniel P.; Wilhelm, Mathias; Schnatbaum, Karsten; Zerweck, Johannes; Knaute, Tobias; Delanghe, Bernard; Bailey, Derek J.; Gessulat, Siegfried; Ehrlich, Hans-Christian; Weininger, Maximilian; Yu, Peng; Schlegl, Judith; Kramer, Karl; Schmidt, Tobias; Kusebauch, Ulrike; Deutsch, Eric W.; Aebersold, Ruedi; Moritz, Robert L.; Wenschuh, Holger; Moehring, Thomas; Aiche, Stephan; Huhmer, Andreas; Reimer, Ulf; Kuster, Bernhard
2018-01-01
The ProteomeTools project builds molecular and digital tools from the human proteome to facilitate biomedical and life science research. Here, we report the generation and multimodal LC-MS/MS analysis of >330,000 synthetic tryptic peptides representing essentially all canonical human gene products and exemplify the utility of this data. The resource will be extended to >1 million peptides and all data will be shared with the community via ProteomicsDB and proteomeXchange. PMID:28135259
Rigbolt, Kristoffer T G; Vanselow, Jens T; Blagoev, Blagoy
2011-08-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)(1). The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net.
Rigbolt, Kristoffer T. G.; Vanselow, Jens T.; Blagoev, Blagoy
2011-01-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)1. The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net. PMID:21602510
Goeminne, Ludger J E; Gevaert, Kris; Clement, Lieven
2018-01-16
Label-free shotgun proteomics is routinely used to assess proteomes. However, extracting relevant information from the massive amounts of generated data remains difficult. This tutorial provides a strong foundation on analysis of quantitative proteomics data. We provide key statistical concepts that help researchers to design proteomics experiments and we showcase how to analyze quantitative proteomics data using our recent free and open-source R package MSqRob, which was developed to implement the peptide-level robust ridge regression method for relative protein quantification described by Goeminne et al. MSqRob can handle virtually any experimental proteomics design and outputs proteins ordered by statistical significance. Moreover, its graphical user interface and interactive diagnostic plots provide easy inspection and also detection of anomalies in the data and flaws in the data analysis, allowing deeper assessment of the validity of results and a critical review of the experimental design. Our tutorial discusses interactive preprocessing, data analysis and visualization of label-free MS-based quantitative proteomics experiments with simple and more complex designs. We provide well-documented scripts to run analyses in bash mode on GitHub, enabling the integration of MSqRob in automated pipelines on cluster environments (https://github.com/statOmics/MSqRob). The concepts outlined in this tutorial aid in designing better experiments and analyzing the resulting data more appropriately. The two case studies using the MSqRob graphical user interface will contribute to a wider adaptation of advanced peptide-based models, resulting in higher quality data analysis workflows and more reproducible results in the proteomics community. We also provide well-documented scripts for experienced users that aim at automating MSqRob on cluster environments. Copyright © 2017 Elsevier B.V. All rights reserved.
NCI Launches Proteomics Assay Portal | Office of Cancer Clinical Proteomics Research
In a paper recently published by the journal Nature Methods, Investigators from the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (NCI-CPTAC) announced the launch of a proteomics Assay Portal for multiple reaction monitoring-mass spectrometry (MRM-MS) assays. This community web-based repository for well-characterized quantitative proteomic assays currently consists of 456 unique peptide assays to 282 unique proteins and ser
[Techniques for rapid production of monoclonal antibodies for use with antibody technology].
Kamada, Haruhiko
2012-01-01
A monoclonal antibody (Mab), due to its specific binding ability to a target protein, can potentially be one of the most useful tools for the functional analysis of proteins in recent proteomics-based research. However, the production of Mab is a very time-consuming and laborious process (i.e., preparation of recombinant antigens, immunization of animals, preparation of hybridomas), making it the rate-limiting step in using Mabs in high-throughput proteomics research, which heavily relies on comprehensive and rapid methods. Therefore, there is a great demand for new methods to efficiently generate Mabs against a group of proteins identified by proteome analysis. Here, we describe a useful method called "Antibody proteomic technique" for the rapid generations of Mabs to pharmaceutical target, which were identified by proteomic analyses of disease samples (ex. tumor tissue, etc.). We also introduce another method to find profitable targets on vasculature, which is called "Vascular proteomic technique". Our results suggest that this method for the rapid generation of Mabs to proteins may be very useful in proteomics-based research as well as in clinical applications.
Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi
2013-01-01
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424
Recent advances in proteomics of cereals.
Bansal, Monika; Sharma, Madhu; Kanwar, Priyanka; Goyal, Aakash
Cereals contribute a major part of human nutrition and are considered as an integral source of energy for human diets. With genomic databases already available in cereals such as rice, wheat, barley, and maize, the focus has now moved to proteome analysis. Proteomics studies involve the development of appropriate databases based on developing suitable separation and purification protocols, identification of protein functions, and can confirm their functional networks based on already available data from other sources. Tremendous progress has been made in the past decade in generating huge data-sets for covering interactions among proteins, protein composition of various organs and organelles, quantitative and qualitative analysis of proteins, and to characterize their modulation during plant development, biotic, and abiotic stresses. Proteomics platforms have been used to identify and improve our understanding of various metabolic pathways. This article gives a brief review of efforts made by different research groups on comparative descriptive and functional analysis of proteomics applications achieved in the cereal science so far.
Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes
NASA Astrophysics Data System (ADS)
Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy
2007-01-01
Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.
HTAPP: High-Throughput Autonomous Proteomic Pipeline
Yu, Kebing; Salomon, Arthur R.
2011-01-01
Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676
Nanoliter-Scale Oil-Air-Droplet Chip-Based Single Cell Proteomic Analysis.
Li, Zi-Yi; Huang, Min; Wang, Xiu-Kun; Zhu, Ying; Li, Jin-Song; Wong, Catherine C L; Fang, Qun
2018-04-17
Single cell proteomic analysis provides crucial information on cellular heterogeneity in biological systems. Herein, we describe a nanoliter-scale oil-air-droplet (OAD) chip for achieving multistep complex sample pretreatment and injection for single cell proteomic analysis in the shotgun mode. By using miniaturized stationary droplet microreaction and manipulation techniques, our system allows all sample pretreatment and injection procedures to be performed in a nanoliter-scale droplet with minimum sample loss and a high sample injection efficiency (>99%), thus substantially increasing the analytical sensitivity for single cell samples. We applied the present system in the proteomic analysis of 100 ± 10, 50 ± 5, 10, and 1 HeLa cell(s), and protein IDs of 1360, 612, 192, and 51 were identified, respectively. The OAD chip-based system was further applied in single mouse oocyte analysis, with 355 protein IDs identified at the single oocyte level, which demonstrated its special advantages of high enrichment of sequence coverage, hydrophobic proteins, and enzymatic digestion efficiency over the traditional in-tube system.
Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.
Hughes, Christopher S; Morin, Gregg B
2018-03-01
Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Recent advances in mass spectrometry-based proteomics of gastric cancer.
Kang, Changwon; Lee, Yejin; Lee, J Eugene
2016-10-07
The last decade has witnessed remarkable technological advances in mass spectrometry-based proteomics. The development of proteomics techniques has enabled the reliable analysis of complex proteomes, leading to the identification and quantification of thousands of proteins in gastric cancer cells, tissues, and sera. This quantitative information has been used to profile the anomalies in gastric cancer and provide insights into the pathogenic mechanism of the disease. In this review, we mainly focus on the advances in mass spectrometry and quantitative proteomics that were achieved in the last five years and how these up-and-coming technologies are employed to track biochemical changes in gastric cancer cells. We conclude by presenting a perspective on quantitative proteomics and its future applications in the clinic and translational gastric cancer research.
Yang, Yongxin; Bu, Dengpan; Zhao, Xiaowei; Sun, Peng; Wang, Jiaqi; Zhou, Lingyun
2013-04-05
To aid in unraveling diverse genetic and biological unknowns, a proteomic approach was used to analyze the whey proteome in cow, yak, buffalo, goat, and camel milk based on the isobaric tag for relative and absolute quantification (iTRAQ) techniques. This analysis is the first to produce proteomic data for the milk from the above-mentioned animal species: 211 proteins have been identified and 113 proteins have been categorized according to molecular function, cellular components, and biological processes based on gene ontology annotation. The results of principal component analysis showed significant differences in proteomic patterns among goat, camel, cow, buffalo, and yak milk. Furthermore, 177 differentially expressed proteins were submitted to advanced hierarchical clustering. The resulting clustering pattern included three major sample clusters: (1) cow, buffalo, and yak milk; (2) goat, cow, buffalo, and yak milk; and (3) camel milk. Certain proteins were chosen as characterization traits for a given species: whey acidic protein and quinone oxidoreductase for camel milk, biglycan for goat milk, uncharacterized protein (Accession Number: F1MK50 ) for yak milk, clusterin for buffalo milk, and primary amine oxidase for cow milk. These results help reveal the quantitative milk whey proteome pattern for analyzed species. This provides information for evaluating adulteration of specific specie milk and may provide potential directions for application of specific milk protein production based on physiological differences among animal species.
Advancing Clinical Proteomics via Analysis Based on Biological Complexes: A Tale of Five Paradigms.
Goh, Wilson Wen Bin; Wong, Limsoon
2016-09-02
Despite advances in proteomic technologies, idiosyncratic data issues, for example, incomplete coverage and inconsistency, resulting in large data holes, persist. Moreover, because of naïve reliance on statistical testing and its accompanying p values, differential protein signatures identified from such proteomics data have little diagnostic power. Thus, deploying conventional analytics on proteomics data is insufficient for identifying novel drug targets or precise yet sensitive biomarkers. Complex-based analysis is a new analytical approach that has potential to resolve these issues but requires formalization. We categorize complex-based analysis into five method classes or paradigms and propose an even-handed yet comprehensive evaluation rubric based on both simulated and real data. The first four paradigms are well represented in the literature. The fifth and newest paradigm, the network-paired (NP) paradigm, represented by a method called Extremely Small SubNET (ESSNET), dominates in precision-recall and reproducibility, maintains strong performance in small sample sizes, and sensitively detects low-abundance complexes. In contrast, the commonly used over-representation analysis (ORA) and direct-group (DG) test paradigms maintain good overall precision but have severe reproducibility issues. The other two paradigms considered here are the hit-rate and rank-based network analysis paradigms; both of these have good precision-recall and reproducibility, but they do not consider low-abundance complexes. Therefore, given its strong performance, NP/ESSNET may prove to be a useful approach for improving the analytical resolution of proteomics data. Additionally, given its stability, it may also be a powerful new approach toward functional enrichment tests, much like its ORA and DG counterparts.
Multidimensional proteomics for cell biology.
Larance, Mark; Lamond, Angus I
2015-05-01
The proteome is a dynamic system in which each protein has interconnected properties - dimensions - that together contribute to the phenotype of a cell. Measuring these properties has proved challenging owing to their diversity and dynamic nature. Advances in mass spectrometry-based proteomics now enable the measurement of multiple properties for thousands of proteins, including their abundance, isoform expression, turnover rate, subcellular localization, post-translational modifications and interactions. Complementing these experimental developments are new data analysis, integration and visualization tools as well as data-sharing resources. Together, these advances in the multidimensional analysis of the proteome are transforming our understanding of various cellular and physiological processes.
Computer aided manual validation of mass spectrometry-based proteomic data.
Curran, Timothy G; Bryson, Bryan D; Reigelhaupt, Michael; Johnson, Hannah; White, Forest M
2013-06-15
Advances in mass spectrometry-based proteomic technologies have increased the speed of analysis and the depth provided by a single analysis. Computational tools to evaluate the accuracy of peptide identifications from these high-throughput analyses have not kept pace with technological advances; currently the most common quality evaluation methods are based on statistical analysis of the likelihood of false positive identifications in large-scale data sets. While helpful, these calculations do not consider the accuracy of each identification, thus creating a precarious situation for biologists relying on the data to inform experimental design. Manual validation is the gold standard approach to confirm accuracy of database identifications, but is extremely time-intensive. To palliate the increasing time required to manually validate large proteomic datasets, we provide computer aided manual validation software (CAMV) to expedite the process. Relevant spectra are collected, catalogued, and pre-labeled, allowing users to efficiently judge the quality of each identification and summarize applicable quantitative information. CAMV significantly reduces the burden associated with manual validation and will hopefully encourage broader adoption of manual validation in mass spectrometry-based proteomics. Copyright © 2013 Elsevier Inc. All rights reserved.
Gregori, Josep; Villarreal, Laura; Sánchez, Alex; Baselga, José; Villanueva, Josep
2013-12-16
The microarray community has shown that the low reproducibility observed in gene expression-based biomarker discovery studies is partially due to relying solely on p-values to get the lists of differentially expressed genes. Their conclusions recommended complementing the p-value cutoff with the use of effect-size criteria. The aim of this work was to evaluate the influence of such an effect-size filter on spectral counting-based comparative proteomic analysis. The results proved that the filter increased the number of true positives and decreased the number of false positives and the false discovery rate of the dataset. These results were confirmed by simulation experiments where the effect size filter was used to evaluate systematically variable fractions of differentially expressed proteins. Our results suggest that relaxing the p-value cut-off followed by a post-test filter based on effect size and signal level thresholds can increase the reproducibility of statistical results obtained in comparative proteomic analysis. Based on our work, we recommend using a filter consisting of a minimum absolute log2 fold change of 0.8 and a minimum signal of 2-4 SpC on the most abundant condition for the general practice of comparative proteomics. The implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of the results obtained among independent laboratories and MS platforms. Quality control analysis of microarray-based gene expression studies pointed out that the low reproducibility observed in the lists of differentially expressed genes could be partially attributed to the fact that these lists are generated relying solely on p-values. Our study has established that the implementation of an effect size post-test filter improves the statistical results of spectral count-based quantitative proteomics. The results proved that the filter increased the number of true positives whereas decreased the false positives and the false discovery rate of the datasets. The results presented here prove that a post-test filter applying a reasonable effect size and signal level thresholds helps to increase the reproducibility of statistical results in comparative proteomic analysis. Furthermore, the implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of results obtained among independent laboratories and MS platforms. This article is part of a Special Issue entitled: Standardization and Quality Control in Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.
The Challenge of Human Spermatozoa Proteome: A Systematic Review.
Gilany, Kambiz; Minai-Tehrani, Arash; Amini, Mehdi; Agharezaee, Niloofar; Arjmand, Babak
2017-01-01
Currently, there are 20,197 human protein-coding genes in the most expertly curated database (UniProtKB/Swiss-Pro). Big efforts have been made by the international consortium, the Chromosome-Centric Human Proteome Project (C-HPP) and independent researchers, to map human proteome. In brief, anno 2017 the human proteome was outlined. The male factor contributes to 50% of infertility in couples. However, there are limited human spermatozoa proteomic studies. Firstly, the development of the mapping of the human spermatozoa was analyzed. The human spermatozoa have been used as a model for missing proteins. It has been shown that human spermatozoa are excellent sources for finding missing proteins. Y chromosome proteome mapping is led by Iran. However, it seems that it is extremely challenging to map the human spermatozoa Y chromosome proteins based on current mass spectrometry-based proteomics technology. Post-translation modifications (PTMs) of human spermatozoa proteome are the most unexplored area and currently the exact role of PTMs in male infertility is unknown. Additionally, the clinical human spermatozoa proteomic analysis, anno 2017 was done in this study.
SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer.
Petricoin, Emanuel F; Liotta, Lance A
2004-02-01
Proteomics is more than just generating lists of proteins that increase or decrease in expression as a cause or consequence of pathology. The goal should be to characterize the information flow through the intercellular protein circuitry that communicates with the extracellular microenvironment and then ultimately to the serum/plasma macroenvironment. The nature of this information can be a cause, or a consequence, of disease and toxicity-based processes. Serum proteomic pattern diagnostics is a new type of proteomic platform in which patterns of proteomic signatures from high dimensional mass spectrometry data are used as a diagnostic classifier. This approach has recently shown tremendous promise in the detection of early-stage cancers. The biomarkers found by SELDI-TOF-based pattern recognition analysis are mostly low molecular weight fragments produced at the specific tumor microenvironment.
Pietrowska, M; Marczak, L; Polanska, J; Nowicka, E; Behrent, K; Tarnawski, R; Stobiecki, M; Polanski, A; Widlak, P
2010-01-01
Mass spectrometry-based analysis of the serum proteome allows identifying multi-peptide patterns/signatures specific for blood of cancer patients, thus having high potential value for cancer diagnostics. However, because of problems with optimization and standardization of experimental and computational design, none of identified proteome patterns/signatures was approved for diagnostics in clinical practice as yet. Here we compared two methods of serum sample preparation for mass spectrometry-based proteome pattern analysis aimed to identify biomarkers that could be used in early detection of breast cancer patients. Blood samples were collected in a group of 92 patients diagnosed at early (I and II) stages of the disease before the start of therapy, and in a group of age-matched healthy controls (104 women). Serum specimens were purified and analyzed using MALDI-ToF spectrometry, either directly or after membrane filtration (50 kDa cut-off) to remove albumin and other large serum proteins. Mass spectra of the low-molecular-weight fraction (2-10 kDa) of the serum proteome were resolved using the Gaussian mixture decomposition, and identified spectral components were used to build classifiers that differentiated samples from breast cancer patients and healthy persons. Mass spectra of complete serum and membrane-filtered albumin-depleted samples have apparently different structure and peaks specific for both types of samples could be identified. The optimal classifier built for the complete serum specimens consisted of 8 spectral components, and had 81% specificity and 72% sensitivity, while that built for the membrane-filtered samples consisted of 4 components, and had 80% specificity and 81% sensitivity. We concluded that pre-processing of samples to remove albumin might be recommended before MALDI-ToF mass spectrometric analysis of the low-molecular-weight components of human serum Keywords: albumin removal; breast cancer; clinical proteomics; mass spectrometry; pattern analysis; serum proteome.
Zhu, Xiaoyu; Liu, Xin; Cheng, Zhongyi; Zhu, Jun; Xu, Lei; Wang, Fengsong; Qi, Wulin; Yan, Jiawei; Liu, Ning; Sun, Zimin; Liu, Huilan; Peng, Xiaojun; Hao, Yingchan; Zheng, Nan; Wu, Quan
2016-01-29
Valproic acid (VPA) and suberoylanilide hydroxamic acid (SAHA) are both HDAC inhibitors (HDACi). Previous studies indicated that both inhibitors show therapeutic effects on acute myeloid leukaemia (AML), while the differential impacts of the two different HDACi on AML treatment still remains elusive. In this study, using 3-plex SILAC based quantitative proteomics technique, anti-acetyllysine antibody based affinity enrichment, high resolution LC-MS/MS and intensive bioinformatic analysis, the quantitative proteome and acetylome in SAHA and VPA treated AML HL60 cells were extensively studied. In total, 5,775 proteins and 1,124 lysine acetylation sites were successfully obtained in response to VAP and SAHA treatment. It is found that VPA and SAHA treatment differently induced proteome and acetylome profiling in AML HL60 cells. This study revealed the differential impacts of VPA and SAHA on proteome/acetylome in AML cells, deepening our understanding of HDAC inhibitor mediated AML therapeutics.
Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi
2017-06-23
The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from genome sequences, though there are over lapped proteins. Based on the demonstrated application of data stored in the database for functional analyses, it is suggested that these data will be useful for analyses of biological mechanisms in soybean. Furthermore, coupled with recent advances in information and communication technology, the usefulness of this database would increase in the analyses of biological mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.
Proteomics wants cRacker: automated standardized data analysis of LC-MS derived proteomic data.
Zauber, Henrik; Schulze, Waltraud X
2012-11-02
The large-scale analysis of thousands of proteins under various experimental conditions or in mutant lines has gained more and more importance in hypothesis-driven scientific research and systems biology in the past years. Quantitative analysis by large scale proteomics using modern mass spectrometry usually results in long lists of peptide ion intensities. The main interest for most researchers, however, is to draw conclusions on the protein level. Postprocessing and combining peptide intensities of a proteomic data set requires expert knowledge, and the often repetitive and standardized manual calculations can be time-consuming. The analysis of complex samples can result in very large data sets (lists with several 1000s to 100,000 entries of different peptides) that cannot easily be analyzed using standard spreadsheet programs. To improve speed and consistency of the data analysis of LC-MS derived proteomic data, we developed cRacker. cRacker is an R-based program for automated downstream proteomic data analysis including data normalization strategies for metabolic labeling and label free quantitation. In addition, cRacker includes basic statistical analysis, such as clustering of data, or ANOVA and t tests for comparison between treatments. Results are presented in editable graphic formats and in list files.
Advances in Proteomics Data Analysis and Display Using an Accurate Mass and Time Tag Approach
Zimmer, Jennifer S.D.; Monroe, Matthew E.; Qian, Wei-Jun; Smith, Richard D.
2007-01-01
Proteomics has recently demonstrated utility in understanding cellular processes on the molecular level as a component of systems biology approaches and for identifying potential biomarkers of various disease states. The large amount of data generated by utilizing high efficiency (e.g., chromatographic) separations coupled to high mass accuracy mass spectrometry for high-throughput proteomics analyses presents challenges related to data processing, analysis, and display. This review focuses on recent advances in nanoLC-FTICR-MS-based proteomics approaches and the accompanying data processing tools that have been developed to display and interpret the large volumes of data being produced. PMID:16429408
Seliger, Barbara; Dressler, Sven P.; Wang, Ena; Kellner, Roland; Recktenwald, Christian V.; Lottspeich, Friedrich; Marincola, Francesco M.; Baumgärtner, Maja; Atkins, Derek; Lichtenfels, Rudolf
2012-01-01
Results obtained from expression profilings of renal cell carcinoma using different “ome”-based approaches and comprehensive data analysis demonstrated that proteome-based technologies and cDNA microarray analyses complement each other during the discovery phase for disease-related candidate biomarkers. The integration of the respective data revealed the uniqueness and complementarities of the different technologies. While comparative cDNA microarray analyses though restricted to upregulated targets largely revealed genes involved in controlling gene/protein expression (19%) and signal transduction processes (13%), proteomics/PROTEOMEX-defined candidate biomarkers include enzymes of the cellular metabolism (36%), transport proteins (12%) and cell motility/structural molecules (10%). Candidate biomarkers defined by proteomics and PROTEOMEX are frequently shared, whereas the sharing rate between cDNA microarray and proteome-based profilings is limited. Putative candidate biomarkers provide insights into their cellular (dys)function and their diagnostic/prognostic value but still warrant further validation in larger patient numbers. Based on the fact that merely 3 candidate biomarkers were shared by all applied technologies, namely annexin A4, tubulin alpha-1A chain and ubiquitin carboxyl-terminal hydrolase L1 the analysis at a single hierarchical level of biological regulation seems to provide only limited results thus emphasizing the importance and benefit of performing rather combinatorial screenings which can complement the standard clinical predictors. PMID:19235166
Systems Proteomics for Translational Network Medicine
Arrell, D. Kent; Terzic, Andre
2012-01-01
Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016
Lehmann, Roland; Schmidt, André; Pastuschek, Jana; Müller, Mario M; Fritzsche, Andreas; Dieterle, Stefan; Greb, Robert R; Markert, Udo R; Slevogt, Hortense
2018-06-25
The proteomic analysis of complex body fluids by liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis requires the selection of suitable sample preparation techniques and optimal parameter settings in data analysis software packages to obtain reliable results. Proteomic analysis of follicular fluid, as a representative of a complex body fluid similar to serum or plasma, is difficult as it contains a vast amount of high abundant proteins and a variety of proteins with different concentrations. However, the accessibility of this complex body fluid for LC-MS/MS analysis is an opportunity to gain insights into the status, the composition of fertility-relevant proteins including immunological factors or for the discovery of new diagnostic and prognostic markers for, for example, the treatment of infertility. In this study, we compared different sample preparation methods (FASP, eFASP and in-solution digestion) and three different data analysis software packages (Proteome Discoverer with SEQUEST, Mascot and MaxQuant with Andromeda) combined with semi- and full-tryptic databank search options to obtain a maximum coverage of the follicular fluid proteome. We found that the most comprehensive proteome coverage is achieved by the eFASP sample preparation method using SDS in the initial denaturing step and the SEQUEST-based semi-tryptic data analysis. In conclusion, we have developed a fractionation-free methodical workflow for in depth LC-MS/MS-based analysis for the standardized investigation of human follicle fluid as an important representative of a complex body fluid. Taken together, we were able to identify a total of 1392 proteins in follicular fluid. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Proteomics in medical microbiology.
Cash, P
2000-04-01
The techniques of proteomics (high resolution two-dimensional electrophoresis and protein characterisation) are widely used for microbiological research to analyse global protein synthesis as an indicator of gene expression. The rapid progress in microbial proteomics has been achieved through the wide availability of whole genome sequences for a number of bacterial groups. Beyond providing a basic understanding of microbial gene expression, proteomics has also played a role in medical areas of microbiology. Progress has been made in the use of the techniques for investigating the epidemiology and taxonomy of human microbial pathogens, the identification of novel pathogenic mechanisms and the analysis of drug resistance. In each of these areas, proteomics has provided new insights that complement genomic-based investigations. This review describes the current progress in these research fields and highlights some of the technical challenges existing for the application of proteomics in medical microbiology. The latter concern the analysis of genetically heterogeneous bacterial populations and the integration of the proteomic and genomic data for these bacteria. The characterisation of the proteomes of bacterial pathogens growing in their natural hosts remains a future challenge.
Matallana-Surget, Sabine; Derock, Jérémy; Leroy, Baptiste; Badri, Hanène; Deschoenmaeker, Frédéric; Wattiez, Ruddy
2014-01-01
The filamentous cyanobacterium Arthrospira platensis has a long history of use as a food supply and it has been used by the European Space Agency in the MELiSSA project, an artificial microecosystem which supports life during long-term manned space missions. This study assesses progress in the field of cyanobacterial shotgun proteomics and light/dark diurnal cycles by focusing on Arthrospira platensis. Several fractionation workflows including gel-free and gel-based protein/peptide fractionation procedures were used and combined with LC-MS/MS analysis, enabling the overall identification of 1306 proteins, which represents 21% coverage of the theoretical proteome. A total of 30 proteins were found to be significantly differentially regulated under light/dark growth transition. Interestingly, most of the proteins showing differential abundance were related to photosynthesis, the Calvin cycle and translation processes. A novel aspect and major achievement of this work is the successful improvement of the cyanobacterial proteome coverage using a 3D LC-MS/MS approach, based on an immobilized metal affinity chromatography, a suitable tool that enabled us to eliminate the most abundant protein, the allophycocyanin. We also demonstrated that cell growth follows a light/dark cycle in A. platensis. This preliminary proteomic study has highlighted new characteristics of the Arthrospira platensis proteome in terms of diurnal regulation.
Matallana-Surget, Sabine; Derock, Jérémy; Leroy, Baptiste; Badri, Hanène; Deschoenmaeker, Frédéric; Wattiez, Ruddy
2014-01-01
The filamentous cyanobacterium Arthrospira platensis has a long history of use as a food supply and it has been used by the European Space Agency in the MELiSSA project, an artificial microecosystem which supports life during long-term manned space missions. This study assesses progress in the field of cyanobacterial shotgun proteomics and light/dark diurnal cycles by focusing on Arthrospira platensis. Several fractionation workflows including gel-free and gel-based protein/peptide fractionation procedures were used and combined with LC-MS/MS analysis, enabling the overall identification of 1306 proteins, which represents 21% coverage of the theoretical proteome. A total of 30 proteins were found to be significantly differentially regulated under light/dark growth transition. Interestingly, most of the proteins showing differential abundance were related to photosynthesis, the Calvin cycle and translation processes. A novel aspect and major achievement of this work is the successful improvement of the cyanobacterial proteome coverage using a 3D LC-MS/MS approach, based on an immobilized metal affinity chromatography, a suitable tool that enabled us to eliminate the most abundant protein, the allophycocyanin. We also demonstrated that cell growth follows a light/dark cycle in A. platensis. This preliminary proteomic study has highlighted new characteristics of the Arthrospira platensis proteome in terms of diurnal regulation. PMID:24914774
Proteomics Analysis of Bladder Cancer Exosomes*
Welton, Joanne L.; Khanna, Sanjay; Giles, Peter J.; Brennan, Paul; Brewis, Ian A.; Staffurth, John; Mason, Malcolm D.; Clayton, Aled
2010-01-01
Exosomes are nanometer-sized vesicles, secreted by various cell types, present in biological fluids that are particularly rich in membrane proteins. Ex vivo analysis of exosomes may provide biomarker discovery platforms and form non-invasive tools for disease diagnosis and monitoring. These vesicles have never before been studied in the context of bladder cancer, a major malignancy of the urological tract. We present the first proteomics analysis of bladder cancer cell exosomes. Using ultracentrifugation on a sucrose cushion, exosomes were highly purified from cultured HT1376 bladder cancer cells and verified as low in contaminants by Western blotting and flow cytometry of exosome-coated beads. Solubilization in a buffer containing SDS and DTT was essential for achieving proteomics analysis using an LC-MALDI-TOF/TOF MS approach. We report 353 high quality identifications with 72 proteins not previously identified by other human exosome proteomics studies. Overrepresentation analysis to compare this data set with previous exosome proteomics studies (using the ExoCarta database) revealed that the proteome was consistent with that of various exosomes with particular overlap with exosomes of carcinoma origin. Interrogating the Gene Ontology database highlighted a strong association of this proteome with carcinoma of bladder and other sites. The data also highlighted how homology among human leukocyte antigen haplotypes may confound MASCOT designation of major histocompatability complex Class I nomenclature, requiring data from PCR-based human leukocyte antigen haplotyping to clarify anomalous identifications. Validation of 18 MS protein identifications (including basigin, galectin-3, trophoblast glycoprotein (5T4), and others) was performed by a combination of Western blotting, flotation on linear sucrose gradients, and flow cytometry, confirming their exosomal expression. Some were confirmed positive on urinary exosomes from a bladder cancer patient. In summary, the exosome proteomics data set presented is of unrivaled quality. The data will aid in the development of urine exosome-based clinical tools for monitoring disease and will inform follow-up studies into varied aspects of exosome manufacture and function. PMID:20224111
Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpievitch, Yuliya V.; Polpitiya, Ashoka D.; Anderson, Gordon A.
2010-12-01
Mass spectrometry-based proteomics has become the tool of choice for identifying and quantifying the proteome of an organism. Though recent years have seen a tremendous improvement in instrument performance and the computational tools used, significant challenges remain, and there are many opportunities for statisticians to make important contributions. In the most widely used "bottom-up" approach to proteomics, complex mixtures of proteins are first subjected to enzymatic cleavage, the resulting peptide products are separated based on chemical or physical properties and analyzed using a mass spectrometer. The two fundamental challenges in the analysis of bottom-up MS-based proteomics are: (1) Identifying themore » proteins that are present in a sample, and (2) Quantifying the abundance levels of the identified proteins. Both of these challenges require knowledge of the biological and technological context that gives rise to observed data, as well as the application of sound statistical principles for estimation and inference. We present an overview of bottom-up proteomics and outline the key statistical issues that arise in protein identification and quantification.« less
Wiśniewski, Jacek R; Mann, Matthias
2016-07-01
Proteomics and other protein-based analysis methods such as Western blotting all face the challenge of discriminating changes in the levels of proteins of interest from inadvertent changes in the amount loaded for analysis. Mass-spectrometry-based proteomics can now estimate the relative and absolute amounts of thousands of proteins across diverse biological systems. We reasoned that this new technology could prove useful for selection of very stably expressed proteins that could serve as better loading controls than those traditionally employed. Large-scale proteomic analyses of SDS lysates of cultured cells and tissues revealed deglycase DJ-1 as the protein with the lowest variability in abundance among different cell types in human, mouse, and amphibian cells. The protein constitutes 0.069 ± 0.017% of total cellular protein and occurs at a specific concentration of 34.6 ± 8.7 pmol/mg of total protein. Since DJ-1 is ubiquitous and therefore easily detectable with several peptides, it can be helpful in normalization of proteomic data sets. In addition, DJ-1 appears to be an advantageous loading control for Western blot that is superior to those used commonly used, allowing comparisons between tissues and cells originating from evolutionarily distant vertebrate species. Notably, this is not possible by the detection and quantitation of housekeeping proteins, which are often used in the Western blot technique. The approach introduced here can be applied to select the most appropriate loading controls for MS-based proteomics or Western blotting in any biological system.
Arntzen, Magnus Ø; Thiede, Bernd
2012-02-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no.
Arntzen, Magnus Ø.; Thiede, Bernd
2012-01-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no. PMID:22067098
Pacific Northwest National Laboratory (PNNL) investigators in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI), announces the public release of 98 targeted mass spectrometry-based assays for ovarian cancer research studies. Chosen based on proteogenomic observations from the recently published multi-institutional collaborative project between PNNL and Johns Hopkins University that comprehensively examined the collections of proteins in the tumors of ovarian cancer patients (highlighted in a paper in
Timms, John F; Hale, Oliver J; Cramer, Rainer
2016-06-01
The last 20 years have seen significant improvements in the analytical capabilities of biological mass spectrometry (MS). Studies using advanced MS have resulted in new insights into cell biology and the etiology of diseases as well as its use in clinical applications. This review discusses recent developments in MS-based technologies and their cancer-related applications with a focus on proteomics. It also discusses the issues around translating the research findings to the clinic and provides an outline of where the field is moving. Expert commentary: Proteomics has been problematic to adapt for the clinical setting. However, MS-based techniques continue to demonstrate potential in novel clinical uses beyond classical cancer proteomics.
USDA-ARS?s Scientific Manuscript database
Cold-induced sweetening in potato tubers is a costly problem for food industry. To systematically identify the proteins associated with this process, we employed a comparative proteomics approach using isobaric, stable isotope coded labels to compare the proteomes of potato tubers after 0 and 5 mont...
Nouri, Mohammad-Zaman; Komatsu, Setsuko
2010-05-01
To study the soybean plasma membrane proteome under osmotic stress, two methods were used: a gel-based and a LC MS/MS-based proteomics method. Two-day-old seedlings were subjected to 10% PEG for 2 days. Plasma membranes were purified from seedlings using a two-phase partitioning method and their purity was verified by measuring ATPase activity. Using the gel-based proteomics, four and eight protein spots were identified as up- and downregulated, respectively, whereas in the nanoLC MS/MS approach, 11 and 75 proteins were identified as up- and downregulated, respectively, under PEG treatment. Out of osmotic stress responsive proteins, most of the transporter proteins and all proteins with high number of transmembrane helices as well as low-abundance proteins could be identified by the LC MS/MS-based method. Three homologues of plasma membrane H(+)-ATPase, which are transporter proteins involved in ion efflux, were upregulated under osmotic stress. Gene expression of this protein was increased after 12 h of stress exposure. Among the identified proteins, seven proteins were mutual in two proteomics techniques, in which calnexin was the highly upregulated protein. Accumulation of calnexin in plasma membrane was confirmed by immunoblot analysis. These results suggest that under hyperosmotic conditions, calnexin accumulates in the plasma membrane and ion efflux accelerates by upregulation of plasma membrane H(+)-ATPase protein.
Lawless, Craig; Hubbard, Simon J.; Fan, Jun; Bessant, Conrad; Hermjakob, Henning; Jones, Andrew R.
2012-01-01
Abstract New methods for performing quantitative proteome analyses based on differential labeling protocols or label-free techniques are reported in the literature on an almost monthly basis. In parallel, a correspondingly vast number of software tools for the analysis of quantitative proteomics data has also been described in the literature and produced by private companies. In this article we focus on the review of some of the most popular techniques in the field and present a critical appraisal of several software packages available to process and analyze the data produced. We also describe the importance of community standards to support the wide range of software, which may assist researchers in the analysis of data using different platforms and protocols. It is intended that this review will serve bench scientists both as a useful reference and a guide to the selection and use of different pipelines to perform quantitative proteomics data analysis. We have produced a web-based tool (http://www.proteosuite.org/?q=other_resources) to help researchers find appropriate software for their local instrumentation, available file formats, and quantitative methodology. PMID:22804616
Proteomics in Heart Failure: Top-down or Bottom-up?
Gregorich, Zachery R.; Chang, Ying-Hua; Ge, Ying
2014-01-01
Summary The pathophysiology of heart failure (HF) is diverse, owing to multiple etiologies and aberrations in a number of cellular processes. Therefore, it is essential to understand how defects in the molecular pathways that mediate cellular responses to internal and external stressors function as a system to drive the HF phenotype. Mass spectrometry (MS)-based proteomics strategies have great potential for advancing our understanding of disease mechanisms at the systems level because proteins are the effector molecules for all cell functions and, thus, are directly responsible for determining cell phenotype. Two MS-based proteomics strategies exist: peptide-based bottom-up and protein-based top-down proteomics—each with its own unique strengths and weaknesses for interrogating the proteome. In this review, we will discuss the advantages and disadvantages of bottom-up and top-down MS for protein identification, quantification, and the analysis of post-translational modifications, as well as highlight how both of these strategies have contributed to our understanding of the molecular and cellular mechanisms underlying HF. Additionally, the challenges associated with both proteomics approaches will be discussed and insights will be offered regarding the future of MS-based proteomics in HF research. PMID:24619480
Achievements and perspectives of top-down proteomics.
Armirotti, Andrea; Damonte, Gianluca
2010-10-01
Over the last years, top-down (TD) MS has gained a remarkable space in proteomics, rapidly trespassing the limit between a promising approach and a solid, established technique. Several research groups worldwide have implemented TD analysis in their routine work on proteomics, deriving structural information on proteins with the level of accuracy that is impossible to achieve with classical bottom-up approaches. Complete maps of PTMs and assessment of single aminoacid polymorphisms are only a few of the results that can be obtained with this technique. Despite some existing technical and economical limitations, TD analysis is at present the most powerful instrument for MS-based proteomics and its implementation in routine workflow is a rapidly approaching turning point in proteomics. In this review article, the state-of-the-art of TD approach is described along with its major advantages and drawbacks and the most recent trends in TD analysis are discussed. References for all the covered topics are reported in the text, with the aim to support both newcomers and mass spectrometrists already introduced to TD proteomics.
Wu, Qi; Yuan, Huiming; Zhang, Lihua; Zhang, Yukui
2012-06-20
With the acceleration of proteome research, increasing attention has been paid to multidimensional liquid chromatography-mass spectrometry (MDLC-MS) due to its high peak capacity and separation efficiency. Recently, many efforts have been put to improve MDLC-based strategies including "top-down" and "bottom-up" to enable highly sensitive qualitative and quantitative analysis of proteins, as well as accelerate the whole analytical procedure. Integrated platforms with combination of sample pretreatment, multidimensional separations and identification were also developed to achieve high throughput and sensitive detection of proteomes, facilitating highly accurate and reproducible quantification. This review summarized the recent advances of such techniques and their applications in qualitative and quantitative analysis of proteomes. Copyright © 2012 Elsevier B.V. All rights reserved.
Proteomic analysis of tissue samples in translational breast cancer research.
Gromov, Pavel; Moreira, José M A; Gromova, Irina
2014-06-01
In the last decade, many proteomic technologies have been applied, with varying success, to the study of tissue samples of breast carcinoma for protein expression profiling in order to discover protein biomarkers/signatures suitable for: characterization and subtyping of tumors; early diagnosis, and both prognosis and prediction of outcome of chemotherapy. The purpose of this review is to critically appraise what has been achieved to date using proteomic technologies and to bring forward novel strategies - based on the analysis of clinically relevant samples - that promise to accelerate the translation of basic discoveries into the daily breast cancer clinical practice. In particular, we address major issues in experimental design by reviewing the strengths and weaknesses of current proteomic strategies in the context of the analysis of human breast tissue specimens.
Di Silvestre, Dario; Brambilla, Francesca; Scardoni, Giovanni; Brunetti, Pietro; Motta, Sara; Matteucci, Marco; Laudanna, Carlo; Recchia, Fabio A; Lionetti, Vincenzo; Mauri, Pierluigi
2017-05-01
We have demonstrated that intramyocardial delivery of human mesenchymal stem cells preconditioned with a hyaluronan mixed ester of butyric and retinoic acid (MSCp + ) is more effective in preventing the decay of regional myocardial contractility in a swine model of myocardial infarction (MI). However, the understanding of the role of MSCp + in proteomic remodeling of cardiac infarcted tissue is not complete. We therefore sought to perform a comprehensive analysis of the proteome of infarct remote (RZ) and border zone (BZ) of pigs treated with MSCp + or unconditioned stem cells. Heart tissues were analyzed by MudPIT and differentially expressed proteins were selected by a label-free approach based on spectral counting. Protein profiles were evaluated by using PPI networks and their topological analysis. The proteomic remodeling was largely prevented in MSCp + group. Extracellular proteins involved in fibrosis were down-regulated, while energetic pathways were globally up-regulated. Cardioprotectant pathways involved in the production of keto acid metabolites were also activated. Additionally, we found that new hub proteins support the cardioprotective phenotype characterizing the left ventricular BZ treated with MSCp + . In fact, the up-regulation of angiogenic proteins NCL and RAC1 can be explained by the increase of capillary density induced by MSCp + . Our results show that angiogenic pathways appear to be uniquely positioned to integrate signaling with energetic pathways involving cardiac repair. Our findings prompt the use of proteomics-based network analysis to optimize new approaches preventing the post-ischemic proteomic remodeling that may underlie the limited self-repair ability of adult heart. Copyright © 2017 Elsevier B.V. All rights reserved.
Hamzeiy, Hamid; Cox, Jürgen
2017-02-01
Computational workflows for mass spectrometry-based shotgun proteomics and untargeted metabolomics share many steps. Despite the similarities, untargeted metabolomics is lagging behind in terms of reliable fully automated quantitative data analysis. We argue that metabolomics will strongly benefit from the adaptation of successful automated proteomics workflows to metabolomics. MaxQuant is a popular platform for proteomics data analysis and is widely considered to be superior in achieving high precursor mass accuracies through advanced nonlinear recalibration, usually leading to five to ten-fold better accuracy in complex LC-MS/MS runs. This translates to a sharp decrease in the number of peptide candidates per measured feature, thereby strongly improving the coverage of identified peptides. We argue that similar strategies can be applied to untargeted metabolomics, leading to equivalent improvements in metabolite identification. Copyright © 2016 The Author(s). Published by Elsevier Ltd.. All rights reserved.
A systems biology-led insight into the role of the proteome in neurodegenerative diseases.
Fasano, Mauro; Monti, Chiara; Alberio, Tiziana
2016-09-01
Multifactorial disorders are the result of nonlinear interactions of several factors; therefore, a reductionist approach does not appear to be appropriate. Proteomics is a global approach that can be efficiently used to investigate pathogenetic mechanisms of neurodegenerative diseases. Here, we report a general introduction about the systems biology approach and mechanistic insights recently obtained by over-representation analysis of proteomics data of cellular and animal models of Alzheimer's disease, Parkinson's disease and other neurodegenerative disorders, as well as of affected human tissues. Expert commentary: As an inductive method, proteomics is based on unbiased observations that further require validation of generated hypotheses. Pathway databases and over-representation analysis tools allow researchers to assign an expectation value to pathogenetic mechanisms linked to neurodegenerative diseases. The systems biology approach based on omics data may be the key to unravel the complex mechanisms underlying neurodegeneration.
P-MartCancer-Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets.
Webb-Robertson, Bobbie-Jo M; Bramer, Lisa M; Jensen, Jeffrey L; Kobold, Markus A; Stratton, Kelly G; White, Amanda M; Rodland, Karin D
2017-11-01
P-MartCancer is an interactive web-based software environment that enables statistical analyses of peptide or protein data, quantitated from mass spectrometry-based global proteomics experiments, without requiring in-depth knowledge of statistical programming. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification, and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access and the capability to analyze multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium at the peptide, gene, and protein levels. P-MartCancer is deployed as a web service (https://pmart.labworks.org/cptac.html), alternatively available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/). Cancer Res; 77(21); e47-50. ©2017 AACR . ©2017 American Association for Cancer Research.
Jimenez, Connie R; Piersma, Sander; Pham, Thang V
2007-12-01
Proteomics aims to create a link between genomic information, biological function and disease through global studies of protein expression, modification and protein-protein interactions. Recent advances in key proteomics tools, such as mass spectrometry (MS) and (bio)informatics, provide tremendous opportunities for biomarker-related clinical applications. In this review, we focus on two complementary MS-based approaches with high potential for the discovery of biomarker patterns and low-abundant candidate biomarkers in biofluids: high-throughput matrix-assisted laser desorption/ionization time-of-flight mass spectroscopy-based methods for peptidome profiling and label-free liquid chromatography-based methods coupled to MS for in-depth profiling of biofluids with a focus on subproteomes, including the low-molecular-weight proteome, carrier-bound proteome and N-linked glycoproteome. The two approaches differ in their aims, throughput and sensitivity. We discuss recent progress and challenges in the analysis of plasma/serum and proximal fluids using these strategies and highlight the potential of liquid chromatography-MS-based proteomics of cancer cell and tumor secretomes for the discovery of candidate blood-based biomarkers. Strategies for candidate validation are also described.
EBprot: Statistical analysis of labeling-based quantitative proteomics data.
Koh, Hiromi W L; Swa, Hannah L F; Fermin, Damian; Ler, Siok Ghee; Gunaratne, Jayantha; Choi, Hyungwon
2015-08-01
Labeling-based proteomics is a powerful method for detection of differentially expressed proteins (DEPs). The current data analysis platform typically relies on protein-level ratios, which is obtained by summarizing peptide-level ratios for each protein. In shotgun proteomics, however, some proteins are quantified with more peptides than others, and this reproducibility information is not incorporated into the differential expression (DE) analysis. Here, we propose a novel probabilistic framework EBprot that directly models the peptide-protein hierarchy and rewards the proteins with reproducible evidence of DE over multiple peptides. To evaluate its performance with known DE states, we conducted a simulation study to show that the peptide-level analysis of EBprot provides better receiver-operating characteristic and more accurate estimation of the false discovery rates than the methods based on protein-level ratios. We also demonstrate superior classification performance of peptide-level EBprot analysis in a spike-in dataset. To illustrate the wide applicability of EBprot in different experimental designs, we applied EBprot to a dataset for lung cancer subtype analysis with biological replicates and another dataset for time course phosphoproteome analysis of EGF-stimulated HeLa cells with multiplexed labeling. Through these examples, we show that the peptide-level analysis of EBprot is a robust alternative to the existing statistical methods for the DE analysis of labeling-based quantitative datasets. The software suite is freely available on the Sourceforge website http://ebprot.sourceforge.net/. All MS data have been deposited in the ProteomeXchange with identifier PXD001426 (http://proteomecentral.proteomexchange.org/dataset/PXD001426/). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lavallée-Adam, Mathieu
2017-01-01
PSEA-Quant analyzes quantitative mass spectrometry-based proteomics datasets to identify enrichments of annotations contained in repositories such as the Gene Ontology and Molecular Signature databases. It allows users to identify the annotations that are significantly enriched for reproducibly quantified high abundance proteins. PSEA-Quant is available on the web and as a command-line tool. It is compatible with all label-free and isotopic labeling-based quantitative proteomics methods. This protocol describes how to use PSEA-Quant and interpret its output. The importance of each parameter as well as troubleshooting approaches are also discussed. PMID:27010334
Liu, Kehui; Zhang, Jiyang; Fu, Bin; Xie, Hongwei; Wang, Yingchun; Qian, Xiaohong
2014-07-01
Precise protein quantification is essential in comparative proteomics. Currently, quantification bias is inevitable when using proteotypic peptide-based quantitative proteomics strategy for the differences in peptides measurability. To improve quantification accuracy, we proposed an "empirical rule for linearly correlated peptide selection (ERLPS)" in quantitative proteomics in our previous work. However, a systematic evaluation on general application of ERLPS in quantitative proteomics under diverse experimental conditions needs to be conducted. In this study, the practice workflow of ERLPS was explicitly illustrated; different experimental variables, such as, different MS systems, sample complexities, sample preparations, elution gradients, matrix effects, loading amounts, and other factors were comprehensively investigated to evaluate the applicability, reproducibility, and transferability of ERPLS. The results demonstrated that ERLPS was highly reproducible and transferable within appropriate loading amounts and linearly correlated response peptides should be selected for each specific experiment. ERLPS was used to proteome samples from yeast to mouse and human, and in quantitative methods from label-free to O18/O16-labeled and SILAC analysis, and enabled accurate measurements for all proteotypic peptide-based quantitative proteomics over a large dynamic range. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The emergence of top-down proteomics in clinical research
2013-01-01
Proteomic technology has advanced steadily since the development of 'soft-ionization' techniques for mass-spectrometry-based molecular identification more than two decades ago. Now, the large-scale analysis of proteins (proteomics) is a mainstay of biological research and clinical translation, with researchers seeking molecular diagnostics, as well as protein-based markers for personalized medicine. Proteomic strategies using the protease trypsin (known as bottom-up proteomics) were the first to be developed and optimized and form the dominant approach at present. However, researchers are now beginning to understand the limitations of bottom-up techniques, namely the inability to characterize and quantify intact protein molecules from a complex mixture of digested peptides. To overcome these limitations, several laboratories are taking a whole-protein-based approach, in which intact protein molecules are the analytical targets for characterization and quantification. We discuss these top-down techniques and how they have been applied to clinical research and are likely to be applied in the near future. Given the recent improvements in mass-spectrometry-based proteomics and stronger cooperation between researchers, clinicians and statisticians, both peptide-based (bottom-up) strategies and whole-protein-based (top-down) strategies are set to complement each other and help researchers and clinicians better understand and detect complex disease phenotypes. PMID:23806018
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Xing; Xu, Yanli; Meng, Qian
Colorectal cancer (CRC) is one of the most common types of malignant tumor worldwide. Currently, although many researchers have been devoting themselves in CRC studies, the process of locating biomarkers for CRC early diagnosis and prognostic is still very slow. Using a centrifugal proteomic reactor-based proteomic analysis of minute amount of colonic biopsies by enteroscopy sampling, 2620 protein groups were quantified between cancer mucosa and adjacent normal colorectal mucosa. Of which, 403 protein groups were differentially expressed with statistic significance between cancer and normal tissues, including 195 up-regulated and 208 down-regulated proteins in cancer tissues. Three proteins (SOD3, PRELP andmore » NGAL) were selected for further Western blot validation. And the resulting Western blot experimental results were consistent with the quantitative proteomic data. SOD3 and PRELP are down-regulated in CRC mucosa comparing to adjacent normal tissue, while NGAL is up-regulated in CRC mucosa. In conclusion, the centrifugal proteomic reactor-based label-free quantitative proteomic approach provides a highly sensitive and powerful tool for analyzing minute protein sample from tiny colorectal biopsies, which may facilitate CRC biomarkers discovery for diagnoses and prognoses. -- Highlights: •Minute amount of colonic biopsies by endoscopy is suitable for proteomic analysis. •Centrifugal proteomic reactor can be used for processing tiny clinic biopsy sample. •SOD3 and PRELP are down-regulated in CRC, while NGAL is up-regulated in CRC.« less
To label or not to label: applications of quantitative proteomics in neuroscience research.
Filiou, Michaela D; Martins-de-Souza, Daniel; Guest, Paul C; Bahn, Sabine; Turck, Christoph W
2012-02-01
Proteomics has provided researchers with a sophisticated toolbox of labeling-based and label-free quantitative methods. These are now being applied in neuroscience research where they have already contributed to the elucidation of fundamental mechanisms and the discovery of candidate biomarkers. In this review, we evaluate and compare labeling-based and label-free quantitative proteomic techniques for applications in neuroscience research. We discuss the considerations required for the analysis of brain and central nervous system specimens, the experimental design of quantitative proteomic workflows as well as the feasibility, advantages, and disadvantages of the available techniques for neuroscience-oriented questions. Furthermore, we assess the use of labeled standards as internal controls for comparative studies in humans and review applications of labeling-based and label-free mass spectrometry approaches in relevant model organisms and human subjects. Providing a comprehensive guide of feasible and meaningful quantitative proteomic methodologies for neuroscience research is crucial not only for overcoming current limitations but also for gaining useful insights into brain function and translating proteomics from bench to bedside. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Alberio, Tiziana; Pieroni, Luisa; Ronci, Maurizio; Banfi, Cristina; Bongarzone, Italia; Bottoni, Patrizia; Brioschi, Maura; Caterino, Marianna; Chinello, Clizia; Cormio, Antonella; Cozzolino, Flora; Cunsolo, Vincenzo; Fontana, Simona; Garavaglia, Barbara; Giusti, Laura; Greco, Viviana; Lucacchini, Antonio; Maffioli, Elisa; Magni, Fulvio; Monteleone, Francesca; Monti, Maria; Monti, Valentina; Musicco, Clara; Petrosillo, Giuseppe; Porcelli, Vito; Saletti, Rosaria; Scatena, Roberto; Soggiu, Alessio; Tedeschi, Gabriella; Zilocchi, Mara; Roncada, Paola; Urbani, Andrea; Fasano, Mauro
2017-12-01
The Mitochondrial Human Proteome Project aims at understanding the function of the mitochondrial proteome and its crosstalk with the proteome of other organelles. Being able to choose a suitable and validated enrichment protocol of functional mitochondria, based on the specific needs of the downstream proteomics analysis, would greatly help the researchers in the field. Mitochondrial fractions from ten model cell lines were prepared using three enrichment protocols and analyzed on seven different LC-MS/MS platforms. All data were processed using neXtProt as reference database. The data are available for the Human Proteome Project purposes through the ProteomeXchange Consortium with the identifier PXD007053. The processed data sets were analyzed using a suite of R routines to perform a statistical analysis and to retrieve subcellular and submitochondrial localizations. Although the overall number of identified total and mitochondrial proteins was not significantly dependent on the enrichment protocol, specific line to line differences were observed. Moreover, the protein lists were mapped to a network representing the functional mitochondrial proteome, encompassing mitochondrial proteins and their first interactors. More than 80% of the identified proteins resulted in nodes of this network but with a different ability in coisolating mitochondria-associated structures for each enrichment protocol/cell line pair.
Analysis of high accuracy, quantitative proteomics data in the MaxQB database.
Schaab, Christoph; Geiger, Tamar; Stoehr, Gabriele; Cox, Juergen; Mann, Matthias
2012-03-01
MS-based proteomics generates rapidly increasing amounts of precise and quantitative information. Analysis of individual proteomic experiments has made great strides, but the crucial ability to compare and store information across different proteome measurements still presents many challenges. For example, it has been difficult to avoid contamination of databases with low quality peptide identifications, to control for the inflation in false positive identifications when combining data sets, and to integrate quantitative data. Although, for example, the contamination with low quality identifications has been addressed by joint analysis of deposited raw data in some public repositories, we reasoned that there should be a role for a database specifically designed for high resolution and quantitative data. Here we describe a novel database termed MaxQB that stores and displays collections of large proteomics projects and allows joint analysis and comparison. We demonstrate the analysis tools of MaxQB using proteome data of 11 different human cell lines and 28 mouse tissues. The database-wide false discovery rate is controlled by adjusting the project specific cutoff scores for the combined data sets. The 11 cell line proteomes together identify proteins expressed from more than half of all human genes. For each protein of interest, expression levels estimated by label-free quantification can be visualized across the cell lines. Similarly, the expression rank order and estimated amount of each protein within each proteome are plotted. We used MaxQB to calculate the signal reproducibility of the detected peptides for the same proteins across different proteomes. Spearman rank correlation between peptide intensity and detection probability of identified proteins was greater than 0.8 for 64% of the proteome, whereas a minority of proteins have negative correlation. This information can be used to pinpoint false protein identifications, independently of peptide database scores. The information contained in MaxQB, including high resolution fragment spectra, is accessible to the community via a user-friendly web interface at http://www.biochem.mpg.de/maxqb.
Liquid chromatography tandem-mass spectrometry (LC-MS/MS)- based methods such as isobaric tags for relative and absolute quantification (iTRAQ) and tandem mass tags (TMT) have been shown to provide overall better quantification accuracy and reproducibility over other LC-MS/MS techniques. However, large scale projects like the Clinical Proteomic Tumor Analysis Consortium (CPTAC) require comparisons across many genomically characterized clinical specimens in a single study and often exceed the capability of traditional iTRAQ-based quantification.
Wimmer, Helge; Gundacker, Nina C; Griss, Johannes; Haudek, Verena J; Stättner, Stefan; Mohr, Thomas; Zwickl, Hannes; Paulitschke, Verena; Baron, David M; Trittner, Wolfgang; Kubicek, Markus; Bayer, Editha; Slany, Astrid; Gerner, Christopher
2009-06-01
Interpretation of proteome data with a focus on biomarker discovery largely relies on comparative proteome analyses. Here, we introduce a database-assisted interpretation strategy based on proteome profiles of primary cells. Both 2-D-PAGE and shotgun proteomics are applied. We obtain high data concordance with these two different techniques. When applying mass analysis of tryptic spot digests from 2-D gels of cytoplasmic fractions, we typically identify several hundred proteins. Using the same protein fractions, we usually identify more than thousand proteins by shotgun proteomics. The data consistency obtained when comparing these independent data sets exceeds 99% of the proteins identified in the 2-D gels. Many characteristic differences in protein expression of different cells can thus be independently confirmed. Our self-designed SQL database (CPL/MUW - database of the Clinical Proteomics Laboratories at the Medical University of Vienna accessible via www.meduniwien.ac.at/proteomics/database) facilitates (i) quality management of protein identification data, which are based on MS, (ii) the detection of cell type-specific proteins and (iii) of molecular signatures of specific functional cell states. Here, we demonstrate, how the interpretation of proteome profiles obtained from human liver tissue and hepatocellular carcinoma tissue is assisted by the Clinical Proteomics Laboratories at the Medical University of Vienna-database. Therefore, we suggest that the use of reference experiments supported by a tailored database may substantially facilitate data interpretation of proteome profiling experiments.
Couto, Narciso; Schooling, Sarah R; Dutcher, John R; Barber, Jill
2015-10-02
In the present work, two different proteomic platforms, gel-based and gel-free, were used to map the matrix and outer membrane vesicle exoproteomes of Pseudomonas aeruginosa PAO1 biofilms. These two proteomic strategies allowed us a confident identification of 207 and 327 proteins from enriched outer membrane vesicles and whole matrix isolated from biofilms. Because of the physicochemical characteristics of these subproteomes, the two strategies showed complementarity, and thus, the most comprehensive analysis of P. aeruginosa exoproteome to date was achieved. Under our conditions, outer membrane vesicles contribute approximately 20% of the whole matrix proteome, demonstrating that membrane vesicles are an important component of the matrix. The proteomic profiles were analyzed in terms of their biological context, namely, a biofilm. Accordingly relevant metabolic processes involved in cellular adaptation to the biofilm lifestyle as well as those related to P. aeruginosa virulence capabilities were a key feature of the analyses. The diversity of the matrix proteome corroborates the idea of high heterogeneity within the biofilm; cells can display different levels of metabolism and can adapt to local microenvironments making this proteomic analysis challenging. In addition to analyzing our own primary data, we extend the analysis to published data by other groups in order to deepen our understanding of the complexity inherent within biofilm populations.
Assembling proteomics data as a prerequisite for the analysis of large scale experiments
Schmidt, Frank; Schmid, Monika; Thiede, Bernd; Pleißner, Klaus-Peter; Böhme, Martina; Jungblut, Peter R
2009-01-01
Background Despite the complete determination of the genome sequence of a huge number of bacteria, their proteomes remain relatively poorly defined. Beside new methods to increase the number of identified proteins new database applications are necessary to store and present results of large- scale proteomics experiments. Results In the present study, a database concept has been developed to address these issues and to offer complete information via a web interface. In our concept, the Oracle based data repository system SQL-LIMS plays the central role in the proteomics workflow and was applied to the proteomes of Mycobacterium tuberculosis, Helicobacter pylori, Salmonella typhimurium and protein complexes such as 20S proteasome. Technical operations of our proteomics labs were used as the standard for SQL-LIMS template creation. By means of a Java based data parser, post-processed data of different approaches, such as LC/ESI-MS, MALDI-MS and 2-D gel electrophoresis (2-DE), were stored in SQL-LIMS. A minimum set of the proteomics data were transferred in our public 2D-PAGE database using a Java based interface (Data Transfer Tool) with the requirements of the PEDRo standardization. Furthermore, the stored proteomics data were extractable out of SQL-LIMS via XML. Conclusion The Oracle based data repository system SQL-LIMS played the central role in the proteomics workflow concept. Technical operations of our proteomics labs were used as standards for SQL-LIMS templates. Using a Java based parser, post-processed data of different approaches such as LC/ESI-MS, MALDI-MS and 1-DE and 2-DE were stored in SQL-LIMS. Thus, unique data formats of different instruments were unified and stored in SQL-LIMS tables. Moreover, a unique submission identifier allowed fast access to all experimental data. This was the main advantage compared to multi software solutions, especially if personnel fluctuations are high. Moreover, large scale and high-throughput experiments must be managed in a comprehensive repository system such as SQL-LIMS, to query results in a systematic manner. On the other hand, these database systems are expensive and require at least one full time administrator and specialized lab manager. Moreover, the high technical dynamics in proteomics may cause problems to adjust new data formats. To summarize, SQL-LIMS met the requirements of proteomics data handling especially in skilled processes such as gel-electrophoresis or mass spectrometry and fulfilled the PSI standardization criteria. The data transfer into a public domain via DTT facilitated validation of proteomics data. Additionally, evaluation of mass spectra by post-processing using MS-Screener improved the reliability of mass analysis and prevented storage of data junk. PMID:19166578
Zhang, Ting; Guo, Yueshuai; Guo, Xuejiang; Zhou, Tao; Chen, Daozhen; Xiang, Jingying; Zhou, Zuomin
2013-01-01
Intrahepatic cholestasis of pregnancy (ICP) usually occurs in the third trimester and associated with increased risks in fetal complications. Currently, the exact cause of this disease is unknown. In this study we aim to investigate the potential proteins in placenta, which may participate in the molecular mechanisms of ICP-related fetal complications using iTRAQ-based proteomics approach. The iTRAQ analysis combined with liquid chromatography-tandem mass spectrometry (LC-MS/MS) was performed to separate differentially expressed placental proteins from 4 pregnant women with ICP and 4 healthy pregnant women. Bioinformatics analysis was used to find the relative processes that these differentially expressed proteins were involved in. Three apoptosis related proteins ERp29, PRDX6 and MPO that resulted from iTRAQ-based proteomics were further verified in placenta by Western blotting and immunohistochemistry. Placental apoptosis was also detected by TUNEL assay. Proteomics results showed there were 38 differentially expressed proteins from pregnant women with ICP and healthy pregnant women, 29 were upregulated and 9 were downregulated in placenta from pregnant women with ICP. Bioinformatics analysis showed most of the identified proteins was functionally related to specific cell processes, including apoptosis, oxidative stress, lipid metabolism. The expression levels of ERp29, PRDX6 and MPO were consistent with the proteomics data. The apoptosis index in placenta from ICP patients was significantly increased. This preliminary work provides a better understanding of the proteomic alterations of placenta from pregnant women with ICP and may provide us some new insights into the pathophysiology and potential novel treatment targets for ICP.
Bilić, Petra; Guillemin, Nicolas; Kovačević, Alan; Beer Ljubić, Blanka; Jović, Ines; Galan, Asier; Eckersall, Peter David; Burchmore, Richard; Mrljak, Vladimir
2018-05-15
Idiopathic dilated cardiomyopathy (iDCM) is a primary myocardial disorder with an unknown aetiology, characterized by reduced contractility and ventricular dilation of the left or both ventricles. Naturally occurring canine iDCM was used herein to identify serum proteomic signature of the disease compared to the healthy state, providing an insight into underlying mechanisms and revealing proteins with biomarker potential. To achieve this, we used high-throughput label-based quantitative LC-MS/MS proteomics approach and bioinformatics analysis of the in silico inferred interactome protein network created from the initial list of differential proteins. To complement the proteomic analysis, serum biochemical parameters and levels of know biomarkers of cardiac function were measured. Several proteins with biomarker potential were identified, such as inter-alpha-trypsin inhibitor heavy chain H4, microfibril-associated glycoprotein 4 and apolipoprotein A-IV, which were validated using an independent method (Western blotting) and showed high specificity and sensitivity according to the receiver operating characteristic curve analysis. Bioinformatics analysis revealed involvement of different pathways in iDCM, such as complement cascade activation, lipoprotein particles dynamics, elastic fibre formation, GPCR signalling and respiratory electron transport chain. Idiopathic dilated cardiomyopathy is a severe primary myocardial disease of unknown cause, affecting both humans and dogs. This study is a contribution to the canine heart disease research by means of proteomic and bioinformatic state of the art analyses, following similar approach in human iDCM research. Importantly, we used serum as non-invasive and easily accessible biological source of information and contributed to the scarce data on biofluid proteome research on this topic. Bioinformatics analysis revealed biological pathways modulated in canine iDCM with potential of further targeted research. Also, several proteins with biomarker potential have been identified and successfully validated. Copyright © 2018 Elsevier B.V. All rights reserved.
Birth of plant proteomics in India: a new horizon.
Narula, Kanika; Pandey, Aarti; Gayali, Saurabh; Chakraborty, Niranjan; Chakraborty, Subhra
2015-09-08
In the post-genomic era, proteomics is acknowledged as the next frontier for biological research. Although India has a long and distinguished tradition in protein research, the initiation of proteomics studies was a new horizon. Protein research witnessed enormous progress in protein separation, high-resolution refinements, biochemical identification of the proteins, protein-protein interaction, and structure-function analysis. Plant proteomics research, in India, began its journey on investigation of the proteome profiling, complexity analysis, protein trafficking, and biochemical modeling. The research article by Bhushan et al. in 2006 marked the birth of the plant proteomics research in India. Since then plant proteomics studies expanded progressively and are now being carried out in various institutions spread across the country. The compilation presented here seeks to trace the history of development in the area during the past decade based on publications till date. In this review, we emphasize on outcomes of the field providing prospects on proteomic pathway analyses. Finally, we discuss the connotation of strategies and the potential that would provide the framework of plant proteome research. The past decades have seen rapidly growing number of sequenced plant genomes and associated genomic resources. To keep pace with this increasing body of data, India is in the provisional phase of proteomics research to develop a comparative hub for plant proteomes and protein families, but it requires a strong impetus from intellectuals, entrepreneurs, and government agencies. Here, we aim to provide an overview of past, present and future of Indian plant proteomics, which would serve as an evaluation platform for those seeking to incorporate proteomics into their research programs. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.
Bianco, Linda; Perrotta, Gaetano
2015-01-01
Filamentous fungi possess the extraordinary ability to digest complex biomasses and mineralize numerous xenobiotics, as consequence of their aptitude to sensing the environment and regulating their intra and extra cellular proteins, producing drastic changes in proteome and secretome composition. Recent advancement in proteomic technologies offers an exciting opportunity to reveal the fluctuations of fungal proteins and enzymes, responsible for their metabolic adaptation to a large variety of environmental conditions. Here, an overview of the most commonly used proteomic strategies will be provided; this paper will range from sample preparation to gel-free and gel-based proteomics, discussing pros and cons of each mentioned state-of-the-art technique. The main focus will be kept on filamentous fungi. Due to the biotechnological relevance of lignocellulose degrading fungi, special attention will be finally given to their extracellular proteome, or secretome. Secreted proteins and enzymes will be discussed in relation to their involvement in bio-based processes, such as biomass deconstruction and mycoremediation. PMID:25775160
Bianco, Linda; Perrotta, Gaetano
2015-03-12
Filamentous fungi possess the extraordinary ability to digest complex biomasses and mineralize numerous xenobiotics, as consequence of their aptitude to sensing the environment and regulating their intra and extra cellular proteins, producing drastic changes in proteome and secretome composition. Recent advancement in proteomic technologies offers an exciting opportunity to reveal the fluctuations of fungal proteins and enzymes, responsible for their metabolic adaptation to a large variety of environmental conditions. Here, an overview of the most commonly used proteomic strategies will be provided; this paper will range from sample preparation to gel-free and gel-based proteomics, discussing pros and cons of each mentioned state-of-the-art technique. The main focus will be kept on filamentous fungi. Due to the biotechnological relevance of lignocellulose degrading fungi, special attention will be finally given to their extracellular proteome, or secretome. Secreted proteins and enzymes will be discussed in relation to their involvement in bio-based processes, such as biomass deconstruction and mycoremediation.
Statistical design of quantitative mass spectrometry-based proteomic experiments.
Oberg, Ann L; Vitek, Olga
2009-05-01
We review the fundamental principles of statistical experimental design, and their application to quantitative mass spectrometry-based proteomics. We focus on class comparison using Analysis of Variance (ANOVA), and discuss how randomization, replication and blocking help avoid systematic biases due to the experimental procedure, and help optimize our ability to detect true quantitative changes between groups. We also discuss the issues of pooling multiple biological specimens for a single mass analysis, and calculation of the number of replicates in a future study. When applicable, we emphasize the parallels between designing quantitative proteomic experiments and experiments with gene expression microarrays, and give examples from that area of research. We illustrate the discussion using theoretical considerations, and using real-data examples of profiling of disease.
Data from quantitative label free proteomics analysis of rat spleen.
Dudekula, Khadar; Le Bihan, Thierry
2016-09-01
The dataset presented in this work has been obtained using a label-free quantitative proteomic analysis of rat spleen. A robust method for extraction of proteins from rat spleen tissue and LC-MS-MS analysis was developed using a urea and SDS-based buffer. Different fractionation methods were compared. A total of 3484 different proteins were identified from the pool of all experiments run in this study (a total of 2460 proteins with at least two peptides). A total of 1822 proteins were identified from nine non-fractionated pulse gels, 2288 proteins and 2864 proteins were identified by SDS-PAGE fractionation into three and five fractions respectively. The proteomics data are deposited in ProteomeXchange Consortium via PRIDE PXD003520, Progenesis and Maxquant output are presented in the supported information. The generated list of proteins under different regimes of fractionation allow assessing the nature of the identified proteins; variability in the quantitative analysis associated with the different sampling strategy and allow defining a proper number of replicates for future quantitative analysis.
Rice proteome analysis: a step toward functional analysis of the rice genome.
Komatsu, Setsuko; Tanaka, Naoki
2005-03-01
The technique of proteome analysis using 2-DE has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this review, we describe construction of the rice proteome database, the cataloging of rice proteins, and the functional characterization of some of the proteins identified. Initially, proteins extracted from various tissues and organelles were separated by 2-DE and an image analyzer was used to construct a display or reference map of the proteins. The rice proteome database currently contains 23 reference maps based on 2-DE of proteins from different rice tissues and subcellular compartments. These reference maps comprise 13 129 rice proteins, and the amino acid sequences of 5092 of these proteins are entered in the database. Major proteins involved in growth or stress responses have been identified by using a proteomics approach and some of these proteins have unique functions. Furthermore, initial work has also begun on analyzing the phosphoproteome and protein-protein interactions in rice. The information obtained from the rice proteome database will aid in the molecular cloning of rice genes and in predicting the function of unknown proteins.
Beyond the proteome: Mass Spectrometry Special Interest Group (MS-SIG) at ISMB/ECCB 2013
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ryu, Soyoung; Payne, Samuel H.; Schaab, Christoph
2014-07-02
Mass spectrometry special interest group (MS-SIG) aims to bring together experts from the global research community to discuss highlights and challenges in the field of mass spectrometry (MS)-based proteomics and computational biology. The rapid echnological developments in MS-based proteomics have enabled the generation of a large amount of meaningful information on hundreds to thousands of proteins simultaneously from a biological sample; however, the complexity of the MS data require sophisticated computational algorithms and software for data analysis and interpretation. This year’s MS-SIG meeting theme was ‘Beyond the Proteome’ with major focuses on improving protein identification/quantification and using proteomics data tomore » solve interesting problems in systems biology and clinical research.« less
Making a protein extract from plant pathogenic fungi for gel- and LC-based proteomics.
Fernández, Raquel González; Redondo, Inmaculada; Jorrin-Novo, Jesus V
2014-01-01
Proteomic technologies have become a successful tool to provide relevant information on fungal biology. In the case of plant pathogenic fungi, this approach would allow a deeper knowledge of the interaction and the biological cycle of the pathogen, as well as the identification of pathogenicity and virulence factors. These two elements open up new possibilities for crop disease diagnosis and environment-friendly crop protection. Phytopathogenic fungi, due to its particular cellular characteristics, can be considered as a recalcitrant biological material, which makes it difficult to obtain quality protein samples for proteomic analysis. This chapter focuses on protein extraction for gel- and LC-based proteomics with specific protocols of our current research with Botrytis cinerea.
Completed | Office of Cancer Clinical Proteomics Research
Prior to the current Clinical Proteomic Tumor Analysis Consortium (CPTAC), previously funded initiatives associated with clinical proteomics research included: Clinical Proteomic Tumor Analysis Consortium (CPTAC 2.0) Clinical Proteomic Technologies for Cancer Initiative (CPTC) Mouse Proteomic Technologies Initiative
Sandra, Koen; Moshir, Mahan; D'hondt, Filip; Tuytten, Robin; Verleysen, Katleen; Kas, Koen; François, Isabelle; Sandra, Pat
2009-04-15
Multidimensional liquid-based separation techniques are described for maximizing the resolution of the enormous number of peptides generated upon tryptic digestion of proteomes, and hence, reduce the spatial and temporal complexity of the sample to a level that allows successful mass spectrometric analysis. This review complements the previous contribution on unidimensional high performance liquid chromatography (HPLC). Both chromatography and electrophoresis will be discussed albeit with reversed-phase HPLC (RPLC) as the final separation dimension prior to MS analysis.
Proteomic platform for the identification of proteins in olive (Olea europaea) pulp.
Capriotti, Anna Laura; Cavaliere, Chiara; Foglia, Patrizia; Piovesana, Susy; Samperi, Roberto; Stampachiacchiere, Serena; Laganà, Aldo
2013-10-24
The nutritional and cancer-protective properties of the oil extracted mechanically from the ripe fruits of Olea europaea trees are attracting constantly more attention worldwide. The preparation of high-quality protein samples from plant tissues for proteomic analysis poses many challenging problems. In this study we employed a proteomic platform based on two different extraction methods, SDS and CHAPS based protocols, followed by two precipitation protocols, TCA/acetone and MeOH precipitation, in order to increase the final number of identified proteins. The use of advanced MS techniques in combination with the Swissprot and NCBI Viridiplantae databases and TAIR10 Arabidopsis database allowed us to identify 1265 proteins, of which 22 belong to O. europaea. The application of this proteomic platform for protein extraction and identification will be useful also for other proteomic studies on recalcitrant plant/fruit tissues. Copyright © 2013. Published by Elsevier B.V.
Proteomics of Plant Pathogenic Fungi
González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V.
2010-01-01
Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection. PMID:20589070
Proteomics of plant pathogenic fungi.
González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V
2010-01-01
Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection.
Loroch, Stefan; Schommartz, Tim; Brune, Wolfram; Zahedi, René Peiman; Sickmann, Albert
2015-05-01
Quantitative proteomics and phosphoproteomics have become key disciplines in understanding cellular processes. Fundamental research can be done using cell culture providing researchers with virtually infinite sample amounts. In contrast, clinical, pre-clinical and biomedical research is often restricted to minute sample amounts and requires an efficient analysis with only micrograms of protein. To address this issue, we generated a highly sensitive workflow for combined LC-MS-based quantitative proteomics and phosphoproteomics by refining an ERLIC-based 2D phosphoproteomics workflow into an ERLIC-based 3D workflow covering the global proteome as well. The resulting 3D strategy was successfully used for an in-depth quantitative analysis of both, the proteome and the phosphoproteome of murine cytomegalovirus-infected mouse fibroblasts, a model system for host cell manipulation by a virus. In a 2-plex SILAC experiment with 150 μg of a tryptic digest per condition, the 3D strategy enabled the quantification of ~75% more proteins and even ~134% more peptides compared to the 2D strategy. Additionally, we could quantify ~50% more phosphoproteins by non-phosphorylated peptides, concurrently yielding insights into changes on the levels of protein expression and phosphorylation. Beside its sensitivity, our novel three-dimensional ERLIC-strategy has the potential for semi-automated sample processing rendering it a suitable future perspective for clinical, pre-clinical and biomedical research. Copyright © 2015. Published by Elsevier B.V.
Bordbar, Aarash; Jamshidi, Neema; Palsson, Bernhard O
2011-07-12
The development of high-throughput technologies capable of whole cell measurements of genes, proteins, and metabolites has led to the emergence of systems biology. Integrated analysis of the resulting omic data sets has proved to be hard to achieve. Metabolic network reconstructions enable complex relationships amongst molecular components to be represented formally in a biologically relevant manner while respecting physical constraints. In silico models derived from such reconstructions can then be queried or interrogated through mathematical simulations. Proteomic profiling studies of the mature human erythrocyte have shown more proteins present related to metabolic function than previously thought; however the significance and the causal consequences of these findings have not been explored. Erythrocyte proteomic data was used to reconstruct the most expansive description of erythrocyte metabolism to date, following extensive manual curation, assessment of the literature, and functional testing. The reconstruction contains 281 enzymes representing functions from glycolysis to cofactor and amino acid metabolism. Such a comprehensive view of erythrocyte metabolism implicates the erythrocyte as a potential biomarker for different diseases as well as a 'cell-based' drug-screening tool. The analysis shows that 94 erythrocyte enzymes are implicated in morbid single nucleotide polymorphisms, representing 142 pathologies. In addition, over 230 FDA-approved and experimental pharmaceuticals have enzymatic targets in the erythrocyte. The advancement of proteomic technologies and increased generation of high-throughput proteomic data have created the need for a means to analyze these data in a coherent manner. Network reconstructions provide a systematic means to integrate and analyze proteomic data in a biologically meaning manner. Analysis of the red cell proteome has revealed an unexpected level of complexity in the functional capabilities of human erythrocyte metabolism.
Han, Jinzhi; Gao, Peng; Zhao, Shengming; Bie, Xiaomei; Lu, Zhaoxin; Zhang, Chong; Lv, Fengxia
2017-01-06
LI-F type peptides (AMP-jsa9) produced by Paenibacillus polymyxa JSa-9 are a group of cyclic lipodepsipeptide antibiotics that exhibit a broad antimicrobial spectrum against Gram-positive bacteria and filamentous fungi, especially Bacillus cereus and Fusarium moniliforme. In this study, to better understand the antibacterial mechanism of AMP-jsa9 against B. cereus, the ultrastructure of AMP-jsa9-treated B. cereus cells was observed by both atomic force microscopy and transmission electron microscopy, and quantitative proteomic analysis was performed on proteins extracted from treated and untreated bacterial cells by using isobaric tag for relative and absolute quantitation (iTRAQ) labeling and LC-MS/MS analysis to access differentially expressed proteins. Furthermore, multiple experiments were conducted to validate the results of the proteomic analysis, including determinations of ATP, NAD (+) H, NADP (+) H, reactive oxygen species (ROS), the activities of catalase (CAT) and superoxide dismutase (SOD), and the relative expression of target genes by quantitative real-time PCR. Bacterial cells exposed to AMP-jsa9 showed irregular surfaces with bleb projections and concaves; we hypothesize that AMP-jsa9 penetrated the cell wall and was anchored on the cytoplasmic membrane and that ROS accumulated in the cell membrane after treatment with AMP-jsa9, modulating the bacterial membrane properties and increasing membrane permeability. Consequently, the blebs were formed on the cell wall by the impulsive force of the leakage of intercellular contents. iTRAQ-based proteomic analysis detected a total of 1317 proteins, including 176 differentially expressed proteins (75 upregulated (fold >2) and 101 downregulated (fold <0.5)). Based on proteome analysis, the putative pathways of AMP-jsa9 action against B. cereus can be summarized as: (i) inhibition of bacterial sporulation, thiamine biosynthesis, energy metabolism, DNA transcription and translation, and cell wall biosynthesis, through direct regulation of protein levels; and (ii) indirect effects on the same pathways through the accumulation of ROS and the consequent impairment of cellular functions, resulting from downregulation of antioxidant proteins, especially CAT and SOD. The mode of action of LI-F type antimicrobial peptides (AMP-jsa9) against B. cereus was elucidated at the proteomic level. Two pathways of AMP-jsa9 action upon B. cereus cells were identified and the mechanism of bleb formation on the surfaces of bacterial cells was predicted based on the results of ultrastructural observation and proteomic analysis. These results are helpful in understanding the mechanism of LI-F type peptides and in providing the theoretical base for applying AMP-jsa9 or its analogs to combat Gram-positive pathogenic bacteria in the food and feed industries. Copyright © 2016 Elsevier B.V. All rights reserved.
Nakamura, Tatsuji; Kuromitsu, Junro; Oda, Yoshiya
2008-03-01
Two-dimensional liquid-chromatographic (LC) separation followed by mass spectrometric (MS) analysis was examined for the identification of peptides in complex mixtures as an alternative to widely used two-dimensional gel electrophoresis followed by MS analysis for use in proteomics. The present method involves the off-line coupling of a narrow-bore, polymer-based, reversed-phase column using an acetonitrile gradient in an alkaline mobile phase in the first dimension with octadecylsilanized silica (ODS)-based nano-LC/MS in the second dimension. After the first separation, successive fractions were acidified and dried off-line, then loaded on the second dimension column. Both columns separate peptides according to hydrophobicity under different pH conditions, but more peptides were identified than with the conventional technique for shotgun proteomics, that is, the combination of a strong cation exchange column with an ODS column, and the system was robust because no salts were included in the mobile phases. The suitability of the method for proteomics measurements was evaluated.
Miernyk, Ján A; Hajduch, Martin
2011-04-01
Seeds comprise a protective covering, a small embryonic plant, and a nutrient-storage organ. Seeds are protein-rich, and have been the subject of many mass spectrometry-based analyses. Seed storage proteins (SSP), which are transient depots for reduced nitrogen, have been studied for decades by cell biologists, and many of the complicated aspects of their processing, assembly, and compartmentation are now well understood. Unfortunately, the abundance and complexity of the SSP requires that they be avoided or removed prior to gel-based analysis of non-SSP. While much of the extant data from MS-based proteomic analysis of seeds is descriptive, it has nevertheless provided a preliminary metabolic picture explaining much of their biology. Contemporary studies are moving more toward analysis of protein interactions and posttranslational modifications, and functions of metabolic networks. Many aspects of the biology of seeds make then an attractive platform for heterologous protein expression. Herein we present a broad review of the results from the proteomic studies of seeds, and speculate on a potential future research directions. Copyright © 2010 Elsevier B.V. All rights reserved.
Morris, Jeffrey S
2012-01-01
In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry ( Cromwell ) and 2D gel electrophoresis ( Pinnacle ) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods presented are applied to two specific proteomic technologies, MALDI-TOF and 2D gel electrophoresis, these methods and the other principles discussed in the paper apply much more broadly to other expression proteomics technologies.
QC-ART: A tool for real-time quality control assessment of mass spectrometry-based proteomics data.
Stanfill, Bryan A; Nakayasu, Ernesto S; Bramer, Lisa M; Thompson, Allison M; Ansong, Charles K; Clauss, Therese; Gritsenko, Marina A; Monroe, Matthew E; Moore, Ronald J; Orton, Daniel J; Piehowski, Paul D; Schepmoes, Athena A; Smith, Richard D; Webb-Robertson, Bobbie-Jo; Metz, Thomas O
2018-04-17
Liquid chromatography-mass spectrometry (LC-MS)-based proteomics studies of large sample cohorts can easily require from months to years to complete. Acquiring consistent, high-quality data in such large-scale studies is challenging because of normal variations in instrumentation performance over time, as well as artifacts introduced by the samples themselves, such as those due to collection, storage and processing. Existing quality control methods for proteomics data primarily focus on post-hoc analysis to remove low-quality data that would degrade downstream statistics; they are not designed to evaluate the data in near real-time, which would allow for interventions as soon as deviations in data quality are detected. In addition to flagging analyses that demonstrate outlier behavior, evaluating how the data structure changes over time can aide in understanding typical instrument performance or identify issues such as a degradation in data quality due to the need for instrument cleaning and/or re-calibration. To address this gap for proteomics, we developed Quality Control Analysis in Real-Time (QC-ART), a tool for evaluating data as they are acquired in order to dynamically flag potential issues with instrument performance or sample quality. QC-ART has similar accuracy as standard post-hoc analysis methods with the additional benefit of real-time analysis. We demonstrate the utility and performance of QC-ART in identifying deviations in data quality due to both instrument and sample issues in near real-time for LC-MS-based plasma proteomics analyses of a sample subset of The Environmental Determinants of Diabetes in the Young cohort. We also present a case where QC-ART facilitated the identification of oxidative modifications, which are often underappreciated in proteomic experiments. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Global iTRAQ-based proteomic profiling of Toxoplasma gondii oocysts during sporulation.
Zhou, Chun-Xue; Zhu, Xing-Quan; Elsheikha, Hany M; He, Shuai; Li, Qian; Zhou, Dong-Hui; Suo, Xun
2016-10-04
Toxoplasma gondii is a medically and economically important protozoan parasite. However, the molecular mechanisms of its sporulation remain largely unknown. Here, we applied iTRAQ coupled with 2D LC-MS/MS proteomic analysis to investigate the proteomic expression profile of T. gondii oocysts during sporulation. Of the 2095 non-redundant proteins identified, 587 were identified as differentially expressed proteins (DEPs). Based on Gene Ontology enrichment and KEGG pathway analyses the majority of these DEPs were found related to the metabolism of amino acids, carbon and energy. Protein interaction network analysis generated by STRING identified ATP-citrate lyase (ACL), GMP synthase, IMP dehydrogenase (IMPDH), poly (ADP-ribose) glycohydrolase (PARG), and bifunctional dihydrofolate reductase-thymidylate synthase (DHFR-TS) as the top five hubs. We also identified 25 parasite virulence factors that were expressed at relatively high levels in sporulated oocysts compared to non-sporulated oocysts, which might contribute to the infectivity of mature oocysts. Considering the importance of oocysts in the dissemination of toxoplasmosis these findings may help in the search of protein targets with a key role in infectiousness and ecological success of oocysts, creating new opportunities for the development of better means for disease prevention. The development of new preventative interventions against T. gondii infection relies on an improved understanding of the proteome and chemical pathways of this parasite. To identify proteins required for the development of environmentally resistant and infective T. gondii oocysts, we compared the proteome of non-sporulated (immature) oocysts with the proteome of sporulated (mature, infective) oocysts. iTRAQ 2D-LC-MS/MS analysis revealed proteomic changes that distinguish non-sporulated from sporulated oocysts. Many of the differentially expressed proteins were involved in metabolic pathways and 25 virulence factors were identified upregulated in the sporulated oocysts. This work provides the first quantitative characterization of the proteomic variations that occur in T. gondii oocyst stage during sporulation. Copyright © 2016. Published by Elsevier B.V.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V.; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J.; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wiśniewski, Jacek R.; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools. PMID:17090601
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
Using proteomics to study sexual reproduction in angiosperms
USDA-ARS?s Scientific Manuscript database
While a relative latecomer to the post-genomics era of functional biology, the application of mass spectrometry-based proteomic analysis has increased exponentially over the past 10 years. Some of this increase is the result of transition of chemists physicists, and mathematicians to the study of ...
Assessment of Sample Preparation Bias in Mass Spectrometry-Based Proteomics.
Klont, Frank; Bras, Linda; Wolters, Justina C; Ongay, Sara; Bischoff, Rainer; Halmos, Gyorgy B; Horvatovich, Péter
2018-04-17
For mass spectrometry-based proteomics, the selected sample preparation strategy is a key determinant for information that will be obtained. However, the corresponding selection is often not based on a fit-for-purpose evaluation. Here we report a comparison of in-gel (IGD), in-solution (ISD), on-filter (OFD), and on-pellet digestion (OPD) workflows on the basis of targeted (QconCAT-multiple reaction monitoring (MRM) method for mitochondrial proteins) and discovery proteomics (data-dependent acquisition, DDA) analyses using three different human head and neck tissues (i.e., nasal polyps, parotid gland, and palatine tonsils). Our study reveals differences between the sample preparation methods, for example, with respect to protein and peptide losses, quantification variability, protocol-induced methionine oxidation, and asparagine/glutamine deamidation as well as identification of cysteine-containing peptides. However, none of the methods performed best for all types of tissues, which argues against the existence of a universal sample preparation method for proteome analysis.
The Assay Development Working Group (ADWG) of the CPTAC Program is currently drafting a document to propose best practices for generation, quantification, storage, and handling of peptide standards used for mass spectrometry-based assays, as well as interpretation of quantitative proteomic data based on peptide standards. The ADWG is seeking input from commercial entities that provide peptide standards for mass spectrometry-based assays or that perform amino acid analysis.
Shui, Wenqing; Xiong, Yun; Xiao, Weidi; Qi, Xianni; Zhang, Yong; Lin, Yuping; Guo, Yufeng; Zhang, Zhidan; Wang, Qinhong; Ma, Yanhe
2015-01-01
Saccharomyces cerevisiae has been intensively studied in responses to different environmental stresses such as heat shock through global omic analysis. However, the S. cerevisiae industrial strains with superior thermotolerance have not been explored in any proteomic studies for elucidating the tolerance mechanism. Recently a new diploid strain was obtained through evolutionary engineering of a parental industrial strain, and it exhibited even higher resistance to prolonged thermal stress. Herein, we performed iTRAQ-based quantitative proteomic analysis on both the parental and evolved industrial strains to further understand the mechanism of thermotolerant adaptation. Out of ∼2600 quantifiable proteins from biological quadruplicates, 193 and 204 proteins were differentially regulated in the parental and evolved strains respectively during heat-stressed growth. The proteomic response of the industrial strains cultivated under prolonged thermal stress turned out to be substantially different from that of the laboratory strain exposed to sudden heat shock. Further analysis of transcription factors underlying the proteomic perturbation also indicated the distinct regulatory mechanism of thermotolerance. Finally, a cochaperone Mdj1 and a metabolic enzyme Adh1 were selected to investigate their roles in mediating heat-stressed growth and ethanol production of yeasts. Our proteomic characterization of the industrial strain led to comprehensive understanding of the molecular basis of thermotolerance, which would facilitate future improvement in the industrially important trait of S. cerevisiae by rational engineering. PMID:25926660
Lavallée-Adam, Mathieu; Yates, John R
2016-03-24
PSEA-Quant analyzes quantitative mass spectrometry-based proteomics datasets to identify enrichments of annotations contained in repositories such as the Gene Ontology and Molecular Signature databases. It allows users to identify the annotations that are significantly enriched for reproducibly quantified high abundance proteins. PSEA-Quant is available on the Web and as a command-line tool. It is compatible with all label-free and isotopic labeling-based quantitative proteomics methods. This protocol describes how to use PSEA-Quant and interpret its output. The importance of each parameter as well as troubleshooting approaches are also discussed. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Mol, Praseeda; Kannegundla, Uday; Dey, Gourav; Gopalakrishnan, Lathika; Dammalli, Manjunath; Kumar, Manish; Patil, Arun H; Basavaraju, Marappa; Rao, Akhila; Ramesha, Kerekoppa P; Prasad, Thottethodi Subrahmanya Keshava
2018-03-01
Bovine milk is important for both veterinary medicine and human nutrition. Understanding the bovine milk proteome at different stages of lactation has therefore broad significance for integrative biology and clinical medicine as well. Indeed, different lactation stages have marked influence on the milk yield, milk constituents, and nourishment of the neonates. We performed a comparative proteome analysis of the bovine milk obtained at different stages of lactation from the Indian indigenous cattle Malnad Gidda (Bos indicus), a widely available breed. The milk differential proteome during the lactation stages in B. indicus has not been investigated to date. Using high-resolution mass spectrometry-based quantitative proteomics of the bovine whey proteins at early, mid, and late lactation stages, we identified a total of 564 proteins, out of which 403 proteins were found to be differentially abundant at different lactation stages. As is expected of any body fluid proteome, 51% of the proteins identified in the milk were found to have signal peptides. Gene ontology analyses were carried out to categorize proteins altered across different lactation stages based on biological process and molecular function, which enabled us to correlate their significance in each lactation stage. We also investigated the potential pathways enriched in different lactation stages using bioinformatics pathway analysis tools. To the best of our knowledge, this study represents the first and largest inventory of milk proteins identified to date for an Indian cattle breed. We believe that the current study broadly informs both veterinary omics research and the emerging field of nutriproteomics during lactation stages.
Proteomic Expression Patterns in Fathead Minnows Exposed to Trenbolone and Flutamide
Insights into androgen signaling in the liver of fathead minnow (Pimephales promelas) was obtained using non-gel based proteomics analysis. We exposed female fathead minnows for 48 hr through the water to a prototypical androgen (17b-trenbolone, 5 ?g/L), a prototypical anti-andr...
Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong
2008-04-01
The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.
2012-01-01
Background Accurate diagnostic and monitoring tools for ulcerative colitis (UC) are missing. Our aim was to describe the proteomic profile of UC and search for markers associated with disease exacerbation. Therefore, we aimed to characterize specific proteins associated with inflamed colon mucosa from patients with acute UC using mass spectrometry-based proteomic analysis. Methods Biopsies were sampled from rectum, sigmoid colon and left colonic flexure from twenty patients with active proctosigmoiditis and from four healthy controls for proteomics and histology. Proteomic profiles of whole colonic biopsies were characterized using 2D-gel electrophoresis, and peptide mass fingerprinting using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) was applied for identification of differently expressed protein spots. Results A total of 597 spots were annotated by image analysis and 222 of these had a statistically different protein level between inflamed and non-inflamed tissue in the patient group. Principal component analysis clearly grouped non-inflamed samples separately from the inflamed samples indicating that the proteomic signature of colon mucosa with acute UC is strong. Totally, 43 individual protein spots were identified, including proteins involved in energy metabolism (triosephosphate isomerase, glycerol-3-phosphate-dehydrogenase, alpha enolase and L-lactate dehydrogenase B-chain) and in oxidative stress (superoxide dismutase, thioredoxins and selenium binding protein). Conclusions A distinct proteomic profile of inflamed tissue in UC patients was found. Specific proteins involved in energy metabolism and oxidative stress were identified as potential candidate markers for UC. PMID:22726388
P2P proteomics -- data sharing for enhanced protein identification
2012-01-01
Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that could be derived from the proposed P2P proteomics system, however it is not straightforward to arrive to the same conclusion by conventional means as it is difficult to discard organic contamination of samples. The actual presence of this contaminant was only stated after the ABRF study of all the identifications reported by the laboratories. PMID:22293032
Lim, Sanghyun; Borza, Tudor; Peters, Rick D; Coffin, Robert H; Al-Mughrabi, Khalil I; Pinto, Devanand M; Wang-Pruski, Gefu
2013-11-20
Phosphite (salts of phosphorous acid; Phi)-based fungicides are increasingly used in controlling oomycete pathogens, such as the late blight agent Phytophthora infestans. In plants, low amounts of Phi induce pathogen resistance through an indirect mode of action. We used iTRAQ-based quantitative proteomics to investigate the effects of phosphite on potato plants before and after infection with P. infestans. Ninety-three (62 up-regulated and 31 down-regulated) differentially regulated proteins, from a total of 1172 reproducibly identified proteins, were identified in the leaf proteome of Phi-treated potato plants. Four days post-inoculation with P. infestans, 16 of the 31 down-regulated proteins remained down-regulated and 42 of the 62 up-regulated proteins remained up-regulated, including 90% of the defense proteins. This group includes pathogenesis-related, stress-responsive, and detoxification-related proteins. Callose deposition and ultrastructural analyses of leaf tissues after infection were used to complement the proteomics approach. This study represents the first comprehensive proteomics analysis of the indirect mode of action of Phi, demonstrating broad effects on plant defense and plant metabolism. The proteomics data and the microscopy study suggest that Phi triggers a hypersensitive response that is responsible for induced resistance of potato leaves against P. infestans. Phosphie triggers complex functional changes in potato leaves that are responsible for the induced resistance against Phytophthora infestans. This article is part of a Special Issue entitled: Translational Plant Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.
Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.
2015-01-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L
2015-02-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Elamin, Ashraf; Titz, Bjoern; Dijon, Sophie; Merg, Celine; Geertz, Marcel; Schneider, Thomas; Martin, Florian; Schlage, Walter K; Frentzel, Stefan; Talamo, Fabio; Phillips, Blaine; Veljkovic, Emilija; Ivanov, Nikolai V; Vanscheeuwijck, Patrick; Peitsch, Manuel C; Hoeng, Julia
2016-08-11
Smoking is associated with several serious diseases, such as lung cancer and chronic obstructive pulmonary disease (COPD). Within our systems toxicology framework, we are assessing whether potential modified risk tobacco products (MRTP) can reduce smoking-related health risks compared to conventional cigarettes. In this article, we evaluated to what extent 2D-PAGE/MALDI MS/MS (2D-PAGE) can complement the iTRAQ LC-MS/MS results from a previously reported mouse inhalation study, in which we assessed a prototypic MRTP (pMRTP). Selected differentially expressed proteins identified by both LC-MS/MS and 2D-PAGE approaches were further verified using reverse-phase protein microarrays. LC-MS/MS captured the effects of cigarette smoke (CS) on the lung proteome more comprehensively than 2D-PAGE. However, an integrated analysis of both proteomics data sets showed that 2D-PAGE data complement the LC-MS/MS results by supporting the overall trend of lower effects of pMRTP aerosol than CS on the lung proteome. Biological effects of CS exposure supported by both methods included increases in immune-related, surfactant metabolism, proteasome, and actin cytoskeleton protein clusters. Overall, while 2D-PAGE has its value, especially as a complementary method for the analysis of effects on intact proteins, LC-MS/MS approaches will likely be the method of choice for proteome analysis in systems toxicology investigations. Quantitative proteomics is anticipated to play a growing role within systems toxicology assessment frameworks in the future. To further understand how different proteomics technologies can contribute to toxicity assessment, we conducted a quantitative proteomics analysis using 2D-PAGE and isobaric tag-based LC-MS/MS approaches and compared the results produced from the 2 approaches. Using a prototypic modified risk tobacco product (pMRTP) as our test item, we show compared with cigarette smoke, how 2D-PAGE results can complement and support LC-MS/MS data, demonstrating the much lower effects of pMRTP aerosol than cigarette smoke on the mouse lung proteome. The combined analysis of 2D-PAGE and LC-MS/MS data identified an effect of cigarette smoke on the proteasome and actin cytoskeleton in the lung. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Cell death proteomics database: consolidating proteomics data on cell death.
Arntzen, Magnus Ø; Bull, Vibeke H; Thiede, Bernd
2013-05-03
Programmed cell death is a ubiquitous process of utmost importance for the development and maintenance of multicellular organisms. More than 10 different types of programmed cell death forms have been discovered. Several proteomics analyses have been performed to gain insight in proteins involved in the different forms of programmed cell death. To consolidate these studies, we have developed the cell death proteomics (CDP) database, which comprehends data from apoptosis, autophagy, cytotoxic granule-mediated cell death, excitotoxicity, mitotic catastrophe, paraptosis, pyroptosis, and Wallerian degeneration. The CDP database is available as a web-based database to compare protein identifications and quantitative information across different experimental setups. The proteomics data of 73 publications were integrated and unified with protein annotations from UniProt-KB and gene ontology (GO). Currently, more than 6,500 records of more than 3,700 proteins are included in the CDP. Comparing apoptosis and autophagy using overrepresentation analysis of GO terms, the majority of enriched processes were found in both, but also some clear differences were perceived. Furthermore, the analysis revealed differences and similarities of the proteome between autophagosomal and overall autophagy. The CDP database represents a useful tool to consolidate data from proteome analyses of programmed cell death and is available at http://celldeathproteomics.uio.no.
Sethi, Manveen K; Thaysen-Andersen, Morten; Kim, Hoguen; Park, Cheol Keun; Baker, Mark S; Packer, Nicolle H; Paik, Young-Ki; Hancock, William S; Fanayan, Susan
2015-08-03
Modern proteomics has proven instrumental in our understanding of the molecular deregulations associated with the development and progression of cancer. Herein, we profile membrane-enriched proteome of tumor and adjacent normal tissues from eight CRC patients using label-free nanoLC-MS/MS-based quantitative proteomics and advanced pathway analysis. Of the 948 identified proteins, 184 proteins were differentially expressed (P<0.05, fold change>1.5) between the tumor and non-tumor tissue (69 up-regulated and 115 down-regulated in tumor tissues). The CRC tumor and non-tumor tissues clustered tightly in separate groups using hierarchical cluster analysis of the differentially expressed proteins, indicating a strong CRC-association of this proteome subset. Specifically, cancer associated proteins such as FN1, TNC, DEFA1, ITGB2, MLEC, CDH17, EZR and pathways including actin cytoskeleton and RhoGDI signaling were deregulated. Stage-specific proteome signatures were identified including up-regulated ribosomal proteins and down-regulated annexin proteins in early stage CRC. Finally, EGFR(+) CRC tissues showed an EGFR-dependent down-regulation of cell adhesion molecules, relative to EGFR(-) tissues. Taken together, this study provides a detailed map of the altered proteome and associated protein pathways in CRC, which enhances our mechanistic understanding of CRC biology and opens avenues for a knowledge-driven search for candidate CRC protein markers. Copyright © 2015 Elsevier B.V. All rights reserved.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes.
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wisniewski, Jacek R; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at http://www.mapuproteome.com using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools.
Martínez-Bartolomé, Salvador; Medina-Aunon, J Alberto; López-García, Miguel Ángel; González-Tejedo, Carmen; Prieto, Gorka; Navajas, Rosana; Salazar-Donate, Emilio; Fernández-Costa, Carolina; Yates, John R; Albar, Juan Pablo
2018-04-06
Mass-spectrometry-based proteomics has evolved into a high-throughput technology in which numerous large-scale data sets are generated from diverse analytical platforms. Furthermore, several scientific journals and funding agencies have emphasized the storage of proteomics data in public repositories to facilitate its evaluation, inspection, and reanalysis. (1) As a consequence, public proteomics data repositories are growing rapidly. However, tools are needed to integrate multiple proteomics data sets to compare different experimental features or to perform quality control analysis. Here, we present a new Java stand-alone tool, Proteomics Assay COMparator (PACOM), that is able to import, combine, and simultaneously compare numerous proteomics experiments to check the integrity of the proteomic data as well as verify data quality. With PACOM, the user can detect source of errors that may have been introduced in any step of a proteomics workflow and that influence the final results. Data sets can be easily compared and integrated, and data quality and reproducibility can be visually assessed through a rich set of graphical representations of proteomics data features as well as a wide variety of data filters. Its flexibility and easy-to-use interface make PACOM a unique tool for daily use in a proteomics laboratory. PACOM is available at https://github.com/smdb21/pacom .
Zhou, Li; Wang, Kui; Li, Qifu; Nice, Edouard C; Zhang, Haiyuan; Huang, Canhua
2016-01-01
Cancer is a common disease that is a leading cause of death worldwide. Currently, early detection and novel therapeutic strategies are urgently needed for more effective management of cancer. Importantly, protein profiling using clinical proteomic strategies, with spectacular sensitivity and precision, offer excellent promise for the identification of potential biomarkers that would direct the development of targeted therapeutic anticancer drugs for precision medicine. In particular, clinical sample sources, including tumor tissues and body fluids (blood, feces, urine and saliva), have been widely investigated using modern high-throughput mass spectrometry-based proteomic approaches combined with bioinformatic analysis, to pursue the possibilities of precision medicine for targeted cancer therapy. Discussed in this review are the current advantages and limitations of clinical proteomics, the available strategies of clinical proteomics for the management of precision medicine, as well as the challenges and future perspectives of clinical proteomics-driven precision medicine for targeted cancer therapy.
Proteome studies of filamentous fungi.
Baker, Scott E; Panisko, Ellen A
2011-01-01
The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide variety of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, nongel-based proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of variations on the general methods and technologies for identifying peptides in a given sample. We present a method that can serve as a "baseline" for proteomic studies of fungi.
Zeng, Yunliu; Pan, Zhiyong; Ding, Yuduan; Zhu, Andan; Cao, Hongbo; Xu, Qiang; Deng, Xiuxin
2011-11-01
Here, a comprehensive proteomic analysis of the chromoplasts purified from sweet orange using Nycodenz density gradient centrifugation is reported. A GeLC-MS/MS shotgun approach was used to identify the proteins of pooled chromoplast samples. A total of 493 proteins were identified from purified chromoplasts, of which 418 are putative plastid proteins based on in silico sequence homology and functional analyses. Based on the predicted functions of these identified plastid proteins, a large proportion (∼60%) of the chromoplast proteome of sweet orange is constituted by proteins involved in carbohydrate metabolism, amino acid/protein synthesis, and secondary metabolism. Of note, HDS (hydroxymethylbutenyl 4-diphosphate synthase), PAP (plastid-lipid-associated protein), and psHSPs (plastid small heat shock proteins) involved in the synthesis or storage of carotenoid and stress response are among the most abundant proteins identified. A comparison of chromoplast proteomes between sweet orange and tomato suggested a high level of conservation in a broad range of metabolic pathways. However, the citrus chromoplast was characterized by more extensive carotenoid synthesis, extensive amino acid synthesis without nitrogen assimilation, and evidence for lipid metabolism concerning jasmonic acid synthesis. In conclusion, this study provides an insight into the major metabolic pathways as well as some unique characteristics of the sweet orange chromoplasts at the whole proteome level.
Using HPLC-Mass Spectrometry to Teach Proteomics Concepts with Problem-Based Techniques
ERIC Educational Resources Information Center
Short, Michael; Short, Anne; Vankempen, Rachel; Seymour, Michael; Burnatowska-Hledin, Maria
2010-01-01
Practical instruction of proteomics concepts was provided using high-performance liquid chromatography coupled with a mass selective detection system (HPLC-MS) for the analysis of simulated protein digests. The samples were prepared from selected dipeptides in order to facilitate the mass spectral identification. As part of the prelaboratory…
Yin, Xiaojian; Sakata, Katsumi; Nanjo, Yohei; Komatsu, Setsuko
2014-06-25
Flooding has a severe negative effect on soybean cultivation in the early stages of growth. To obtain a better understanding of the response mechanisms of soybean to flooding stress, initial changes in root tip proteins under flooding were analyzed using two proteomic techniques. Two-day-old soybeans were treated with flooding for 3, 6, 12, and 24h. The weight of soybeans increased during the first 3h of flooding, but root elongation was not observed. Using gel-based and gel-free proteomic techniques, 115 proteins were identified in root tips, of which 9 proteins were commonly detected by both methods. The 71 proteins identified by the gel-free proteomics were analyzed by a hierarchical clustering method based on induction levels during the flooding, and the proteins were divided into 5 clusters. Additional interaction analysis of the proteins revealed that ten proteins belonging to cluster I formed the center of a protein interaction network. mRNA expression analysis of these ten proteins showed that citrate lyase and heat shock protein 70 were down-regulated, whereas calreticulin was up-regulated in initial phase of flooding. These results suggest that flooding stress to soybean induces calcium-related signal transduction, which might play important roles in the early responses to flooding. Flooding has a severe negative effect on soybean cultivation, particularly in the early stages of growth. To better understand the response mechanisms of soybean to the early stages of flooding stress, two proteomic techniques were used. Two-day-old soybeans were treated without or with flooding for 3, 6, 12, and 24h. The fresh weight of soybeans increased during the first 3h of flooding stress, but the growth then slowed and no root elongation was observed. Using gel-based and gel-free proteomic techniques, 115 proteins were identified in root tips, of which 9 proteins were commonly detected by both methods. The 71 proteins identified by the gel-free proteomics were analyzed by a hierarchical clustering method based on induction levels during the flooding stress, and 5 protein clusters were recognized. Protein interaction analysis revealed that ten proteins belonging to cluster I formed the center of a protein interaction network. mRNA expression analysis of these ten proteins showed that citrate lyase and heat shock protein 70 were down-regulated in response to flooding stress, whereas calreticulin was up-regulated. These results suggest that flooding stress to soybean induces calcium-related signal transduction, which might play important roles in the early responses to flooding. Copyright © 2014 Elsevier B.V. All rights reserved.
Hegedűs, Tamás; Chaubey, Pururawa Mayank; Várady, György; Szabó, Edit; Sarankó, Hajnalka; Hofstetter, Lia; Roschitzki, Bernd; Sarkadi, Balázs
2015-01-01
Based on recent results, the determination of the easily accessible red blood cell (RBC) membrane proteins may provide new diagnostic possibilities for assessing mutations, polymorphisms or regulatory alterations in diseases. However, the analysis of the current mass spectrometry-based proteomics datasets and other major databases indicates inconsistencies—the results show large scattering and only a limited overlap for the identified RBC membrane proteins. Here, we applied membrane-specific proteomics studies in human RBC, compared these results with the data in the literature, and generated a comprehensive and expandable database using all available data sources. The integrated web database now refers to proteomic, genetic and medical databases as well, and contains an unexpected large number of validated membrane proteins previously thought to be specific for other tissues and/or related to major human diseases. Since the determination of protein expression in RBC provides a method to indicate pathological alterations, our database should facilitate the development of RBC membrane biomarker platforms and provide a unique resource to aid related further research and diagnostics. Database URL: http://rbcc.hegelab.org PMID:26078478
Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach.
Guruceaga, Elizabeth; Garin-Muga, Alba; Prieto, Gorka; Bejarano, Bartolomé; Marcilla, Miguel; Marín-Vicente, Consuelo; Perez-Riverol, Yasset; Casal, J Ignacio; Vizcaíno, Juan Antonio; Corrales, Fernando J; Segura, Victor
2017-12-01
The Human Proteome Project (HPP) aims deciphering the complete map of the human proteome. In the past few years, significant efforts of the HPP teams have been dedicated to the experimental detection of the missing proteins, which lack reliable mass spectrometry evidence of their existence. In this endeavor, an in depth analysis of shotgun experiments might represent a valuable resource to select a biological matrix in design validation experiments. In this work, we used all the proteomic experiments from the NCI60 cell lines and applied an integrative approach based on the results obtained from Comet, Mascot, OMSSA, and X!Tandem. This workflow benefits from the complementarity of these search engines to increase the proteome coverage. Five missing proteins C-HPP guidelines compliant were identified, although further validation is needed. Moreover, 165 missing proteins were detected with only one unique peptide, and their functional analysis supported their participation in cellular pathways as was also proposed in other studies. Finally, we performed a combined analysis of the gene expression levels and the proteomic identifications from the common cell lines between the NCI60 and the CCLE project to suggest alternatives for further validation of missing protein observations.
Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach
2017-01-01
The Human Proteome Project (HPP) aims deciphering the complete map of the human proteome. In the past few years, significant efforts of the HPP teams have been dedicated to the experimental detection of the missing proteins, which lack reliable mass spectrometry evidence of their existence. In this endeavor, an in depth analysis of shotgun experiments might represent a valuable resource to select a biological matrix in design validation experiments. In this work, we used all the proteomic experiments from the NCI60 cell lines and applied an integrative approach based on the results obtained from Comet, Mascot, OMSSA, and X!Tandem. This workflow benefits from the complementarity of these search engines to increase the proteome coverage. Five missing proteins C-HPP guidelines compliant were identified, although further validation is needed. Moreover, 165 missing proteins were detected with only one unique peptide, and their functional analysis supported their participation in cellular pathways as was also proposed in other studies. Finally, we performed a combined analysis of the gene expression levels and the proteomic identifications from the common cell lines between the NCI60 and the CCLE project to suggest alternatives for further validation of missing protein observations. PMID:28960077
Hepatic SILAC proteomic data from PANDER transgenic model.
Athanason, Mark G; Stevens, Stanley M; Burkhardt, Brant R
2016-12-01
This article contains raw and processed data related to research published in "Quantitative Proteomic Profiling Reveals Hepatic Lipogenesis and Liver X Receptor Activation in the PANDER Transgenic Model" (M.G. Athanason, W.A. Ratliff, D. Chaput, C.B. MarElia, M.N. Kuehl, S.M., Jr. Stevens, B.R. Burkhardt (2016)) [1], and was generated by "spike-in" SILAC-based proteomic analysis of livers obtained from the PANcreatic-Derived factor (PANDER) transgenic mouse (PANTG) under various metabolic conditions [1]. The mass spectrometry output of the PANTG and wild-type B6SJLF mice liver tissue and resulting proteome search from MaxQuant 1.2.2.5 employing the Andromeda search algorithm against the UniprotKB reference database for Mus musculus has been deposited to the ProteomeXchange Consortium (http://www.proteomexchange.org) via the PRIDE partner repository with dataset identifiers PRIDE: PXD004171 and doi:10.6019/PXD004171. Protein ratio values representing PANTG/wild-type obtained by MaxQuant analysis were input into the Perseus processing suite to determine statistical significance using the Significance A outlier test (p<0.05). Differentially expressed proteins using this approach were input into Ingenuity Pathway Analysis to determined altered pathways and upstream regulators that were altered in PANTG mice.
Rice proteome database: a step toward functional analysis of the rice genome.
Komatsu, Setsuko
2005-09-01
The technique of proteome analysis using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this study, the proteins of rice were cataloged, a rice proteome database was constructed, and a functional characterization of some of the identified proteins was undertaken. Proteins extracted from various tissues and subcellular compartments in rice were separated by 2D-PAGE and an image analyzer was used to construct a display of the proteins. The Rice Proteome Database contains 23 reference maps based on 2D-PAGE of proteins from various rice tissues and subcellular compartments. These reference maps comprise 13129 identified proteins, and the amino acid sequences of 5092 proteins are entered in the database. Major proteins involved in growth or stress responses were identified using the proteome approach. Some of these proteins, including a beta-tubulin, calreticulin, and ribulose-1,5-bisphosphate carboxylase/oxygenase activase in rice, have unexpected functions. The information obtained from the Rice Proteome Database will aid in cloning the genes for and predicting the function of unknown proteins.
Yu, Yanbao; Leng, Taohua; Yun, Dong; Liu, Na; Yao, Jun; Dai, Ying; Yang, Pengyuan; Chen, Xian
2013-01-01
Emerging evidences indicate that blood platelets function in multiple biological processes including immune response, bone metastasis and liver regeneration in addition to their known roles in hemostasis and thrombosis. Global elucidation of platelet proteome will provide the molecular base of these platelet functions. Here, we set up a high throughput platform for maximum exploration of the rat/human platelet proteome using integrated proteomics technologies, and then applied to identify the largest number of the proteins expressed in both rat and human platelets. After stringent statistical filtration, a total of 837 unique proteins matched with at least two unique peptides were precisely identified, making it the first comprehensive protein database so far for rat platelets. Meanwhile, quantitative analyses of the thrombin-stimulated platelets offered great insights into the biological functions of platelet proteins and therefore confirmed our global profiling data. A comparative proteomic analysis between rat and human platelets was also conducted, which revealed not only a significant similarity, but also an across-species evolutionary link that the orthologous proteins representing ‘core proteome’, and the ‘evolutionary proteome’ is actually a relatively static proteome. PMID:20443191
Quantitative Proteomic Analysis of the Hfq-Regulon in Sinorhizobium meliloti 2011
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with 15N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti. PMID:23119037
Quantitative proteomic analysis of the Hfq-regulon in Sinorhizobium meliloti 2011.
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with (15)N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti.
Zhang, Aihua; Zhou, Xiaohang; Zhao, Hongwei; Zou, Shiyu; Ma, Chung Wah; Liu, Qi; Sun, Hui; Liu, Liang; Wang, Xijun
2017-01-31
An integrative metabolomics and proteomics approach can provide novel insights in the understanding of biological systems. We have integrated proteome and metabolome data sets for a holistic view of the molecular mechanisms in disease. Using quantitative iTRAQ-LC-MS/MS proteomics coupled with UPLC-Q-TOF-HDMS based metabolomics, we determined the protein and metabolite expression changes in the kidney-yang deficiency syndrome (KYDS) rat model and further investigated the intervention effects of the Jinkui Shenqi Pill (JSP). The VIP-plot of the orthogonal PLS-DA (OPLS-DA) was used for discovering the potential biomarkers to clarify the therapeutic mechanisms of JSP in treating KYDS. The results showed that JSP can alleviate the kidney impairment induced by KYDS. Sixty potential biomarkers, including 5-l-glutamyl-taurine, phenylacetaldehyde, 4,6-dihydroxyquinoline, and xanthurenic acid etc., were definitely up- or down-regulated. The regulatory effect of JSP on the disturbed metabolic pathways was proved by the established metabonomic method. Using pathway analyses, we identified the disturbed metabolic pathways such as taurine and hypotaurine metabolism, pyrimidine metabolism, tyrosine metabolism, tryptophan metabolism, histidine metabolism, steroid hormone biosynthesis, etc. Furthermore, using iTRAQ-based quantitative proteomics analysis, seventeen differential proteins were identified and significantly altered by the JSP treatment. These proteins appear to be involved in Wnt, chemokine, PPAR, and MAPK signaling pathways, etc. Functional pathway analysis revealed that most of the proteins were found to play a key role in the regulation of metabolism pathways. Bioinformatics analysis with the IPA software found that these differentially-expressed moleculars had a strong correlation with the α-adrenergic signaling, FGF signaling, etc. Our data indicate that high-throughput metabolomics and proteomics can provide an insight on the herbal preparations affecting the metabolic disorders using high resolution mass spectrometry.
Proteomic identification of fat-browning markers in cultured white adipocytes treated with curcumin.
Kim, Sang Woo; Choi, Jae Heon; Mukherjee, Rajib; Hwang, Ki-Chul; Yun, Jong Won
2016-04-01
We previously reported that curcumin induces browning of primary white adipocytes via enhanced expression of brown adipocyte-specific genes. In this study, we attempted to identify target proteins responsible for this fat-browning effect by analyzing proteomic changes in cultured white adipocytes in response to curcumin treatment. To elucidate the role of curcumin in fat-browning, we conducted comparative proteomic analysis of primary adipocytes between control and curcumin-treated cells using two-dimensional electrophoresis combined with MALDI-TOF-MS. We also investigated fatty acid metabolic targets, mitochondrial biogenesis, and fat-browning-associated proteins using combined proteomic and network analyses. Proteomic analysis revealed that 58 protein spots from a total of 325 matched spots showed differential expression between control and curcumin-treated adipocytes. Using network analysis, most of the identified proteins were proven to be involved in various metabolic and cellular processes based on the PANTHER classification system. One of the most striking findings is that hormone-sensitive lipase (HSL) was highly correlated with main browning markers based on the STRING database. HSL and two browning markers (UCP1, PGC-1α) were co-immunoprecipitated with these markers, suggesting that HSL possibly plays a role in fat-browning of white adipocytes. Our results suggest that curcumin increased HSL levels and other browning-specific markers, suggesting its possible role in augmentation of lipolysis and suppression of lipogenesis by trans-differentiation from white adipocytes into brown adipocytes (beige).
Griss, Johannes; Jones, Andrew R; Sachsenberg, Timo; Walzer, Mathias; Gatto, Laurent; Hartler, Jürgen; Thallinger, Gerhard G; Salek, Reza M; Steinbeck, Christoph; Neuhauser, Nadin; Cox, Jürgen; Neumann, Steffen; Fan, Jun; Reisinger, Florian; Xu, Qing-Wei; Del Toro, Noemi; Pérez-Riverol, Yasset; Ghali, Fawaz; Bandeira, Nuno; Xenarios, Ioannis; Kohlbacher, Oliver; Vizcaíno, Juan Antonio; Hermjakob, Henning
2014-10-01
The HUPO Proteomics Standards Initiative has developed several standardized data formats to facilitate data sharing in mass spectrometry (MS)-based proteomics. These allow researchers to report their complete results in a unified way. However, at present, there is no format to describe the final qualitative and quantitative results for proteomics and metabolomics experiments in a simple tabular format. Many downstream analysis use cases are only concerned with the final results of an experiment and require an easily accessible format, compatible with tools such as Microsoft Excel or R. We developed the mzTab file format for MS-based proteomics and metabolomics results to meet this need. mzTab is intended as a lightweight supplement to the existing standard XML-based file formats (mzML, mzIdentML, mzQuantML), providing a comprehensive summary, similar in concept to the supplemental material of a scientific publication. mzTab files can contain protein, peptide, and small molecule identifications together with experimental metadata and basic quantitative information. The format is not intended to store the complete experimental evidence but provides mechanisms to report results at different levels of detail. These range from a simple summary of the final results to a representation of the results including the experimental design. This format is ideally suited to make MS-based proteomics and metabolomics results available to a wider biological community outside the field of MS. Several software tools for proteomics and metabolomics have already adapted the format as an output format. The comprehensive mzTab specification document and extensive additional documentation can be found online. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Griss, Johannes; Jones, Andrew R.; Sachsenberg, Timo; Walzer, Mathias; Gatto, Laurent; Hartler, Jürgen; Thallinger, Gerhard G.; Salek, Reza M.; Steinbeck, Christoph; Neuhauser, Nadin; Cox, Jürgen; Neumann, Steffen; Fan, Jun; Reisinger, Florian; Xu, Qing-Wei; del Toro, Noemi; Pérez-Riverol, Yasset; Ghali, Fawaz; Bandeira, Nuno; Xenarios, Ioannis; Kohlbacher, Oliver; Vizcaíno, Juan Antonio; Hermjakob, Henning
2014-01-01
The HUPO Proteomics Standards Initiative has developed several standardized data formats to facilitate data sharing in mass spectrometry (MS)-based proteomics. These allow researchers to report their complete results in a unified way. However, at present, there is no format to describe the final qualitative and quantitative results for proteomics and metabolomics experiments in a simple tabular format. Many downstream analysis use cases are only concerned with the final results of an experiment and require an easily accessible format, compatible with tools such as Microsoft Excel or R. We developed the mzTab file format for MS-based proteomics and metabolomics results to meet this need. mzTab is intended as a lightweight supplement to the existing standard XML-based file formats (mzML, mzIdentML, mzQuantML), providing a comprehensive summary, similar in concept to the supplemental material of a scientific publication. mzTab files can contain protein, peptide, and small molecule identifications together with experimental metadata and basic quantitative information. The format is not intended to store the complete experimental evidence but provides mechanisms to report results at different levels of detail. These range from a simple summary of the final results to a representation of the results including the experimental design. This format is ideally suited to make MS-based proteomics and metabolomics results available to a wider biological community outside the field of MS. Several software tools for proteomics and metabolomics have already adapted the format as an output format. The comprehensive mzTab specification document and extensive additional documentation can be found online. PMID:24980485
DOGMA: domain-based transcriptome and proteome quality assessment.
Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten
2016-09-01
Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Molecular Diagnosis and Biomarker Identification on SELDI proteomics data by ADTBoost method.
Wang, Lu-Yong; Chakraborty, Amit; Comaniciu, Dorin
2005-01-01
Clinical proteomics is an emerging field that will have great impact on molecular diagnosis, identification of disease biomarkers, drug discovery and clinical trials in the post-genomic era. Protein profiling in tissues and fluids in disease and pathological control and other proteomics techniques will play an important role in molecular diagnosis with therapeutics and personalized healthcare. We introduced a new robust diagnostic method based on ADTboost algorithm, a novel algorithm in proteomics data analysis to improve classification accuracy. It generates classification rules, which are often smaller and easier to interpret. This method often gives most discriminative features, which can be utilized as biomarkers for diagnostic purpose. Also, it has a nice feature of providing a measure of prediction confidence. We carried out this method in amyotrophic lateral sclerosis (ALS) disease data acquired by surface enhanced laser-desorption/ionization-time-of-flight mass spectrometry (SELDI-TOF MS) experiments. Our method is shown to have outstanding prediction capacity through the cross-validation, ROC analysis results and comparative study. Our molecular diagnosis method provides an efficient way to distinguish ALS disease from neurological controls. The results are expressed in a simple and straightforward alternating decision tree format or conditional format. We identified most discriminative peaks in proteomic data, which can be utilized as biomarkers for diagnosis. It will have broad application in molecular diagnosis through proteomics data analysis and personalized medicine in this post-genomic era.
Venkataramanan, Keerthi P; Min, Lie; Hou, Shuyu; Jones, Shawn W; Ralston, Matthew T; Lee, Kelvin H; Papoutsakis, E Terry
2015-01-01
Clostridium acetobutylicum is a model organism for both clostridial biology and solvent production. The organism is exposed to its own toxic metabolites butyrate and butanol, which trigger an adaptive stress response. Integrative analysis of proteomic and RNAseq data may provide novel insights into post-transcriptional regulation. The identified iTRAQ-based quantitative stress proteome is made up of 616 proteins with a 15 % genome coverage. The differentially expressed proteome correlated poorly with the corresponding differential RNAseq transcriptome. Up to 31 % of the differentially expressed proteins under stress displayed patterns opposite to those of the transcriptome, thus suggesting significant post-transcriptional regulation. The differential proteome of the translation machinery suggests that cells employ a different subset of ribosomal proteins under stress. Several highly upregulated proteins but with low mRNA levels possessed mRNAs with long 5'UTRs and strong RBS scores, thus supporting the argument that regulatory elements on the long 5'UTRs control their translation. For example, the oxidative stress response rubrerythrin was upregulated only at the protein level up to 40-fold without significant mRNA changes. We also identified many leaderless transcripts, several displaying different transcriptional start sites, thus suggesting mRNA-trimming mechanisms under stress. Downregulation of Rho and partner proteins pointed to changes in transcriptional elongation and termination under stress. The integrative proteomic-transcriptomic analysis demonstrated complex expression patterns of a large fraction of the proteome. Such patterns could not have been detected with one or the other omic analyses. Our analysis proposes the involvement of specific molecular mechanisms of post-transcriptional regulation to explain the observed complex stress response.
MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics*
Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.; Chen, Albert J.; Ayaz-Guner, Serife; Peng, Ying; Valeja, Santosh G.; Liu, Xiaowen; Ge, Ying
2016-01-01
Top-down mass spectrometry (MS)-based proteomics is arguably a disruptive technology for the comprehensive analysis of all proteoforms arising from genetic variation, alternative splicing, and posttranslational modifications (PTMs). However, the complexity of top-down high-resolution mass spectra presents a significant challenge for data analysis. In contrast to the well-developed software packages available for data analysis in bottom-up proteomics, the data analysis tools in top-down proteomics remain underdeveloped. Moreover, despite recent efforts to develop algorithms and tools for the deconvolution of top-down high-resolution mass spectra and the identification of proteins from complex mixtures, a multifunctional software platform, which allows for the identification, quantitation, and characterization of proteoforms with visual validation, is still lacking. Herein, we have developed MASH Suite Pro, a comprehensive software tool for top-down proteomics with multifaceted functionality. MASH Suite Pro is capable of processing high-resolution MS and tandem MS (MS/MS) data using two deconvolution algorithms to optimize protein identification results. In addition, MASH Suite Pro allows for the characterization of PTMs and sequence variations, as well as the relative quantitation of multiple proteoforms in different experimental conditions. The program also provides visualization components for validation and correction of the computational outputs. Furthermore, MASH Suite Pro facilitates data reporting and presentation via direct output of the graphics. Thus, MASH Suite Pro significantly simplifies and speeds up the interpretation of high-resolution top-down proteomics data by integrating tools for protein identification, quantitation, characterization, and visual validation into a customizable and user-friendly interface. We envision that MASH Suite Pro will play an integral role in advancing the burgeoning field of top-down proteomics. PMID:26598644
Proteomic analysis of Medulloblastoma reveals functional biology with translational potential.
Rivero-Hinojosa, Samuel; Lau, Ling San; Stampar, Mojca; Staal, Jerome; Zhang, Huizhen; Gordish-Dressman, Heather; Northcott, Paul A; Pfister, Stefan M; Taylor, Michael D; Brown, Kristy J; Rood, Brian R
2018-06-07
Genomic characterization has begun to redefine diagnostic classifications of cancers. However, it remains a challenge to infer disease phenotypes from genomic alterations alone. To help realize the promise of genomics, we have performed a quantitative proteomics investigation using Stable Isotope Labeling by Amino Acids in Cell Culture (SILAC) and 41 tissue samples spanning the 4 genomically based subgroups of medulloblastoma and control cerebellum. We have identified and quantitated thousands of proteins across these groups and find that we are able to recapitulate the genomic subgroups based upon subgroup restricted and differentially abundant proteins while also identifying subgroup specific protein isoforms. Integrating our proteomic measurements with genomic data, we calculate a poor correlation between mRNA and protein abundance. Using EPIC 850 k methylation array data on the same tissues, we also investigate the influence of copy number alterations and DNA methylation on the proteome in an attempt to characterize the impact of these genetic features on the proteome. Reciprocally, we are able to use the proteome to identify which genomic alterations result in altered protein abundance and thus are most likely to impact biology. Finally, we are able to assemble protein-based pathways yielding potential avenues for clinical intervention. From these, we validate the EIF4F cap-dependent translation pathway as a novel druggable pathway in medulloblastoma. Thus, quantitative proteomics complements genomic platforms to yield a more complete understanding of functional tumor biology and identify novel therapeutic targets for medulloblastoma.
Gao, Jing; Zhong, Shaoyun; Zhou, Yanting; He, Han; Peng, Shuying; Zhu, Zhenyun; Liu, Xing; Zheng, Jing; Xu, Bin; Zhou, Hu
2017-06-06
Detergents and salts are widely used in lysis buffers to enhance protein extraction from biological samples, facilitating in-depth proteomic analysis. However, these detergents and salt additives must be efficiently removed from the digested samples prior to LC-MS/MS analysis to obtain high-quality mass spectra. Although filter-aided sample preparation (FASP), acetone precipitation (AP), followed by in-solution digestion, and strong cation exchange-based centrifugal proteomic reactors (CPRs) are commonly used for proteomic sample processing, little is known about their efficiencies at removing detergents and salt additives. In this study, we (i) developed an integrative workflow for the quantification of small molecular additives in proteomic samples, developing a multiple reaction monitoring (MRM)-based LC-MS approach for the quantification of six additives (i.e., Tris, urea, CHAPS, SDS, SDC, and Triton X-100) and (ii) systematically evaluated the relationships between the level of additive remaining in samples following sample processing and the number of peptides/proteins identified by mass spectrometry. Although FASP outperformed the other two methods, the results were complementary in terms of peptide/protein identification, as well as the GRAVY index and amino acid distributions. This is the first systematic and quantitative study of the effect of detergents and salt additives on protein identification. This MRM-based approach can be used for an unbiased evaluation of the performance of new sample preparation methods. Data are available via ProteomeXchange under identifier PXD005405.
Proteomic and Bioinformatic Profile of Primary Human Oral Epithelial Cells
Ghosh, Santosh K.; Yohannes, Elizabeth; Bebek, Gurkan; Weinberg, Aaron; Jiang, Bin; Willard, Belinda; Chance, Mark R.; Kinter, Michael T.; McCormick, Thomas S.
2012-01-01
Wounding of the oral mucosa occurs frequently in a highly septic environment. Remarkably, these wounds heal quickly and the oral cavity, for the most part, remains healthy. Deciphering the normal human oral epithelial cell (NHOEC) proteome is critical for understanding the mechanism(s) of protection elicited when the mucosal barrier is intact, as well as when it is breached. Combining 2D gel electrophoresis with shotgun proteomics resulted in identification of 1662 NHOEC proteins. Proteome annotations were performed based on protein classes, molecular functions, disease association and membership in canonical and metabolic signaling pathways. Comparing the NHOEC proteome with a database of innate immunity-relevant interactions (InnateDB) identified 64 common proteins associated with innate immunity. Comparison with published salivary proteomes revealed that 738/1662 NHOEC proteins were common, suggesting that significant numbers of salivary proteins are of epithelial origin. Gene ontology analysis showed similarities in the distributions of NHOEC and saliva proteomes with regard to biological processes, and molecular functions. We also assessed the inter-individual variability of the NHOEC proteome and observed it to be comparable with other primary cells. The baseline proteome described in this study should serve as a resource for proteome studies of the oral mucosa, especially in relation to disease processes. PMID:23035736
ERIC Educational Resources Information Center
Brown, Cecelia
2003-01-01
Discusses the growth in use and acceptance of Web-based genomic and proteomic databases (GPD) in scholarly communication. Confirms the role of GPD in the scientific literature cycle, suggests GPD are a storage and retrieval mechanism for molecular biology information, and recommends that existing models of scientific communication be updated to…
Functional analysis of proteins and protein species using shotgun proteomics and linear mathematics.
Hoehenwarter, Wolfgang; Chen, Yanmei; Recuenco-Munoz, Luis; Wienkoop, Stefanie; Weckwerth, Wolfram
2011-07-01
Covalent post-translational modification of proteins is the primary modulator of protein function in the cell. It greatly expands the functional potential of the proteome compared to the genome. In the past few years shotgun proteomics-based research, where the proteome is digested into peptides prior to mass spectrometric analysis has been prolific in this area. It has determined the kinetics of tens of thousands of sites of covalent modification on an equally large number of proteins under various biological conditions and uncovered a transiently active regulatory network that extends into diverse branches of cellular physiology. In this review, we discuss this work in light of the concept of protein speciation, which emphasizes the entire post-translationally modified molecule and its interactions and not just the modification site as the functional entity. Sometimes, particularly when considering complex multisite modification, all of the modified molecular species involved in the investigated condition, the protein species must be completely resolved for full understanding. We present a mathematical technique that delivers a good approximation for shotgun proteomics data.
Saliva Proteomics Analysis Offers Insights on Type 1 Diabetes Pathology in a Pediatric Population
Pappa, Eftychia; Vastardis, Heleni; Mermelekas, George; Gerasimidi-Vazeou, Andriani; Zoidakis, Jerome; Vougas, Konstantinos
2018-01-01
The composition of the salivary proteome is affected by pathological conditions. We analyzed by high resolution mass spectrometry approaches saliva samples collected from children and adolescents with type 1 diabetes and healthy controls. The list of more than 2000 high confidence protein identifications constitutes a comprehensive characterization of the salivary proteome. Patients with good glycemic regulation and healthy individuals have comparable proteomic profiles. In contrast, a significant number of differentially expressed proteins were identified in the saliva of patients with poor glycemic regulation compared to patients with good glycemic control and healthy children. These proteins are involved in biological processes relevant to diabetic pathology such as endothelial damage and inflammation. Moreover, a putative preventive therapeutic approach was identified based on bioinformatic analysis of the deregulated salivary proteins. Thus, thorough characterization of saliva proteins in diabetic pediatric patients established a connection between molecular changes and disease pathology. This proteomic and bioinformatic approach highlights the potential of salivary diagnostics in diabetes pathology and opens the way for preventive treatment of the disease. PMID:29755368
A human protein atlas for normal and cancer tissues based on antibody proteomics.
Uhlén, Mathias; Björling, Erik; Agaton, Charlotta; Szigyarto, Cristina Al-Khalili; Amini, Bahram; Andersen, Elisabet; Andersson, Ann-Catrin; Angelidou, Pia; Asplund, Anna; Asplund, Caroline; Berglund, Lisa; Bergström, Kristina; Brumer, Harry; Cerjan, Dijana; Ekström, Marica; Elobeid, Adila; Eriksson, Cecilia; Fagerberg, Linn; Falk, Ronny; Fall, Jenny; Forsberg, Mattias; Björklund, Marcus Gry; Gumbel, Kristoffer; Halimi, Asif; Hallin, Inga; Hamsten, Carl; Hansson, Marianne; Hedhammar, My; Hercules, Görel; Kampf, Caroline; Larsson, Karin; Lindskog, Mats; Lodewyckx, Wald; Lund, Jan; Lundeberg, Joakim; Magnusson, Kristina; Malm, Erik; Nilsson, Peter; Odling, Jenny; Oksvold, Per; Olsson, Ingmarie; Oster, Emma; Ottosson, Jenny; Paavilainen, Linda; Persson, Anja; Rimini, Rebecca; Rockberg, Johan; Runeson, Marcus; Sivertsson, Asa; Sköllermo, Anna; Steen, Johanna; Stenvall, Maria; Sterky, Fredrik; Strömberg, Sara; Sundberg, Mårten; Tegel, Hanna; Tourle, Samuel; Wahlund, Eva; Waldén, Annelie; Wan, Jinghong; Wernérus, Henrik; Westberg, Joakim; Wester, Kenneth; Wrethagen, Ulla; Xu, Lan Lan; Hober, Sophia; Pontén, Fredrik
2005-12-01
Antibody-based proteomics provides a powerful approach for the functional study of the human proteome involving the systematic generation of protein-specific affinity reagents. We used this strategy to construct a comprehensive, antibody-based protein atlas for expression and localization profiles in 48 normal human tissues and 20 different cancers. Here we report a new publicly available database containing, in the first version, approximately 400,000 high resolution images corresponding to more than 700 antibodies toward human proteins. Each image has been annotated by a certified pathologist to provide a knowledge base for functional studies and to allow queries about protein profiles in normal and disease tissues. Our results suggest it should be possible to extend this analysis to the majority of all human proteins thus providing a valuable tool for medical and biological research.
PrePhyloPro: phylogenetic profile-based prediction of whole proteome linkages
Niu, Yulong; Liu, Chengcheng; Moghimyfiroozabad, Shayan; Yang, Yi
2017-01-01
Direct and indirect functional links between proteins as well as their interactions as part of larger protein complexes or common signaling pathways may be predicted by analyzing the correlation of their evolutionary patterns. Based on phylogenetic profiling, here we present a highly scalable and time-efficient computational framework for predicting linkages within the whole human proteome. We have validated this method through analysis of 3,697 human pathways and molecular complexes and a comparison of our results with the prediction outcomes of previously published co-occurrency model-based and normalization methods. Here we also introduce PrePhyloPro, a web-based software that uses our method for accurately predicting proteome-wide linkages. We present data on interactions of human mitochondrial proteins, verifying the performance of this software. PrePhyloPro is freely available at http://prephylopro.org/phyloprofile/. PMID:28875072
Yang, Shuai; Zhang, Xinlei; Diao, Lihong; Guo, Feifei; Wang, Dan; Liu, Zhongyang; Li, Honglei; Zheng, Junjie; Pan, Jingshan; Nice, Edouard C; Li, Dong; He, Fuchu
2015-09-04
The Chromosome-centric Human Proteome Project (C-HPP) aims to catalog genome-encoded proteins using a chromosome-by-chromosome strategy. As the C-HPP proceeds, the increasing requirement for data-intensive analysis of the MS/MS data poses a challenge to the proteomic community, especially small laboratories lacking computational infrastructure. To address this challenge, we have updated the previous CAPER browser into a higher version, CAPER 3.0, which is a scalable cloud-based system for data-intensive analysis of C-HPP data sets. CAPER 3.0 uses cloud computing technology to facilitate MS/MS-based peptide identification. In particular, it can use both public and private cloud, facilitating the analysis of C-HPP data sets. CAPER 3.0 provides a graphical user interface (GUI) to help users transfer data, configure jobs, track progress, and visualize the results comprehensively. These features enable users without programming expertise to easily conduct data-intensive analysis using CAPER 3.0. Here, we illustrate the usage of CAPER 3.0 with four specific mass spectral data-intensive problems: detecting novel peptides, identifying single amino acid variants (SAVs) derived from known missense mutations, identifying sample-specific SAVs, and identifying exon-skipping events. CAPER 3.0 is available at http://prodigy.bprc.ac.cn/caper3.
Xue, Lu; Lin, Lin; Zhou, Wenbin; Chen, Wendong; Tang, Jun; Sun, Xiujie; Huang, Peiwu; Tian, Ruijun
2018-06-09
Plasma proteome profiling by LC-MS based proteomics has drawn great attention recently for biomarker discovery from blood liquid biopsy. Due to standard multi-step sample preparation could potentially cause plasma protein degradation and analysis variation, integrated proteomics sample preparation technologies became promising solution towards this end. Here, we developed a fully integrated proteomics sample preparation technology for both fast and deep plasma proteome profiling under its native pH. All the sample preparation steps, including protein digestion and two-dimensional fractionation by both mixed-mode ion exchange and high-pH reversed phase mechanism were integrated into one spintip device for the first time. The mixed-mode ion exchange beads design achieved the sample loading at neutral pH and protein digestion within 30 min. Potential sample loss and protein degradation by pH changing could be voided. 1 μL of plasma sample with depletion of high abundant proteins was processed by the developed technology with 12 equally distributed fractions and analyzed with 12 h of LC-MS gradient time, resulting in the identification of 862 proteins. The combination of the Mixed-mode-SISPROT and data-independent MS method achieved fast plasma proteome profiling in 2 h with high identification overlap and quantification precision for a proof-of-concept study of plasma samples from 5 healthy donors. We expect that the Mixed-mode-SISPROT become a generally applicable sample preparation technology for clinical oriented plasma proteome profiling. Copyright © 2018 Elsevier B.V. All rights reserved.
Zhang, Kai; Tang, Chaohua; Liang, Xiaowei; Zhao, Qingyu; Zhang, Junmin
2018-01-10
Salbutamol, a selective β 2 -agonist, endangers the safety of animal products as a result of illegal use in food animals. In this study, an iTRAQ-based untargeted quantitative proteomic approach was applied to screen potential protein biomarkers in plasma of cattle before and after treatment with salbutamol for 21 days. A total of 62 plasma proteins were significantly affected by salbutamol treatment, which can be used as potential biomarkers to screen for the illegal use of salbutamol in beef cattle. Enzyme-linked immunosorbent assay measurements of five selected proteins demonstrated the reliability of iTRAQ-based proteomics in screening of candidate biomarkers among the plasma proteins. The plasma samples collected before and after salbutamol treatment were well-separated by principal component analysis (PCA) using the differentially expressed proteins. These results suggested that an iTRAQ-based untargeted quantitative proteomic strategy combined with PCA pattern recognition methods can discriminate differences in plasma protein profiles collected before and after salbutamol treatment.
van Herwijnen, Martijn J.C.; Zonneveld, Marijke I.; Goerdayal, Soenita; Nolte – 't Hoen, Esther N.M.; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A.F.; Redegeld, Frank A.; Wauben, Marca H.M.
2016-01-01
Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of the whole milk proteome and illustrates that milk-derived EV are macromolecular components with a unique functional proteome. PMID:27601599
Tipton, Jeremiah D; Tran, John C; Catherman, Adam D; Ahlf, Dorothy R; Durbin, Kenneth R; Lee, Ji Eun; Kellie, John F; Kelleher, Neil L; Hendrickson, Christopher L; Marshall, Alan G
2012-03-06
Current high-throughput top-down proteomic platforms provide routine identification of proteins less than 25 kDa with 4-D separations. This short communication reports the application of technological developments over the past few years that improve protein identification and characterization for masses greater than 25 kDa. Advances in separation science have allowed increased numbers of proteins to be identified, especially by nanoliquid chromatography (nLC) prior to mass spectrometry (MS) analysis. Further, a goal of high-throughput top-down proteomics is to extend the mass range for routine nLC MS analysis up to 80 kDa because gene sequence analysis predicts that ~70% of the human proteome is transcribed to be less than 80 kDa. Normally, large proteins greater than 50 kDa are identified and characterized by top-down proteomics through fraction collection and direct infusion at relatively low throughput. Further, other MS-based techniques provide top-down protein characterization, however at low resolution for intact mass measurement. Here, we present analysis of standard (up to 78 kDa) and whole cell lysate proteins by Fourier transform ion cyclotron resonance mass spectrometry (nLC electrospray ionization (ESI) FTICR MS). The separation platform reduced the complexity of the protein matrix so that, at 14.5 T, proteins from whole cell lysate up to 72 kDa are baseline mass resolved on a nano-LC chromatographic time scale. Further, the results document routine identification of proteins at improved throughput based on accurate mass measurement (less than 10 ppm mass error) of precursor and fragment ions for proteins up to 50 kDa.
Proteomics of the Human Placenta: Promises and Realities
Robinson, J.M.; Ackerman, W.E.; Kniss, D.A.; Takizawa, T.; Vandré, D.D.
2015-01-01
Proteomics is an area of study that sets as its ultimate goal the global analysis of all of the proteins expressed in a biological system of interest. However, technical limitations currently hamper proteome-wide analyses of complex systems. In a more practical sense, a desired outcome of proteomics research is the translation of large protein data sets into formats that provide meaningful information regarding clinical conditions (e.g., biomarkers to serve as diagnostic and/or prognostic indicators of disease). Herein, we discuss placental proteomics by describing existing studies, pointing out their strengths and weaknesses. In so doing, we strive to inform investigators interested in this area of research about the current gap between hyperbolic promises and realities. Additionally, we discuss the utility of proteomics in discovery-based research, particularly as regards the capacity to unearth novel insights into placental biology. Importantly, when considering under studied systems such as the human placenta and diseases associated with abnormalities in placental function, proteomics can serve as a robust ‘shortcut’ to obtaining information unlikely to be garnered using traditional approaches. PMID:18222537
Zeng, Yunliu; Pan, Zhiyong; Ding, Yuduan; Zhu, Andan; Cao, Hongbo; Xu, Qiang; Deng, Xiuxin
2011-01-01
Here, a comprehensive proteomic analysis of the chromoplasts purified from sweet orange using Nycodenz density gradient centrifugation is reported. A GeLC-MS/MS shotgun approach was used to identify the proteins of pooled chromoplast samples. A total of 493 proteins were identified from purified chromoplasts, of which 418 are putative plastid proteins based on in silico sequence homology and functional analyses. Based on the predicted functions of these identified plastid proteins, a large proportion (∼60%) of the chromoplast proteome of sweet orange is constituted by proteins involved in carbohydrate metabolism, amino acid/protein synthesis, and secondary metabolism. Of note, HDS (hydroxymethylbutenyl 4-diphosphate synthase), PAP (plastid-lipid-associated protein), and psHSPs (plastid small heat shock proteins) involved in the synthesis or storage of carotenoid and stress response are among the most abundant proteins identified. A comparison of chromoplast proteomes between sweet orange and tomato suggested a high level of conservation in a broad range of metabolic pathways. However, the citrus chromoplast was characterized by more extensive carotenoid synthesis, extensive amino acid synthesis without nitrogen assimilation, and evidence for lipid metabolism concerning jasmonic acid synthesis. In conclusion, this study provides an insight into the major metabolic pathways as well as some unique characteristics of the sweet orange chromoplasts at the whole proteome level. PMID:21841170
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Jing; Ma, Zihao; Carr, Steven A.
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC).more » Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies. Molecular & Cellular Proteomics 16: 10.1074/mcp.M116.060301, 121–134, 2017.« less
Clark, David J; Fondrie, William E; Liao, Zhongping; Hanson, Phyllis I; Fulton, Amy; Mao, Li; Yang, Austin J
2015-10-20
Exosomes are microvesicles of endocytic origin constitutively released by multiple cell types into the extracellular environment. With evidence that exosomes can be detected in the blood of patients with various malignancies, the development of a platform that uses exosomes as a diagnostic tool has been proposed. However, it has been difficult to truly define the exosome proteome due to the challenge of discerning contaminant proteins that may be identified via mass spectrometry using various exosome enrichment strategies. To better define the exosome proteome in breast cancer, we incorporated a combination of Tandem-Mass-Tag (TMT) quantitative proteomics approach and Support Vector Machine (SVM) cluster analysis of three conditioned media derived fractions corresponding to a 10 000g cellular debris pellet, a 100 000g crude exosome pellet, and an Optiprep enriched exosome pellet. The quantitative analysis identified 2 179 proteins in all three fractions, with known exosomal cargo proteins displaying at least a 2-fold enrichment in the exosome fraction based on the TMT protein ratios. Employing SVM cluster analysis allowed for the classification 251 proteins as "true" exosomal cargo proteins. This study provides a robust and vigorous framework for the future development of using exosomes as a potential multiprotein marker phenotyping tool that could be useful in breast cancer diagnosis and monitoring disease progression.
Top-down Proteomics in Health and Disease: Challenges and Opportunities
Gregorich, Zachery R.; Ge, Ying
2014-01-01
Proteomics is essential for deciphering how molecules interact as a system and for understanding the functions of cellular systems in human disease; however, the unique characteristics of the human proteome, which include a high dynamic range of protein expression and extreme complexity due to a plethora of post-translational modifications (PTMs) and sequence variations, make such analyses challenging. An emerging “top-down” mass spectrometry (MS)-based proteomics approach, which provides a “bird’s eye” view of all proteoforms, has unique advantages for the assessment of PTMs and sequence variations. Recently, a number of studies have showcased the potential of top-down proteomics for unraveling of disease mechanisms and discovery of new biomarkers. Nevertheless, the top-down approach still faces significant challenges in terms of protein solubility, separation, and the detection of large intact proteins, as well as the under-developed data analysis tools. Consequently, new technological developments are urgently needed to advance the field of top-down proteomics. Herein, we intend to provide an overview of the recent applications of top-down proteomics in biomedical research. Moreover, we will outline the challenges and opportunities facing top-down proteomics strategies aimed at understanding and diagnosing human diseases. PMID:24723472
A comprehensive and scalable database search system for metaproteomics.
Chatterjee, Sandip; Stupp, Gregory S; Park, Sung Kyu Robin; Ducom, Jean-Christophe; Yates, John R; Su, Andrew I; Wolan, Dennis W
2016-08-16
Mass spectrometry-based shotgun proteomics experiments rely on accurate matching of experimental spectra against a database of protein sequences. Existing computational analysis methods are limited in the size of their sequence databases, which severely restricts the proteomic sequencing depth and functional analysis of highly complex samples. The growing amount of public high-throughput sequencing data will only exacerbate this problem. We designed a broadly applicable metaproteomic analysis method (ComPIL) that addresses protein database size limitations. Our approach to overcome this significant limitation in metaproteomics was to design a scalable set of sequence databases assembled for optimal library querying speeds. ComPIL was integrated with a modified version of the search engine ProLuCID (termed "Blazmass") to permit rapid matching of experimental spectra. Proof-of-principle analysis of human HEK293 lysate with a ComPIL database derived from high-quality genomic libraries was able to detect nearly all of the same peptides as a search with a human database (~500x fewer peptides in the database), with a small reduction in sensitivity. We were also able to detect proteins from the adenovirus used to immortalize these cells. We applied our method to a set of healthy human gut microbiome proteomic samples and showed a substantial increase in the number of identified peptides and proteins compared to previous metaproteomic analyses, while retaining a high degree of protein identification accuracy and allowing for a more in-depth characterization of the functional landscape of the samples. The combination of ComPIL with Blazmass allows proteomic searches to be performed with database sizes much larger than previously possible. These large database searches can be applied to complex meta-samples with unknown composition or proteomic samples where unexpected proteins may be identified. The protein database, proteomic search engine, and the proteomic data files for the 5 microbiome samples characterized and discussed herein are open source and available for use and additional analysis.
Comparative analysis of genomics and proteomics in Bacillus thuringiensis 4.0718.
Rang, Jie; He, Hao; Wang, Ting; Ding, Xuezhi; Zuo, Mingxing; Quan, Meifang; Sun, Yunjun; Yu, Ziquan; Hu, Shengbiao; Xia, Liqiu
2015-01-01
Bacillus thuringiensis is a widely used biopesticide that produced various insecticidal active substances during its life cycle. Separation and purification of numerous insecticide active substances have been difficult because of the relatively short half-life of such substances. On the other hand, substances can be synthetized at different times during development, so samples at different stages have to be studied, further complicating the analysis. A dual genomic and proteomic approach would enhance our ability to identify such substances, and particularily using mass spectrometry-based proteomic methods. The comparative analysis for genomic and proteomic data have showed that not all of the products deduced from the annotated genome could be identified among the proteomic data. For instance, genome annotation results showed that 39 coding sequences in the whole genome were related to insect pathogenicity, including five cry genes. However, Cry2Ab, Cry1Ia, Cytotoxin K, Bacteriocin, Exoenzyme C3 and Alveolysin could not be detected in the proteomic data obtained. The sporulation-related proteins were also compared analysis, results showed that the great majority sporulation-related proteins can be detected by mass spectrometry. This analysis revealed Spo0A~P, SigF, SigE(+), SigK(+) and SigG(+), all known to play an important role in the process of spore formation regulatory network, also were displayed in the proteomic data. Through the comparison of the two data sets, it was possible to infer that some genes were silenced or were expressed at very low levels. For instance, found that cry2Ab seems to lack a functional promoter while cry1Ia may not be expressed due to the presence of transposons. With this comparative study a relatively complete database can be constructed and used to transform hereditary material, thereby prompting the high expression of toxic proteins. A theoretical basis is provided for constructing highly virulent engineered bacteria and for promoting the application of proteogenomics in the life sciences.
Lima, D. C.; Duarte, F. T.; Medeiros, V. K. S.; Carvalho, P. C.; Nogueira, F. C. S.; Araujo, G. D. T.; Domont, G. B.; Batistuzzo de Medeiros, S. R.
2016-01-01
Chromobacterium violaceum is a free-living bacillus with several genes that enables it survival under different harsh environments such as oxidative and temperature stresses. Here we performed a label-free quantitative proteomic study to unravel the molecular mechanisms that enable C. violaceum to survive oxidative stress. To achieve this, total proteins extracted from control and C. violaceum cultures exposed during two hours with 8 mM hydrogen peroxide were analyzed using GeLC-MS proteomics. Analysis revealed that under the stress condition, the bacterium expressed proteins that protected it from the damage caused by reactive oxygen condition and decreasing the abundance of proteins responsible for bacterial growth and catabolism. GeLC-MS proteomics analysis provided an overview of the metabolic pathways involved in the response of C. violaceum to oxidative stress ultimately aggregating knowledge of the response of this organism to environmental stress. This study identified approximately 1500 proteins, generating the largest proteomic coverage of C. violaceum so far. We also detected proteins with unknown function that we hypothesize to be part of new mechanisms related to oxidative stress defense. Finally, we identified the mechanism of clustered regularly interspaced short palindromic repeats (CRISPR), which has not yet been reported for this organism. PMID:27321545
Lima, D C; Duarte, F T; Medeiros, V K S; Carvalho, P C; Nogueira, F C S; Araujo, G D T; Domont, G B; Batistuzzo de Medeiros, S R
2016-06-20
Chromobacterium violaceum is a free-living bacillus with several genes that enables it survival under different harsh environments such as oxidative and temperature stresses. Here we performed a label-free quantitative proteomic study to unravel the molecular mechanisms that enable C. violaceum to survive oxidative stress. To achieve this, total proteins extracted from control and C. violaceum cultures exposed during two hours with 8 mM hydrogen peroxide were analyzed using GeLC-MS proteomics. Analysis revealed that under the stress condition, the bacterium expressed proteins that protected it from the damage caused by reactive oxygen condition and decreasing the abundance of proteins responsible for bacterial growth and catabolism. GeLC-MS proteomics analysis provided an overview of the metabolic pathways involved in the response of C. violaceum to oxidative stress ultimately aggregating knowledge of the response of this organism to environmental stress. This study identified approximately 1500 proteins, generating the largest proteomic coverage of C. violaceum so far. We also detected proteins with unknown function that we hypothesize to be part of new mechanisms related to oxidative stress defense. Finally, we identified the mechanism of clustered regularly interspaced short palindromic repeats (CRISPR), which has not yet been reported for this organism.
Differential expression profiling of serum proteins and metabolites for biomarker discovery
NASA Astrophysics Data System (ADS)
Roy, Sushmita Mimi; Anderle, Markus; Lin, Hua; Becker, Christopher H.
2004-11-01
A liquid chromatography-mass spectrometry (LC-MS) proteomics and metabolomics platform is presented for quantitative differential expression analysis. Proteome profiles obtained from 1.5 [mu]L of human serum show ~5000 de-isotoped and quantifiable molecular ions. Approximately 1500 metabolites are observed from 100 [mu]L of serum. Quantification is based on reproducible sample preparation and linear signal intensity as a function of concentration. The platform is validated using human serum, but is generally applicable to all biological fluids and tissues. The median coefficient of variation (CV) for ~5000 proteomic and ~1500 metabolomic molecular ions is approximately 25%. For the case of C-reactive protein, results agree with quantification by immunoassay. The independent contributions of two sources of variance, namely sample preparation and LC-MS analysis, are respectively quantified as 20.4 and 15.1% for the proteome, and 19.5 and 13.5% for the metabolome, for median CV values. Furthermore, biological diversity for ~20 healthy individuals is estimated by measuring the variance of ~6500 proteomic and metabolomic molecular ions in sera for each sample; the median CV is 22.3% for the proteome and 16.7% for the metabolome. Finally, quantitative differential expression profiling is applied to a clinical study comparing healthy individuals and rheumatoid arthritis (RA) patients.
Proteomic Analysis of the Cell Cycle of Procylic Form Trypanosoma brucei.
Crozier, Thomas W M; Tinti, Michele; Wheeler, Richard J; Ly, Tony; Ferguson, Michael A J; Lamond, Angus I
2018-06-01
We describe a single-step centrifugal elutriation method to produce synchronous Gap1 (G1)-phase procyclic trypanosomes at a scale amenable for proteomic analysis of the cell cycle. Using ten-plex tandem mass tag (TMT) labeling and mass spectrometry (MS)-based proteomics technology, the expression levels of 5325 proteins were quantified across the cell cycle in this parasite. Of these, 384 proteins were classified as cell-cycle regulated and subdivided into nine clusters with distinct temporal regulation. These groups included many known cell cycle regulators in trypanosomes, which validates the approach. In addition, we identify 40 novel cell cycle regulated proteins that are essential for trypanosome survival and thus represent potential future drug targets for the prevention of trypanosomiasis. Through cross-comparison to the TrypTag endogenous tagging microscopy database, we were able to validate the cell-cycle regulated patterns of expression for many of the proteins of unknown function detected in our proteomic analysis. A convenient interface to access and interrogate these data is also presented, providing a useful resource for the scientific community. Data are available via ProteomeXchange with identifier PXD008741 (https://www.ebi.ac.uk/pride/archive/). © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Data Portal | Office of Cancer Clinical Proteomics Research
The CPTAC Data Portal is a centralized repository for the public dissemination of proteomic sequence datasets collected by CPTAC, along with corresponding genomic sequence datasets. In addition, available are analyses of CPTAC's raw mass spectrometry-based data files (mapping of spectra to peptide sequences and protein identification) by individual investigators from CPTAC and by a Common Data Analysis Pipeline.
Pan, Sheng; Rush, John; Peskind, Elaine R; Galasko, Douglas; Chung, Kathryn; Quinn, Joseph; Jankovic, Joseph; Leverenz, James B; Zabetian, Cyrus; Pan, Catherine; Wang, Yan; Oh, Jung Hun; Gao, Jean; Zhang, Jianpeng; Montine, Thomas; Zhang, Jing
2008-02-01
Targeted quantitative proteomics by mass spectrometry aims to selectively detect one or a panel of peptides/proteins in a complex sample and is particularly appealing for novel biomarker verification/validation because it does not require specific antibodies. Here, we demonstrated the application of targeted quantitative proteomics in searching, identifying, and quantifying selected peptides in human cerebrospinal spinal fluid (CSF) using a matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer (MALDI TOF/TOF)-based platform. The approach involved two major components: the use of isotopic-labeled synthetic peptides as references for targeted identification and quantification and a highly selective mass spectrometric analysis based on the unique characteristics of the MALDI instrument. The platform provides high confidence for targeted peptide detection in a complex system and can potentially be developed into a high-throughput system. Using the liquid chromatography (LC) MALDI TOF/TOF platform and the complementary identification strategy, we were able to selectively identify and quantify a panel of targeted peptides in the whole proteome of CSF without prior depletion of abundant proteins. The effectiveness and robustness of the approach associated with different sample complexity, sample preparation strategies, as well as mass spectrometric quantification were evaluated. Other issues related to chromatography separation and the feasibility for high-throughput analysis were also discussed. Finally, we applied targeted quantitative proteomics to analyze a subset of previously identified candidate markers in CSF samples of patients with Parkinson's disease (PD) at different stages and Alzheimer's disease (AD) along with normal controls.
Han, Mee-Jung
2017-11-28
The Escherichia coli K-12 and B strains are among the most frequently used bacterial hosts for scientific research and biotechnological applications. However, omics analyses have revealed that E. coli K-12 and B exhibit notably different genotypic and phenotypic attributes, even though they were derived from the same ancestor. In a previous study, we identified a limited number of proteins from the two strains using two-dimensional gel electrophoresis and tandem mass spectrometry (MS/MS). In this study, an in-depth analysis of the physiological behavior of the E. coli K-12 and B strains at the proteomic level was performed using six-plex isobaric tandem mass tag-based quantitative MS. Additionally, the best lysis buffer for increasing the efficiency of protein extraction was selected from three tested buffers prior to the quantitative proteomic analysis. This study identifies the largest number of proteins in the two E. coli strains reported to date and is the first to show the dynamics of these proteins. Notable differences in proteins associated with key cellular properties, including some metabolic pathways, the biosynthesis and degradation of amino acids, membrane integrity, cellular tolerance, and motility, were found between the two representative strains. Compared with previous studies, these proteomic results provide a more holistic view of the overall state of E. coli cells based on a single proteomic study and reveal significant insights into why the two strains show distinct phenotypes. Additionally, the resulting data provide in-depth information that will help fine-tune processes in the future.
TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics
Röst, Hannes L.; Liu, Yansheng; D’Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C.; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi
2016-01-01
Large scale, quantitative proteomic studies have become essential for the analysis of clinical cohorts, large perturbation experiments and systems biology studies. While next-generation mass spectrometric techniques such as SWATH-MS have substantially increased throughput and reproducibility, ensuring consistent quantification of thousands of peptide analytes across multiple LC-MS/MS runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we have developed the TRIC software which utilizes fragment ion data to perform cross-run alignment, consistent peak-picking and quantification for high throughput targeted proteomics. TRIC uses a graph-based alignment strategy based on non-linear retention time correction to integrate peak elution information from all LC-MS/MS runs acquired in a study. When compared to state-of-the-art SWATH-MS data analysis, the algorithm was able to reduce the identification error by more than 3-fold at constant recall, while correcting for highly non-linear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem (iPS) cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups and substantially increased the quantitative completeness and biological information in the data, providing insights into protein dynamics of iPS cells. Overall, this study demonstrates the importance of consistent quantification in highly challenging experimental setups, and proposes an algorithm to automate this task, constituting the last missing piece in a pipeline for automated analysis of massively parallel targeted proteomics datasets. PMID:27479329
Expanding the bovine milk proteome through extensive fractionation.
Nissen, Asger; Bendixen, Emøke; Ingvartsen, Klaus Lønne; Røntved, Christine Maria
2013-01-01
Bovine milk is an agricultural product of tremendous value worldwide. It contains proteins, fat, lactose, vitamins, and minerals. It provides nutrition and immunological protection (e.g., in the gastrointestinal tract) to the newborn and young calf. It also forms an important part of human nutrition. The repertoire of proteins in milk (i.e., its proteome) is vast and complex. The milk proteome can be described in detail by mass spectrometry-based proteomics. However, the high concentration of dominating proteins in milk reduces mass spectrometry detection sensitivity and limits detection of low abundant proteins. Further, the general health and udder health of the dairy cows delivering the milk may influence the composition of the milk proteome. To gain a more exhaustive and true picture of the milk proteome, we performed an extensive preanalysis fractionation of raw composite milk collected from documented healthy cows in early lactation. Four simple and industrially applicable techniques exploring the physical and chemical properties of milk, including acidification, filtration, and centrifugation, were used for separation of the proteins. This resulted in 5 different fractions, whose content of proteins were compared with the proteins of nonfractionated milk using 2-dimensional liquid chromatography tandem mass spectrometry analysis. To validate the proteome analysis, spectral counts and ELISA were performed on 7 proteins using the ELISA for estimation of the detection sensitivity limit of the 2-dimensional liquid chromatography tandem mass spectrometry analysis. Each fractionation technique resulted in identification of a unique subset of proteins. However, high-speed centrifugation of milk to whey was by far the best method to achieve high and repeatable proteome coverage. The total number of milk proteins initially detected in nonfractionated milk and the fractions were 635 in 2 replicates. Removal of dominant proteins and filtering for redundancy across the different fractions reduced the number to 376 unique proteins in 2 replicates. In addition, 366 proteins were detected by this process in 1 replicate. Hence, by applying different fractionation techniques to milk, we expanded the milk proteome. The milk proteome map may serve as a reference for scientists working in the dairy sector. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
The National Cancer Institute will hold a public pre-application webinar on Friday, December 11 at 12:00 p.m. (EST) for the Funding Opportunity Announcements (FOAs) RFA-CA-15-021 entitled “Proteome Characterization Centers for Clinical Proteomic Tumor Analysis Consortium (U24), RFA-CA-15-022 entitled “Proteogenomic Translational Research Centers for Clinical Proteomic Tumor Analysis Consortium (U01)”, and RFA-CA-15-023 entitled “Proteogenomic Data Analysis Centers for Clinical Proteomic Tumor Analysis Consortium (U24)”.
Zamora-Briseño, Jesús Alejandro; Reyes-Hernández, Sandi Julissa; Zapata, Luis Carlos Rodríguez
2018-06-02
Plant response to water stress involves the activation of mechanisms expected to help them cope with water scarcity. Among these mechanisms, proteome-wide adjustment is well known. This includes actions to save energy, protect cellular and molecular components, and maintain vital functions of the cell. Intrinsically disordered proteins, which are proteins without a rigid three-dimensional structure, are seen as emerging multifunctional cellular components of proteomes. They are highly abundant in eukaryotic proteomes, and numerous functions for these proteins have been proposed. Here, we discuss several reasons why the collection of intrinsically disordered proteins in a proteome (disordome) could be subjected to an active regulation during conditions of water scarcity in plants. We also discuss the potential misinterpretations of disordome content estimations made so far due to bias-prone data and the need for reliable analysis based on experimental data in order to acknowledge the plasticity nature of the disordome.
Malmström, Erik; Kilsgård, Ola; Hauri, Simon; Smeds, Emanuel; Herwald, Heiko; Malmström, Lars; Malmström, Johan
2016-01-01
The plasma proteome is highly dynamic and variable, composed of proteins derived from surrounding tissues and cells. To investigate the complex processes that control the composition of the plasma proteome, we developed a mass spectrometry-based proteomics strategy to infer the origin of proteins detected in murine plasma. The strategy relies on the construction of a comprehensive protein tissue atlas from cells and highly vascularized organs using shotgun mass spectrometry. The protein tissue atlas was transformed to a spectral library for highly reproducible quantification of tissue-specific proteins directly in plasma using SWATH-like data-independent mass spectrometry analysis. We show that the method can determine drastic changes of tissue-specific protein profiles in blood plasma from mouse animal models with sepsis. The strategy can be extended to several other species advancing our understanding of the complex processes that contribute to the plasma proteome dynamics. PMID:26732734
Quantitative proteomics in Giardia duodenalis-Achievements and challenges.
Emery, Samantha J; Lacey, Ernest; Haynes, Paul A
2016-08-01
Giardia duodenalis (syn. G. lamblia and G. intestinalis) is a protozoan parasite of vertebrates and a major contributor to the global burden of diarrheal diseases and gastroenteritis. The publication of multiple genome sequences in the G. duodenalis species complex has provided important insights into parasite biology, and made post-genomic technologies, including proteomics, significantly more accessible. The aims of proteomics are to identify and quantify proteins present in a cell, and assign functions to them within the context of dynamic biological systems. In Giardia, proteomics in the post-genomic era has transitioned from reliance on gel-based systems to utilisation of a diverse array of techniques based on bottom-up LC-MS/MS technologies. Together, these have generated crucial foundations for subcellular proteomes, elucidated intra- and inter-assemblage isolate variation, and identified pathways and markers in differentiation, host-parasite interactions and drug resistance. However, in Giardia, proteomics remains an emerging field, with considerable shortcomings evident from the published research. These include a bias towards assemblage A, a lack of emphasis on quantitative analytical techniques, and limited information on post-translational protein modifications. Additionally, there are multiple areas of research for which proteomic data is not available to add value to published transcriptomic data. The challenge of amalgamating data in the systems biology paradigm necessitates the further generation of large, high-quality quantitative datasets to accurately model parasite biology. This review surveys the current proteomic research available for Giardia and evaluates their technical and quantitative approaches, while contextualising their biological insights into parasite pathology, isolate variation and eukaryotic evolution. Finally, we propose areas of priority for the generation of future proteomic data to explore fundamental questions in Giardia, including the analysis of post-translational modifications, and the design of MS-based assays for validation of differentially expressed proteins in large datasets. Copyright © 2016 Elsevier B.V. All rights reserved.
Morisawa, Hiraku; Hirota, Mikako; Toda, Tosifusa
2006-01-01
Background In the post-genome era, most research scientists working in the field of proteomics are confronted with difficulties in management of large volumes of data, which they are required to keep in formats suitable for subsequent data mining. Therefore, a well-developed open source laboratory information management system (LIMS) should be available for their proteomics research studies. Results We developed an open source LIMS appropriately customized for 2-D gel electrophoresis-based proteomics workflow. The main features of its design are compactness, flexibility and connectivity to public databases. It supports the handling of data imported from mass spectrometry software and 2-D gel image analysis software. The LIMS is equipped with the same input interface for 2-D gel information as a clickable map on public 2DPAGE databases. The LIMS allows researchers to follow their own experimental procedures by reviewing the illustrations of 2-D gel maps and well layouts on the digestion plates and MS sample plates. Conclusion Our new open source LIMS is now available as a basic model for proteome informatics, and is accessible for further improvement. We hope that many research scientists working in the field of proteomics will evaluate our LIMS and suggest ways in which it can be improved. PMID:17018156
Lavallée-Adam, Mathieu; Rauniyar, Navin; McClatchy, Daniel B; Yates, John R
2014-12-05
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights.
2015-01-01
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights. PMID:25177766
Mudaliar, Manikhandan; Tassi, Riccardo; Thomas, Funmilola C; McNeilly, Tom N; Weidt, Stefan K; McLaughlin, Mark; Wilson, David; Burchmore, Richard; Herzyk, Pawel; Eckersall, P David; Zadoks, Ruth N
2016-08-16
Mastitis, inflammation of the mammary gland, is the most common and costly disease of dairy cattle in the western world. It is primarily caused by bacteria, with Streptococcus uberis as one of the most prevalent causative agents. To characterize the proteome during Streptococcus uberis mastitis, an experimentally induced model of intramammary infection was used. Milk whey samples obtained from 6 cows at 6 time points were processed using label-free relative quantitative proteomics. This proteomic analysis complements clinical, bacteriological and immunological studies as well as peptidomic and metabolomic analysis of the same challenge model. A total of 2552 non-redundant bovine peptides were identified, and from these, 570 bovine proteins were quantified. Hierarchical cluster analysis and principal component analysis showed clear clustering of results by stage of infection, with similarities between pre-infection and resolution stages (0 and 312 h post challenge), early infection stages (36 and 42 h post challenge) and late infection stages (57 and 81 h post challenge). Ingenuity pathway analysis identified upregulation of acute phase protein pathways over the course of infection, with dominance of different acute phase proteins at different time points based on differential expression analysis. Antimicrobial peptides, notably cathelicidins and peptidoglycan recognition protein, were upregulated at all time points post challenge and peaked at 57 h, which coincided with 10 000-fold decrease in average bacterial counts. The integration of clinical, bacteriological, immunological and quantitative proteomics and other-omic data provides a more detailed systems level view of the host response to mastitis than has been achieved previously.
A two-dimensional proteome map of the aflatoxigenic fungus Aspergillus flavus.
Pechanova, Olga; Pechan, Tibor; Rodriguez, Jose M; Williams, W Paul; Brown, Ashli E
2013-05-01
The filamentous fungus Aspergillus flavus is an opportunistic soil-borne pathogen that produces aflatoxins, the most potent naturally occurring carcinogenic compounds known. This work represents the first gel-based profiling analysis of A. flavus proteome and establishes a 2D proteome map. Using 2DE and MALDI-TOF-MS/MS, we identified 538 mycelial proteins of the aflatoxigenic strain NRRL 3357, the majority of which were functionally annotated as related to various cellular metabolic and biosynthetic processes. Additionally, a few enzymes from the aflatoxin synthesis pathway were also identified. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Jeromson, Stewart; Mackenzie, Ivor; Doherty, Mary K; Whitfield, Phillip D; Bell, Gordon; Dick, James; Shaw, Andy; Rao, Francesco V; Ashcroft, Stephen P; Philp, Andrew; Galloway, Stuart D R; Gallagher, Iain; Hamilton, D Lee
2018-06-01
In striated muscle, eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) have differential effects on the metabolism of glucose and differential effects on the metabolism of protein. We have shown that, despite similar incorporation, treatment of C 2 C 12 myotubes (CM) with EPA but not DHA improves glucose uptake and protein accretion. We hypothesized that these differential effects of EPA and DHA may be due to divergent shifts in lipidomic profiles leading to altered proteomic profiles. We therefore carried out an assessment of the impact of treating CM with EPA and DHA on lipidomic and proteomic profiles. Fatty acid methyl esters (FAME) analysis revealed that both EPA and DHA led to similar but substantials changes in fatty acid profiles with the exception of arachidonic acid, which was decreased only by DHA, and docosapentanoic acid (DPA), which was increased only by EPA treatment. Global lipidomic analysis showed that EPA and DHA induced large alterations in the cellular lipid profiles and in particular, the phospholipid classes. Subsequent targeted analysis confirmed that the most differentially regulated species were phosphatidylcholines and phosphatidylethanolamines containing long-chain fatty acids with five (EPA treatment) or six (DHA treatment) double bonds. As these are typically membrane-associated lipid species we hypothesized that these treatments differentially altered the membrane-associated proteome. Stable isotope labeling by amino acids in cell culture (SILAC)-based proteomics of the membrane fraction revealed significant divergence in the effects of EPA and DHA on the membrane-associated proteome. We conclude that the EPA-specific increase in polyunsaturated long-chain fatty acids in the phospholipid fraction is associated with an altered membrane-associated proteome and these may be critical events in the metabolic remodeling induced by EPA treatment.
P-MartCancer–Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Bramer, Lisa M.; Jensen, Jeffrey L.
P-MartCancer is a new interactive web-based software environment that enables biomedical and biological scientists to perform in-depth analyses of global proteomics data without requiring direct interaction with the data or with statistical software. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access to multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium (CPTAC) at the peptide, gene and protein levels. P-MartCancer is deployed using Azure technologies (http://pmart.labworks.org/cptac.html), the web-service is alternativelymore » available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/) and many statistical functions can be utilized directly from an R package available on GitHub (https://github.com/pmartR).« less
Supramolecular Affinity Chromatography for Methylation-Targeted Proteomics.
Garnett, Graham A E; Starke, Melissa J; Shaurya, Alok; Li, Janessa; Hof, Fraser
2016-04-05
Proteome-wide studies of post-translationally methylated species using mass spectrometry are complicated by high sample diversity, competition for ionization among peptides, and mass redundancies. Antibody-based enrichment has powered methylation proteomics until now, but the reliability, pan-specificity, polyclonal nature, and stability of the available pan-specific antibodies are problematic and do not provide a standard, reliable platform for investigators. We have invented an anionic supramolecular host that can form host-guest complexes selectively with methyllysine-containing peptides and used it to create a methylysine-affinity column. The column resolves peptides on the basis of methylation-a feat impossible with a comparable commercial cation-exchange column. A proteolyzed nuclear extract was separated on the methyl-affinity column prior to standard proteomics analysis. This experiment demonstrates that such chemical methyl-affinity columns are capable of enriching and improving the analysis of methyllysine residues from complex protein mixtures. We discuss the importance of this advance in the context of biomolecule-driven enrichment methods.
A proteomic landscape of diffuse-type gastric cancer.
Ge, Sai; Xia, Xia; Ding, Chen; Zhen, Bei; Zhou, Quan; Feng, Jinwen; Yuan, Jiajia; Chen, Rui; Li, Yumei; Ge, Zhongqi; Ji, Jiafu; Zhang, Lianhai; Wang, Jiayuan; Li, Zhongwu; Lai, Yumei; Hu, Ying; Li, Yanyan; Li, Yilin; Gao, Jing; Chen, Lin; Xu, Jianming; Zhang, Chunchao; Jung, Sung Yun; Choi, Jong Min; Jain, Antrix; Liu, Mingwei; Song, Lei; Liu, Wanlin; Guo, Gaigai; Gong, Tongqing; Huang, Yin; Qiu, Yang; Huang, Wenwen; Shi, Tieliu; Zhu, Weimin; Wang, Yi; He, Fuchu; Shen, Lin; Qin, Jun
2018-03-08
The diffuse-type gastric cancer (DGC) is a subtype of gastric cancer with the worst prognosis and few treatment options. Here we present a dataset from 84 DGC patients, composed of a proteome of 11,340 gene products and mutation information of 274 cancer driver genes covering paired tumor and nearby tissue. DGC can be classified into three subtypes (PX1-3) based on the altered proteome alone. PX1 and PX2 exhibit dysregulation in the cell cycle and PX2 features an additional EMT process; PX3 is enriched in immune response proteins, has the worst survival, and is insensitive to chemotherapy. Data analysis revealed four major vulnerabilities in DGC that may be targeted for treatment, and allowed the nomination of potential immunotherapy targets for DGC patients, particularly for those in PX3. This dataset provides a rich resource for information and knowledge mining toward altered signaling pathways in DGC and demonstrates the benefit of proteomic analysis in cancer molecular subtyping.
Yu, Kebing; Salomon, Arthur R
2009-12-01
Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through MS/MS. Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to various experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our high throughput autonomous proteomic pipeline used in the automated acquisition and post-acquisition analysis of proteomic data.
Shi, Jing; Zhang, Longteng; Lei, Yutian; Shen, Huixing; Yu, Xunpei; Luo, Yongkang
2018-06-15
An iTRAQ-based strategy was applied to investigate proteome changes in mud shrimp during long-term frozen storage under different conditions. A total of 226 proteins was identified as differential abundance proteins (DAPs) in mud shrimp from two frozen treatment groups (-20 °C and -40 °C) compared with the fresh control group. The proteome changes in mud shrimp muscle stored under -20 °C was much greater than that under -40 °C. Correlation analysis between DAPs and quality traits of mud shrimp muscle showed that 12 proteins were correlated closely with color (L ∗ , a ∗ , and b ∗ value) and texture (hardness, elasticity, and chewiness). Bioinformatic analysis revealed that most of these proteins were involved in protein structure, metabolic enzymes, and protein turnover. Among them, several proteins might be potential protein markers for color, and some proteins are good candidate predictors for textural properties of mud shrimp muscle. Copyright © 2018 Elsevier Ltd. All rights reserved.
Andromeda: a peptide search engine integrated into the MaxQuant environment.
Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias
2011-04-01
A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
Dyrlund, Thomas F; Poulsen, Ebbe T; Scavenius, Carsten; Sanggaard, Kristian W; Enghild, Jan J
2012-09-01
Data processing and analysis of proteomics data are challenging and time consuming. In this paper, we present MS Data Miner (MDM) (http://sourceforge.net/p/msdataminer), a freely available web-based software solution aimed at minimizing the time required for the analysis, validation, data comparison, and presentation of data files generated in MS software, including Mascot (Matrix Science), Mascot Distiller (Matrix Science), and ProteinPilot (AB Sciex). The program was developed to significantly decrease the time required to process large proteomic data sets for publication. This open sourced system includes a spectra validation system and an automatic screenshot generation tool for Mascot-assigned spectra. In addition, a Gene Ontology term analysis function and a tool for generating comparative Excel data reports are included. We illustrate the benefits of MDM during a proteomics study comprised of more than 200 LC-MS/MS analyses recorded on an AB Sciex TripleTOF 5600, identifying more than 3000 unique proteins and 3.5 million peptides. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
iTRAQ-Based Quantitative Proteomic Analysis of the Initiation of Head Regeneration in Planarians.
Geng, Xiaofang; Wang, Gaiping; Qin, Yanli; Zang, Xiayan; Li, Pengfei; Geng, Zhi; Xue, Deming; Dong, Zimei; Ma, Kexue; Chen, Guangwen; Xu, Cunshuan
2015-01-01
The planarian Dugesia japonica has amazing ability to regenerate a head from the anterior ends of the amputated stump with maintenance of the original anterior-posterior polarity. Although planarians present an attractive system for molecular investigation of regeneration and research has focused on clarifying the molecular mechanism of regeneration initiation in planarians at transcriptional level, but the initiation mechanism of planarian head regeneration (PHR) remains unclear at the protein level. Here, a global analysis of proteome dynamics during the early stage of PHR was performed using isobaric tags for relative and absolute quantitation (iTRAQ)-based quantitative proteomics strategy, and our data are available via ProteomeXchange with identifier PXD002100. The results showed that 162 proteins were differentially expressed at 2 h and 6 h following amputation. Furthermore, the analysis of expression patterns and functional enrichment of the differentially expressed proteins showed that proteins involved in muscle contraction, oxidation reduction and protein synthesis were up-regulated in the initiation of PHR. Moreover, ingenuity pathway analysis showed that predominant signaling pathways such as ILK, calcium, EIF2 and mTOR signaling which were associated with cell migration, cell proliferation and protein synthesis were likely to be involved in the initiation of PHR. The results for the first time demonstrated that muscle contraction and ILK signaling might played important roles in the initiation of PHR at the global protein level. The findings of this research provide a molecular basis for further unraveling the mechanism of head regeneration initiation in planarians.
Hong, Haifa; Ye, Lincai; Chen, Huiwen; Xia, Yu; Liu, Yue; Liu, Jinfen; Lu, Yanan; Zhang, Haibo
2015-08-01
We aimed to evaluate global changes in protein expression associated with patency by undertaking proteomic analysis of human constricted and patent ductus arteriosus (DA). Ten constricted and 10 patent human DAs were excised from infants with ductal-dependent heart disease during surgery. Using isobaric tags for relative and absolute quantitation-based quantitative proteomics, 132 differentially expressed proteins were identified. Of 132 proteins, voltage-gated sodium channel 1.3 (SCN3A), myosin 1d (Myo1d), Rho GTPase activating protein 26 (ARHGAP26), and retinitis pigmentosa 1 (RP1) were selected for validation by Western blot and quantitative real-time polymerase chain reaction analyses. Significant upregulation of SCN3A, Myo1d, and RP1 messenger RNA, and protein levels was observed in the patent DA group (all P ≤ 0.048). ARHGAP26 messenger RNA and protein levels were decreased in patent DA tissue (both P ≤ 0.018). Immunohistochemistry analysis revealed that Myo1d, ARHGAP26, and RP1 were specifically expressed in the subendothelial region of constricted DAs; however, diffuse expression of these proteins was noted in the patent group. Proteomic analysis revealed global changes in the expression of proteins that regulate oxygen sensing, ion channels, smooth muscle cell migration, nervous system, immune system, and metabolism, suggesting a basis for the systemic regulation of DA patency by diverse signaling pathways, which will be confirmed in further studies.
Proteomic profiling of early degenerative retina of RCS rats.
Zhu, Zhi-Hong; Fu, Yan; Weng, Chuan-Huang; Zhao, Cong-Jian; Yin, Zheng-Qin
2017-01-01
To identify the underlying cellular and molecular changes in retinitis pigmentosa (RP). Label-free quantification-based proteomics analysis, with its advantages of being more economic and consisting of simpler procedures, has been used with increasing frequency in modern biological research. Dystrophic RCS rats, the first laboratory animal model for the study of RP, possess a similar pathological course as human beings with the diseases. Thus, we employed a comparative proteomics analysis approach for in-depth proteome profiling of retinas from dystrophic RCS rats and non-dystrophic congenic controls through Linear Trap Quadrupole - orbitrap MS/MS, to identify the significant differentially expressed proteins (DEPs). Bioinformatics analyses, including Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation and upstream regulatory analysis, were then performed on these retina proteins. Finally, a Western blotting experiment was carried out to verify the difference in the abundance of transcript factor E2F1. In this study, we identified a total of 2375 protein groups from the retinal protein samples of RCS rats and non-dystrophic congenic controls. Four hundred thirty-four significantly DEPs were selected by Student's t -test. Based on the results of the bioinformatics analysis, we identified mitochondrial dysfunction and transcription factor E2F1 as the key initiation factors in early retinal degenerative process. We showed that the mitochondrial dysfunction and the transcription factor E2F1 substantially contribute to the disease etiology of RP. The results provide a new potential therapeutic approach for this retinal degenerative disease.
Dissecting plasmodesmata molecular composition by mass spectrometry-based proteomics.
Salmon, Magali S; Bayer, Emmanuelle M F
2012-01-01
In plants, the intercellular communication through the membranous channels called plasmodesmata (PD; singular plasmodesma) plays pivotal roles in the orchestration of development, defence responses, and viral propagation. PD are dynamic structures embedded in the plant cell wall that are defined by specialized domains of the endoplasmic reticulum (ER) and the plasma membrane (PM). PD structure and unique functions are guaranteed by their particular molecular composition. Yet, up to recent years and despite numerous approaches such as mutant screens, immunolocalization, or screening of random cDNAs, only few PD proteins had been conclusively identified and characterized. A clear breakthrough in the search of PD constituents came from mass-spectrometry-based proteomic approaches coupled with subcellular fractionation strategies. Due to their position, firmly anchored in the extracellular matrix, PD are notoriously difficult to isolate for biochemical analysis. Proteomic-based approaches have therefore first relied on the use of cell wall fractions containing embedded PD then on "free" PD fractions whereby PD membranes were released from the walls by enzymatic degradation. To discriminate between likely contaminants and PD protein candidates, bioinformatics tools have often been used in combination with proteomic approaches. GFP fusion proteins of selected candidates have confirmed the PD association of several protein families. Here we review the accomplishments and limitations of the proteomic-based strategies to unravel the functional and structural complexity of PD. We also discuss the role of the identified PD-associated proteins.
Quantitative body fluid proteomics in medicine - A focus on minimal invasiveness.
Csősz, Éva; Kalló, Gergő; Márkus, Bernadett; Deák, Eszter; Csutak, Adrienne; Tőzsér, József
2017-02-05
Identification of new biomarkers specific for various pathological conditions is an important field in medical sciences. Body fluids have emerging potential in biomarker studies especially those which are continuously available and can be collected by non-invasive means. Changes in the protein composition of body fluids such as tears, saliva, sweat, etc. may provide information on both local and systemic conditions of medical relevance. In this review, our aim is to discuss the quantitative proteomics techniques used in biomarker studies, and to present advances in quantitative body fluid proteomics of non-invasively collectable body fluids with relevance to biomarker identification. The advantages and limitations of the widely used quantitative proteomics techniques are also presented. Based on the reviewed literature, we suggest an ideal pipeline for body fluid analyses aiming at biomarkers discoveries: starting from identification of biomarker candidates by shotgun quantitative proteomics or protein arrays, through verification of potential biomarkers by targeted mass spectrometry, to the antibody-based validation of biomarkers. The importance of body fluids as a rich source of biomarkers is discussed. Quantitative proteomics is a challenging part of proteomics applications. The body fluids collected by non-invasive means have high relevance in medicine; they are good sources for biomarkers used in establishing the diagnosis, follow up of disease progression and predicting high risk groups. The review presents the most widely used quantitative proteomics techniques in body fluid analysis and lists the potential biomarkers identified in tears, saliva, sweat, nasal mucus and urine for local and systemic diseases. Copyright © 2016 Elsevier B.V. All rights reserved.
SILAC-Based Comparative Proteomic Analysis of Lysosomes from Mammalian Cells Using LC-MS/MS.
Thelen, Melanie; Winter, Dominic; Braulke, Thomas; Gieselmann, Volkmar
2017-01-01
Mass spectrometry-based proteomics of lysosomal proteins has led to significant advances in understanding lysosomal function and pathology. The ever-increasing sensitivity and resolution of mass spectrometry in combination with labeling procedures which allow comparative quantitative proteomics can be applied to shed more light on the steadily increasing range of lysosomal functions. In addition, investigation of alterations in lysosomal protein composition in the many lysosomal storage diseases may yield further insights into the molecular pathology of these disorders. Here, we describe a protocol which allows to determine quantitative differences in the lysosomal proteome of cells which are genetically and/or biochemically different or have been exposed to certain stimuli. The method is based on stable isotope labeling of amino acids in cell culture (SILAC). Cells are exposed to superparamagnetic iron oxide particles which are endocytosed and delivered to lysosomes. After homogenization of cells, intact lysosomes are rapidly enriched by passing the cell homogenates over a magnetic column. Lysosomes are eluted after withdrawal of the magnetic field and subjected to mass spectrometry.
Wang, Li-Chao; Wei, Wen-Hui; Zhang, Xiao-Wen; Liu, Dan; Zeng, Ke-Wu; Tu, Peng-Fei
2018-01-01
Drastic macrophages activation triggered by exogenous infection or endogenous stresses is thought to be implicated in the pathogenesis of various inflammatory diseases. Carnosic acid (CA), a natural phenolic diterpene extracted from Salvia officinalis plant, has been reported to possess anti-inflammatory activity. However, its role in macrophages activation as well as potential molecular mechanism is largely unexplored. In the current study, we sought to elucidate the anti-inflammatory property of CA using an integrated approach based on unbiased proteomics and bioinformatics analysis. CA significantly inhibited the robust increase of nitric oxide and TNF-α, downregulated COX2 protein expression, and lowered the transcriptional level of inflammatory genes including Nos2, Tnfα, Cox2, and Mcp1 in LPS-stimulated RAW264.7 cells, a murine model of peritoneal macrophage cell line. The LC-MS/MS-based shotgun proteomics analysis showed CA negatively regulated 217 LPS-elicited proteins which were involved in multiple inflammatory processes including MAPK, nuclear factor (NF)-κB, and FoxO signaling pathways. A further molecular biology analysis revealed that CA effectually inactivated IKKβ/IκB-α/NF-κB, ERK/JNK/p38 MAPKs, and FoxO1/3 signaling pathways. Collectively, our findings demonstrated the role of CA in regulating inflammation response and provide some insights into the proteomics-guided pharmacological mechanism study of natural products. PMID:29713284
Sun, Xiaochuan; Wang, Yan; Xu, Liang; Li, Chao; Zhang, Wei; Luo, Xiaobo; Jiang, Haiyan; Liu, Liwang
2017-01-01
To understand the molecular mechanism underlying salt stress response in radish, iTRAQ-based proteomic analysis was conducted to investigate the differences in protein species abundance under different salt treatments. In total, 851, 706, and 685 differential abundance protein species (DAPS) were identified between CK vs. Na100, CK vs. Na200, and Na100 vs. Na200, respectively. Functional annotation analysis revealed that salt stress elicited complex proteomic alterations in radish roots involved in carbohydrate and energy metabolism, protein metabolism, signal transduction, transcription regulation, stress and defense and transport. Additionally, the expression levels of nine genes encoding DAPS were further verified using RT-qPCR. The integrative analysis of transcriptomic and proteomic data in conjunction with miRNAs was further performed to strengthen the understanding of radish response to salinity. The genes responsible for signal transduction, ROS scavenging and transport activities as well as several key miRNAs including miR171, miR395, and miR398 played crucial roles in salt stress response in radish. Based on these findings, a schematic genetic regulatory network of salt stress response was proposed. This study provided valuable insights into the molecular mechanism underlying salt stress response in radish roots and would facilitate developing effective strategies toward genetically engineered salt-tolerant radish and other root vegetable crops. PMID:28769938
Image analysis tools and emerging algorithms for expression proteomics
English, Jane A.; Lisacek, Frederique; Morris, Jeffrey S.; Yang, Guang-Zhong; Dunn, Michael J.
2012-01-01
Since their origins in academic endeavours in the 1970s, computational analysis tools have matured into a number of established commercial packages that underpin research in expression proteomics. In this paper we describe the image analysis pipeline for the established 2-D Gel Electrophoresis (2-DE) technique of protein separation, and by first covering signal analysis for Mass Spectrometry (MS), we also explain the current image analysis workflow for the emerging high-throughput ‘shotgun’ proteomics platform of Liquid Chromatography coupled to MS (LC/MS). The bioinformatics challenges for both methods are illustrated and compared, whilst existing commercial and academic packages and their workflows are described from both a user’s and a technical perspective. Attention is given to the importance of sound statistical treatment of the resultant quantifications in the search for differential expression. Despite wide availability of proteomics software, a number of challenges have yet to be overcome regarding algorithm accuracy, objectivity and automation, generally due to deterministic spot-centric approaches that discard information early in the pipeline, propagating errors. We review recent advances in signal and image analysis algorithms in 2-DE, MS, LC/MS and Imaging MS. Particular attention is given to wavelet techniques, automated image-based alignment and differential analysis in 2-DE, Bayesian peak mixture models and functional mixed modelling in MS, and group-wise consensus alignment methods for LC/MS. PMID:21046614
Liu, Qun; Peng, Yong-Bo; Qi, Lian-Wen; Cheng, Xiao-Lan; Xu, Xiao-Jun; Liu, Le-Le; Liu, E-Hu; Li, Ping
2012-01-01
Cervical cancer is one of the most common cancers among women in the world. 6-Shogaol is a natural compound isolated from the rhizome of ginger (Zingiber officinale). In this paper, we demonstrated that 6-shogaol induced apoptosis and G2/M phase arrest in human cervical cancer HeLa cells. Endoplasmic reticulum stress and mitochondrial pathway were involved in 6-shogaol-mediated apoptosis. Proteomic analysis based on label-free strategy by liquid chromatography chip quadrupole time-of-flight mass spectrometry was subsequently proposed to identify, in a non-target-biased manner, the molecular changes in cellular proteins in response to 6-shogaol treatment. A total of 287 proteins were differentially expressed in response to 24 h treatment with 15 μM 6-shogaol in HeLa cells. Significantly changed proteins were subjected to functional pathway analysis by multiple analyzing software. Ingenuity pathway analysis (IPA) suggested that 14-3-3 signaling is a predominant canonical pathway involved in networks which may be significantly associated with the process of apoptosis and G2/M cell cycle arrest induced by 6-shogaol. In conclusion, this work developed an unbiased protein analysis strategy by shotgun proteomics and bioinformatics analysis. Data observed provide a comprehensive analysis of the 6-shogaol-treated HeLa cell proteome and reveal protein alterations that are associated with its anticancer mechanism.
Liu, Qun; Peng, Yong-Bo; Qi, Lian-Wen; Cheng, Xiao-Lan; Xu, Xiao-Jun; Liu, Le-Le; Liu, E-Hu; Li, Ping
2012-01-01
Cervical cancer is one of the most common cancers among women in the world. 6-Shogaol is a natural compound isolated from the rhizome of ginger (Zingiber officinale). In this paper, we demonstrated that 6-shogaol induced apoptosis and G2/M phase arrest in human cervical cancer HeLa cells. Endoplasmic reticulum stress and mitochondrial pathway were involved in 6-shogaol-mediated apoptosis. Proteomic analysis based on label-free strategy by liquid chromatography chip quadrupole time-of-flight mass spectrometry was subsequently proposed to identify, in a non-target-biased manner, the molecular changes in cellular proteins in response to 6-shogaol treatment. A total of 287 proteins were differentially expressed in response to 24 h treatment with 15 μM 6-shogaol in HeLa cells. Significantly changed proteins were subjected to functional pathway analysis by multiple analyzing software. Ingenuity pathway analysis (IPA) suggested that 14-3-3 signaling is a predominant canonical pathway involved in networks which may be significantly associated with the process of apoptosis and G2/M cell cycle arrest induced by 6-shogaol. In conclusion, this work developed an unbiased protein analysis strategy by shotgun proteomics and bioinformatics analysis. Data observed provide a comprehensive analysis of the 6-shogaol-treated HeLa cell proteome and reveal protein alterations that are associated with its anticancer mechanism. PMID:23243437
Methods, Tools and Current Perspectives in Proteogenomics *
Ruggles, Kelly V.; Krug, Karsten; Wang, Xiaojing; Clauser, Karl R.; Wang, Jing; Payne, Samuel H.; Fenyö, David; Zhang, Bing; Mani, D. R.
2017-01-01
With combined technological advancements in high-throughput next-generation sequencing and deep mass spectrometry-based proteomics, proteogenomics, i.e. the integrative analysis of proteomic and genomic data, has emerged as a new research field. Early efforts in the field were focused on improving protein identification using sample-specific genomic and transcriptomic sequencing data. More recently, integrative analysis of quantitative measurements from genomic and proteomic studies have identified novel insights into gene expression regulation, cell signaling, and disease. Many methods and tools have been developed or adapted to enable an array of integrative proteogenomic approaches and in this article, we systematically classify published methods and tools into four major categories, (1) Sequence-centric proteogenomics; (2) Analysis of proteogenomic relationships; (3) Integrative modeling of proteogenomic data; and (4) Data sharing and visualization. We provide a comprehensive review of methods and available tools in each category and highlight their typical applications. PMID:28456751
A HUPO test sample study reveals common problems in mass spectrometry-based proteomics
Bell, Alexander W.; Deutsch, Eric W.; Au, Catherine E.; Kearney, Robert E.; Beavis, Ron; Sechi, Salvatore; Nilsson, Tommy; Bergeron, John J.M.
2009-01-01
We carried out a test sample study to try to identify errors leading to irreproducibility, including incompleteness of peptide sampling, in LC-MS-based proteomics. We distributed a test sample consisting of an equimolar mix of 20 highly purified recombinant human proteins, to 27 laboratories for identification. Each protein contained one or more unique tryptic peptides of 1250 Da to also test for ion selection and sampling in the mass spectrometer. Of the 27 labs, initially only 7 labs reported all 20 proteins correctly, and only 1 lab reported all the tryptic peptides of 1250 Da. Nevertheless, a subsequent centralized analysis of the raw data revealed that all 20 proteins and most of the 1250 Da peptides had in fact been detected by all 27 labs. The centralized analysis allowed us to determine sources of problems encountered in the study, which include missed identifications (false negatives), environmental contamination, database matching, and curation of protein identifications. Improved search engines and databases are likely to increase the fidelity of mass spectrometry-based proteomics. PMID:19448641
Salunkhe, Vishal; De Cuyper, Iris M; Papadopoulos, Petros; van der Meer, Pieter F; Daal, Brunette B; Villa-Fajardo, María; de Korte, Dirk; van den Berg, Timo K; Gutiérrez, Laura
2018-03-19
Platelet concentrates (PCs) represent a blood transfusion product with a major concern for safety as their storage temperature (20-24°C) allows bacterial growth, and their maximum storage time period (less than a week) precludes complete microbiological testing. Pathogen inactivation technologies (PITs) provide an additional layer of safety to the blood transfusion products from known and unknown pathogens such as bacteria, viruses, and parasites. In this context, PITs, such as Mirasol Pathogen Reduction Technology (PRT), have been developed and are implemented in many countries. However, several studies have shown in vitro that Mirasol PRT induces a certain level of platelet shape change, hyperactivation, basal degranulation, and increased oxidative damage during storage. It has been suggested that Mirasol PRT might accelerate what has been described as the platelet storage lesion (PSL), but supportive molecular signatures have not been obtained. We aimed at dissecting the influence of both variables, that is, Mirasol PRT and storage time, at the proteome level. We present comprehensive proteomics data analysis of Control PCs and PCs treated with Mirasol PRT at storage days 1, 2, 6, and 8. Our workflow was set to perform proteomics analysis using a gel-free and label-free quantification (LFQ) approach. Semi-quantification was based on LFQ signal intensities of identified proteins using MaxQuant/Perseus software platform. Data are available via ProteomeXchange with identifier PXD008119. We identified marginal differences between Mirasol PRT and Control PCs during storage. However, those significant changes at the proteome level were specifically related to the functional aspects previously described to affect platelets upon Mirasol PRT. In addition, the effect of Mirasol PRT on the platelet proteome appeared not to be exclusively due to an accelerated or enhanced PSL. In summary, semi-quantitative proteomics allows to discern between proteome changes due to Mirasol PRT or PSL, and proves to be a methodology suitable to phenotype platelets in an unbiased manner, in various physiological contexts.
Use of focused ultrasonication in activity-based profiling of deubiquitinating enzymes in tissue.
Nanduri, Bindu; Shack, Leslie A; Rai, Aswathy N; Epperson, William B; Baumgartner, Wes; Schmidt, Ty B; Edelmann, Mariola J
2016-12-15
To develop a reproducible tissue lysis method that retains enzyme function for activity-based protein profiling, we compared four different methods to obtain protein extracts from bovine lung tissue: focused ultrasonication, standard sonication, mortar & pestle method, and homogenization combined with standard sonication. Focused ultrasonication and mortar & pestle methods were sufficiently effective for activity-based profiling of deubiquitinases in tissue, and focused ultrasonication also had the fastest processing time. We used focused-ultrasonicator for subsequent activity-based proteomic analysis of deubiquitinases to test the compatibility of this method in sample preparation for activity-based chemical proteomics. Copyright © 2016 Elsevier Inc. All rights reserved.
Proteomics Analysis of the Causative Agent of Typhoid Fever
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ansong, Charles; Yoon, Hyunjin; Norbeck, Angela D.
2008-02-01
Typhoid fever is a potentially fatal disease caused by the bacterial pathogen Salmonella enterica serovar Typhi (S. typhi). S. typhi infection is a complex process that involves numerous bacterially-encoded virulence determinants, and these are thought to confer both stringent human host specificity and a high mortality rate. In the present study we used a liquid chromatography-mass spectrometry (LC-MS) based proteomics strategy to investigate the proteome of logarithmic, stationary phase, and low pH/low magnesium (MgM) S. typhi cultures. This represents the first large scale comprehensive characterization of the S. typhi proteome. Our analysis identified a total of 2066 S. typhi proteins.more » In an effort to identify putative S. typhi-specific virulence factors, we then compared our S. typhi results to those obtained in a previously published study of the S. typhimurium proteome under similar conditions (Adkins J.N. et al (2006) Mol Cell Prot). Comparative proteomic analysis of S. typhi (strain Ty2) and S. typhimurium (strains LT2 and 14028) revealed a subset of highly expressed proteins unique to S. typhi that were exclusively detected under conditions that mimic the infective state in macrophage cells. These proteins included CdtB, HlyE, and a conserved protein encoded by t1476. The differential expression of selected proteins was confirmed by Western blot analysis. Taken together with the current literature, our observations suggest that this subset of proteins may play a role in S. typhi pathogenesis and human host specificity. In addition, we observed products of the biotin (bio) operon displayed a higher abundance in the more virulent strains S. typhi-Ty2 and S. typhimurium-14028 compared to the virulence attenuated S. typhimurium strain LT2, suggesting bio proteins may contribute to Salmonella pathogenesis.« less
Welham, Nathan V; Chang, Zhen; Smith, Lloyd M; Frey, Brian L
2013-01-01
Natural biologic scaffolds for tissue engineering are commonly generated by decellularization of tissues and organs. Despite some preclinical and clinical success, in vivo scaffold remodeling and functional outcomes remain variable, presumably due to the influence of unidentified bioactive molecules on the scaffold-host interaction. Here, we used 2D electrophoresis and high-resolution mass spectrometry-based proteomic analyses to evaluate decellularization effectiveness and identify potentially bioactive protein remnants in a human vocal fold mucosa model. We noted proteome, phosphoproteome and O-glycoproteome depletion post-decellularization, and identified >200 unique protein species within the decellularized scaffold. Gene ontology-based enrichment analysis revealed a dominant set of functionally-related ontology terms associated with extracellular matrix assembly, organization, morphology and patterning, consistent with preservation of a tissue-specific niche for later cell seeding and infiltration. We further identified a subset of ontology terms associated with bioactive (some of which are antigenic) cellular proteins, despite histological and immunohistochemical data indicating complete decellularization. These findings demonstrate the value of mass spectrometry-based proteomics in identifying agents potentially responsible for variation in host response to engineered tissues derived from decellularized scaffolds. This work has implications for the manufacturing of biologic scaffolds from any tissue or organ, as well as for prediction and monitoring of the scaffold-host interaction in vivo. Copyright © 2012 Elsevier Ltd. All rights reserved.
Vijay, Sonam; Rawal, Ritu; Kadian, Kavita; Singh, Jagbir; Adak, Tridibesh; Sharma, Arun
2018-05-08
Midgut invasion, a major bottleneck for malaria parasites transmission is considered as a potential target for vector-parasite interaction studies. New intervention strategies are required to explore the midgut proteins and their potential role in refractoriness for malaria control in Anopheles mosquitoes. To better understand the midgut functional proteins of An. culicifacies susceptible and refractory species, proteomic approaches coupled with bioinformatics analysis is an effective means in order to understand the mechanism of refractoriness. In the present study, an integrated in solution- in gel trypsin digestion approach, along with Isobaric tag for relative and absolute quantitation (iTRAQ)-Liquid chromatography/Mass spectrometry (LC/MS/MS) and data mining were performed to identify the proteomic profile and differentially expressed proteins in Anopheles culicifacies susceptible species A and refractory species B. Shot gun proteomics approaches led to the identification of 80 proteins in An. culicifacies susceptible species A and 92 in refractory species B and catalogue was prepared. iTRAQ based proteomic analysis identified 48 differentially expressed proteins from total 130 proteins. Of these, 41 were downregulated and 7 were upregulated in refractory species B in comparison to susceptible species A. We report that the altered midgut proteins identified in naturally refractory mosquitoes are involved in oxidative phosphorylation, antioxidant and proteolysis process that may suggest their role in parasite growth inhibition. Furthermore, real time polymerase chain reaction (PCR) analysis of few proteins indicated higher expression of iTRAQ upregulated protein in refractory species than susceptible species. This study elucidates the first proteome of the midguts of An. culicifacies sibling species that attempts to analyze unique proteogenomic interactions to provide insights for better understanding of the mechanism of refractoriness. Functional implications of these upregulated proteins in refractory species may reflect the phenotypic characteristics of the mosquitoes and will improve our understandings of blood meal digestion process, parasite vector interactions and proteomes of other vectors of human diseases for development of novel vector control strategies.
Protein and gene model inference based on statistical modeling in k-partite graphs.
Gerster, Sarah; Qeli, Ermir; Ahrens, Christian H; Bühlmann, Peter
2010-07-06
One of the major goals of proteomics is the comprehensive and accurate description of a proteome. Shotgun proteomics, the method of choice for the analysis of complex protein mixtures, requires that experimentally observed peptides are mapped back to the proteins they were derived from. This process is also known as protein inference. We present Markovian Inference of Proteins and Gene Models (MIPGEM), a statistical model based on clearly stated assumptions to address the problem of protein and gene model inference for shotgun proteomics data. In particular, we are dealing with dependencies among peptides and proteins using a Markovian assumption on k-partite graphs. We are also addressing the problems of shared peptides and ambiguous proteins by scoring the encoding gene models. Empirical results on two control datasets with synthetic mixtures of proteins and on complex protein samples of Saccharomyces cerevisiae, Drosophila melanogaster, and Arabidopsis thaliana suggest that the results with MIPGEM are competitive with existing tools for protein inference.
Monitoring Peptidase Activities in Complex Proteomes by MALDI-TOF Mass Spectrometry
Villanueva, Josep; Nazarian, Arpi; Lawlor, Kevin; Tempst, Paul
2009-01-01
Measuring enzymatic activities in biological fluids is a form of activity-based proteomics and may be utilized as a means of developing disease biomarkers. Activity-based assays allow amplification of output signals, thus potentially visualizing low-abundant enzymes on a virtually transparent whole-proteome background. The protocol presented here describes a semi-quantitative in vitro assay of proteolytic activities in complex proteomes by monitoring breakdown of designer peptide-substrates using robotic extraction and a MALDI-TOF mass spectrometric read-out. Relative quantitation of the peptide metabolites is done by comparison with spiked internal standards, followed by statistical analysis of the resulting mini-peptidome. Partial automation provides reproducibility and throughput essential for comparing large sample sets. The approach may be employed for diagnostic or predictive purposes and enables profiling of 96 samples in 30 hours. It could be tailored to many diagnostic and pharmaco-dynamic purposes, as a read-out of catalytic and metabolic activities in body fluids or tissues. PMID:19617888
National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have released a dataset of proteins and phosphopeptides identified through deep proteomic and phosphoproteomic analysis of breast tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA).
Song, Ehwang; Gao, Yuqian; Wu, Chaochao; ...
2017-07-19
Here, mass spectrometry (MS) based targeted proteomic methods such as selected reaction monitoring (SRM) are becoming the method of choice for preclinical verification of candidate protein biomarkers. The Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute has investigated the standardization and analytical validation of the SRM assays and demonstrated robust analytical performance on different instruments across different laboratories. An Assay Portal has also been established by CPTAC to provide the research community a resource consisting of large set of targeted MS-based assays, and a depository to share assays publicly, providing that assays meet the guidelines proposed bymore » CPTAC. Herein, we report 98 SRM assays covering 70 candidate protein biomarkers previously reported as associated with ovarian cancer that have been thoroughly characterized according to the CPTAC Assay Characterization Guidance Document. The experiments, methods and results for characterizing these SRM assays for their MS response, repeatability, selectivity, stability, and reproducible detection of endogenous analytes are described in detail.« less
National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have just released a comprehensive dataset of the proteomic analysis of high grade serous ovarian tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA). This is one of the largest public datasets covering the proteome, phosphoproteome and glycoproteome with complementary deep genomic sequencing data on the same tumor.
HIV-1 Vpr modulates macrophage metabolic pathways: a SILAC-based quantitative analysis.
Barrero, Carlos A; Datta, Prasun K; Sen, Satarupa; Deshmane, Satish; Amini, Shohreh; Khalili, Kamel; Merali, Salim
2013-01-01
Human immunodeficiency virus type 1 encoded viral protein Vpr is essential for infection of macrophages by HIV-1. Furthermore, these macrophages are resistant to cell death and are viral reservoir. However, the impact of Vpr on the macrophage proteome is yet to be comprehended. The goal of the present study was to use a stable-isotope labeling by amino acids in cell culture (SILAC) coupled with mass spectrometry-based proteomics approach to characterize the Vpr response in macrophages. Cultured human monocytic cells, U937, were differentiated into macrophages and transduced with adenovirus construct harboring the Vpr gene. More than 600 proteins were quantified in SILAC coupled with LC-MS/MS approach, among which 136 were significantly altered upon Vpr overexpression in macrophages. Quantified proteins were selected and clustered by biological functions, pathway and network analysis using Ingenuity computational pathway analysis. The proteomic data illustrating increase in abundance of enzymes in the glycolytic pathway (pentose phosphate and pyruvate metabolism) was further validated by western blot analysis. In addition, the proteomic data demonstrate down regulation of some key mitochondrial enzymes such as glutamate dehydrogenase 2 (GLUD2), adenylate kinase 2 (AK2) and transketolase (TKT). Based on these observations we postulate that HIV-1 hijacks the macrophage glucose metabolism pathway via the Vpr-hypoxia inducible factor 1 alpha (HIF-1 alpha) axis to induce expression of hexokinase (HK), glucose-6-phosphate dehyrogenase (G6PD) and pyruvate kinase muscle type 2 (PKM2) that facilitates viral replication and biogenesis, and long-term survival of macrophages. Furthermore, dysregulation of mitochondrial glutamate metabolism in macrophages can contribute to neurodegeneration via neuroexcitotoxic mechanisms in the context of NeuroAIDS.
Use of proteomic methods in the analysis of human body fluids in Alzheimer research.
Zürbig, Petra; Jahn, Holger
2012-12-01
Proteomics is the study of the entire population of proteins and peptides in an organism or a part of it, such as a cell, tissue, or fluids like cerebrospinal fluid, plasma, serum, urine, or saliva. It is widely assumed that changes in the composition of the proteome may reflect disease states and provide clues to its origin, eventually leading to targets for new treatments. The ability to perform large-scale proteomic studies now is based jointly on recent advances in our analytical methods. Separation techniques like CE and 2DE have developed and matured. Detection methods like MS have also improved greatly in the last 5 years. These developments have also driven the fields of bioinformatics, needed to deal with the increased data production and systems biology. All these developing methods offer specific advantages but also come with certain limitations. This review describes the different proteomic methods used in the field, their limitations, and their possible pitfalls. Based on a literature search in PubMed, we identified 112 studies that applied proteomic techniques to identify biomarkers for Alzheimer disease. This review describes the results of these studies on proteome changes in human body fluids of Alzheimer patients reviewing the most important studies. We extracted a list of 366 proteins and peptides that were identified by these studies as potential targets in Alzheimer research. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Mapping the Small Molecule Interactome by Mass Spectrometry.
Flaxman, Hope A; Woo, Christina M
2018-01-16
Mapping small molecule interactions throughout the proteome provides the critical structural basis for functional analysis of their impact on biochemistry. However, translation of mass spectrometry-based proteomics methods to directly profile the interaction between a small molecule and the whole proteome is challenging because of the substoichiometric nature of many interactions, the diversity of covalent and noncovalent interactions involved, and the subsequent computational complexity associated with their spectral assignment. Recent advances in chemical proteomics have begun fill this gap to provide a structural basis for the breadth of small molecule-protein interactions in the whole proteome. Innovations enabling direct characterization of the small molecule interactome include faster, more sensitive instrumentation coupled to chemical conjugation, enrichment, and labeling methods that facilitate detection and assignment. These methods have started to measure molecular interaction hotspots due to inherent differences in local amino acid reactivity and binding affinity throughout the proteome. Measurement of the small molecule interactome is producing structural insights and methods for probing and engineering protein biochemistry. Direct structural characterization of the small molecule interactome is a rapidly emerging area pushing new frontiers in biochemistry at the interface of small molecules and the proteome.
Agrawal, Ganesh Kumar; Sarkar, Abhijit; Righetti, Pier Giorgio; Pedreschi, Romina; Carpentier, Sebastien; Wang, Tai; Barkla, Bronwyn J; Kohli, Ajay; Ndimba, Bongani Kaiser; Bykova, Natalia V; Rampitsch, Christof; Zolla, Lello; Rafudeen, Mohamed Suhail; Cramer, Rainer; Bindschedler, Laurence Veronique; Tsakirpaloglou, Nikolaos; Ndimba, Roya Janeen; Farrant, Jill M; Renaut, Jenny; Job, Dominique; Kikuchi, Shoshi; Rakwal, Randeep
2013-01-01
Tremendous progress in plant proteomics driven by mass spectrometry (MS) techniques has been made since 2000 when few proteomics reports were published and plant proteomics was in its infancy. These achievements include the refinement of existing techniques and the search for new techniques to address food security, safety, and health issues. It is projected that in 2050, the world's population will reach 9-12 billion people demanding a food production increase of 34-70% (FAO, 2009) from today's food production. Provision of food in a sustainable and environmentally committed manner for such a demand without threatening natural resources, requires that agricultural production increases significantly and that postharvest handling and food manufacturing systems become more efficient requiring lower energy expenditure, a decrease in postharvest losses, less waste generation and food with longer shelf life. There is also a need to look for alternative protein sources to animal based (i.e., plant based) to be able to fulfill the increase in protein demands by 2050. Thus, plant biology has a critical role to play as a science capable of addressing such challenges. In this review, we discuss proteomics especially MS, as a platform, being utilized in plant biology research for the past 10 years having the potential to expedite the process of understanding plant biology for human benefits. The increasing application of proteomics technologies in food security, analysis, and safety is emphasized in this review. But, we are aware that no unique approach/technology is capable to address the global food issues. Proteomics-generated information/resources must be integrated and correlated with other omics-based approaches, information, and conventional programs to ensure sufficient food and resources for human development now and in the future. © 2013 Wiley Periodicals, Inc.
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006
Ultrahigh pressure fast size exclusion chromatography for top-down proteomics.
Chen, Xin; Ge, Ying
2013-09-01
Top-down MS-based proteomics has gained a solid growth over the past few years but still faces significant challenges in the LC separation of intact proteins. In top-down proteomics, it is essential to separate the high mass proteins from the low mass species due to the exponential decay in S/N as a function of increasing molecular mass. SEC is a favored LC method for size-based separation of proteins but suffers from notoriously low resolution and detrimental dilution. Herein, we reported the use of ultrahigh pressure (UHP) SEC for rapid and high-resolution separation of intact proteins for top-down proteomics. Fast separation of intact proteins (6-669 kDa) was achieved in < 7 min with high resolution and high efficiency. More importantly, we have shown that this UHP-SEC provides high-resolution separation of intact proteins using a MS-friendly volatile solvent system, allowing the direct top-down MS analysis of SEC-eluted proteins without an additional desalting step. Taken together, we have demonstrated that UHP-SEC is an attractive LC strategy for the size separation of proteins with great potential for top-down proteomics. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kakourou, Alexia; Vach, Werner; Nicolardi, Simone; van der Burgt, Yuri; Mertens, Bart
2016-10-01
Mass spectrometry based clinical proteomics has emerged as a powerful tool for high-throughput protein profiling and biomarker discovery. Recent improvements in mass spectrometry technology have boosted the potential of proteomic studies in biomedical research. However, the complexity of the proteomic expression introduces new statistical challenges in summarizing and analyzing the acquired data. Statistical methods for optimally processing proteomic data are currently a growing field of research. In this paper we present simple, yet appropriate methods to preprocess, summarize and analyze high-throughput MALDI-FTICR mass spectrometry data, collected in a case-control fashion, while dealing with the statistical challenges that accompany such data. The known statistical properties of the isotopic distribution of the peptide molecules are used to preprocess the spectra and translate the proteomic expression into a condensed data set. Information on either the intensity level or the shape of the identified isotopic clusters is used to derive summary measures on which diagnostic rules for disease status allocation will be based. Results indicate that both the shape of the identified isotopic clusters and the overall intensity level carry information on the class outcome and can be used to predict the presence or absence of the disease.
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Webb-Robertson, Bobbie-Jo M.; Wiberg, Holli K.; Matzke, Melissa M.; ...
2015-04-09
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yieldedmore » the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. In summary, on the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.« less
Next-Generation Proteomics and Its Application to Clinical Breast Cancer Research.
Mardamshina, Mariya; Geiger, Tamar
2017-10-01
Proteomics technology aims to map the protein landscapes of biological samples, and it can be applied to a variety of samples, including cells, tissues, and body fluids. Because the proteins are the main functional molecules in the cells, their levels reflect much more accurately the cellular phenotype and the regulatory processes within them than gene levels, mutations, and even mRNA levels. With the advancement in the technology, it is possible now to obtain comprehensive views of the biological systems and to study large patient cohorts in a streamlined manner. In this review we discuss the technological advancements in mass spectrometry-based proteomics, which allow analysis of breast cancer tissue samples, leading to the first large-scale breast cancer proteomics studies. Furthermore, we discuss the technological developments in blood-based biomarker discovery, which provide the basis for future development of assays for routine clinical use. Although these are only the first steps in implementation of proteomics into the clinic, extensive collaborative work between these worlds will undoubtedly lead to major discoveries and advances in clinical practice. Copyright © 2017 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
Schönke, Milena; Björnholm, Marie; Chibalin, Alexander V; Zierath, Juleen R; Deshmukh, Atul S
2018-03-01
Skeletal muscle insulin resistance, an early metabolic defect in the pathogenesis of type 2 diabetes (T2D), may be a cause or consequence of altered protein expression profiles. Proteomics technology offers enormous promise to investigate molecular mechanisms underlying pathologies, however, the analysis of skeletal muscle is challenging. Using state-of-the-art multienzyme digestion and filter-aided sample preparation (MED-FASP) and a mass spectrometry (MS)-based workflow, we performed a global proteomics analysis of skeletal muscle from leptin-deficient, obese, insulin resistant (ob/ob) and lean mice in mere two fractions in a short time (8 h per sample). We identified more than 6000 proteins with 118 proteins differentially regulated in obesity. This included protein kinases, phosphatases, and secreted and fiber type associated proteins. Enzymes involved in lipid metabolism in skeletal muscle from ob/ob mice were increased, providing evidence against reduced fatty acid oxidation in lipid-induced insulin resistance. Mitochondrial and peroxisomal proteins, as well as components of pyruvate and lactate metabolism, were increased. Finally, the skeletal muscle proteome from ob/ob mice displayed a shift toward the "slow fiber type." This detailed characterization of an obese rodent model of T2D demonstrates an efficient workflow for skeletal muscle proteomics, which may easily be adapted to other complex tissues. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Cassidy, Liam; Prasse, Daniela; Linke, Dennis; Schmitz, Ruth A; Tholey, Andreas
2016-10-07
The recent discovery of an increasing number of small open reading frames (sORF) creates the need for suitable analytical technologies for the comprehensive identification of the corresponding gene products. For biological and functional studies the knowledge of the entire set of proteins and sORF gene products is essential. Consequently in the present study we evaluated analytical approaches that will allow for simultaneous analysis of widest parts of the proteome together with the predicted sORF. We performed a full proteome analysis of the methane producing archaeon Methanosarcina mazei strain Gö1 cytosolic proteome using a high/low pH reversed phase LC-MS bottom-up approach. The second analytical approach was based on semi-top-down strategy, encompassing a separation at intact protein level using a GelFree system, followed by digestion and LC-MS analysis. A high overlap in identified proteins was found for both approaches yielding the most comprehensive coverage of the cytosolic proteome of this organism achieved so far. The application of the second approach in combination with an adjustment of the search criteria for database searches further led to a significant increase of sORF peptide identifications, finally allowing to detect and identify 28 sORF gene products.
Global Analysis Reveals the Complexity of the Human Glomerular Extracellular Matrix
Byron, Adam; Humphries, Jonathan D.; Randles, Michael J.; Carisey, Alex; Murphy, Stephanie; Knight, David; Brenchley, Paul E.; Zent, Roy; Humphries, Martin J.
2014-01-01
The glomerulus contains unique cellular and extracellular matrix (ECM) components, which are required for intact barrier function. Studies of the cellular components have helped to build understanding of glomerular disease; however, the full composition and regulation of glomerular ECM remains poorly understood. We used mass spectrometry-based proteomics of enriched ECM extracts for a global analysis of human glomerular ECM in vivo and identified a tissue-specific proteome of 144 structural and regulatory ECM proteins. This catalog includes all previously identified glomerular components plus many new and abundant components. Relative protein quantification showed a dominance of collagen IV, collagen I, and laminin isoforms in the glomerular ECM together with abundant collagen VI and TINAGL1. Protein network analysis enabled the creation of a glomerular ECM interactome, which revealed a core of highly connected structural components. More than one half of the glomerular ECM proteome was validated using colocalization studies and data from the Human Protein Atlas. This study yields the greatest number of ECM proteins relative to previous investigations of whole glomerular extracts, highlighting the importance of sample enrichment. It also shows that the composition of glomerular ECM is far more complex than previously appreciated and suggests that many more ECM components may contribute to glomerular development and disease processes. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the dataset identifier PXD000456. PMID:24436468
Lee, Jinoo; Valkova, Nelly; White, Mark P; Kültz, Dietmar
2006-09-01
We used dogfish shark (Squalus acanthias) as a model for proteome analysis of six different tissues to evaluate tissue-specific protein expression on a global scale and to deduce specific functions and the relatedness of multiple tissues from their proteomes. Proteomes of heart, brain, kidney, intestine, gill, and rectal gland were separated by two-dimensional gel electrophoresis (2DGE), gel images were matched using Delta 2D software and then evaluated for tissue-specific proteins. Sixty-one proteins (4%) were found to be in only a single type of tissue and 535 proteins (36%) were equally abundant in all six tissues. Relatedness between tissues was assessed based on tissue-specific expression patterns of all 1465 consistently resolved protein spots. This analysis revealed that tissues with osmoregulatory function (kidney, intestine, gill, rectal gland) were more similar in their overall proteomes than non-osmoregulatory tissues (heart, brain). Sixty-one proteins were identified by MALDI-TOF/TOF mass spectrometry and biological functions characteristic of osmoregulatory tissues were derived from gene ontology and molecular pathway analysis. Our data demonstrate that the molecular machinery for energy and urea metabolism and the Rho-GTPase/cytoskeleton pathway are enriched in osmoregulatory tissues of sharks. Our work provides a strong rationale for further study of the contribution of these mechanisms to the osmoregulation of marine sharks.
A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*
Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing
2011-01-01
Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.; ...
2017-09-01
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here in this study, we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-μm-i.d. columns increase signal intensity by >3-fold relative to those using 75-μm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos MS significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap),more » leading to a ~3-fold increase in peptide identifications and 1.7-fold increase in identified protein groups for 2 ng tryptic digests of the bacterium S. oneidensis. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~95% for 0.5 ng samples and by ~42% for 2 ng samples. Using the best combination of the above variables, we were able to identify >3,000 proteins from 10 ng tryptic digests from both HeLa and THP-1 mammalian cell lines. We also identified >950 proteins from subnanogram archaeal/bacterial cocultures. Finally, the present ultrasensitive LC-MS platform achieves a level of proteome coverage not previously realized for ultra-small sample loadings, and is expected to facilitate the analysis of subnanogram samples, including single mammalian cells.« less
Halligan, Brian D.; Geiger, Joey F.; Vallejos, Andrew K.; Greene, Andrew S.; Twigger, Simon N.
2009-01-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step by step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center website (http://proteomics.mcw.edu/vipdac). PMID:19358578
Halligan, Brian D; Geiger, Joey F; Vallejos, Andrew K; Greene, Andrew S; Twigger, Simon N
2009-06-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step-by-step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center Web site ( http://proteomics.mcw.edu/vipdac ).
Yu, Kebing; Salomon, Arthur R.
2010-01-01
Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through tandem mass spectrometry (MS/MS). Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to a variety of experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our High Throughput Autonomous Proteomic Pipeline (HTAPP) used in the automated acquisition and post-acquisition analysis of proteomic data. PMID:19834895
A tutorial for software development in quantitative proteomics using PSI standard formats☆
Gonzalez-Galarza, Faviel F.; Qi, Da; Fan, Jun; Bessant, Conrad; Jones, Andrew R.
2014-01-01
The Human Proteome Organisation — Proteomics Standards Initiative (HUPO-PSI) has been working for ten years on the development of standardised formats that facilitate data sharing and public database deposition. In this article, we review three HUPO-PSI data standards — mzML, mzIdentML and mzQuantML, which can be used to design a complete quantitative analysis pipeline in mass spectrometry (MS)-based proteomics. In this tutorial, we briefly describe the content of each data model, sufficient for bioinformaticians to devise proteomics software. We also provide guidance on the use of recently released application programming interfaces (APIs) developed in Java for each of these standards, which makes it straightforward to read and write files of any size. We have produced a set of example Java classes and a basic graphical user interface to demonstrate how to use the most important parts of the PSI standards, available from http://code.google.com/p/psi-standard-formats-tutorial. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23584085
van Herwijnen, Martijn J C; Zonneveld, Marijke I; Goerdayal, Soenita; Nolte-'t Hoen, Esther N M; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A F; Redegeld, Frank A; Wauben, Marca H M
2016-11-01
Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of the whole milk proteome and illustrates that milk-derived EV are macromolecular components with a unique functional proteome. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The National Cancer Institute is soliciting applications for the reissuance of its Clinical Proteomic Tumor Analysis Consortium (CPTAC) program. CPTAC will support broad efforts focused on several cancer types to explore further the complexities of cancer proteomes and their connections to abnormalities in cancer genomes.
Senachak, Jittisak; Cheevadhanarak, Supapon; Hongsthong, Apiradee
2015-07-29
Spirulina (Arthrospira) platensis is the only cyanobacterium that in addition to being studied at the molecular level and subjected to gene manipulation, can also be mass cultivated in outdoor ponds for commercial use as a food supplement. Thus, encountering environmental changes, including temperature stresses, is common during the mass production of Spirulina. The use of cyanobacteria as an experimental platform, especially for photosynthetic gene manipulation in plants and bacteria, is becoming increasingly important. Understanding the mechanisms and protein-protein interaction networks that underlie low- and high-temperature responses is relevant to Spirulina mass production. To accomplish this goal, high-throughput techniques such as OMICs analyses are used. Thus, large datasets must be collected, managed and subjected to information extraction. Therefore, databases including (i) proteomic analysis and protein-protein interaction (PPI) data and (ii) domain/motif visualization tools are required for potential use in temperature response models for plant chloroplasts and photosynthetic bacteria. A web-based repository was developed including an embedded database, SpirPro, and tools for network visualization. Proteome data were analyzed integrated with protein-protein interactions and/or metabolic pathways from KEGG. The repository provides various information, ranging from raw data (2D-gel images) to associated results, such as data from interaction and/or pathway analyses. This integration allows in silico analyses of protein-protein interactions affected at the metabolic level and, particularly, analyses of interactions between and within the affected metabolic pathways under temperature stresses for comparative proteomic analysis. The developed tool, which is coded in HTML with CSS/JavaScript and depicted in Scalable Vector Graphics (SVG), is designed for interactive analysis and exploration of the constructed network. SpirPro is publicly available on the web at http://spirpro.sbi.kmutt.ac.th . SpirPro is an analysis platform containing an integrated proteome and PPI database that provides the most comprehensive data on this cyanobacterium at the systematic level. As an integrated database, SpirPro can be applied in various analyses, such as temperature stress response networking analysis in cyanobacterial models and interacting domain-domain analysis between proteins of interest.
Comparative proteomic analysis of outer membrane protein 43 (omp43)-deficient Bartonella henselae.
Kang, Jun-Gu; Lee, Hee-Woo; Ko, Sungjin; Chae, Joon-Seok
2018-01-31
Outer membrane proteins (OMPs) of Gram-negative bacteria constitute the first line of defense protecting cells against environmental stresses including chemical, biophysical, and biological attacks. Although the 43-kDa OMP (OMP43) is major porin protein among Bartonella henselae -derived OMPs, its function remains unreported. In this study, OMP43-deficient mutant B. henselae (Δomp43) was generated to investigate OMP43 function. Interestingly, Δ omp 43 exhibited weaker proliferative ability than that of wild-type (WT) B. henselae . To study the differences in proteomic expression between WT and Δ omp 43, two-dimensional gel electrophoresis-based proteomic analysis was performed. Based on Clusters of Orthologus Groups functional assignments, 12 proteins were associated with metabolism, 7 proteins associated with information storage and processing, and 3 proteins associated with cellular processing and signaling. By semi-quantitative reverse transcriptase polymerase chain reaction, increases in tld D, efp, ntr X, pdh A, pur B, and ATPA mRNA expression and decreases in Rho and yfe A mRNA expression were confirmed in Δ omp 43. In conclusion, this is the first report showing that a loss of OMP43 expression in B. henselae leads to retarded proliferation. Furthermore, our proteomic data provide useful information for the further investigation of mechanisms related to the growth of B. henselae.
Megger, Dominik A.; Philipp, Jos; Le-Trilling, Vu Thuy Khanh; Sitek, Barbara; Trilling, Mirko
2017-01-01
Interferons (IFNs) are pleotropic cytokines secreted upon encounter of pathogens and tumors. Applying their antipathogenic, antiproliferative, and immune stimulatory capacities, recombinant IFNs are frequently prescribed as drugs to treat different diseases. IFNs act by changing the gene expression profile of cells. Due to characteristics such as rapid gene induction and signaling, IFNs also represent prototypical model systems for various aspects of biomedical research (e.g., signal transduction). In regard to the signaling and activated promoters, IFNs can be subdivided into two groups. Here, alterations of the cellular proteome of human cells treated with IFNα and IFNγ were elucidated in a time-resolved manner by quantitative proteome analysis. The majority of protein regulations were strongly IFN type and time dependent. In addition to the expected upregulation of IFN-responsive proteins, an astonishing number of proteins became profoundly repressed especially by IFNγ. Thus, our comprehensive analysis revealed important insights into the human IFN-regulated proteome and its dynamics of protein induction and repression. Interestingly, the new class of IFN-repressed genes comprises known host factors for highly relevant pathogens such as HIV, dengue virus, and hepatitis C virus. PMID:28959263
Hu, Shuangfang; Yu, Yigang; Wu, Xinwei; Xia, Xingzhou; Xiao, Xinglong; Wu, Hui
2017-10-01
Cronobacter sakazakii is a foodborne pathogen throughout the world and survives extremely desiccation stress. However, the molecular basis involved in desiccation resistance of C. sakazakii is still unknown. In this study, the potential desiccation resistance factors of C. sakazakii ATCC 29544 were determined using iTRAQ-based quantitative proteomic analysis. A total of 2775 proteins were identified by iTRAQ, of which 233 showed a different protein expression between control group and desiccation stress group. Among these 233 proteins identified as desiccation resistance proteins, there were 146 proteins downregulated and 87 proteins upregulated. According to the comprehensive proteome coverage analysis, C. sakazakii increased its resistance to desiccation by reducing the gene involved with unnecessary survival functions such as those used for virulence, adhesion, invasion and flagella assembly, while increasing gene expression of genes used in withstanding osmotic stress such as those genes involved in trehalose and betaine uptake. However, the mechanism involved in amino acid metabolism in an osmotic stress response, including the producing of γ-aminobutyric acid in C. sakazakii is still uncertain. This is the first report to determine the potential desiccation resistant factors of C. sakazakii at the proteomic levels. Copyright © 2017. Published by Elsevier Ltd.
Megger, Dominik A; Philipp, Jos; Le-Trilling, Vu Thuy Khanh; Sitek, Barbara; Trilling, Mirko
2017-01-01
Interferons (IFNs) are pleotropic cytokines secreted upon encounter of pathogens and tumors. Applying their antipathogenic, antiproliferative, and immune stimulatory capacities, recombinant IFNs are frequently prescribed as drugs to treat different diseases. IFNs act by changing the gene expression profile of cells. Due to characteristics such as rapid gene induction and signaling, IFNs also represent prototypical model systems for various aspects of biomedical research (e.g., signal transduction). In regard to the signaling and activated promoters, IFNs can be subdivided into two groups. Here, alterations of the cellular proteome of human cells treated with IFNα and IFNγ were elucidated in a time-resolved manner by quantitative proteome analysis. The majority of protein regulations were strongly IFN type and time dependent. In addition to the expected upregulation of IFN-responsive proteins, an astonishing number of proteins became profoundly repressed especially by IFNγ. Thus, our comprehensive analysis revealed important insights into the human IFN-regulated proteome and its dynamics of protein induction and repression. Interestingly, the new class of IFN-repressed genes comprises known host factors for highly relevant pathogens such as HIV, dengue virus, and hepatitis C virus.
Yang, Mei; Cong, Min; Peng, Xiuming; Wu, Junrui; Wu, Rina; Liu, Biao; Ye, Wenhui; Yue, Xiqing
2016-05-18
Milk fat globule membrane (MFGM) proteins have many functions. To explore the different proteomics of human and bovine MFGM, MFGM proteins were separated from human and bovine colostrum and mature milk, and analyzed by the iTRAQ proteomic approach. A total of 411 proteins were recognized and quantified. Among these, 232 kinds of differentially expressed proteins were identified. These differentially expressed proteins were analyzed based on multivariate analysis, gene ontology (GO) annotation and KEGG pathway. Biological processes involved were response to stimulus, localization, establishment of localization, and the immune system process. Cellular components engaged were the extracellular space, extracellular region parts, cell fractions, and vesicles. Molecular functions touched upon were protein binding, nucleotide binding, and enzyme inhibitor activity. The KEGG pathway analysis showed several pathways, including regulation of the actin cytoskeleton, focal adhesion, neurotrophin signaling pathway, leukocyte transendothelial migration, tight junction, complement and coagulation cascades, vascular endothelial growth factor signaling pathway, and adherens junction. These results enhance our understanding of different proteomes of human and bovine MFGM across different lactation phases, which could provide important information and potential directions for the infant milk powder and functional food industries.
Informed-Proteomics: open-source software package for top-down proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Park, Jungkap; Piehowski, Paul D.; Wilkins, Christopher
Top-down proteomics involves the analysis of intact proteins. This approach is very attractive as it allows for analyzing proteins in their endogenous form without proteolysis, preserving valuable information about post-translation modifications, isoforms, proteolytic processing or their combinations collectively called proteoforms. Moreover, the quality of the top-down LC-MS/MS datasets is rapidly increasing due to advances in the liquid chromatography and mass spectrometry instrumentation and sample processing protocols. However, the top-down mass spectra are substantially more complex compare to the more conventional bottom-up data. To take full advantage of the increasing quality of the top-down LC-MS/MS datasets there is an urgent needmore » to develop algorithms and software tools for confident proteoform identification and quantification. In this study we present a new open source software suite for top-down proteomics analysis consisting of an LC-MS feature finding algorithm, a database search algorithm, and an interactive results viewer. The presented tool along with several other popular tools were evaluated using human-in-mouse xenograft luminal and basal breast tumor samples that are known to have significant differences in protein abundance based on bottom-up analysis.« less
Kankeu, Cynthia; Clarke, Kylie; Van Haver, Delphi; Gevaert, Kris; Impens, Francis; Dittrich, Anna; Roderick, H Llewelyn; Passante, Egle; Huber, Heinrich J
2018-05-17
The rat cardiomyoblast cell line H9C2 has emerged as a valuable tool for studying cardiac development, mechanisms of disease and toxicology. We present here a rigorous proteomic analysis that monitored the changes in protein expression during differentiation of H9C2 cells into cardiomyocyte-like cells over time. Quantitative mass spectrometry followed by gene ontology (GO) enrichment analysis revealed that early changes in H9C2 differentiation are related to protein pathways of cardiac muscle morphogenesis and sphingolipid synthesis. These changes in the proteome were followed later in the differentiation time-course by alterations in the expression of proteins involved in cation transport and beta-oxidation. Studying the temporal profile of the H9C2 proteome during differentiation in further detail revealed eight clusters of co-regulated proteins that can be associated with early, late, continuous and transient up- and downregulation. Subsequent reactome pathway analysis based on these eight clusters further corroborated and detailed the results of the GO analysis. Specifically, this analysis confirmed that proteins related to pathways in muscle contraction are upregulated early and transiently, and proteins relevant to extracellular matrix organization are downregulated early. In contrast, upregulation of proteins related to cardiac metabolism occurs at later time points. Finally, independent validation of the proteomics results by immunoblotting confirmed hereto unknown regulators of cardiac structure and ionic metabolism. Our results are consistent with a 'function follows form' model of differentiation, whereby early and transient alterations of structural proteins enable subsequent changes that are relevant to the characteristic physiology of cardiomyocytes.
Megger, Dominik Andre; Rosowski, Kristin; Ahrens, Maike; Bracht, Thilo; Eisenacher, Martin; Schlaak, Jörg F; Weber, Frank; Hoffmann, Andreas-Claudius; Meyer, Helmut E; Baba, Hideo A; Sitek, Barbara
2017-03-01
Human hepatocellular carcinoma (HCC) is a severe malignant disease, and accurate and reliable diagnostic markers are still needed. This study was aimed for the discovery of novel marker candidates by quantitative proteomics. Proteomic differences between HCC and nontumorous liver tissue were studied by mass spectrometry. Among several significantly upregulated proteins, translocator protein 18 (TSPO) and Ras-related protein Rab-1A (RAB1A) were selected for verification by immunohistochemistry in an independent cohort. For RAB1A, a high accuracy for the discrimination of HCC and nontumorous liver tissue was observed. RAB1A was verified to be a potent biomarker candidate for HCC.
A Combined Metabolomic and Proteomic Analysis of Gestational Diabetes Mellitus
Hajduk, Joanna; Klupczynska, Agnieszka; Dereziński, Paweł; Matysiak, Jan; Kokot, Piotr; Nowak, Dorota M.; Gajęcka, Marzena; Nowak-Markwitz, Ewa; Kokot, Zenon J.
2015-01-01
The aim of this pilot study was to apply a novel combined metabolomic and proteomic approach in analysis of gestational diabetes mellitus. The investigation was performed with plasma samples derived from pregnant women with diagnosed gestational diabetes mellitus (n = 18) and a matched control group (n = 13). The mass spectrometry-based analyses allowed to determine 42 free amino acids and low molecular-weight peptide profiles. Different expressions of several peptides and altered amino acid profiles were observed in the analyzed groups. The combination of proteomic and metabolomic data allowed obtaining the model with a high discriminatory power, where amino acids ethanolamine, l-citrulline, l-asparagine, and peptide ions with m/z 1488.59; 4111.89 and 2913.15 had the highest contribution to the model. The sensitivity (94.44%) and specificity (84.62%), as well as the total group membership classification value (90.32%) calculated from the post hoc classification matrix of a joint model were the highest when compared with a single analysis of either amino acid levels or peptide ion intensities. The obtained results indicated a high potential of integration of proteomic and metabolomics analysis regardless the sample size. This promising approach together with clinical evaluation of the subjects can also be used in the study of other diseases. PMID:26694367
Runau, Franscois; Arshad, Ali; Isherwood, John; Norris, Leonie; Howells, Lynne; Metcalfe, Matthew; Dennison, Ashley
2015-06-01
Pancreatic cancer is a disease with a significantly poor prognosis. Despite modern advances in other medical, surgical, and oncologic therapy, the outcome from pancreatic cancer has improved little over the last 40 years. To improve the management of this difficult disease, trials investigating the use of dietary and parenteral fish oils rich in omega-3 (ω-3) fatty acids, exhibiting proven anti-inflammatory and anticarcinogenic properties, have revealed favorable results in pancreatic cancers. Proteomics is the large-scale study of proteins that attempts to characterize the complete set of proteins encoded by the genome of an organism and that, with the use of sensitive mass spectrometric-based techniques, has allowed high-throughput analysis of the proteome to aid identification of putative biomarkers pertinent to given disease states. These biomarkers provide useful insight into potentially discovering new markers for early detection or elucidating the efficacy of treatment on pancreatic cancers. Here, our review identifies potential proteomic-based biomarkers in pancreatic cancer relating to apoptosis, cell proliferation, angiogenesis, and metabolic regulation in clinical studies. We also reviewed proteomic biomarkers from the administration of ω-3 fatty acids that act on similar anticarcinogenic pathways as above and reflect that proteomic studies on the effect of ω-3 fatty acids in pancreatic cancer will yield favorable results. © 2015 American Society for Parenteral and Enteral Nutrition.
Chemical probes for analysis of carbonylated proteins: a review
Yan, Liang-Jun; Forster, Michael J.
2010-01-01
Protein carbonylation is a major form of protein oxidation and is widely used as an indicator of oxidative stress. Carbonyl groups do not have distinguishing UV or visible, spectrophotometric absorbance/fluorescence characteristics and thus their detection and quantification can only be achieved using specific chemical probes. In this paper, we review the advantages and disadvantages of several chemical probes that have been and are still being used for protein carbonyl analysis. These probes include 2, 4-dinitrophenylhydazine (DNPH), tritiated sodium borohydride ([3H]NaBH4), biotin-containing probes, and fluorescence probes. As our discussions lean toward gel-based approaches, utilizations of these probes in 2D gel-based proteomic analysis of carbonylated proteins are illustrated where applicable. Analysis of carbonylated proteins by ELISA, immunofluorescent imaging, near infrared fluorescence detection, and gel-free proteomic approaches are also discussed where appropriate. Additionally, potential applications of blue native gel electrophoresis as a tool for first dimensional separation in 2D gel-based analysis of carbonylated proteins are discussed as well. PMID:20732835
2013-01-01
Despite its prominence for characterization of complex mixtures, LC–MS/MS frequently fails to identify many proteins. Network-based analysis methods, based on protein–protein interaction networks (PPINs), biological pathways, and protein complexes, are useful for recovering non-detected proteins, thereby enhancing analytical resolution. However, network-based analysis methods do come in varied flavors for which the respective efficacies are largely unknown. We compare the recovery performance and functional insights from three distinct instances of PPIN-based approaches, viz., Proteomics Expansion Pipeline (PEP), Functional Class Scoring (FCS), and Maxlink, in a test scenario of valproic acid (VPA)-treated mice. We find that the most comprehensive functional insights, as well as best non-detected protein recovery performance, are derived from FCS utilizing real biological complexes. This outstrips other network-based methods such as Maxlink or Proteomics Expansion Pipeline (PEP). From FCS, we identified known biological complexes involved in epigenetic modifications, neuronal system development, and cytoskeletal rearrangements. This is congruent with the observed phenotype where adult mice showed an increase in dendritic branching to allow the rewiring of visual cortical circuitry and an improvement in their visual acuity when tested behaviorally. In addition, PEP also identified a novel complex, comprising YWHAB, NR1, NR2B, ACTB, and TJP1, which is functionally related to the observed phenotype. Although our results suggest different network analysis methods can produce different results, on the whole, the findings are mutually supportive. More critically, the non-overlapping information each provides can provide greater holistic understanding of complex phenotypes. PMID:23557376
CPTAC | Office of Cancer Clinical Proteomics Research
The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is a national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics.
Droit, Arnaud; Hunter, Joanna M; Rouleau, Michèle; Ethier, Chantal; Picard-Cloutier, Aude; Bourgais, David; Poirier, Guy G
2007-01-01
Background In the "post-genome" era, mass spectrometry (MS) has become an important method for the analysis of proteins and the rapid advancement of this technique, in combination with other proteomics methods, results in an increasing amount of proteome data. This data must be archived and analysed using specialized bioinformatics tools. Description We herein describe "PARPs database," a data analysis and management pipeline for liquid chromatography tandem mass spectrometry (LC-MS/MS) proteomics. PARPs database is a web-based tool whose features include experiment annotation, protein database searching, protein sequence management, as well as data-mining of the peptides and proteins identified. Conclusion Using this pipeline, we have successfully identified several interactions of biological significance between PARP-1 and other proteins, namely RFC-1, 2, 3, 4 and 5. PMID:18093328
Efficient visualization of high-throughput targeted proteomics experiments: TAPIR.
Röst, Hannes L; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars
2015-07-15
Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Chen, Jin-Qiu; Wakefield, Lalage M; Goldstein, David J
2015-06-06
There is an emerging demand for the use of molecular profiling to facilitate biomarker identification and development, and to stratify patients for more efficient treatment decisions with reduced adverse effects. In the past decade, great strides have been made to advance genomic, transcriptomic and proteomic approaches to address these demands. While there has been much progress with these large scale approaches, profiling at the protein level still faces challenges due to limitations in clinical sample size, poor reproducibility, unreliable quantitation, and lack of assay robustness. A novel automated capillary nano-immunoassay (CNIA) technology has been developed. This technology offers precise and accurate measurement of proteins and their post-translational modifications using either charge-based or size-based separation formats. The system not only uses ultralow nanogram levels of protein but also allows multi-analyte analysis using a parallel single-analyte format for increased sensitivity and specificity. The high sensitivity and excellent reproducibility of this technology make it particularly powerful for analysis of clinical samples. Furthermore, the system can distinguish and detect specific protein post-translational modifications that conventional Western blot and other immunoassays cannot easily capture. This review will summarize and evaluate the latest progress to optimize the CNIA system for comprehensive, quantitative protein and signaling event characterization. It will also discuss how the technology has been successfully applied in both discovery research and clinical studies, for signaling pathway dissection, proteomic biomarker assessment, targeted treatment evaluation and quantitative proteomic analysis. Lastly, a comparison of this novel system with other conventional immuno-assay platforms is performed.
Proteomic profiling of early degenerative retina of RCS rats
Zhu, Zhi-Hong; Fu, Yan; Weng, Chuan-Huang; Zhao, Cong-Jian; Yin, Zheng-Qin
2017-01-01
AIM To identify the underlying cellular and molecular changes in retinitis pigmentosa (RP). METHODS Label-free quantification-based proteomics analysis, with its advantages of being more economic and consisting of simpler procedures, has been used with increasing frequency in modern biological research. Dystrophic RCS rats, the first laboratory animal model for the study of RP, possess a similar pathological course as human beings with the diseases. Thus, we employed a comparative proteomics analysis approach for in-depth proteome profiling of retinas from dystrophic RCS rats and non-dystrophic congenic controls through Linear Trap Quadrupole - orbitrap MS/MS, to identify the significant differentially expressed proteins (DEPs). Bioinformatics analyses, including Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation and upstream regulatory analysis, were then performed on these retina proteins. Finally, a Western blotting experiment was carried out to verify the difference in the abundance of transcript factor E2F1. RESULTS In this study, we identified a total of 2375 protein groups from the retinal protein samples of RCS rats and non-dystrophic congenic controls. Four hundred thirty-four significantly DEPs were selected by Student's t-test. Based on the results of the bioinformatics analysis, we identified mitochondrial dysfunction and transcription factor E2F1 as the key initiation factors in early retinal degenerative process. CONCLUSION We showed that the mitochondrial dysfunction and the transcription factor E2F1 substantially contribute to the disease etiology of RP. The results provide a new potential therapeutic approach for this retinal degenerative disease. PMID:28730077
Vrana, Julie A.; Theis, Jason D.; Dasari, Surendra; Mereuta, Oana M.; Dispenzieri, Angela; Zeldenrust, Steven R.; Gertz, Morie A.; Kurtin, Paul J.; Grogg, Karen L.; Dogan, Ahmet
2014-01-01
Examination of abdominal subcutaneous fat aspirates is a practical, sensitive and specific method for the diagnosis of systemic amyloidosis. Here we describe the development and implementation of a clinical assay using mass spectrometry-based proteomics to type amyloidosis in subcutaneous fat aspirates. First, we validated the assay comparing amyloid-positive (n=43) and -negative (n=26) subcutaneous fat aspirates. The assay classified amyloidosis with 88% sensitivity and 96% specificity. We then implemented the assay as a clinical test, and analyzed 366 amyloid-positive subcutaneous fat aspirates in a 4-year period as part of routine clinical care. The assay had a sensitivity of 90%, and diverse amyloid types, including immunoglobulin light chain (74%), transthyretin (13%), serum amyloid A (%1), gelsolin (1%), and lysozyme (1%), were identified. Using bioinformatics, we identified a universal amyloid proteome signature, which has high sensitivity and specificity for amyloidosis similar to that of Congo red staining. We curated proteome databases which included variant proteins associated with systemic amyloidosis, and identified clonotypic immunoglobulin variable gene usage in immunoglobulin light chain amyloidosis, and the variant peptides in hereditary transthyretin amyloidosis. In conclusion, mass spectrometry-based proteomic analysis of subcutaneous fat aspirates offers a powerful tool for the diagnosis and typing of systemic amyloidosis. The assay reveals the underlying pathogenesis by identifying variable gene usage in immunoglobulin light chains and the variant peptides in hereditary amyloidosis. PMID:24747948
Jarnuczak, Andrew F; Lee, Dave C H; Lawless, Craig; Holman, Stephen W; Eyers, Claire E; Hubbard, Simon J
2016-09-02
Quantitative mass spectrometry-based proteomics of complex biological samples remains challenging in part due to the variability and charge competition arising during electrospray ionization (ESI) of peptides and the subsequent transfer and detection of ions. These issues preclude direct quantification from signal intensity alone in the absence of a standard. A deeper understanding of the governing principles of peptide ionization and exploitation of the inherent ionization and detection parameters of individual peptides is thus of great value. Here, using the yeast proteome as a model system, we establish the concept of peptide F-factor as a measure of detectability, closely related to ionization efficiency. F-factor is calculated by normalizing peptide precursor ion intensity by absolute abundance of the parent protein. We investigated F-factor characteristics in different shotgun proteomics experiments, including across multiple ESI-based LC-MS platforms. We show that F-factors mirror previously observed physicochemical predictors as peptide detectability but demonstrate a nonlinear relationship between hydrophobicity and peptide detectability. Similarly, we use F-factors to show how peptide ion coelution adversely affects detectability and ionization. We suggest that F-factors have great utility for understanding peptide detectability and gas-phase ion chemistry in complex peptide mixtures, selection of surrogate peptides in targeted MS studies, and for calibration of peptide ion signal in label-free workflows. Data are available via ProteomeXchange with identifier PXD003472.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gritsenko, Marina A.; Xu, Zhe; Liu, Tao
Comprehensive, quantitative information on abundances of proteins and their post-translational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labelling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification andmore » quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples, and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.« less
Gritsenko, Marina A; Xu, Zhe; Liu, Tao; Smith, Richard D
2016-01-01
Comprehensive, quantitative information on abundances of proteins and their posttranslational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labeling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification and quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.
Human body fluid proteome analysis
Hu, Shen; Loo, Joseph A.; Wong, David T.
2010-01-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future. PMID:17083142
Human body fluid proteome analysis.
Hu, Shen; Loo, Joseph A; Wong, David T
2006-12-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future.
HiQuant: Rapid Postquantification Analysis of Large-Scale MS-Generated Proteomics Data.
Bryan, Kenneth; Jarboui, Mohamed-Ali; Raso, Cinzia; Bernal-Llinares, Manuel; McCann, Brendan; Rauch, Jens; Boldt, Karsten; Lynn, David J
2016-06-03
Recent advances in mass-spectrometry-based proteomics are now facilitating ambitious large-scale investigations of the spatial and temporal dynamics of the proteome; however, the increasing size and complexity of these data sets is overwhelming current downstream computational methods, specifically those that support the postquantification analysis pipeline. Here we present HiQuant, a novel application that enables the design and execution of a postquantification workflow, including common data-processing steps, such as assay normalization and grouping, and experimental replicate quality control and statistical analysis. HiQuant also enables the interpretation of results generated from large-scale data sets by supporting interactive heatmap analysis and also the direct export to Cytoscape and Gephi, two leading network analysis platforms. HiQuant may be run via a user-friendly graphical interface and also supports complete one-touch automation via a command-line mode. We evaluate HiQuant's performance by analyzing a large-scale, complex interactome mapping data set and demonstrate a 200-fold improvement in the execution time over current methods. We also demonstrate HiQuant's general utility by analyzing proteome-wide quantification data generated from both a large-scale public tyrosine kinase siRNA knock-down study and an in-house investigation into the temporal dynamics of the KSR1 and KSR2 interactomes. Download HiQuant, sample data sets, and supporting documentation at http://hiquant.primesdb.eu .
Open reading frames associated with cancer in the dark matter of the human genome.
Delgado, Ana Paula; Brandao, Pamela; Chapado, Maria Julia; Hamid, Sheilin; Narayanan, Ramaswamy
2014-01-01
The uncharacterized proteins (open reading frames, ORFs) in the human genome offer an opportunity to discover novel targets for cancer. A systematic analysis of the dark matter of the human proteome for druggability and biomarker discovery is crucial to mining the genome. Numerous data mining tools are available to mine these ORFs to develop a comprehensive knowledge base for future target discovery and validation. Using the Genetic Association Database, the ORFs of the human dark matter proteome were screened for evidence of association with neoplasms. The Phenome-Genome Integrator tool was used to establish phenotypic association with disease traits including cancer. Batch analysis of the tools for protein expression analysis, gene ontology and motifs and domains was used to characterize the ORFs. Sixty-two ORFs were identified for neoplasm association. The expression Quantitative Trait Loci (eQTL) analysis identified thirteen ORFs related to cancer traits. Protein expression, motifs and domain analysis and genome-wide association studies verified the relevance of these OncoORFs in diverse tumors. The OncoORFs are also associated with a wide variety of human diseases and disorders. Our results link the OncoORFs to diverse diseases and disorders. This suggests a complex landscape of the uncharacterized proteome in human diseases. These results open the dark matter of the proteome to novel cancer target research. Copyright© 2014, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
Song, Yimeng; Zhong, Lijun; Zhou, Juntuo; Lu, Min; Xing, Tianying; Ma, Lulin; Shen, Jing
2017-12-01
Renal cell carcinoma (RCC) is a malignant and metastatic cancer with 95% mortality, and clear cell RCC (ccRCC) is the most observed among the five major subtypes of RCC. Specific biomarkers that can distinguish cancer tissues from adjacent normal tissues should be developed to diagnose this disease in early stages and conduct a reliable prognostic evaluation. Data-independent acquisition (DIA) strategy has been widely employed in proteomic analysis because of various advantages, including enhanced protein coverage and reliable data acquisition. In this study, a DIA workflow is constructed on a quadrupole-Orbitrap LC-MS platform to reveal dysregulated proteins between ccRCC and adjacent normal tissues. More than 4000 proteins are identified, 436 of these proteins are dysregulated in ccRCC tissues. Bioinformatic analysis reveals that multiple pathways and Gene Ontology items are strongly associated with ccRCC. The expression levels of L-lactate dehydrogenase A chain, annexin A4, nicotinamide N-methyltransferase, and perilipin-2 examined through RT-qPCR, Western blot, and immunohistochemistry confirm the validity of the proteomic analysis results. The proposed DIA workflow yields optimum time efficiency and data reliability and provides a good choice for proteomic analysis in biological and clinical studies, and these dysregulated proteins might be potential biomarkers for ccRCC diagnosis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Bensaddek, Dalila; Narayan, Vikram; Nicolas, Armel; Murillo, Alejandro Brenes; Gartner, Anton; Kenyon, Cynthia J; Lamond, Angus I
2016-02-01
Proteomics studies typically analyze proteins at a population level, using extracts prepared from tens of thousands to millions of cells. The resulting measurements correspond to average values across the cell population and can mask considerable variation in protein expression and function between individual cells or organisms. Here, we report the development of micro-proteomics for the analysis of Caenorhabditis elegans, a eukaryote composed of 959 somatic cells and ∼1500 germ cells, measuring the worm proteome at a single organism level to a depth of ∼3000 proteins. This includes detection of proteins across a wide dynamic range of expression levels (>6 orders of magnitude), including many chromatin-associated factors involved in chromosome structure and gene regulation. We apply the micro-proteomics workflow to measure the global proteome response to heat-shock in individual nematodes. This shows variation between individual animals in the magnitude of proteome response following heat-shock, including variable induction of heat-shock proteins. The micro-proteomics pipeline thus facilitates the investigation of stochastic variation in protein expression between individuals within an isogenic population of C. elegans. All data described in this study are available online via the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd), an open access, searchable database resource. © 2015 The Authors. PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rudnick, Paul A.; Markey, Sanford P.; Roth, Jeri; Mirokhin, Yuri; Yan, Xinjian; Tchekhovskoi, Dmitrii V.; Edwards, Nathan J.; Thangudu, Ratna R.; Ketchum, Karen A.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Stephen E.
2016-01-01
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics datasets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and non-reference markers of cancer. The CPTAC labs have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these datasets were produced from 2D LC-MS/MS analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) Peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false discovery rate (FDR)-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the datasets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level (“rolled-up”) precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ™. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data, enabling comparisons between different samples and cancer types as well as across the major ‘omics fields. PMID:26860878
Rudnick, Paul A; Markey, Sanford P; Roth, Jeri; Mirokhin, Yuri; Yan, Xinjian; Tchekhovskoi, Dmitrii V; Edwards, Nathan J; Thangudu, Ratna R; Ketchum, Karen A; Kinsinger, Christopher R; Mesri, Mehdi; Rodriguez, Henry; Stein, Stephen E
2016-03-04
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics data sets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and nonreference markers of cancer. The CPTAC laboratories have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these data sets were produced from 2D liquid chromatography-tandem mass spectrometry analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false-discovery rate-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the data sets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level ("rolled-up") precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data to enable comparisons between different samples and cancer types as well as across the major omics fields.
Cambiaghi, Alice; Díaz, Ramón; Martinez, Julia Bauzá; Odena, Antonia; Brunelli, Laura; Caironi, Pietro; Masson, Serge; Baselli, Giuseppe; Ristagno, Giuseppe; Gattinoni, Luciano; de Oliveira, Eliandre; Pastorelli, Roberta; Ferrario, Manuela
2018-04-27
In this work, we examined plasma metabolome, proteome and clinical features in patients with severe septic shock enrolled in the multicenter ALBIOS study. The objective was to identify changes in the levels of metabolites involved in septic shock progression and to integrate this information with the variation occurring in proteins and clinical data. Mass spectrometry-based targeted metabolomics and untargeted proteomics allowed us to quantify absolute metabolites concentration and relative proteins abundance. We computed the ratio D7/D1 to take into account their variation from day 1 (D1) to day 7 (D7) after shock diagnosis. Patients were divided into two groups according to 28-day mortality. Three different elastic net logistic regression models were built: one on metabolites only, one on metabolites and proteins and one to integrate metabolomics and proteomics data with clinical parameters. Linear discriminant analysis and Partial least squares Discriminant Analysis were also implemented. All the obtained models correctly classified the observations in the testing set. By looking at the variable importance (VIP) and the selected features, the integration of metabolomics with proteomics data showed the importance of circulating lipids and coagulation cascade in septic shock progression, thus capturing a further layer of biological information complementary to metabolomics information.
Find Pairs: The Module for Protein Quantification of the PeakQuant Software Suite
Eisenacher, Martin; Kohl, Michael; Wiese, Sebastian; Hebeler, Romano; Meyer, Helmut E.
2012-01-01
Abstract Accurate quantification of proteins is one of the major tasks in current proteomics research. To address this issue, a wide range of stable isotope labeling techniques have been developed, allowing one to quantitatively study thousands of proteins by means of mass spectrometry. In this article, the FindPairs module of the PeakQuant software suite is detailed. It facilitates the automatic determination of protein abundance ratios based on the automated analysis of stable isotope-coded mass spectrometric data. Furthermore, it implements statistical methods to determine outliers due to biological as well as technical variance of proteome data obtained in replicate experiments. This provides an important means to evaluate the significance in obtained protein expression data. For demonstrating the high applicability of FindPairs, we focused on the quantitative analysis of proteome data acquired in 14N/15N labeling experiments. We further provide a comprehensive overview of the features of the FindPairs software, and compare these with existing quantification packages. The software presented here supports a wide range of proteomics applications, allowing one to quantitatively assess data derived from different stable isotope labeling approaches, such as 14N/15N labeling, SILAC, and iTRAQ. The software is publicly available at http://www.medizinisches-proteom-center.de/software and free for academic use. PMID:22909347
Cilia, M.; Fish, T.; Yang, X.; Mclaughlin, M.; Thannhauser, T. W.
2009-01-01
Protein extraction methods can vary widely in reproducibility and in representation of the total proteome, yet there are limited data comparing protein isolation methods. The methodical comparison of protein isolation methods is the first critical step for proteomic studies. To address this, we compared three methods for isolation, purification, and solubilization of insect proteins. The aphid Schizaphis graminum, an agricultural pest, was the source of insect tissue. Proteins were extracted using TCA in acetone (TCA-acetone), phenol, or multi-detergents in a chaotrope solution. Extracted proteins were solubilized in a multiple chaotrope solution and examined using 1-D and 2-D electrophoresis and compared directly using 2-D Difference Gel Electrophoresis (2-D DIGE). Mass spectrometry was used to identify proteins from each extraction type. We were unable to ascribe the differences in the proteins extracted to particular physical characteristics, cell location, or biological function. The TCA-acetone extraction yielded the greatest amount of protein from aphid tissues. Each extraction method isolated a unique subset of the aphid proteome. The TCA-acetone method was explored further for its quantitative reliability using 2-D DIGE. Principal component analysis showed that little of the variation in the data was a result of technical issues, thus demonstrating that the TCA-acetone extraction is a reliable method for preparing aphid proteins for a quantitative proteomics experiment. These data suggest that although the TCA-acetone method is a suitable method for quantitative aphid proteomics, a combination of extraction approaches is recommended for increasing proteome coverage when using gel-based separation techniques. PMID:19721822
Dammeier, Sascha; Nahnsen, Sven; Veit, Johannes; Wehner, Frank; Ueffing, Marius; Kohlbacher, Oliver
2016-01-04
Standard forensic procedures to examine bullets after an exchange of fire include a mechanical or ballistic reconstruction of the event. While this is routine to identify which projectile hit a subject by DNA analysis of biological material on the surface of the projectile, it is rather difficult to determine which projectile caused the lethal injury--often the crucial point with regard to legal proceedings. With respect to fundamental law it is the duty of the public authority to make every endeavor to solve every homicide case. To improve forensic examinations, we present a forensic proteomic method to investigate biological material from a projectile's surface and determine the tissues traversed by it. To obtain a range of relevant samples, different major bovine organs were penetrated with projectiles experimentally. After tryptic "on-surface" digestion, mass-spectrometry-based proteome analysis, and statistical data analysis, we were able to achieve a cross-validated organ classification accuracy of >99%. Different types of anticipated external variables exhibited no prominent influence on the findings. In addition, shooting experiments were performed to validate the results. Finally, we show that these concepts could be applied to a real case of murder to substantially improve the forensic reconstruction.
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie; Gruden, Kristina
2017-07-06
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato ( Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it.
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie
2017-01-01
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato (Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it. PMID:28684682
Coorssen, Jens R; Yergey, Alfred L
2015-12-03
Molecular mechanisms underlying health and disease function at least in part based on the flexibility and fine-tuning afforded by protein isoforms and post-translational modifications. The ability to effectively and consistently resolve these protein species or proteoforms, as well as assess quantitative changes is therefore central to proteomic analyses. Here we discuss the pros and cons of currently available and developing analytical techniques from the perspective of the full spectrum of available tools and their current applications, emphasizing the concept of fitness-for-purpose in experimental design based on consideration of sample size and complexity; this necessarily also addresses analytical reproducibility and its variance. Data quality is considered the primary criterion, and we thus emphasize that the standards of Analytical Chemistry must apply throughout any proteomic analysis.
NASA Astrophysics Data System (ADS)
Wang, Wei; Sun, Yeqing; Zhao, Qian; Han, Lu
2016-07-01
Highly ionizing radiation (HZE) in space is considered as main factor causing biological effects. Radiobiological studies during space flights are unrepeatable due to the variable space radiation environment, ground-base ion radiations are usually performed to simulate of the space biological effect. Spaceflights present a low-dose rate (0.1˜~0.3mGy/day) radiation environment inside aerocrafts while ground-base ion radiations present a much higher dose rate (100˜~500mGy/min). Whether ground-base ion radiation can reflect effects of space radiation is worth of evaluation. In this research, we compared the functional proteomic profiles of rice plants between on-ground simulated HZE particle radiation and spaceflight treatments. Three independent ground-base seed ionizing radiation experiments with different cumulative doses (dose range: 2˜~20000mGy) and different liner energy transfer (LET) values (13.3˜~500keV/μμm) and two independent seed spaceflight experiments onboard Chinese 20th satellite and SZ-6 spacecraft were carried out. Alterations in the proteome were analyzed by two-dimensional difference gel electrophoresis (2-D DIGE) with MALDI-TOF/TOF mass spectrometry identifications. 45 and 59 proteins showed significant (p<0.05) and reproducible quantitative differences in ground-base ion radiation and spaceflight experiments respectively. The functions of ground-base radiation and spaceflight proteins were both involved in a wide range of biological processes. Gene Ontology enrichment analysis further revealed that ground-base radiation responsive proteins were mainly involved in removal of superoxide radicals, defense response to stimulus and photosynthesis, while spaceflight responsive proteins mainly participate in nucleoside metabolic process, protein folding and phosphorylation. The results implied that ground-base radiations cannot truly reflect effects of spaceflight radiations, ground-base radiation was a kind of indirect effect to rice causing oxidation and metabolism stresses, but space radiation was a kind of direct effect leading to macromolecule (DNA and protein) damage and signal pathway disorders. This functional proteomic analysis work might provide a new evaluation method for further on-ground simulated HZE radiation experiments.
Megger, Dominik A; Padden, Juliet; Rosowski, Kristin; Uszkoreit, Julian; Bracht, Thilo; Eisenacher, Martin; Gerges, Christian; Neuhaus, Horst; Schumacher, Brigitte; Schlaak, Jörg F; Sitek, Barbara
2017-02-10
The proteome analysis of bile fluid represents a promising strategy to identify biomarker candidates for various diseases of the hepatobiliary system. However, to obtain substantive results in biomarker discovery studies large patient cohorts necessarily need to be analyzed. Consequently, this would lead to an unmanageable number of samples to be analyzed if sample preparation protocols with extensive fractionation methods are applied. Hence, the performance of simple workflows allowing for "one sample, one shot" experiments have been evaluated in this study. In detail, sixteen different protocols implying modifications at the stages of desalting, delipidation, deglycosylation and tryptic digestion have been examined. Each method has been individually evaluated regarding various performance criteria and comparative analyses have been conducted to uncover possible complementarities. Here, the best performance in terms of proteome coverage has been assessed for a combination of acetone precipitation with in-gel digestion. Finally, a mapping of all obtained protein identifications with putative biomarkers for hepatocellular carcinoma (HCC) and cholangiocellular carcinoma (CCC) revealed several proteins easily detectable in bile fluid. These results can build the basis for future studies with large and well-defined patient cohorts in a more disease-related context. Human bile fluid is a proximal body fluid and supposed to be a potential source of disease markers. However, due to its biochemical composition, the proteome analysis of bile fluid still represents a challenging task and is therefore mostly conducted using extensive fractionation procedures. This in turn leads to a high number of mass spectrometric measurements for one biological sample. Considering the fact that in order to overcome the biological variability a high number of biological samples needs to be analyzed in biomarker discovery studies, this leads to the dilemma of an unmanageable number of necessary MS-based analyses. Hence, easy sample preparation protocols are demanded representing a compromise between proteome coverage and simplicity. In the presented study, such protocols have been evaluated regarding various technical criteria (e.g. identification rates, missed cleavages, chromatographic separation) uncovering the strengths and weaknesses of various methods. Furthermore, a cumulative bile proteome list has been generated that extends the current bile proteome catalog by 248 proteins. Finally, a mapping with putative biomarkers for hepatocellular carcinoma (HCC) and cholangiocellular carcinoma (CCC) derived from tissue-based studies, revealed several of these proteins being easily and reproducibly detectable in human bile. Therefore, the presented technical work represents a solid base for future disease-related studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Integrated Proteomic Approaches for Understanding Toxicity of Environmental Chemicals
To apply quantitative proteomic analysis to the evaluation of toxicity of environmental chemicals, we have developed an integrated proteomic technology platform. This platform has been applied to the analysis of the toxic effects and pathways of many important environmental chemi...
Functional protease profiling for diagnosis of malignant disease.
Findeisen, Peter; Neumaier, Michael
2012-01-01
Clinical proteomic profiling by mass spectrometry (MS) aims at uncovering specific alterations within mass profiles of clinical specimens that are of diagnostic value for the detection and classification of various diseases including cancer. However, despite substantial progress in the field, the clinical proteomic profiling approaches have not matured into routine diagnostic applications so far. Their limitations are mainly related to high-abundance proteins and their complex processing by a multitude of endogenous proteases thus making rigorous standardization difficult. MS is biased towards the detection of low-molecular-weight peptides. Specifically, in serum specimens, the particular fragments of proteolytically degraded proteins are amenable to MS analysis. Proteases are known to be involved in tumour progression and tumour-specific proteases are released into the blood stream presumably as a result of invasive progression and metastasis. Thus, the determination of protease activity in clinical specimens from patients with malignant disease can offer diagnostic and also therapeutic options. The identification of specific substrates for tumour proteases in complex biological samples is challenging, but proteomic screens for proteases/substrate interactions are currently experiencing impressive progress. Such proteomic screens include peptide-based libraries, differential isotope labelling in combination with MS, quantitative degradomic analysis of proteolytically generated neo-N-termini, monitoring the degradation of exogenous reporter peptides with MS, and activity-based protein profiling. In the present article, we summarize and discuss the current status of proteomic techniques to identify tumour-specific protease-substrate interactions for functional protease profiling. Thereby, we focus on the potential diagnostic use of the respective approaches. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Di, Guilan; Li, Hui; Zhang, Chao; Zhao, Yanjing; Zhou, Chuanjiang; Naeem, Sajid; Li, Li; Kong, Xianghui
2017-07-01
Outbreaks of infectious diseases in common carp Cyprinus carpio, a major cultured fish in northern regions of China, constantly result in significant economic losses. Until now, information proteomic on immune defence remains limited. In the present study, a profile of intestinal mucosa immune response in Cyprinus carpio was investigated after 0, 12, 36 and 84 h after challenging tissues with Aeromonas hydrophila at a concentration of 1.4 × 10 8 CFU/mL. Proteomic profiles in different samples were compared using label-free quantitative proteomic approach. Based on MASCOT database search, 1149 proteins were identified in samples after normalisation of proteins. Treated groups 1 (T1) and 2 (T2) were first clustered together and then clustered with control (C group). The distance between C and treated group 3 (T3) represented the maxima according to hierarchical cluster analysis. Therefore, comparative analysis between C and T3 was selected in the following analysis. A total of 115 proteins with differential abundance were detected to show conspicuous expressing variances. A total of 52 up-regulated proteins and 63 down-regulated proteins were detected in T3. Gene ontology analysis showed that identified up-regulated differentially expressed proteins in T3 were mainly localised in the hemoglobin complex, and down-regulated proteins in T3 were mainly localised in the major histocompatibility complex II protein complex. Forty-six proteins of differential abundance (40% of 115) were involved in immune response, with 17 up-regulated and 29 down-regulated proteins detected in T3. This study is the first to report proteome response of carp intestinal mucosa against A. hydrophila infection; information obtained contribute to understanding defence mechanisms of carp intestinal mucosa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Top-down Proteomics: Technology Advancements and Applications to Heart Diseases
Cai, Wenxuan; Tucholski, Trisha M.; Gregorich, Zachery R.; Ge, Ying
2016-01-01
Introduction Diseases of the heart are a leading cause of morbidity and mortality for both men and women worldwide, and impose significant economic burdens on the healthcare systems. Despite substantial effort over the last several decades, the molecular mechanisms underlying diseases of the heart remain poorly understood. Areas covered Altered protein post-translational modifications (PTMs) and protein isoform switching are increasingly recognized as important disease mechanisms. Top-down high-resolution mass spectrometry (MS)-based proteomics has emerged as the most powerful method for the comprehensive analysis of PTMs and protein isoforms. Here, we will review recent technology developments in the field of top-down proteomics, as well as highlight recent studies utilizing top-down proteomics to decipher the cardiac proteome for the understanding of the molecular mechanisms underlying diseases of the heart. Expert commentary Top-down proteomics is a premier method for the global and comprehensive study of protein isoforms and their PTMs, enabling the identification of novel protein isoforms and PTMs, characterization of sequence variations, and quantification of disease-associated alterations. Despite significant challenges, continuous development of top-down proteomics technology will greatly aid the dissection of the molecular mechanisms underlying diseases of the hearts for the identification of novel biomarkers and therapeutic targets. PMID:27448560
Petricoin, Emanuel F; Rajapaske, Vinodh; Herman, Eugene H; Arekani, Ali M; Ross, Sally; Johann, Donald; Knapton, Alan; Zhang, J; Hitt, Ben A; Conrads, Thomas P; Veenstra, Timothy D; Liotta, Lance A; Sistare, Frank D
2004-01-01
Proteomics is more than just generating lists of proteins that increase or decrease in expression as a cause or consequence of pathology. The goal should be to characterize the information flow through the intercellular protein circuitry which communicates with the extracellular microenvironment and then ultimately to the serum/plasma macroenvironment. The nature of this information can be a cause, or a consequence, of disease and toxicity based processes as cascades of reinforcing information percolate through the system and become reflected in changing proteomic information content of the circulation. Serum Proteomic Pattern Diagnostics is a new type of proteomic platform in which patterns of proteomic signatures from high dimensional mass spectrometry data are used as a diagnostic classifier. While this approach has shown tremendous promise in early detection of cancers, detection of drug-induced toxicity may also be possible with this same technology. Analysis of serum from rat models of anthracycline and anthracenedione induced cardiotoxicity indicate the potential clinical utility of diagnostic proteomic patterns where low molecular weight peptides and protein fragments may have higher accuracy than traditional biomarkers of cardiotoxicity such as troponins. These fragments may one day be harvested by circulating nanoparticles designed to absorb, enrich and amplify the diagnostic biomarker repertoire generated even at the critical initial stages of toxicity.
Proteomics of Skeletal Muscle: Focus on Insulin Resistance and Exercise Biology
Deshmukh, Atul S.
2016-01-01
Skeletal muscle is the largest tissue in the human body and plays an important role in locomotion and whole body metabolism. It accounts for ~80% of insulin stimulated glucose disposal. Skeletal muscle insulin resistance, a primary feature of Type 2 diabetes, is caused by a decreased ability of muscle to respond to circulating insulin. Physical exercise improves insulin sensitivity and whole body metabolism and remains one of the most promising interventions for the prevention of Type 2 diabetes. Insulin resistance and exercise adaptations in skeletal muscle might be a cause, or consequence, of altered protein expressions profiles and/or their posttranslational modifications (PTMs). Mass spectrometry (MS)-based proteomics offer enormous promise for investigating the molecular mechanisms underlying skeletal muscle insulin resistance and exercise-induced adaptation; however, skeletal muscle proteomics are challenging. This review describes the technical limitations of skeletal muscle proteomics as well as emerging developments in proteomics workflow with respect to samples preparation, liquid chromatography (LC), MS and computational analysis. These technologies have not yet been fully exploited in the field of skeletal muscle proteomics. Future studies that involve state-of-the-art proteomics technology will broaden our understanding of exercise-induced adaptations as well as molecular pathogenesis of insulin resistance. This could lead to the identification of new therapeutic targets. PMID:28248217
Madio, Bruno; Undheim, Eivind A B; King, Glenn F
2017-08-23
More than a century of research on sea anemone venoms has shown that they contain a diversity of biologically active proteins and peptides. However, recent omics studies have revealed that much of the venom proteome remains unexplored. We used, for the first time, a combination of proteomic and transcriptomic techniques to obtain a holistic overview of the venom arsenal of the well-studied sea anemone Stichodactyla haddoni. A purely search-based approach to identify putative toxins in a transcriptome from tentacles regenerating after venom extraction identified 508 unique toxin-like transcripts grouped into 63 families. However, proteomic analysis of venom revealed that 52 of these toxin families are likely false positives. In contrast, the combination of transcriptomic and proteomic data enabled positive identification of 23 families of putative toxins, 12 of which have no homology known proteins or peptides. Our data highlight the importance of using proteomics of milked venom to correctly identify venom proteins/peptides, both known and novel, while minimizing false positive identifications from non-toxin homologues identified in transcriptomes of venom-producing tissues. This work lays the foundation for uncovering the role of individual toxins in sea anemone venom and how they contribute to the envenomation of prey, predators, and competitors. Proteomic analysis of milked venom combined with analysis of a tentacle transcriptome revealed the full extent of the venom arsenal of the sea anemone Stichodactyla haddoni. This combined approach led to the discovery of 12 entirely new families of disulfide-rich peptides and proteins in a genus of anemones that have been studied for over a century. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-µm-i.d. columns increase signal intensity by >3-fold relative to those using 75-µm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos mass spectrometer significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap), leading tomore » a ~3× increase in peptide identifications and 1.7× increase in identified protein groups for 2 ng tryptic digests of bacterial lysate. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~ 95% for 0.5 ng samples and by ~42% for 2 ng samples. The present platform is capable of identifying >3000 protein groups from tryptic digestion of cell lysates equivalent to 50 HeLa cells and 100 THP-1 cells (~10 ng total proteins), respectively, and >950 proteins from subnanogram bacterial and archaeal cell lysates. The present ultrasensitive LC-MS platform is expected to enable deep proteome coverage for subnanogram samples, including single mammalian cells.« less
Identification of new intrinsic proteins in Arabidopsis plasma membrane proteome.
Marmagne, Anne; Rouet, Marie-Aude; Ferro, Myriam; Rolland, Norbert; Alcon, Carine; Joyard, Jacques; Garin, Jérome; Barbier-Brygoo, Hélène; Ephritikhine, Geneviève
2004-07-01
Identification and characterization of anion channel genes in plants represent a goal for a better understanding of their central role in cell signaling, osmoregulation, nutrition, and metabolism. Though channel activities have been well characterized in plasma membrane by electrophysiology, the corresponding molecular entities are little documented. Indeed, the hydrophobic protein equipment of plant plasma membrane still remains largely unknown, though several proteomic approaches have been reported. To identify new putative transport systems, we developed a new proteomic strategy based on mass spectrometry analyses of a plasma membrane fraction enriched in hydrophobic proteins. We produced from Arabidopsis cell suspensions a highly purified plasma membrane fraction and characterized it in detail by immunological and enzymatic tests. Using complementary methods for the extraction of hydrophobic proteins and mass spectrometry analyses on mono-dimensional gels, about 100 proteins have been identified, 95% of which had never been found in previous proteomic studies. The inventory of the plasma membrane proteome generated by this approach contains numerous plasma membrane integral proteins, one-third displaying at least four transmembrane segments. The plasma membrane localization was confirmed for several proteins, therefore validating such proteomic strategy. An in silico analysis shows a correlation between the putative functions of the identified proteins and the expected roles for plasma membrane in transport, signaling, cellular traffic, and metabolism. This analysis also reveals 10 proteins that display structural properties compatible with transport functions and will constitute interesting targets for further functional studies.
Sang, Ting; Shan, Xi; Li, Bin; Shu, Sheng; Sun, Jin; Guo, Shirong
2016-08-01
Our results based on proteomics data and physiological alterations proposed the putative mechanism of exogenous Spd enhanced salinity tolerance in cucumber seedlings. Current studies showed that exogenous spermidine (Spd) could alleviate harmful effects of salinity. It is important to increase our understanding of the beneficial physiological responses of exogenous Spd treatment, and to determine the molecular responses underlying these responses. Here, we combined a physiological analysis with iTRAQ-based comparative proteomics of cucumber (Cucumis sativus L.) leaves, treated with 0.1 mM exogenous Spd, 75 mM NaCl and/or exogenous Spd. A total of 221 differentially expressed proteins were found and involved in 30 metabolic pathways, such as photosynthesis, carbohydrate metabolism, amino acid metabolism, stress response, signal transduction and antioxidant. Based on functional classification of the differentially expressed proteins and the physiological responses, we found cucumber seedlings treated with Spd under salt stress had higher photosynthesis efficiency, upregulated tetrapyrrole synthesis, stronger ROS scavenging ability and more protein biosynthesis activity than NaCl treatment, suggesting that these pathways may promote salt tolerance under high salinity. This study provided insights into how exogenous Spd protects photosynthesis and enhances salt tolerance in cucumber seedlings.
Mahadevan, Chidambareswaren; Krishnan, Anu; Saraswathy, Gayathri G.; Surendran, Arun; Jaleel, Abdul; Sakuntala, Manjula
2016-01-01
Black pepper (Piper nigrum L.), a tropical spice crop of global acclaim, is susceptible to Phytophthora capsici, an oomycete pathogen which causes the highly destructive foot rot disease. A systematic understanding of this phytopathosystem has not been possible owing to lack of genome or proteome information. In this study, we explain an integrated transcriptome-assisted label-free quantitative proteomics pipeline to study the basal immune components of black pepper when challenged with P. capsici. We report a global identification of 532 novel leaf proteins from black pepper, of which 518 proteins were functionally annotated using BLAST2GO tool. A label-free quantitation of the protein datasets revealed 194 proteins common to diseased and control protein datasets of which 22 proteins showed significant up-regulation and 134 showed significant down-regulation. Ninety-three proteins were identified exclusively on P. capsici infected leaf tissues and 245 were expressed only in mock (control) infected samples. In-depth analysis of our data gives novel insights into the regulatory pathways of black pepper which are compromised during the infection. Differential down-regulation was observed in a number of critical pathways like carbon fixation in photosynthetic organism, cyano-amino acid metabolism, fructose, and mannose metabolism, glutathione metabolism, and phenylpropanoid biosynthesis. The proteomics results were validated with real-time qRT-PCR analysis. We were also able to identify the complete coding sequences for all the proteins of which few selected genes were cloned and sequence characterized for further confirmation. Our study is the first report of a quantitative proteomics dataset in black pepper which provides convincing evidence on the effectiveness of a transcriptome-based label-free proteomics approach for elucidating the host response to biotic stress in a non-model spice crop like P. nigrum, for which genome information is unavailable. Our dataset will serve as a useful resource for future studies in this plant. Data are available via ProteomeXchange with identifier PXD003887. PMID:27379110
Mahadevan, Chidambareswaren; Krishnan, Anu; Saraswathy, Gayathri G; Surendran, Arun; Jaleel, Abdul; Sakuntala, Manjula
2016-01-01
Black pepper (Piper nigrum L.), a tropical spice crop of global acclaim, is susceptible to Phytophthora capsici, an oomycete pathogen which causes the highly destructive foot rot disease. A systematic understanding of this phytopathosystem has not been possible owing to lack of genome or proteome information. In this study, we explain an integrated transcriptome-assisted label-free quantitative proteomics pipeline to study the basal immune components of black pepper when challenged with P. capsici. We report a global identification of 532 novel leaf proteins from black pepper, of which 518 proteins were functionally annotated using BLAST2GO tool. A label-free quantitation of the protein datasets revealed 194 proteins common to diseased and control protein datasets of which 22 proteins showed significant up-regulation and 134 showed significant down-regulation. Ninety-three proteins were identified exclusively on P. capsici infected leaf tissues and 245 were expressed only in mock (control) infected samples. In-depth analysis of our data gives novel insights into the regulatory pathways of black pepper which are compromised during the infection. Differential down-regulation was observed in a number of critical pathways like carbon fixation in photosynthetic organism, cyano-amino acid metabolism, fructose, and mannose metabolism, glutathione metabolism, and phenylpropanoid biosynthesis. The proteomics results were validated with real-time qRT-PCR analysis. We were also able to identify the complete coding sequences for all the proteins of which few selected genes were cloned and sequence characterized for further confirmation. Our study is the first report of a quantitative proteomics dataset in black pepper which provides convincing evidence on the effectiveness of a transcriptome-based label-free proteomics approach for elucidating the host response to biotic stress in a non-model spice crop like P. nigrum, for which genome information is unavailable. Our dataset will serve as a useful resource for future studies in this plant. Data are available via ProteomeXchange with identifier PXD003887.
de Jesus, Jemmyson Romário; Galazzi, Rodrigo Moretto; de Lima, Tatiani Brenelli; Banzato, Cláudio Eduardo Muller; de Almeida Lima E Silva, Luiz Fernando; de Rosalmeida Dantas, Clarissa; Gozzo, Fábio Cézar; Arruda, Marco Aurélio Zezzi
2017-12-01
An exploratory analysis using proteomic strategies in blood serum of patients with bipolar disorder (BD), and with other psychiatric conditions such as Schizophrenia (SCZ), can provide a better understanding of this disorder, as well as their discrimination based on their proteomic profile. The proteomic profile of blood serum samples obtained from patients with BD using lithium or other drugs (N=14), healthy controls, including non-family (HCNF; N=3) and family (HCF; N=9), patients with schizophrenia (SCZ; N=23), and patients using lithium for other psychiatric conditions (OD; N=4) were compared. Four methods for simplifying the serum samples proteome were evaluated for both removing the most abundant proteins and for enriching those of lower-abundance: protein depletion with acetonitrile (ACN), dithiothreitol (DTT), sequential depletion using DTT and ACN, and protein equalization using commercial ProteoMiner® kit (PM). For proteomic evaluation, 2-D DIGE and nanoLC-MS/MS analysis were employed. PM method was the best strategy for removing proteins of high abundance. Through 2-D DIGE gel image comparison, 37 protein spots were found differentially abundant (p<0.05, Student's t-test), which exhibited ≥2.0-fold change of the average value of normalized spot intensities in the serum of SCZ, BD and OD patients compared to subject controls (HCF and HCNF). From these spots detected, 13 different proteins were identified: ApoA1, ApoE, ApoC3, ApoA4, Samp, SerpinA1, TTR, IgK, Alb, VTN, TR, C4A and C4B. Proteomic analysis allowed the discrimination of patients with BD from patients with other mental disorders, such as SCZ. The findings in this exploratory study may also contribute for better understanding the pathophysiology of these disorders and finding potential serum biomarkers for these conditions. Copyright © 2017 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.
The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) announces the release of the cancer proteome confirmatory colon study data. The goal of the study is to analyze the proteomes of approximately 100 confirmatory colon tumor patients, which includes tumor and adjacent normal samples, with liquid chromatography-tandem mass spectrometry (LC-MS/MS) global proteomic and phosphoproteomic profiling.
Data Independent Acquisition analysis in ProHits 4.0.
Liu, Guomin; Knight, James D R; Zhang, Jian Ping; Tsou, Chih-Chiang; Wang, Jian; Lambert, Jean-Philippe; Larsen, Brett; Tyers, Mike; Raught, Brian; Bandeira, Nuno; Nesvizhskii, Alexey I; Choi, Hyungwon; Gingras, Anne-Claude
2016-10-21
Affinity purification coupled with mass spectrometry (AP-MS) is a powerful technique for the identification and quantification of physical interactions. AP-MS requires careful experimental design, appropriate control selection and quantitative workflows to successfully identify bona fide interactors amongst a large background of contaminants. We previously introduced ProHits, a Laboratory Information Management System for interaction proteomics, which tracks all samples in a mass spectrometry facility, initiates database searches and provides visualization tools for spectral counting-based AP-MS approaches. More recently, we implemented Significance Analysis of INTeractome (SAINT) within ProHits to provide scoring of interactions based on spectral counts. Here, we provide an update to ProHits to support Data Independent Acquisition (DIA) with identification software (DIA-Umpire and MSPLIT-DIA), quantification tools (through DIA-Umpire, or externally via targeted extraction), and assessment of quantitative enrichment (through mapDIA) and scoring of interactions (through SAINT-intensity). With additional improvements, notably support of the iProphet pipeline, facilitated deposition into ProteomeXchange repositories and enhanced export and viewing functions, ProHits 4.0 offers a comprehensive suite of tools to facilitate affinity proteomics studies. It remains challenging to score, annotate and analyze proteomics data in a transparent manner. ProHits was previously introduced as a LIMS to enable storing, tracking and analysis of standard AP-MS data. In this revised version, we expand ProHits to include integration with a number of identification and quantification tools based on Data-Independent Acquisition (DIA). ProHits 4.0 also facilitates data deposition into public repositories, and the transfer of data to new visualization tools. Copyright © 2016 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Lespinats, Sylvain; Pinker-Domenig, Katja; Wengert, Georg; Houben, Ivo; Lobbes, Marc; Stadlbauer, Andreas; Meyer-Bäse, Anke
2016-05-01
Glioma-derived cancer stem cells (GSCs) are tumor-initiating cells and may be refractory to radiation and chemotherapy and thus have important implications for tumor biology and therapeutics. The analysis and interpretation of large proteomic data sets requires the development of new data mining and visualization approaches. Traditional techniques are insufficient to interpret and visualize these resulting experimental data. The emphasis of this paper lies in the application of novel approaches for the visualization, clustering and projection representation to unveil hidden data structures relevant for the accurate interpretation of biological experiments. These qualitative and quantitative methods are applied to the proteomic analysis of data sets derived from the GSCs. The achieved clustering and visualization results provide a more detailed insight into the protein-level fold changes and putative upstream regulators for the GSCs. However the extracted molecular information is insufficient in classifying GSCs and paving the pathway to an improved therapeutics of the heterogeneous glioma.
Shen, Xiaomeng; Hu, Qiang; Li, Jun; Wang, Jianmin; Qu, Jun
2015-10-02
Comprehensive and accurate evaluation of data quality and false-positive biomarker discovery is critical to direct the method development/optimization for quantitative proteomics, which nonetheless remains challenging largely due to the high complexity and unique features of proteomic data. Here we describe an experimental null (EN) method to address this need. Because the method experimentally measures the null distribution (either technical or biological replicates) using the same proteomic samples, the same procedures and the same batch as the case-vs-contol experiment, it correctly reflects the collective effects of technical variability (e.g., variation/bias in sample preparation, LC-MS analysis, and data processing) and project-specific features (e.g., characteristics of the proteome and biological variation) on the performances of quantitative analysis. To show a proof of concept, we employed the EN method to assess the quantitative accuracy and precision and the ability to quantify subtle ratio changes between groups using different experimental and data-processing approaches and in various cellular and tissue proteomes. It was found that choices of quantitative features, sample size, experimental design, data-processing strategies, and quality of chromatographic separation can profoundly affect quantitative precision and accuracy of label-free quantification. The EN method was also demonstrated as a practical tool to determine the optimal experimental parameters and rational ratio cutoff for reliable protein quantification in specific proteomic experiments, for example, to identify the necessary number of technical/biological replicates per group that affords sufficient power for discovery. Furthermore, we assessed the ability of EN method to estimate levels of false-positives in the discovery of altered proteins, using two concocted sample sets mimicking proteomic profiling using technical and biological replicates, respectively, where the true-positives/negatives are known and span a wide concentration range. It was observed that the EN method correctly reflects the null distribution in a proteomic system and accurately measures false altered proteins discovery rate (FADR). In summary, the EN method provides a straightforward, practical, and accurate alternative to statistics-based approaches for the development and evaluation of proteomic experiments and can be universally adapted to various types of quantitative techniques.
Singh, Varinder; Singh, Baldev; Joshi, Robin; Jaju, Puneet
2017-01-01
Withania somnifera is a high value medicinal plant which is used against large number of ailments. The medicinal properties of the plant attributes to a wide array of important secondary metabolites. The plant is predominantly infected with leaf spot pathogen Alternaria alternata, which leads to substantial biodeterioration of pharmaceutically important metabolites. To develop an effective strategy to combat this disease, proteomics based approach could be useful. Hence, in the present study, three different protein extraction methods tris-buffer based, phenol based and trichloroacetic acid-acetone (TCA-acetone) based method were comparatively evaluated for two-dimensional electrophoresis (2-DE) analysis of W. somnifera. TCA-acetone method was found to be most effective and was further used to identify differentially expressed proteins in response to fungal infection. Thirty-eight differentially expressed proteins were identified by matrix assisted laser desorption/ionization time of flight-mass spectrometry (MALDI TOF/TOF MS/MS). The known proteins were categorized into eight different groups based on their function and maximum proteins belonged to energy and metabolism, cell structure, stress and defense and RNA/DNA categories. Differential expression of some key proteins were also crosschecked at transcriptomic level by using qRT-PCR and were found to be consistent with the 2-DE data. These outcomes enable us to evaluate modifications that take place at the proteomic level during a compatible host pathogen interaction. The comparative proteome analysis conducted in this paper revealed the involvement of many key proteins in the process of pathogenesis and further investigation of these identified proteins could assist in the discovery of new strategies for the development of pathogen resistance in the plant. PMID:28575108
Zougman, Alexandre; Banks, Rosamonde E
2015-01-01
Recently we introduced the concept of Suspension Trapping (STrap) for bottom-up proteomics sample processing that is based upon SDS-mediated protein extraction, swift detergent removal and rapid reactor-type protein digestion in a quartz depth filter trap. As the depth filter surface is made of silica, it is readily modifiable with various functional groups using the silane coupling chemistries. Thus, during the digest, peptides possessing specific features could be targeted for enrichment by the functionalized depth filter material while non-targeted peptides could be collected as an unbound distinct fraction after the digest. In the example presented here the quartz depth filter surface is functionalized with the pyridyldithiol group therefore enabling reversible in-situ capture of the cysteine-containing peptides generated during the STrap-based digest. The described C-STrap method retains all advantages of the original STrap methodology and provides robust foundation for the conception of the targeted in-situ peptide fractionation in the STrap format for bottom-up proteomics. The presented data support the method's use in qualitative and semi-quantitative proteomics experiments.
CPTAC Proteomics Data on UCSC Genome Browser | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium scientists are working together with the University of California, Santa Cruz (UCSC) Genomics Institute to provide public access to cancer proteomics data via the UCSC Genome Browser. This effort extends accessibility of the CPTAC data to more researchers and provides an additional level of analysis to assist the cancer biology community.
Red blood cell (RBC) membrane proteomics--Part I: Proteomics and RBC physiology.
Pasini, Erica M; Lutz, Hans U; Mann, Matthias; Thomas, Alan W
2010-01-03
Membrane proteomics is concerned with accurately and sensitively identifying molecules involved in cell compartmentalisation, including those controlling the interface between the cell and the outside world. The high lipid content of the environment in which these proteins are found often causes a particular set of problems that must be overcome when isolating the required material before effective HPLC-MS approaches can be performed. The membrane is an unusually dynamic cellular structure since it interacts with an ever changing environment. A full understanding of this critical cell component will ultimately require, in addition to proteomics, lipidomics, glycomics, interactomics and study of post-translational modifications. Devoid of nucleus and organelles in mammalian species other than camelids, and constantly in motion in the blood stream, red blood cells (RBCs) are the sole mammalian oxygen transporter. The fact that mature mammalian RBCs have no internal membrane-bound organelles, somewhat simplifies proteomics analysis of the plasma membrane and the fact that it has no nucleus disqualifies microarray based methods. Proteomics has the potential to provide a better understanding of this critical interface, and thereby assist in identifying new approaches to diseases. (c) 2009 Elsevier B.V. All rights reserved.
Progress and pitfalls in finding the 'missing proteins' from the human proteome map.
Segura, Victor; Garin-Muga, Alba; Guruceaga, Elizabeth; Corrales, Fernando J
2017-01-01
The Human Proteome Project was launched with two main goals: the comprehensive and systematic definition of the human proteome map and the development of ready to use analytical tools to measure relevant proteins in their biological context in health and disease. Despite the great progress in this endeavour, there is still a group of reluctant proteins with no, or scarce, experimental evidence supporting their existence. These are called the 'missing proteins' and represent one of the biggest challenges to complete the human proteome map. Areas covered: This review focuses on the description of the missing proteome based on the HUPO standards, the analysis of the reasons explaining the difficulty of detecting missing proteins and the strategies currently used in the search for missing proteins. The present and future of the quest for the missing proteins is critically revised hereafter. Expert commentary: An overarching multidisciplinary effort is currently being done under the HUPO umbrella to allow completion of the human proteome map. It is expected that the detection of missing proteins will grow in the coming years since the methods and the best tissue/cell type sample for their search are already on the table.
Sandra, Koen; Moshir, Mahan; D'hondt, Filip; Verleysen, Katleen; Kas, Koen; Sandra, Pat
2008-04-15
Sample complexity and dynamic range constitute enormous challenges in proteome analysis. The back-end technology in typical proteomics platforms, namely mass spectrometry (MS), can only tolerate a certain complexity, has a limited dynamic range per spectrum and is very sensitive towards ion suppression. Therefore, component overlap has to be minimized for successful mass spectrometric analysis and subsequent protein identification and quantification. The present review describes the advances that have been made in liquid-based separation techniques with focus on the recent developments to boost the resolving power. The review is divided in two parts; the first part deals with unidimensional liquid chromatography and the second part with bi- and multidimensional liquid-based separation techniques. Part 1 mainly focuses on reversed-phase HPLC due to the fact that it is and will, in the near future, remain the technique of choice to be hyphenated with MS. The impact of increasing the column length, decreasing the particle diameter, replacing the traditional packed beds by monolithics, amongst others, is described. The review is complemented with data obtained in the laboratories of the authors.
Plant proteome analysis: a 2006 update.
Jorrín, Jesús V; Maldonado, Ana M; Castillejo, Ma Angeles
2007-08-01
This 2006 'Plant Proteomics Update' is a continuation of the two previously published in 'Proteomics' by 2004 (Canovas et al., Proteomics 2004, 4, 285-298) and 2006 (Rossignol et al., Proteomics 2006, 6, 5529-5548) and it aims to bring up-to-date the contribution of proteomics to plant biology on the basis of the original research papers published throughout 2006, with references to those appearing last year. According to the published papers and topics addressed, we can conclude that, as observed for the three previous years, there has been a quantitative, but not qualitative leap in plant proteomics. The full potential of proteomics is far from being exploited in plant biology research, especially if compared to other organisms, mainly yeast and humans, and a number of challenges, mainly technological, remain to be tackled. The original papers published last year numbered nearly 100 and deal with the proteome of at least 26 plant species, with a high percentage for Arabidopsis thaliana (28) and rice (11). Scientific objectives ranged from proteomic analysis of organs/tissues/cell suspensions (57) or subcellular fractions (29), to the study of plant development (12), the effect of hormones and signalling molecules (8) and response to symbionts (4) and stresses (27). A small number of contributions have covered PTMs (8) and protein interactions (4). 2-DE (specifically IEF-SDS-PAGE) coupled to MS still constitutes the almost unique platform utilized in plant proteome analysis. The application of gel-free protein separation methods and 'second generation' proteomic techniques such as multidimensional protein identification technology (MudPIT), and those for quantitative proteomics including DIGE, isotope-coded affinity tags (ICAT), iTRAQ and stable isotope labelling by amino acids in cell culture (SILAC) still remains anecdotal. This review is divided into seven sections: Introduction, Methodology, Subcellular proteomes, Development, Responses to biotic and abiotic stresses, PTMs and Protein interactions. Section 8 summarizes the major pitfalls and challenges of plant proteomics.
Zhang, Ying; Wang, Xi; Cui, Dan; Zhu, Jun
2016-12-01
Human whole saliva is a vital body fluid for studying the physiology and pathology of the oral cavity. As a powerful technique for biomarker discovery, MS-based proteomic strategies have been introduced for saliva analysis and identified hundreds of proteins and N-glycosylation sites. However, there is still a lack of quantitative analysis, which is necessary for biomarker screening and biological research. In this study, we establish an integrated workflow by the combination of stable isotope dimethyl labeling, HILIC enrichment, and high resolution MS for both quantification of the global proteome and N-glycoproteome of human saliva from oral ulcer patients. With the help of advanced bioinformatics, we comprehensively studied oral ulcers at both protein and glycoprotein scales. Bioinformatics analyses revealed that starch digestion and protein degradation activities are inhibited while the immune response is promoted in oral ulcer saliva. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
MALDI versus ESI: The Impact of the Ion Source on Peptide Identification.
Nadler, Wiebke Maria; Waidelich, Dietmar; Kerner, Alexander; Hanke, Sabrina; Berg, Regina; Trumpp, Andreas; Rösli, Christoph
2017-03-03
For mass spectrometry-based proteomic analyses, electrospray ionization (ESI) and matrix-assisted laser desorption/ionization (MALDI) are the commonly used ionization techniques. To investigate the influence of the ion source on peptide detection in large-scale proteomics, an optimized GeLC/MS workflow was developed and applied either with ESI/MS or with MALDI/MS for the proteomic analysis of different human cell lines of pancreatic origin. Statistical analysis of the resulting data set with more than 72 000 peptides emphasized the complementary character of the two methods, as the percentage of peptides identified with both approaches was as low as 39%. Significant differences between the resulting peptide sets were observed with respect to amino acid composition, charge-related parameters, hydrophobicity, and modifications of the detected peptides and could be linked to factors governing the respective ion yields in ESI and MALDI.
Investigators from the National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) who comprehensively analyzed 95 human colorectal tumor samples, have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, provides a more comprehensive view of the biological features that drive cancer than genomic analysis alone and may help identify the most important targets for cancer detection and intervention.
Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.
Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin
2014-06-13
Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Guidelines for reporting quantitative mass spectrometry based experiments in proteomics.
Martínez-Bartolomé, Salvador; Deutsch, Eric W; Binz, Pierre-Alain; Jones, Andrew R; Eisenacher, Martin; Mayer, Gerhard; Campos, Alex; Canals, Francesc; Bech-Serra, Joan-Josep; Carrascal, Montserrat; Gay, Marina; Paradela, Alberto; Navajas, Rosana; Marcilla, Miguel; Hernáez, María Luisa; Gutiérrez-Blázquez, María Dolores; Velarde, Luis Felipe Clemente; Aloria, Kerman; Beaskoetxea, Jabier; Medina-Aunon, J Alberto; Albar, Juan P
2013-12-16
Mass spectrometry is already a well-established protein identification tool and recent methodological and technological developments have also made possible the extraction of quantitative data of protein abundance in large-scale studies. Several strategies for absolute and relative quantitative proteomics and the statistical assessment of quantifications are possible, each having specific measurements and therefore, different data analysis workflows. The guidelines for Mass Spectrometry Quantification allow the description of a wide range of quantitative approaches, including labeled and label-free techniques and also targeted approaches such as Selected Reaction Monitoring (SRM). The HUPO Proteomics Standards Initiative (HUPO-PSI) has invested considerable efforts to improve the standardization of proteomics data handling, representation and sharing through the development of data standards, reporting guidelines, controlled vocabularies and tooling. In this manuscript, we describe a key output from the HUPO-PSI-namely the MIAPE Quant guidelines, which have developed in parallel with the corresponding data exchange format mzQuantML [1]. The MIAPE Quant guidelines describe the HUPO-PSI proposal concerning the minimum information to be reported when a quantitative data set, derived from mass spectrometry (MS), is submitted to a database or as supplementary information to a journal. The guidelines have been developed with input from a broad spectrum of stakeholders in the proteomics field to represent a true consensus view of the most important data types and metadata, required for a quantitative experiment to be analyzed critically or a data analysis pipeline to be reproduced. It is anticipated that they will influence or be directly adopted as part of journal guidelines for publication and by public proteomics databases and thus may have an impact on proteomics laboratories across the world. This article is part of a Special Issue entitled: Standardization and Quality Control. Copyright © 2013 Elsevier B.V. All rights reserved.
Uddin, Reaz; Jamil, Faiza
2018-06-01
Pseudomonas aeruginosa is an opportunistic gram-negative bacterium that has the capability to acquire resistance under hostile conditions and become a threat worldwide. It is involved in nosocomial infections. In the current study, potential novel drug targets against P. aeruginosa have been identified using core proteomic analysis and Protein-Protein Interactions (PPIs) studies. The non-redundant reference proteome of 68 strains having complete genome and latest assembly version of P. aeruginosa were downloaded from ftp NCBI RefSeq server in October 2016. The standalone CD-HIT tool was used to cluster ortholog proteins (having >=80% amino acid identity) present in all strains. The pan-proteome was clustered in 12,380 Clusters of Orthologous Proteins (COPs). By using in-house shell scripts, 3252 common COPs were extracted out and designated as clusters of core proteome. The core proteome of PAO1 strain was selected by fetching PAO1's proteome from common COPs. As a result, 1212 proteins were shortlisted that are non-homologous to the human but essential for the survival of the pathogen. Among these 1212 proteins, 321 proteins are conserved hypothetical proteins. Considering their potential as drug target, those 321 hypothetical proteins were selected and their probable functions were characterized. Based on the druggability criteria, 18 proteins were shortlisted. The interacting partners were identified by investigating the PPIs network using STRING v10 database. Subsequently, 8 proteins were shortlisted as 'hub proteins' and proposed as potential novel drug targets against P. aeruginosa. The study is interesting for the scientific community working to identify novel drug targets against MDR pathogens particularly P. aeruginosa. Copyright © 2018 Elsevier Ltd. All rights reserved.
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Preparation of the low molecular weight serum proteome for mass spectrometry analysis.
Waybright, Timothy J; Chan, King C; Veenstra, Timothy D; Xiao, Zhen
2013-01-01
The discovery of viable biomarkers or indicators of disease states is complicated by the inherent complexity of the chosen biological specimen. Every sample, whether it is serum, plasma, urine, tissue, cells, or a host of others, contains thousands of large and small components, each interacting in multiple ways. The need to concentrate on a group of these components to narrow the focus on a potential biomarker candidate becomes, out of necessity, a priority, especially in the search for immune-related low molecular weight serum biomarkers. One such method in the field of proteomics is to divide the sample proteome into groups based on the size of the protein, analyze each group, and mine the data for statistically significant items. This chapter details a portion of this method, concentrating on a method for fractionating and analyzing the low molecular weight proteome of human serum.
Integrated proteogenomic characterization of human high grade serous ovarian cancer
Zhang, Bai; McDermott, Jason E; Zhou, Jian-Ying; Petyuk, Vladislav A; Chen, Li; Ray, Debjit; Sun, Shisheng; Yang, Feng; Chen, Lijun; Wang, Jing; Shah, Punit; Cha, Seong Won; Aiyetan, Paul; Woo, Sunghee; Tian, Yuan; Gritsenko, Marina A; Clauss, Therese R; Choi, Caitlin; Monroe, Matthew E; Thomas, Stefani; Nie, Song; Wu, Chaochao; Moore, Ronald J; Yu, Kun-Hsing; Tabb, David L; Fenyö, David; Bafna, Vineet; Wang, Yue; Rodriguez, Henry; Boja, Emily S; Hiltke, Tara; Rivers, Robert C; Sokoll, Lori; Zhu, Heng; Shih, Ie-Ming; Cope, Leslie; Pandey, Akhilesh; Zhang, Bing; Snyder, Michael P; Levine, Douglas A; Smith, Richard D
2016-01-01
SUMMARY To provide a detailed analysis of the molecular components and underlying mechanisms associated with ovarian cancer, we performed a comprehensive mass spectrometry-based proteomic characterization of 174 ovarian tumors previously analyzed by The Cancer Genome Atlas (TCGA), of which 169 were high-grade serous carcinomas (HGSC). Integrating our proteomic measurements with the genomic data yielded a number of insights into disease such as how different copy number alternations influence the proteome, the proteins associated with chromosomal instability, the sets of signaling pathways that diverse genome rearrangements converge on, as well as the ones most associated with short overall survival. Specific protein acetylations associated with homologous recombination deficiency suggest a potential means for stratifying patients for therapy. In addition to providing a valuable resource, these findings provide a view of how the somatic genome drives the cancer proteome and associations between protein and post-translational modification levels and clinical outcomes in HGSC. PMID:27372738
Journet, Agnès; Klein, Gérard; Brugière, Sabine; Vandenbrouck, Yves; Chapel, Agnès; Kieffer, Sylvie; Bruley, Christophe; Masselon, Christophe; Aubry, Laurence
2012-01-01
The cellular slime mold Dictyostelium discoideum is a soil-living eukaryote, which feeds on microorganisms engulfed by phagocytosis. Axenic laboratory strains have been produced that are able to use liquid growth medium internalized by macropinocytosis as the source of food. To better define the macropinocytosis process, we established the inventory of proteins associated with this pathway using mass spectrometry-based proteomics. Using a magnetic purification procedure and high-performance LC-MS/MS proteome analysis, a list of 2108 non-redundant proteins was established, of which 24% featured membrane-spanning domains. Bioinformatics analyses indicated that the most abundant proteins were linked to signaling, vesicular trafficking and the cytoskeleton. The present repertoire validates our purification method and paves the way for a future proteomics approach to study the dynamics of macropinocytosis. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Stable isotope dimethyl labelling for quantitative proteomics and beyond
Hsu, Jue-Liang; Chen, Shu-Hui
2016-01-01
Stable-isotope reductive dimethylation, a cost-effective, simple, robust, reliable and easy-to- multiplex labelling method, is widely applied to quantitative proteomics using liquid chromatography-mass spectrometry. This review focuses on biological applications of stable-isotope dimethyl labelling for a large-scale comparative analysis of protein expression and post-translational modifications based on its unique properties of the labelling chemistry. Some other applications of the labelling method for sample preparation and mass spectrometry-based protein identification and characterization are also summarized. This article is part of the themed issue ‘Quantitative mass spectrometry’. PMID:27644970
A Proteomics View of the Molecular Mechanisms and Biomarkers of Glaucomatous Neurodegeneration
Tezel, Gülgün
2013-01-01
Despite improving understanding of glaucoma, key molecular players of neurodegeneration that can be targeted for treatment of glaucoma, or molecular biomarkers that can be useful for clinical testing, remain unclear. Proteomics technology offers a powerful toolbox to accomplish these important goals of the glaucoma research and is increasingly being applied to identify molecular mechanisms and biomarkers of glaucoma. Recent studies of glaucoma using proteomics analysis techniques have resulted in the lists of differentially expressed proteins in human glaucoma and animal models. The global analysis of protein expression in glaucoma has been followed by cell-specific proteome analysis of retinal ganglion cells and astrocytes. The proteomics data have also guided targeted studies to identify post-translational modifications and protein-protein interactions during glaucomatous neurodegeneration. In addition, recent applications of proteomics have provided a number of potential biomarker candidates. Proteomics technology holds great promise to move glaucoma research forward toward new treatment strategies and biomarker discovery. By reviewing the major proteomics approaches and their applications in the field of glaucoma, this article highlights the power of proteomics in translational and clinical research related to glaucoma and also provides a framework for future research to functionally test the importance of specific molecular pathways and validate candidate biomarkers. PMID:23396249
The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.
Tyanova, Stefka; Temu, Tikira; Cox, Juergen
2016-12-01
MaxQuant is one of the most frequently used platforms for mass-spectrometry (MS)-based proteomics data analysis. Since its first release in 2008, it has grown substantially in functionality and can be used in conjunction with more MS platforms. Here we present an updated protocol covering the most important basic computational workflows, including those designed for quantitative label-free proteomics, MS1-level labeling and isobaric labeling techniques. This protocol presents a complete description of the parameters used in MaxQuant, as well as of the configuration options of its integrated search engine, Andromeda. This protocol update describes an adaptation of an existing protocol that substantially modifies the technique. Important concepts of shotgun proteomics and their implementation in MaxQuant are briefly reviewed, including different quantification strategies and the control of false-discovery rates (FDRs), as well as the analysis of post-translational modifications (PTMs). The MaxQuant output tables, which contain information about quantification of proteins and PTMs, are explained in detail. Furthermore, we provide a short version of the workflow that is applicable to data sets with simple and standard experimental designs. The MaxQuant algorithms are efficiently parallelized on multiple processors and scale well from desktop computers to servers with many cores. The software is written in C# and is freely available at http://www.maxquant.org.
Quantitation of heat-shock proteins in clinical samples using mass spectrometry.
Kaur, Punit; Asea, Alexzander
2011-01-01
Mass spectrometry (MS) is a powerful analytical tool for proteomics research and drug and biomarker discovery. MS enables identification and quantification of known and unknown compounds by revealing their structural and chemical properties. Proper sample preparation for MS-based analysis is a critical step in the proteomics workflow because the quality and reproducibility of sample extraction and preparation for downstream analysis significantly impact the separation and identification capabilities of mass spectrometers. The highly expressed proteins represent potential biomarkers that could aid in diagnosis, therapy, or drug development. Because the proteome is so complex, there is no one standard method for preparing protein samples for MS analysis. Protocols differ depending on the type of sample, source, experiment, and method of analysis. Molecular chaperones play significant roles in almost all biological functions due to their capacity for detecting intracellular denatured/unfolded proteins, initiating refolding or denaturation of such malfolded protein sequences and more recently for their role in the extracellular milieu as chaperokines. In this chapter, we describe the latest techniques for quantitating the expression of molecular chaperones in human clinical samples.
Xing, Tong; Wang, Chong; Zhao, Xue; Dai, Chen; Zhou, Guanghong; Xu, Xinglian
2017-04-05
The current study was designed to investigate changes in the protein profiles of pale, soft, and exudative (PSE)-like muscles of broilers subjected to transportation under high-temperature conditions, using isobaric tags for relative and absolute analysis quantitation (iTRAQ). Arbor Acres chickens (n = 112) were randomly divided into two treatments: unstressed control (CON) and 0.5 h of transport (T). Birds were transported according to a designed protocol. Pectoralis major (PM) muscle samples in the T group were collected and classified as normal (T-NOR) or PSE-like (T-PSE). Plasma activities of stress indicators, muscle microstructure, and proteome were measured. Results indicated that broilers in the T-PSE group exhibited higher activities of plasma stress indicators. The microstructure of T-PSE group showed a looser network and larger intercellular spaces in comparison to the other groups. Proteomic analysis, based on iTRAQ, revealed 29 differentially expressed proteins in the T-NOR and T-PSE groups that were involved in protein turnover, signal transduction, stress and defense, calcium handling, cell structure, and metabolism. In particular, proteins relating to the glycolysis pathway, calcium signaling, and molecular chaperones exhibited significant differences that may contribute to the inferior post-mortem meat quality. Overall, the proteomic results provide a further understanding of the mechanism of meat quality changes in response to stress.
Proteomic analysis of urine in rats chronically exposed to fluoride.
Kobayashi, Claudia Ayumi Nakai; Leite, Aline de Lima; da Silva, Thelma Lopes; dos Santos, Lucilene Delazari; Nogueira, Fábio César Sousa; Santos, Keity Souza; de Oliveira, Rodrigo Cardoso; Palma, Mario Sérgio; Domont, Gilberto Barbosa; Buzalaf, Marília Afonso Rabelo
2011-01-01
Urine is an ideal source of materials to search for potential disease-related biomarkers as it is produced by the affected tissues and can be easily obtained by noninvasive methods. 2-DE-based proteomic approach was used to better understand the molecular mechanisms of injury induced by fluoride (F(-)) and define potential biomarkers of dental fluorosis. Three groups of weanling male Wistar rats were treated with drinking water containing 0 (control), 5, or 50 ppm F(-) for 60 days (n = 15/group). During the experimental period, the animals were kept individually in metabolic cages, to analyze the water and food consumption, as well as fecal and urinary F(-) excretion. Urinary proteome profiles were examined using 2-DE and Colloidal Coomassie Brilliant Blue staining. A dose-response regarding F(-) intake and excretion was detected. Quantitative intensity analysis revealed 8, 11, and 8 significantly altered proteins between control vs. 5 ppm F(-), control vs. 50 ppm F(-) and 5 ppm F(-) vs. 50 ppm F(-) groups, respectively. Two proteins regulated by androgens (androgen-regulated 20-KDa protein and α-2μ-globulin) and one related to detoxification (aflatoxin-B1-aldehyde-reductase) were identified by MALDI-TOF-TOF MS/MS. Thus, proteomic analysis can help to better understand the mechanisms underlying F(-) toxicity, even in low doses. Copyright © 2010 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacobs, Jon M.; Yang, Xiaohua; Luft, Benjamin J.
2005-04-01
The Borrelia burgdorferi spirochete is the causative agent of Lyme disease, the most common tick-borne disease in the United States. It has been studied extensively to help understand its pathogenicity of infection and how it can persist in different mammalian hosts. We report the proteomic analysis of the archetype B. burgdorferi B31 strain and two other strains (ND40, and JD-1) having different Borrelia pathotypes using strong cation exchange fractionation of proteolytic peptides followed by high-resolution, reversed phase capillary liquid chromatography coupled with ion trap tandem mass spectrometric (LC-MS/MS) analysis. Protein identification was facilitated by the availability of the complete B31more » genome sequence. A total of 665 Borrelia proteins were identified representing ~38 % coverage of the theoretical B31 proteome. A significant overlap was observed between the identified proteins in direct comparisons between any two strains (>72%), but distinct differences were observed among identified hypothetical and outer membrane proteins of the three strains. Such a concurrent proteomic overview of three Borrelia strains based upon only the B31 genome sequence is shown to provide significant insights into the presence or absence of specific proteins and a broad overall comparison among strains.« less
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Pan, Hai-Tao; Ding, Hai-Gang; Fang, Min; Yu, Bin; Cheng, Yi; Tan, Ya-Jing; Fu, Qi-Qin; Lu, Bo; Cai, Hong-Guang; Jin, Xin; Xia, Xian-Qing; Zhang, Tao
2018-01-01
Recurrent miscarriage (RM) affects 5% of women, it has an adverse emotional impact on women. Because of the complexities of early development, the mechanism of recurrent miscarriage is still unclear. We hypothesized that abnormal placenta leads to early recurrent miscarriage (ERM). The aim of this study was to identify ERM associated factors in human placenta villous tissue using proteomics. Investigation of these differences in protein expression in parallel profiling is essential to understand the comprehensive pathophysiological mechanism underlying recurrent miscarriage (RM). To gain more insight into mechanisms of recurrent miscarriage (RM), a comparative proteome profile of the human placenta villous tissue in normal and RM pregnancies was analyzed using iTRAQ technology and bioinformatics analysis used by Ingenuity Pathway Analysis (IPA) software. In this study, we employed an iTRAQ based proteomics analysis of four placental villous tissues from patients with early recurrent miscarriage (ERM) and four from normal pregnant women. Finally, we identified 2805 proteins and 79,998 peptides between patients with RM and normal matched group. Further analysis identified 314 differentially expressed proteins in placental villous tissue (≥1.3-fold, Student's t-test, p < 0.05); 209 proteins showed the increased expression while 105 proteins showed decreased expression. These 314 proteins were analyzed by Ingenuity Pathway Analysis (IPA) and were found to play important roles in the growth of embryo. Furthermore, network analysis show that Angiotensinogen (AGT), MAPK14 and Prothrombin (F2) are core factors in early embryonic development. We used another 8 independent samples (4 cases and 4 controls) to cross validation of the proteomic data. This study has identified several proteins that are associated with early development, these results may supply new insight into mechanisms behind recurrent miscarriage. Copyright © 2017 Elsevier Ltd. All rights reserved.
Kistler, Andreas D.; Serra, Andreas L.; Siwy, Justyna; Poster, Diane; Krauer, Fabienne; Torres, Vicente E.; Mrug, Michal; Grantham, Jared J.; Bae, Kyongtae T.; Bost, James E.; Mullen, William; Wüthrich, Rudolf P.; Mischak, Harald; Chapman, Arlene B.
2013-01-01
Treatment options for autosomal dominant polycystic kidney disease (ADPKD) will likely become available in the near future, hence reliable diagnostic and prognostic biomarkers for the disease are strongly needed. Here, we aimed to define urinary proteomic patterns in ADPKD patients, which aid diagnosis and risk stratification. By capillary electrophoresis online coupled to mass spectrometry (CE-MS), we compared the urinary peptidome of 41 ADPKD patients to 189 healthy controls and identified 657 peptides with significantly altered excretion, of which 209 could be sequenced using tandem mass spectrometry. A support-vector-machine based diagnostic biomarker model based on the 142 most consistent peptide markers achieved a diagnostic sensitivity of 84.5% and specificity of 94.2% in an independent validation cohort, consisting of 251 ADPKD patients from five different centers and 86 healthy controls. The proteomic alterations in ADPKD included, but were not limited to markers previously associated with acute kidney injury (AKI). The diagnostic biomarker model was highly specific for ADPKD when tested in a cohort consisting of 481 patients with a variety of renal and extrarenal diseases, including AKI. Similar to ultrasound, sensitivity and specificity of the diagnostic score depended on patient age and genotype. We were furthermore able to identify biomarkers for disease severity and progression. A proteomic severity score was developed to predict height adjusted total kidney volume (htTKV) based on proteomic analysis of 134 ADPKD patients and showed a correlation of r = 0.415 (p<0.0001) with htTKV in an independent validation cohort consisting of 158 ADPKD patients. In conclusion, the performance of peptidomic biomarker scores is superior to any other biochemical markers of ADPKD and the proteomic biomarker patterns are a promising tool for prognostic evaluation of ADPKD. PMID:23326375
2013-01-01
Background Guanine-cytosine (GC) composition is an important feature of genomes. Likewise, amino acid composition is a distinct, but less valued, feature of proteomes. A major concern is that it is not clear what valuable information can be acquired from amino acid composition data. To address this concern, in-depth analyses of the amino acid composition of the complete proteomes from 63 archaea, 270 bacteria, and 128 eukaryotes were performed. Results Principal component analysis of the amino acid matrices showed that the main contributors to proteomic architecture were genomic GC variation, phylogeny, and environmental influences. GC pressure drove positive selection on Ala, Arg, Gly, Pro, Trp, and Val, and adverse selection on Asn, Lys, Ile, Phe, and Tyr. The physico-chemical framework of the complete proteomes withstood GC pressure by frequency complementation of GC-dependent amino acid pairs with similar physico-chemical properties. Gln, His, Ser, and Val were responsible for phylogeny and their constituted components could differentiate archaea, bacteria, and eukaryotes. Environmental niche was also a significant factor in determining proteomic architecture, especially for archaea for which the main amino acids were Cys, Leu, and Thr. In archaea, hyperthermophiles, acidophiles, mesophiles, psychrophiles, and halophiles gathered successively along the environment-based principal component. Concordance between proteomic architecture and the genetic code was also related closely to genomic GC content, phylogeny, and lifestyles. Conclusions Large-scale analyses of the complete proteomes of a wide range of organisms suggested that amino acid composition retained the trace of GC variation, phylogeny, and environmental influences during evolution. The findings from this study will help in the development of a global understanding of proteome evolution, and even biological evolution. PMID:24088322
CPTAC Assay Portal: a repository of targeted proteomic assays
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whiteaker, Jeffrey R.; Halusa, Goran; Hoofnagle, Andrew N.
2014-06-27
To address these issues, the Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as a public repository of well-characterized quantitative, MS-based, targeted proteomic assays. The purpose of the CPTAC Assay Portal is to facilitate widespread adoption of targeted MS assays by disseminating SOPs, reagents, and assay characterization data for highly characterized assays. A primary aim of the NCI-supported portal is to bring together clinicians or biologists and analytical chemists to answer hypothesis-driven questions using targeted, MS-based assays. Assay content is easily accessed through queries and filters, enabling investigatorsmore » to find assays to proteins relevant to their areas of interest. Detailed characterization data are available for each assay, enabling researchers to evaluate assay performance prior to launching the assay in their own laboratory.« less
Computational clustering for viral reference proteomes
Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.
2016-01-01
Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712
Wen, Qiong; Zhang, Li; Mao, Hai-Ping; Tang, Xue-Qing; Rong, Rong; Fan, Jin-Jin; Yu, Xue-Qing
2013-08-30
Peritoneal membranes can be categorized as high, high average, low average, and low transporters, based on the removal or transport rate of solutes. In this study, we used proteomic analysis to determine the differences in proteins removed by different types of peritoneal membranes. Peritoneal transport characteristics in patients who received peritoneal dialysis therapy were assessed by a peritoneal equilibration test. Two-dimensional differential gel electrophoresis technology followed by quantitative analysis was performed to study the variation in protein expression from peritoneal dialysis effluents (PDE) among different groups. Proteins were identified by MALDI-TOF-MS/MS analyses. Further validation in PDE or serum was performed utilizing ELISA analysis. Proteomics analysis revealed ten protein spots with significant differences in intensity levels among different groups, including vitamin D-binding protein, complement C3, apolipoprotein-A1, complement factor C4A, haptoglobin, alpha-1 antitrypsin, immunoglobulin kappa light chain, alpha-2-microglobulin, retinol-binding protein 4 and transthyretin. The levels of vitamin D-binding protein, complement C3, and apolipoprotein-A1 in PDE derived from different groups were greatly varied (P<0.05). However, no significant difference was found in the serum levels of these proteins among different groups (P>0.05 for all groups). This study provides a novel overview of the differences in PDE proteomes of four types of peritoneal membranes. Vitamin D-binding protein, complement C3, and apolipoprotein-A1 showed enhanced expression in PDE of patients with high transporter. Copyright © 2013 Elsevier Inc. All rights reserved.
Determination of burn patient outcome by large-scale quantitative discovery proteomics
Finnerty, Celeste C.; Jeschke, Marc G.; Qian, Wei-Jun; Kaushal, Amit; Xiao, Wenzhong; Liu, Tao; Gritsenko, Marina A.; Moore, Ronald J.; Camp, David G.; Moldawer, Lyle L.; Elson, Constance; Schoenfeld, David; Gamelli, Richard; Gibran, Nicole; Klein, Matthew; Arnoldo, Brett; Remick, Daniel; Smith, Richard D.; Davis, Ronald; Tompkins, Ronald G.; Herndon, David N.
2013-01-01
Objective Emerging proteomics techniques can be used to establish proteomic outcome signatures and to identify candidate biomarkers for survival following traumatic injury. We applied high-resolution liquid chromatography-mass spectrometry (LC-MS) and multiplex cytokine analysis to profile the plasma proteome of survivors and non-survivors of massive burn injury to determine the proteomic survival signature following a major burn injury. Design Proteomic discovery study. Setting Five burn hospitals across the U.S. Patients Thirty-two burn patients (16 non-survivors and 16 survivors), 19–89 years of age, were admitted within 96 h of injury to the participating hospitals with burns covering >20% of the total body surface area and required at least one surgical intervention. Interventions None. Measurements and Main Results We found differences in circulating levels of 43 proteins involved in the acute phase response, hepatic signaling, the complement cascade, inflammation, and insulin resistance. Thirty-two of the proteins identified were not previously known to play a role in the response to burn. IL-4, IL-8, GM-CSF, MCP-1, and β2-microglobulin correlated well with survival and may serve as clinical biomarkers. Conclusions These results demonstrate the utility of these techniques for establishing proteomic survival signatures and for use as a discovery tool to identify candidate biomarkers for survival. This is the first clinical application of a high-throughput, large-scale LC-MS-based quantitative plasma proteomic approach for biomarker discovery for the prediction of patient outcome following burn, trauma or critical illness. PMID:23507713
Role of Proteomics in the Development of Personalized Medicine.
Jain, Kewal K
2016-01-01
Advances in proteomic technologies have made import contribution to the development of personalized medicine by facilitating detection of protein biomarkers, proteomics-based molecular diagnostics, as well as protein biochips and pharmacoproteomics. Application of nanobiotechnology in proteomics, nanoproteomics, has further enhanced applications in personalized medicine. Proteomics-based molecular diagnostics will have an important role in the diagnosis of certain conditions and understanding the pathomechanism of disease. Proteomics will be a good bridge between diagnostics and therapeutics; the integration of these will be important for advancing personalized medicine. Use of proteomic biomarkers and combination of pharmacoproteomics with pharmacogenomics will enable stratification of clinical trials and improve monitoring of patients for development of personalized therapies. Proteomics is an important component of several interacting technologies used for development of personalized medicine, which is depicted graphically. Finally, cancer is a good example of applications of proteomic technologies for personalized management of cancer. © 2016 Elsevier Inc. All rights reserved.
Proteomic analysis of isolated chlamydomonas centrioles reveals orthologs of ciliary-disease genes.
Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F
2005-06-21
The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.
Wierer, Michael; Prestel, Matthias; Schiller, Herbert B; Yan, Guangyao; Schaab, Christoph; Azghandi, Sepiede; Werner, Julia; Kessler, Thorsten; Malik, Rainer; Murgia, Marta; Aherrahrou, Zouhair; Schunkert, Heribert; Dichgans, Martin; Mann, Matthias
2018-02-01
Atherosclerosis leads to vascular lesions that involve major rearrangements of the vascular proteome, especially of the extracellular matrix (ECM). Using single aortas from ApoE knock out mice, we quantified formation of plaques by single-run, high-resolution mass spectrometry (MS)-based proteomics. To probe localization on a proteome-wide scale we employed quantitative detergent solubility profiling. This compartment- and time-resolved resource of atherogenesis comprised 5117 proteins, 182 of which changed their expression status in response to vessel maturation and atherosclerotic plaque development. In the insoluble ECM proteome, 65 proteins significantly changed, including relevant collagens, matrix metalloproteinases and macrophage derived proteins. Among novel factors in atherosclerosis, we identified matrilin-2, the collagen IV crosslinking enzyme peroxidasin as well as the poorly characterized MAM-domain containing 2 (Mamdc2) protein as being up-regulated in the ECM during atherogenesis. Intriguingly, three subunits of the osteoclast specific V-ATPase complex were strongly increased in mature plaques with an enrichment in macrophages thus implying an active de-mineralization function. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Wierer, Michael; Prestel, Matthias; Schiller, Herbert B.; Yan, Guangyao; Schaab, Christoph; Azghandi, Sepiede; Werner, Julia; Kessler, Thorsten; Malik, Rainer; Murgia, Marta; Aherrahrou, Zouhair; Schunkert, Heribert; Dichgans, Martin; Mann, Matthias
2018-01-01
Atherosclerosis leads to vascular lesions that involve major rearrangements of the vascular proteome, especially of the extracellular matrix (ECM). Using single aortas from ApoE knock out mice, we quantified formation of plaques by single-run, high-resolution mass spectrometry (MS)-based proteomics. To probe localization on a proteome-wide scale we employed quantitative detergent solubility profiling. This compartment- and time-resolved resource of atherogenesis comprised 5117 proteins, 182 of which changed their expression status in response to vessel maturation and atherosclerotic plaque development. In the insoluble ECM proteome, 65 proteins significantly changed, including relevant collagens, matrix metalloproteinases and macrophage derived proteins. Among novel factors in atherosclerosis, we identified matrilin-2, the collagen IV crosslinking enzyme peroxidasin as well as the poorly characterized MAM-domain containing 2 (Mamdc2) protein as being up-regulated in the ECM during atherogenesis. Intriguingly, three subunits of the osteoclast specific V-ATPase complex were strongly increased in mature plaques with an enrichment in macrophages thus implying an active de-mineralization function. PMID:29208753
Al Kindi, Mahmood A; Colella, Alex D; Chataway, Tim K; Jackson, Michael W; Wang, Jing J; Gordon, Tom P
2016-04-01
The structures of epitopes bound by autoantibodies against RNA-protein complexes have been well-defined over several decades, but little is known of the clonality, immunoglobulin (Ig) variable (V) gene usage and mutational status of the autoantibodies themselves at the level of the secreted (serum) proteome. A novel proteomic workflow is presented based on affinity purification of specific Igs from serum, high-resolution two-dimensional gel electrophoresis, and de novo and database-driven sequencing of V-region proteins by mass spectrometry. Analysis of anti-Ro52/Ro60/La proteomes in primary Sjögren's syndrome (SS) and anti-Sm and anti-ribosomal P proteomes in systemic lupus erythematosus (SLE) has revealed that these antibody responses are dominated by restricted sets of public (shared) clonotypes, consistent with common pathways of production across unrelated individuals. The discovery of shared sets of specific V-region peptides can be exploited for diagnostic biomarkers in targeted mass spectrometry platforms and for tracking and removal of pathogenic clones. Copyright © 2016 Elsevier B.V. All rights reserved.
Recent advances in proteomic applications for schistosomiasis research: potential clinical impact.
Sotillo, Javier; Doolan, Denise; Loukas, Alex
2017-02-01
Schistosomiasis is a neglected tropical disease affecting hundreds of millions of people worldwide. Recent advances in the field of proteomics and the development of new and highly sensitive mass spectrometers and quantitative techniques have provided new tools for advancing the molecular biology, cell biology, diagnosis and vaccine development for public health threats such as schistosomiasis. Areas covered: In this review we describe the latest advances in research that utilizes proteomics-based tools to address some of the key challenges to developing effective interventions against schistosomiasis. We also provide information about the potential of extracellular vesicles to advance the fight against this devastating disease. Expert commentary: Different proteins are already being tested as vaccines against schistosomiasis with promising results. The re-analysis of the Schistosoma spp. proteomes using new and more sensitive mass spectrometers as well as better separation approaches will help identify more vaccine targets in a rational and informed manner. In addition, the recent development of new proteome microarrays will facilitate characterisation of novel markers of infection as well as new vaccine and diagnostic candidate antigens.
Persi, Erez; Horn, David
2013-01-01
We present a novel analysis of compositional order (CO) based on the occurrence of Frequent amino-acid Triplets (FTs) that appear much more than random in protein sequences. The method captures all types of proteomic compositional order including single amino-acid runs, tandem repeats, periodic structure of motifs and otherwise low complexity amino-acid regions. We introduce new order measures, distinguishing between ‘regularity’, ‘periodicity’ and ‘vocabulary’, to quantify these phenomena and to facilitate the identification of evolutionary effects. Detailed analysis of representative species across the tree-of-life demonstrates that CO proteins exhibit numerous functional enrichments, including a wide repertoire of particular patterns of dependencies on regularity and periodicity. Comparison between human and mouse proteomes further reveals the interplay of CO with evolutionary trends, such as faster substitution rate in mouse leading to decrease of periodicity, while innovation along the human lineage leads to larger regularity. Large-scale analysis of 94 proteomes leads to systematic ordering of all major taxonomic groups according to FT-vocabulary size. This is measured by the count of Different Frequent Triplets (DFT) in proteomes. The latter provides a clear hierarchical delineation of vertebrates, invertebrates, plants, fungi and prokaryotes, with thermophiles showing the lowest level of FT-vocabulary. Among eukaryotes, this ordering correlates with phylogenetic proximity. Interestingly, in all kingdoms CO accumulation in the proteome has universal characteristics. We suggest that CO is a genomic-information correlate of both macroevolution and various protein functions. The results indicate a mechanism of genomic ‘innovation’ at the peptide level, involved in protein elongation, shaped in a universal manner by mutational and selective forces. PMID:24278003
The Escherichia coli Proteome: Past, Present, and Future Prospects†
Han, Mee-Jung; Lee, Sang Yup
2006-01-01
Proteomics has emerged as an indispensable methodology for large-scale protein analysis in functional genomics. The Escherichia coli proteome has been extensively studied and is well defined in terms of biochemical, biological, and biotechnological data. Even before the entire E. coli proteome was fully elucidated, the largest available data set had been integrated to decipher regulatory circuits and metabolic pathways, providing valuable insights into global cellular physiology and the development of metabolic and cellular engineering strategies. With the recent advent of advanced proteomic technologies, the E. coli proteome has been used for the validation of new technologies and methodologies such as sample prefractionation, protein enrichment, two-dimensional gel electrophoresis, protein detection, mass spectrometry (MS), combinatorial assays with n-dimensional chromatographies and MS, and image analysis software. These important technologies will not only provide a great amount of additional information on the E. coli proteome but also synergistically contribute to other proteomic studies. Here, we review the past development and current status of E. coli proteome research in terms of its biological, biotechnological, and methodological significance and suggest future prospects. PMID:16760308
Tumor Cold Ischemia | Office of Cancer Clinical Proteomics Research
In a recently published manuscript in the journal of Molecular and Cellular Proteomics, researchers from the National Cancer Institutes (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigated the effect of cold ischemia on the proteome of fresh frozen tumors.
Davidi, Lital; Levin, Yishai; Ben-Dor, Shifra; Pick, Uri
2015-01-01
The halotolerant green alga Dunaliella bardawil is unique in that it accumulates under stress two types of lipid droplets: cytoplasmatic lipid droplets (CLD) and β-carotene-rich (βC) plastoglobuli. Recently, we isolated and analyzed the lipid and pigment compositions of these lipid droplets. Here, we describe their proteome analysis. A contamination filter and an enrichment filter were utilized to define core proteins. A proteome database of Dunaliella salina/D. bardawil was constructed to aid the identification of lipid droplet proteins. A total of 124 and 42 core proteins were identified in βC-plastoglobuli and CLD, respectively, with only eight common proteins. Dunaliella spp. CLD resemble cytoplasmic droplets from Chlamydomonas reinhardtii and contain major lipid droplet-associated protein and enzymes involved in lipid and sterol metabolism. The βC-plastoglobuli proteome resembles the C. reinhardtii eyespot and Arabidopsis (Arabidopsis thaliana) plastoglobule proteomes and contains carotene-globule-associated protein, plastid-lipid-associated protein-fibrillins, SOUL heme-binding proteins, phytyl ester synthases, β-carotene biosynthesis enzymes, and proteins involved in membrane remodeling/lipid droplet biogenesis: VESICLE-INDUCING PLASTID PROTEIN1, synaptotagmin, and the eyespot assembly proteins EYE3 and SOUL3. Based on these and previous results, we propose models for the biogenesis of βC-plastoglobuli and the biosynthesis of β-carotene within βC-plastoglobuli and hypothesize that βC-plastoglobuli evolved from eyespot lipid droplets. PMID:25404729
Metabolomics method to comprehensively analyze amino acids in different domains.
Gu, Haiwei; Du, Jianhai; Carnevale Neto, Fausto; Carroll, Patrick A; Turner, Sally J; Chiorean, E Gabriela; Eisenman, Robert N; Raftery, Daniel
2015-04-21
Amino acids play essential roles in both metabolism and the proteome. Many studies have profiled free amino acids (FAAs) or proteins; however, few have connected the measurement of FAA with individual amino acids in the proteome. In this study, we developed a metabolomics method to comprehensively analyze amino acids in different domains, using two examples of different sample types and disease models. We first examined the responses of FAAs and insoluble-proteome amino acids (IPAAs) to the Myc oncogene in Tet21N human neuroblastoma cells. The metabolic and proteomic amino acid profiles were quite different, even under the same Myc condition, and their combination provided a better understanding of the biological status. In addition, amino acids were measured in 3 domains (FAAs, free and soluble-proteome amino acids (FSPAAs), and IPAAs) to study changes in serum amino acid profiles related to colon cancer. A penalized logistic regression model based on the amino acids from the three domains had better sensitivity and specificity than that from each individual domain. To the best of our knowledge, this is the first study to perform a combined analysis of amino acids in different domains, and indicates the useful biological information available from a metabolomics analysis of the protein pellet. This study lays the foundation for further quantitative tracking of the distribution of amino acids in different domains, with opportunities for better diagnosis and mechanistic studies of various diseases.
An object model and database for functional genomics.
Jones, Andrew; Hunt, Ela; Wastling, Jonathan M; Pizarro, Angel; Stoeckert, Christian J
2004-07-10
Large-scale functional genomics analysis is now feasible and presents significant challenges in data analysis, storage and querying. Data standards are required to enable the development of public data repositories and to improve data sharing. There is an established data format for microarrays (microarray gene expression markup language, MAGE-ML) and a draft standard for proteomics (PEDRo). We believe that all types of functional genomics experiments should be annotated in a consistent manner, and we hope to open up new ways of comparing multiple datasets used in functional genomics. We have created a functional genomics experiment object model (FGE-OM), developed from the microarray model, MAGE-OM and two models for proteomics, PEDRo and our own model (Gla-PSI-Glasgow Proposal for the Proteomics Standards Initiative). FGE-OM comprises three namespaces representing (i) the parts of the model common to all functional genomics experiments; (ii) microarray-specific components; and (iii) proteomics-specific components. We believe that FGE-OM should initiate discussion about the contents and structure of the next version of MAGE and the future of proteomics standards. A prototype database called RNA And Protein Abundance Database (RAPAD), based on FGE-OM, has been implemented and populated with data from microbial pathogenesis. FGE-OM and the RAPAD schema are available from http://www.gusdb.org/fge.html, along with a set of more detailed diagrams. RAPAD can be accessed by registration at the site.
Cuadrat, Rafael R C; da Serra Cruz, Sérgio Manuel; Tschoeke, Diogo Antônio; Silva, Edno; Tosta, Frederico; Jucá, Henrique; Jardim, Rodrigo; Campos, Maria Luiza M; Mattoso, Marta; Dávila, Alberto M R
2014-08-01
A key focus in 21(st) century integrative biology and drug discovery for neglected tropical and other diseases has been the use of BLAST-based computational methods for identification of orthologous groups in pathogenic organisms to discern orthologs, with a view to evaluate similarities and differences among species, and thus allow the transfer of annotation from known/curated proteins to new/non-annotated ones. We used here a profile-based sensitive methodology to identify distant homologs, coupled to the NCBI's COG (Unicellular orthologs) and KOG (Eukaryote orthologs), permitting us to perform comparative genomics analyses on five protozoan genomes. OrthoSearch was used in five protozoan proteomes showing that 3901 and 7473 orthologs can be identified by comparison with COG and KOG proteomes, respectively. The core protozoa proteome inferred was 418 Protozoa-COG orthologous groups and 704 Protozoa-KOG orthologous groups: (i) 31.58% (132/418) belongs to the category J (translation, ribosomal structure, and biogenesis), and 9.81% (41/418) to the category O (post-translational modification, protein turnover, chaperones) using COG; (ii) 21.45% (151/704) belongs to the categories J, and 13.92% (98/704) to the O using KOG. The phylogenomic analysis showed four well-supported clades for Eukarya, discriminating Multicellular [(i) human, fly, plant and worm] and Unicellular [(ii) yeast, (iii) fungi, and (iv) protozoa] species. These encouraging results attest to the usefulness of the profile-based methodology for comparative genomics to accelerate semi-automatic re-annotation, especially of the protozoan proteomes. This approach may also lend itself for applications in global health, for example, in the case of novel drug target discovery against pathogenic organisms previously considered difficult to research with traditional drug discovery tools.
Cuadrat, Rafael R. C.; da Serra Cruz, Sérgio Manuel; Tschoeke, Diogo Antônio; Silva, Edno; Tosta, Frederico; Jucá, Henrique; Jardim, Rodrigo; Campos, Maria Luiza M.; Mattoso, Marta
2014-01-01
Abstract A key focus in 21st century integrative biology and drug discovery for neglected tropical and other diseases has been the use of BLAST-based computational methods for identification of orthologous groups in pathogenic organisms to discern orthologs, with a view to evaluate similarities and differences among species, and thus allow the transfer of annotation from known/curated proteins to new/non-annotated ones. We used here a profile-based sensitive methodology to identify distant homologs, coupled to the NCBI's COG (Unicellular orthologs) and KOG (Eukaryote orthologs), permitting us to perform comparative genomics analyses on five protozoan genomes. OrthoSearch was used in five protozoan proteomes showing that 3901 and 7473 orthologs can be identified by comparison with COG and KOG proteomes, respectively. The core protozoa proteome inferred was 418 Protozoa-COG orthologous groups and 704 Protozoa-KOG orthologous groups: (i) 31.58% (132/418) belongs to the category J (translation, ribosomal structure, and biogenesis), and 9.81% (41/418) to the category O (post-translational modification, protein turnover, chaperones) using COG; (ii) 21.45% (151/704) belongs to the categories J, and 13.92% (98/704) to the O using KOG. The phylogenomic analysis showed four well-supported clades for Eukarya, discriminating Multicellular [(i) human, fly, plant and worm] and Unicellular [(ii) yeast, (iii) fungi, and (iv) protozoa] species. These encouraging results attest to the usefulness of the profile-based methodology for comparative genomics to accelerate semi-automatic re-annotation, especially of the protozoan proteomes. This approach may also lend itself for applications in global health, for example, in the case of novel drug target discovery against pathogenic organisms previously considered difficult to research with traditional drug discovery tools. PMID:24960463
Cho, Jin-Young; Lee, Hyoung-Joo; Jeong, Seul-Ki; Paik, Young-Ki
2017-12-01
Mass spectrometry (MS) is a widely used proteome analysis tool for biomedical science. In an MS-based bottom-up proteomic approach to protein identification, sequence database (DB) searching has been routinely used because of its simplicity and convenience. However, searching a sequence DB with multiple variable modification options can increase processing time, false-positive errors in large and complicated MS data sets. Spectral library searching is an alternative solution, avoiding the limitations of sequence DB searching and allowing the detection of more peptides with high sensitivity. Unfortunately, this technique has less proteome coverage, resulting in limitations in the detection of novel and whole peptide sequences in biological samples. To solve these problems, we previously developed the "Combo-Spec Search" method, which uses manually multiple references and simulated spectral library searching to analyze whole proteomes in a biological sample. In this study, we have developed a new analytical interface tool called "Epsilon-Q" to enhance the functions of both the Combo-Spec Search method and label-free protein quantification. Epsilon-Q performs automatically multiple spectral library searching, class-specific false-discovery rate control, and result integration. It has a user-friendly graphical interface and demonstrates good performance in identifying and quantifying proteins by supporting standard MS data formats and spectrum-to-spectrum matching powered by SpectraST. Furthermore, when the Epsilon-Q interface is combined with the Combo-Spec search method, called the Epsilon-Q system, it shows a synergistic function by outperforming other sequence DB search engines for identifying and quantifying low-abundance proteins in biological samples. The Epsilon-Q system can be a versatile tool for comparative proteome analysis based on multiple spectral libraries and label-free quantification.
Biochemical Markers of Brain Injury: An Integrated Proteomics-Based Approach
2006-02-01
Anthony J Williams, X-C May Lu, Renwu Chen, Zhilin Liao, Rebeca Connors, Kevin K Wang, Ron L Hayes, Frank C Tortella, Jitendra R Dave. High throughput... YANG , A., et al. (2002). Evalu- ation of two-dimensional differential gel electrophoresis for proteomic expression analysis of a model breast cancer cell...apoptosis. J. Biol. Chem. 279, 1030–1039. Kuida K., Zheng T. S., Na S., Kuan C., Yang D., Karasuyama H., Rakic P. and Flavell R. A. (1996) Decreased apoptosis
Characterization of individual mouse cerebrospinal fluid proteomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Jeffrey S.; Angel, Thomas E.; Chavkin, Charles
2014-03-20
Analysis of cerebrospinal fluid (CSF) offers key insight into the status of the central nervous system. Characterization of murine CSF proteomes can provide a valuable resource for studying central nervous system injury and disease in animal models. However, the small volume of CSF in mice has thus far limited individual mouse proteome characterization. Through non-terminal CSF extractions in C57Bl/6 mice and high-resolution liquid chromatography-mass spectrometry analysis of individual murine samples, we report the most comprehensive proteome characterization of individual murine CSF to date. Utilizing stringent protein inclusion criteria that required the identification of at least two unique peptides (1% falsemore » discovery rate at the peptide level) we identified a total of 566 unique proteins, including 128 proteins from three individual CSF samples that have been previously identified in brain tissue. Our methods and analysis provide a mechanism for individual murine CSF proteome analysis.« less
Application of Large-Scale Aptamer-Based Proteomic Profiling to Planned Myocardial Infarctions.
Jacob, Jaison; Ngo, Debby; Finkel, Nancy; Pitts, Rebecca; Gleim, Scott; Benson, Mark D; Keyes, Michelle J; Farrell, Laurie A; Morgan, Thomas; Jennings, Lori L; Gerszten, Robert E
2018-03-20
Emerging proteomic technologies using novel affinity-based reagents allow for efficient multiplexing with high-sample throughput. To identify early biomarkers of myocardial injury, we recently applied an aptamer-based proteomic profiling platform that measures 1129 proteins to samples from patients undergoing septal alcohol ablation for hypertrophic cardiomyopathy, a human model of planned myocardial injury. Here, we examined the scalability of this approach using a markedly expanded platform to study a far broader range of human proteins in the context of myocardial injury. We applied a highly multiplexed, expanded proteomic technique that uses single-stranded DNA aptamers to assay 4783 human proteins (4137 distinct human gene targets) to derivation and validation cohorts of planned myocardial injury, individuals with spontaneous myocardial infarction, and at-risk controls. We found 376 target proteins that significantly changed in the blood after planned myocardial injury in a derivation cohort (n=20; P <1.05E-05, 1-way repeated measures analysis of variance, Bonferroni threshold). Two hundred forty-seven of these proteins were validated in an independent planned myocardial injury cohort (n=15; P <1.33E-04, 1-way repeated measures analysis of variance); >90% were directionally consistent and reached nominal significance in the validation cohort. Among the validated proteins that were increased within 1 hour after planned myocardial injury, 29 were also elevated in patients with spontaneous myocardial infarction (n=63; P <6.17E-04). Many of the novel markers identified in our study are intracellular proteins not previously identified in the peripheral circulation or have functional roles relevant to myocardial injury. For example, the cardiac LIM protein, cysteine- and glycine-rich protein 3, is thought to mediate cardiac mechanotransduction and stress responses, whereas the mitochondrial ATP synthase F 0 subunit component is a vasoactive peptide on its release from cells. Last, we performed aptamer-affinity enrichment coupled with mass spectrometry to technically verify aptamer specificity for a subset of the new biomarkers. Our results demonstrate the feasibility of large-scale aptamer multiplexing at a level that has not previously been reported and with sample throughput that greatly exceeds other existing proteomic methods. The expanded aptamer-based proteomic platform provides a unique opportunity for biomarker and pathway discovery after myocardial injury. © 2017 American Heart Association, Inc.
Proteomics Analysis of the Nucleolus in Adenovirus-infected Cells
Lam, Yun W.; Evans, Vanessa C.; Heesom, Kate J.; Lamond, Angus I.; Matthews, David A.
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined. PMID:19812395
Proteomics analysis of the nucleolus in adenovirus-infected cells.
Lam, Yun W; Evans, Vanessa C; Heesom, Kate J; Lamond, Angus I; Matthews, David A
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined.
PIQMIe: a web server for semi-quantitative proteomics data management and analysis
Kuzniar, Arnold; Kanaar, Roland
2014-01-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. PMID:24861615
Proteome reference map and regulation network of neonatal rat cardiomyocyte
Li, Zi-jian; Liu, Ning; Han, Qi-de; Zhang, You-yi
2011-01-01
Aim: To study and establish a proteome reference map and regulation network of neonatal rat cardiomyocyte. Methods: Cultured cardiomyocytes of neonatal rats were used. All proteins expressed in the cardiomyocytes were separated and identified by two-dimensional polyacrylamide gel electrophoresis (2-DE) and matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS). Biological networks and pathways of the neonatal rat cardiomyocytes were analyzed using the Ingenuity Pathway Analysis (IPA) program (www.ingenuity.com). A 2-DE database was made accessible on-line by Make2ddb package on a web server. Results: More than 1000 proteins were separated on 2D gels, and 148 proteins were identified. The identified proteins were used for the construction of an extensible markup language-based database. Biological networks and pathways were constructed to analyze the functions associate with cardiomyocyte proteins in the database. The 2-DE database of rat cardiomyocyte proteins can be accessed at http://2d.bjmu.edu.cn. Conclusion: A proteome reference map and regulation network of the neonatal rat cardiomyocytes have been established, which may serve as an international platform for storage, analysis and visualization of cardiomyocyte proteomic data. PMID:21841810
PIQMIe: a web server for semi-quantitative proteomics data management and analysis.
Kuzniar, Arnold; Kanaar, Roland
2014-07-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Parsing and Quantification of Raw Orbitrap Mass Spectrometer Data Using RawQuant.
Kovalchik, Kevin A; Moggridge, Sophie; Chen, David D Y; Morin, Gregg B; Hughes, Christopher S
2018-06-01
Effective analysis of protein samples by mass spectrometry (MS) requires careful selection and optimization of a range of experimental parameters. As the output from the primary detection device, the "raw" MS data file can be used to gauge the success of a given sample analysis. However, the closed-source nature of the standard raw MS file can complicate effective parsing of the data contained within. To ease and increase the range of analyses possible, the RawQuant tool was developed to enable parsing of raw MS files derived from Thermo Orbitrap instruments to yield meta and scan data in an openly readable text format. RawQuant can be commanded to export user-friendly files containing MS 1 , MS 2 , and MS 3 metadata as well as matrices of quantification values based on isobaric tagging approaches. In this study, the utility of RawQuant is demonstrated in several scenarios: (1) reanalysis of shotgun proteomics data for the identification of the human proteome, (2) reanalysis of experiments utilizing isobaric tagging for whole-proteome quantification, and (3) analysis of a novel bacterial proteome and synthetic peptide mixture for assessing quantification accuracy when using isobaric tags. Together, these analyses successfully demonstrate RawQuant for the efficient parsing and quantification of data from raw Thermo Orbitrap MS files acquired in a range of common proteomics experiments. In addition, the individual analyses using RawQuant highlights parametric considerations in the different experimental sets and suggests targetable areas to improve depth of coverage in identification-focused studies and quantification accuracy when using isobaric tags.
An, SuFang; Gong, FangPing; Wang, Wei
2012-01-01
Pinellia ternata tuber is one of the well-known Chinese traditional medicines. In order to understand the pharmacological properties of tuber proteins, it is necessary to perform proteome analysis of P. ternata tubers. However, a few high-abundance proteins (HAPs), mainly mannose-binding lectin (agglutinin), exist in aggregates of various sizes in the tubers and seriously interfere with proteome profiling by two-dimensional electrophoresis (2-DE). Therefore, selective depletion of these HAPs is a prerequisite for enhanced proteome analysis of P. ternata tubers. Based on differential protein solubility, we developed a novel protocol involving two sequential extractions for depletion of some HAPs and prefractionation of tuber proteins prior to 2-DE. The first extraction using 10% acetic acid selectively extracted acid-soluble HAPs and the second extraction using the SDS-containing buffer extracted remaining acid-insoluble proteins. After application of the protocol, 2-DE profiles of P. ternata tuber proteins were greatly improved and more protein spots were detected, especially low-abundance proteins. Moreover, the subunit composition of P. ternata lectin was analyzed by electrophoresis. Native lectin consists of two hydrogen-bonded subunits (11 kDa and 25 kDa) and the 11 kDa subunit was a glycoprotein. Subsequently, major HAPs in the tubers were analyzed by mass spectrometry, with nine protein spots being identified as lectin isoforms. The methodology was easy to perform and required no specialized apparatus. It would be useful for proteome analysis of other tuber plants of Araceae. PMID:23185632
Wu, Xiaolin; Xiong, Erhui; An, Sufang; Gong, Fangping; Wang, Wei
2012-01-01
Pinellia ternata tuber is one of the well-known Chinese traditional medicines. In order to understand the pharmacological properties of tuber proteins, it is necessary to perform proteome analysis of P. ternata tubers. However, a few high-abundance proteins (HAPs), mainly mannose-binding lectin (agglutinin), exist in aggregates of various sizes in the tubers and seriously interfere with proteome profiling by two-dimensional electrophoresis (2-DE). Therefore, selective depletion of these HAPs is a prerequisite for enhanced proteome analysis of P. ternata tubers. Based on differential protein solubility, we developed a novel protocol involving two sequential extractions for depletion of some HAPs and prefractionation of tuber proteins prior to 2-DE. The first extraction using 10% acetic acid selectively extracted acid-soluble HAPs and the second extraction using the SDS-containing buffer extracted remaining acid-insoluble proteins. After application of the protocol, 2-DE profiles of P. ternata tuber proteins were greatly improved and more protein spots were detected, especially low-abundance proteins. Moreover, the subunit composition of P. ternata lectin was analyzed by electrophoresis. Native lectin consists of two hydrogen-bonded subunits (11 kDa and 25 kDa) and the 11 kDa subunit was a glycoprotein. Subsequently, major HAPs in the tubers were analyzed by mass spectrometry, with nine protein spots being identified as lectin isoforms. The methodology was easy to perform and required no specialized apparatus. It would be useful for proteome analysis of other tuber plants of Araceae.
[Methods of quantitative proteomics].
Kopylov, A T; Zgoda, V G
2007-01-01
In modern science proteomic analysis is inseparable from other fields of systemic biology. Possessing huge resources quantitative proteomics operates colossal information on molecular mechanisms of life. Advances in proteomics help researchers to solve complex problems of cell signaling, posttranslational modification, structure and functional homology of proteins, molecular diagnostics etc. More than 40 various methods have been developed in proteomics for quantitative analysis of proteins. Although each method is unique and has certain advantages and disadvantages all these use various isotope labels (tags). In this review we will consider the most popular and effective methods employing both chemical modifications of proteins and also metabolic and enzymatic methods of isotope labeling.
PROTICdb: a web-based application to store, track, query, and compare plant proteome data.
Ferry-Dumazet, Hélène; Houel, Gwenn; Montalent, Pierre; Moreau, Luc; Langella, Olivier; Negroni, Luc; Vincent, Delphine; Lalanne, Céline; de Daruvar, Antoine; Plomion, Christophe; Zivy, Michel; Joets, Johann
2005-05-01
PROTICdb is a web-based application, mainly designed to store and analyze plant proteome data obtained by two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) and mass spectrometry (MS). The purposes of PROTICdb are (i) to store, track, and query information related to proteomic experiments, i.e., from tissue sampling to protein identification and quantitative measurements, and (ii) to integrate information from the user's own expertise and other sources into a knowledge base, used to support data interpretation (e.g., for the determination of allelic variants or products of post-translational modifications). Data insertion into the relational database of PROTICdb is achieved either by uploading outputs of image analysis and MS identification software, or by filling web forms. 2-D PAGE annotated maps can be displayed, queried, and compared through a graphical interface. Links to external databases are also available. Quantitative data can be easily exported in a tabulated format for statistical analyses. PROTICdb is based on the Oracle or the PostgreSQL Database Management System and is freely available upon request at the following URL: http://moulon.inra.fr/ bioinfo/PROTICdb.
Proteome analysis in the assessment of ageing.
Nkuipou-Kenfack, Esther; Koeck, Thomas; Mischak, Harald; Pich, Andreas; Schanstra, Joost P; Zürbig, Petra; Schumacher, Björn
2014-11-01
Based on demographic trends, the societies in many developed countries are facing an increasing number and proportion of people over the age of 65. The raise in elderly populations along with improved health-care will be concomitant with an increased prevalence of ageing-associated chronic conditions like cardiovascular, renal, and respiratory diseases, arthritis, dementia, and diabetes mellitus. This is expected to pose unprecedented challenges both for individuals and societies and their health care systems. An ultimate goal of ageing research is therefore the understanding of physiological ageing and the achievement of 'healthy' ageing by decreasing age-related pathologies. However, on a molecular level, ageing is a complex multi-mechanistic process whose contributing factors may vary individually, partly overlap with pathological alterations, and are often poorly understood. Proteome analysis potentially allows modelling of these multifactorial processes. This review summarises recent proteomic research on age-related changes identified in animal models and human studies. We combined this information with pathway analysis to identify molecular mechanisms associated with ageing. We identified some molecular pathways that are affected in most or even all organs and others that are organ-specific. However, appropriately powered studies are needed to confirm these findings based in in silico evaluation. Copyright © 2014 Elsevier B.V. All rights reserved.
Luo, Yanzhang; Mok, Tin Seak; Lin, Xiuxian; Zhang, Wanling; Cui, Yizhi; Guo, Jiahui; Chen, Xing; Zhang, Tao; Wang, Tong
2017-01-01
Nasopharyngeal carcinoma (NPC) is a serious threat to public health, and the biomarker discovery is of urgent needs. The data-independent mode (DIA) based sequential window acquisition of all theoretical fragment-ion spectra (SWATH) mass spectrometry (MS) has been proved to be precise in protein quantitation and efficient for cancer biomarker researches. In this study, we performed the first SWATH-MS analysis comparing the NPC and normal tissues. Spike-in stable isotope labeling by amino acids in cell culture (super-SILAC) MS was used as a shotgun reference. We identified and quantified 1414 proteins across all SWATH-MS analyses. We found that SWATH-MS had a unique feature to preferentially detect proteins with smaller molecular weights than either super-SILAC MS or human proteome background. With SWATH-MS, 29 significant differentially express proteins (DEPs) were identified. Among them, carbonic anhydrase 2 (CA2) was selected for further validation per novelty, MS quality and other supporting rationale. With the tissue microarray analysis, we found that CA2 had an AUC of 0.94 in differentiating NPC from normal tissue samples. In conclusion, SWATH-MS has unique features in proteome analysis, and it leads to the identification of CA2 as a potentially new diagnostic biomarker for NPC. PMID:28117408
Yan, Guokai; Lestari, Retno; Long, Baisheng; Fan, Qiwen; Wang, Zhichang; Guo, Xiaozhen; Yu, Jie; Hu, Jun; Yang, Xingya; Chen, Changqing; Liu, Lu; Li, Xiuzhi; Purnomoadi, Agung; Achmadi, Joelal; Yan, Xianghua
2016-03-17
L-Arginine (Arg) is a versatile amino acid that plays crucial roles in a wide range of physiological and pathological processes. In this study, to investigate the alteration induced by Arg supplementation in proteome scale, isobaric tags for relative and absolute quantification (iTRAQ) based proteomic approach was employed to comparatively characterize the differentially expressed proteins between Arg deprivation (Ctrl) and Arg supplementation (+Arg) treated human liver hepatocellular carcinoma (HepG2) cells. A total of 21 proteins were identified as differentially expressed proteins and these 21 proteins were all up-regulated by Arg supplementation. Six amino acid metabolism-related proteins, mostly metabolic enzymes, showed differential expressions. Intriguingly, Ingenuity Pathway Analysis (IPA) based pathway analysis suggested that the three ethanol degradation pathways were significantly altered between Ctrl and +Arg. Western blotting and enzymatic activity assays validated that the key enzymes ADH1C, ALDH1A1, and ALDH2, which are mainly involved in ethanol degradation pathways, were highly differentially expressed, and activated between Ctrl and +Arg in HepG2 cells. Furthermore, 10 mM Arg significantly attenuated the cytotoxicity induced by 100 mM ethanol treatment (P < 0.0001). This study is the first time to reveal that Arg activates ethanol degradation pathways in HepG2 cells.
Zhang, Bo; Pirmoradian, Mohammad; Chernobrovkin, Alexey; Zubarev, Roman A.
2014-01-01
Based on conventional data-dependent acquisition strategy of shotgun proteomics, we present a new workflow DeMix, which significantly increases the efficiency of peptide identification for in-depth shotgun analysis of complex proteomes. Capitalizing on the high resolution and mass accuracy of Orbitrap-based tandem mass spectrometry, we developed a simple deconvolution method of “cloning” chimeric tandem spectra for cofragmented peptides. Additional to a database search, a simple rescoring scheme utilizes mass accuracy and converts the unwanted cofragmenting events into a surprising advantage of multiplexing. With the combination of cloning and rescoring, we obtained on average nine peptide-spectrum matches per second on a Q-Exactive workbench, whereas the actual MS/MS acquisition rate was close to seven spectra per second. This efficiency boost to 1.24 identified peptides per MS/MS spectrum enabled analysis of over 5000 human proteins in single-dimensional LC-MS/MS shotgun experiments with an only two-hour gradient. These findings suggest a change in the dominant “one MS/MS spectrum - one peptide” paradigm for data acquisition and analysis in shotgun data-dependent proteomics. DeMix also demonstrated higher robustness than conventional approaches in terms of lower variation among the results of consecutive LC-MS/MS runs. PMID:25100859
MPA Portable: A Stand-Alone Software Package for Analyzing Metaproteome Samples on the Go.
Muth, Thilo; Kohrs, Fabian; Heyer, Robert; Benndorf, Dirk; Rapp, Erdmann; Reichl, Udo; Martens, Lennart; Renard, Bernhard Y
2018-01-02
Metaproteomics, the mass spectrometry-based analysis of proteins from multispecies samples faces severe challenges concerning data analysis and results interpretation. To overcome these shortcomings, we here introduce the MetaProteomeAnalyzer (MPA) Portable software. In contrast to the original server-based MPA application, this newly developed tool no longer requires computational expertise for installation and is now independent of any relational database system. In addition, MPA Portable now supports state-of-the-art database search engines and a convenient command line interface for high-performance data processing tasks. While search engine results can easily be combined to increase the protein identification yield, an additional two-step workflow is implemented to provide sufficient analysis resolution for further postprocessing steps, such as protein grouping as well as taxonomic and functional annotation. Our new application has been developed with a focus on intuitive usability, adherence to data standards, and adaptation to Web-based workflow platforms. The open source software package can be found at https://github.com/compomics/meta-proteome-analyzer .
Mass spectrometry based proteomics: existing capabilities and future directions
Angel, Thomas E.; Aryal, Uma K.; Hengel, Shawna M.; Baker, Erin S.; Kelly, Ryan T.; Robinson, Errol W.; Smith, Richard D.
2012-01-01
Mass spectrometry (MS)-based proteomics is emerging as a broadly effective means for identification, characterization, and quantification of proteins that are integral components of the processes essential for life. Characterization of proteins at the proteome and sub-proteome (e.g., the phosphoproteome, proteoglycome, or degradome/peptidome) levels provides a foundation for understanding fundamental aspects of biology. Emerging technologies such as ion mobility separations coupled with MS and microchip-based-proteome measurements combined with MS instrumentation and chromatographic separation techniques, such as nanoscale reversed phase liquid chromatography and capillary electrophoresis, show great promise for both broad undirected and targeted highly sensitive measurements. MS-based proteomics is increasingly contribute to our understanding of the dynamics, interactions, and roles that proteins and peptides play, advancing our understanding of biology on a systems wide level for a wide range of applications including investigations of microbial communities, bioremediation, and human health. PMID:22498958
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J.; Chatterjee, Aditi; Prasad, T. S. Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division. PMID:25874956
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J; Chatterjee, Aditi; Prasad, T S Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division.
Wei, Dahai; Zhang, Xiaobo
2010-01-01
The virus-host interaction is essential to understanding the role that viruses play in ecological and geochemical processes in deep-sea vent ecosystems. Virus-induced changes in cellular gene expression and host physiology have been studied extensively. However, the molecular mechanism of interaction between a bacteriophage and its host at high temperature remains poorly understood. In the present study, the virus-induced gene expression profile of Geobacillus sp. E263, a thermophile isolated from a deep-sea hydrothermal ecosystem, was characterized. Based on proteomic analysis and random arbitrarily primed PCR (RAP-PCR) of Geobacillus sp. E263 cultured under non-bacteriophage GVE2 infection and GVE2 infection conditions, there were two types of protein/gene profiles in response to GVE2 infection. Twenty differentially expressed genes and proteins were revealed that could be grouped into 3 different categories based on cellular function, suggesting a coordinated response to infection. These differentially expressed genes and proteins were further confirmed by Northern blot analysis. To characterize the host proteins in response to virus infection, aspartate aminotransferase (AST) was inactivated to construct the AST mutant of Geobacillus sp. E263. The results showed that the AST protein was essential in virus infection. Thus, transcriptional and proteomic analyses and functional analysis revealed previously unknown host responses to deep-sea thermophilic virus infection. PMID:20015994
Yi, Zhou; Manil-Ségalen, Marion; Sago, Laila; Glatigny, Annie; Redeker, Virginie; Legouis, Renaud; Mucchielli-Giorgi, Marie-Hélène
2016-05-06
Affinity purifications followed by mass spectrometric analysis are used to identify protein-protein interactions. Because quantitative proteomic data are noisy, it is necessary to develop statistical methods to eliminate false-positives and identify true partners. We present here a novel approach for filtering false interactors, named "SAFER" for mass Spectrometry data Analysis by Filtering of Experimental Replicates, which is based on the reproducibility of the replicates and the fold-change of the protein intensities between bait and control. To identify regulators or targets of autophagy, we characterized the interactors of LGG1, a ubiquitin-like protein involved in autophagosome formation in C. elegans. LGG-1 partners were purified by affinity, analyzed by nanoLC-MS/MS mass spectrometry, and quantified by a label-free proteomic approach based on the mass spectrometric signal intensity of peptide precursor ions. Because the selection of confident interactions depends on the method used for statistical analysis, we compared SAFER with several statistical tests and different scoring algorithms on this set of data. We show that SAFER recovers high-confidence interactors that have been ignored by the other methods and identified new candidates involved in the autophagy process. We further validated our method on a public data set and conclude that SAFER notably improves the identification of protein interactors.
Label free quantitative proteomics analysis on the cisplatin resistance in ovarian cancer cells.
Wang, F; Zhu, Y; Fang, S; Li, S; Liu, S
2017-05-20
Quantitative proteomics has been made great progress in recent years. Label free quantitative proteomics analysis based on the mass spectrometry is widely used. Using this technique, we determined the differentially expressed proteins in the cisplatin-sensitive ovarian cancer cells COC1 and cisplatin-resistant cells COC1/DDP before and after the application of cisplatin. Using the GO analysis, we classified those proteins into different subgroups bases on their cellular component, biological process, and molecular function. We also used KEGG pathway analysis to determine the key signal pathways that those proteins were involved in. There are 710 differential proteins between COC1 and COC1/DDP cells, 783 between COC1 and COC1/DDP cells treated with cisplatin, 917 between the COC1/DDP cells and COC1/DDP cells treated with LaCl3, 775 between COC1/DDP cells treated with cisplatin and COC1/DDP cells treated with cisplatin and LaCl3. Among the same 411 differentially expressed proteins in cisplatin-sensitive COC1 cells and cisplain-resistant COC1/DDP cells before and after cisplatin treatment, 14% of them were localized on the cell membrane. According to the KEGG results, differentially expressed proteins were classified into 21 groups. The most abundant proteins were involved in spliceosome. This study lays a foundation for deciphering the mechanism for drug resistance in ovarian tumor.
Moon, Yoon-Jung; Kwon, Joseph; Yun, Sung-Ho; Lim, Hye Li; Kim, Jonghyun; Kim, Soo Jung; Kang, Sung Gyun; Lee, Jung-Hyun; Kim, Seung Il; Chung, Young-Ho
2015-01-01
The hyperthermophilic archaeon Thermococcus onnurineus NA1 has been shown to produce H2 when using CO, formate, or starch as a growth substrate. This strain can also utilize elemental sulfur as a terminal electron acceptor for heterotrophic growth. To gain insight into sulfur metabolism, the proteome of T. onnurineus NA1 cells grown under sulfur culture conditions was quantified and compared with those grown under H2-evolving substrate culture conditions. Using label-free nano-UPLC-MSE-based comparative proteomic analysis, approximately 38.4% of the total identified proteome (589 proteins) was found to be significantly up-regulated (≥1.5-fold) under sulfur culture conditions. Many of these proteins were functionally associated with carbon fixation, Fe–S cluster biogenesis, ATP synthesis, sulfur reduction, protein glycosylation, protein translocation, and formate oxidation. Based on the abundances of the identified proteins in this and other genomic studies, the pathways associated with reductive sulfur metabolism, H2-metabolism, and oxidative stress defense were proposed. The results also revealed markedly lower expression levels of enzymes involved in the sulfur assimilation pathway, as well as cysteine desulfurase, under sulfur culture condition. The present results provide the first global atlas of proteome changes triggered by sulfur, and may facilitate an understanding of how hyperthermophilic archaea adapt to sulfur-rich, extreme environments. PMID:25915030
He, Ruifeng; Kim, Min-Jeong; Nelson, William; Balbuena, Tiago S; Kim, Ryan; Kramer, Robin; Crow, John A; May, Greg D; Thelen, Jay J; Soderlund, Carol A; Gang, David R
2012-02-01
The common reed (Phragmites australis), one of the most widely distributed of all angiosperms, uses its rhizomes (underground stems) to invade new territory, making it one of the most successful weedy species worldwide. Characterization of the rhizome transcriptome and proteome is needed to identify candidate genes and proteins involved in rhizome growth, development, metabolism, and invasiveness. We employed next-generation sequencing technologies including 454 and Illumina platforms to characterize the reed rhizome transcriptome and used quantitative proteomics techniques to identify the rhizome proteome. Combining 336514 Roche 454 Titanium reads and 103350802 Illumina paired-end reads in a de novo hybrid assembly yielded 124450 unique transcripts with an average length of 549 bp, of which 54317 were annotated. Rhizome-specific and differentially expressed transcripts were identified between rhizome apical tips (apical meristematic region) and rhizome elongation zones. A total of 1280 nonredundant proteins were identified and quantified using GeLC-MS/MS based label-free proteomics, where 174 and 77 proteins were preferentially expressed in the rhizome elongation zone and apical tip tissues, respectively. Genes involved in allelopathy and in controlling development and potentially invasiveness were identified. In addition to being a valuable sequence and protein data resource for studying plant rhizome species, our results provide useful insights into identifying specific genes and proteins with potential roles in rhizome differentiation, development, and function.
Alvarez, Sophie; Roy Choudhury, Swarup; Hicks, Leslie M; Pandey, Sona
2013-03-01
Abscisic acid (ABA) is proposed to be perceived by multiple receptors in plants. We have previously reported on the role of two GPCR-type G-proteins (GTG proteins) as plasma membrane-localized ABA receptors in Arabidopsis thaliana. However, due to the presence of multiple transmembrane domains, detailed structural and biochemical characterization of GTG proteins remains limited. Since ABA induces substantial changes in the proteome of plants, a labeling LC-based quantitative proteomics approach was applied to elucidate the global effects and possible downstream targets of GTG1/GTG2 proteins. Quantitative differences in protein abundance between wild-type and gtg1gtg2 were analyzed for evaluation of the effect of ABA on the root proteome and its dependence on the presence of functional GTG1/GTG2 proteins. The results presented in this study reveal the most comprehensive ABA-responsive root proteome reported to date in Arabidopsis. Notably, the majority of ABA-responsive proteins required the presence of GTG proteins, supporting their key role in ABA signaling. These observations were further confirmed by additional experiments. Overall, comparison of the ABA-dependent protein abundance changes in wild-type versus gtg1gtg2 provides clues to their possible links with some of the well-established effectors of the ABA signaling pathways and their role in mediating phytohormone cross-talk.
Analyzing large-scale proteomics projects with latent semantic indexing.
Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning
2008-01-01
Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.
NCI's Office of Cancer Clinical Proteomics Research authored a review of the current state of clinical proteomics in the peer-reviewed Journal of Proteome Research. The review highlights outcomes from the CPTC program and also provides a thorough overview of the different technologies that have pushed the field forward. Additionally, the review provides a vision for moving the field forward through linking advances in genomic and proteomic analysis to develop new, molecularly targeted interventions.
Clinical proteomic analysis of scrub typhus infection.
Park, Edmond Changkyun; Lee, Sang-Yeop; Yun, Sung Ho; Choi, Chi-Won; Lee, Hayoung; Song, Hyun Seok; Jun, Sangmi; Kim, Gun-Hwa; Lee, Chang-Seop; Kim, Seung Il
2018-01-01
Scrub typhus is an acute and febrile infectious disease caused by the Gram-negative α-proteobacterium Orientia tsutsugamushi from the family Rickettsiaceae that is widely distributed in Northern, Southern and Eastern Asia. In the present study, we analysed the serum proteome of scrub typhus patients to investigate specific clinical protein patterns in an attempt to explain pathophysiology and discover potential biomarkers of infection. Serum samples were collected from three patients (before and after treatment with antibiotics) and three healthy subjects. One-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis followed by liquid chromatography-tandem mass spectrometry was performed to identify differentially abundant proteins using quantitative proteomic approaches. Bioinformatic analysis was then performed using Ingenuity Pathway Analysis. Proteomic analysis identified 236 serum proteins, of which 32 were differentially expressed in normal subjects, naive scrub typhus patients and patients treated with antibiotics. Comparative bioinformatic analysis of the identified proteins revealed up-regulation of proteins involved in immune responses, especially complement system, following infection with O. tsutsugamushi , and normal expression was largely rescued by antibiotic treatment. This is the first proteomic study of clinical serum samples from scrub typhus patients. Proteomic analysis identified changes in protein expression upon infection with O. tsutsugamushi and following antibiotic treatment. Our results provide valuable information for further investigation of scrub typhus therapy and diagnosis.
O'Hurley, Gillian; Busch, Christer; Fagerberg, Linn; Hallström, Björn M.; Stadler, Charlotte; Tolf, Anna; Lundberg, Emma; Schwenk, Jochen M.; Jirström, Karin; Bjartell, Anders; Gallagher, William M.; Uhlén, Mathias; Pontén, Fredrik
2015-01-01
To better understand prostate function and disease, it is important to define and explore the molecular constituents that signify the prostate gland. The aim of this study was to define the prostate specific transcriptome and proteome, in comparison to 26 other human tissues. Deep sequencing of mRNA (RNA-seq) and immunohistochemistry-based protein profiling were combined to identify prostate specific gene expression patterns and to explore tissue biomarkers for potential clinical use in prostate cancer diagnostics. We identified 203 genes with elevated expression in the prostate, 22 of which showed more than five-fold higher expression levels compared to all other tissue types. In addition to previously well-known proteins we identified two poorly characterized proteins, TMEM79 and ACOXL, with potential to differentiate between benign and cancerous prostatic glands in tissue biopsies. In conclusion, we have applied a genome-wide analysis to identify the prostate specific proteome using transcriptomics and antibody-based protein profiling to identify genes with elevated expression in the prostate. Our data provides a starting point for further functional studies to explore the molecular repertoire of normal and diseased prostate including potential prostate cancer markers such as TMEM79 and ACOXL. PMID:26237329
Röst, Hannes L; Liu, Yansheng; D'Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi
2016-09-01
Next-generation mass spectrometric (MS) techniques such as SWATH-MS have substantially increased the throughput and reproducibility of proteomic analysis, but ensuring consistent quantification of thousands of peptide analytes across multiple liquid chromatography-tandem MS (LC-MS/MS) runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we developed TRIC (http://proteomics.ethz.ch/tric/), a software tool that utilizes fragment-ion data to perform cross-run alignment, consistent peak-picking and quantification for high-throughput targeted proteomics. TRIC reduced the identification error compared to a state-of-the-art SWATH-MS analysis without alignment by more than threefold at constant recall while correcting for highly nonlinear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups. Thus, TRIC fills a gap in the pipeline for automated analysis of massively parallel targeted proteomics data sets.
Nuez-Ortín, Waldo G; Carter, Chris G; Nichols, Peter D; Wilson, Richard
2016-07-01
Understanding diet- and environmentally induced physiological changes in fish larvae is a major goal for the aquaculture industry. Proteomic analysis of whole fish larvae comprising multiple tissues offers considerable potential but is challenging due to the very large dynamic range of protein abundance. To extend the coverage of the larval phase of the Atlantic salmon (Salmo salar) proteome, we applied a two-step sequential extraction (SE) method, based on differential protein solubility, using a nondenaturing buffer containing 150 mM NaCl followed by a denaturing buffer containing 7 M urea and 2 M thiourea. Extracts prepared using SE and one-step direct extraction were characterized via label-free shotgun proteomics using nanoLC-MS/MS (LTQ-Orbitrap). SE partitioned the proteins into two fractions of approximately equal amounts, but with very distinct protein composition, leading to identification of ∼40% more proteins than direct extraction. This fractionation strategy enabled the most detailed characterization of the salmon larval proteome to date and provides a platform for greater understanding of physiological changes in whole fish larvae. The MS data are available via the ProteomeXchange Consortium PRIDE partner repository, dataset PXD003366. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Plasma proteomic analysis reveals altered protein abundances in cardiovascular disease.
Lygirou, Vasiliki; Latosinska, Agnieszka; Makridakis, Manousos; Mullen, William; Delles, Christian; Schanstra, Joost P; Zoidakis, Jerome; Pieske, Burkert; Mischak, Harald; Vlahou, Antonia
2018-04-17
Cardiovascular disease (CVD) describes the pathological conditions of the heart and blood vessels. Despite the large number of studies on CVD and its etiology, its key modulators remain largely unknown. To this end, we performed a comprehensive proteomic analysis of blood plasma, with the scope to identify disease-associated changes after placing them in the context of existing knowledge, and generate a well characterized dataset for further use in CVD multi-omics integrative analysis. LC-MS/MS was employed to analyze plasma from 32 subjects (19 cases of various CVD phenotypes and 13 controls) in two steps: discovery (13 cases and 8 controls) and test (6 cases and 5 controls) set analysis. Following label-free quantification, the detected proteins were correlated to existing plasma proteomics datasets (plasma proteome database; PPD) and functionally annotated (Cytoscape, Ingenuity Pathway Analysis). Differential expression was defined based on identification confidence (≥ 2 peptides per protein), statistical significance (Mann-Whitney p value ≤ 0.05) and a minimum of twofold change. Peptides detected in at least 50% of samples per group were considered, resulting in a total of 3796 identified proteins (838 proteins based on ≥ 2 peptides). Pathway annotation confirmed the functional relevance of the findings (representation of complement cascade, fibrin clot formation, platelet degranulation, etc.). Correlation of the relative abundance of the proteins identified in the discovery set with their reported concentrations in the PPD was significant, confirming the validity of the quantification method. The discovery set analysis revealed 100 differentially expressed proteins between cases and controls, 39 of which were verified (≥ twofold change) in the test set. These included proteins already studied in the context of CVD (such as apolipoprotein B, alpha-2-macroglobulin), as well as novel findings (such as low density lipoprotein receptor related protein 2 [LRP2], protein SZT2) for which a mechanism of action is suggested. This proteomic study provides a comprehensive dataset to be used for integrative and functional studies in the field. The observed protein changes reflect known CVD-related processes (e.g. lipid uptake, inflammation) but also novel hypotheses for further investigation including a potential pleiotropic role of LPR2 but also links of SZT2 to CVD.
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-01-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)1 not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. PMID:27215607
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes.
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-08-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)(1) not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Ramus, Claire; Hovasse, Agnès; Marcellin, Marlène; Hesse, Anne-Marie; Mouton-Barbosa, Emmanuelle; Bouyssié, David; Vaca, Sebastian; Carapito, Christine; Chaoui, Karima; Bruley, Christophe; Garin, Jérôme; Cianférani, Sarah; Ferro, Myriam; Van Dorssaeler, Alain; Burlet-Schiltz, Odile; Schaeffer, Christine; Couté, Yohann; Gonzalez de Peredo, Anne
2016-01-30
Proteomic workflows based on nanoLC-MS/MS data-dependent-acquisition analysis have progressed tremendously in recent years. High-resolution and fast sequencing instruments have enabled the use of label-free quantitative methods, based either on spectral counting or on MS signal analysis, which appear as an attractive way to analyze differential protein expression in complex biological samples. However, the computational processing of the data for label-free quantification still remains a challenge. Here, we used a proteomic standard composed of an equimolar mixture of 48 human proteins (Sigma UPS1) spiked at different concentrations into a background of yeast cell lysate to benchmark several label-free quantitative workflows, involving different software packages developed in recent years. This experimental design allowed to finely assess their performances in terms of sensitivity and false discovery rate, by measuring the number of true and false-positive (respectively UPS1 or yeast background proteins found as differential). The spiked standard dataset has been deposited to the ProteomeXchange repository with the identifier PXD001819 and can be used to benchmark other label-free workflows, adjust software parameter settings, improve algorithms for extraction of the quantitative metrics from raw MS data, or evaluate downstream statistical methods. Bioinformatic pipelines for label-free quantitative analysis must be objectively evaluated in their ability to detect variant proteins with good sensitivity and low false discovery rate in large-scale proteomic studies. This can be done through the use of complex spiked samples, for which the "ground truth" of variant proteins is known, allowing a statistical evaluation of the performances of the data processing workflow. We provide here such a controlled standard dataset and used it to evaluate the performances of several label-free bioinformatics tools (including MaxQuant, Skyline, MFPaQ, IRMa-hEIDI and Scaffold) in different workflows, for detection of variant proteins with different absolute expression levels and fold change values. The dataset presented here can be useful for tuning software tool parameters, and also testing new algorithms for label-free quantitative analysis, or for evaluation of downstream statistical methods. Copyright © 2015 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Santra, Tapesh; Delatola, Eleni Ioanna
2016-07-01
Presence of considerable noise and missing data points make analysis of mass-spectrometry (MS) based proteomic data a challenging task. The missing values in MS data are caused by the inability of MS machines to reliably detect proteins whose abundances fall below the detection limit. We developed a Bayesian algorithm that exploits this knowledge and uses missing data points as a complementary source of information to the observed protein intensities in order to find differentially expressed proteins by analysing MS based proteomic data. We compared its accuracy with many other methods using several simulated datasets. It consistently outperformed other methods. We then used it to analyse proteomic screens of a breast cancer (BC) patient cohort. It revealed large differences between the proteomic landscapes of triple negative and Luminal A, which are the most and least aggressive types of BC. Unexpectedly, majority of these differences could be attributed to the direct transcriptional activity of only seven transcription factors some of which are known to be inactive in triple negative BC. We also identified two new proteins which significantly correlated with the survival of BC patients, and therefore may have potential diagnostic/prognostic values.
Doulgeraki, Agapi I.; Efthimiou, Georgios; Paramithiotis, Spiros; Pappas, Katherine M.; Typas, Milton A.; Nychas, George-John
2017-01-01
The emergence of methicillin-resistant Staphylococcus aureus (MRSA) in food has provoked a great concern about the presence of MRSA in associated foodstuff. Although MRSA is often detected in various retailed meat products, it seems that food handlers are more strongly associated with this type of food contamination. Thus, it can be easily postulated that any food could be contaminated with this pathogen in an industrial environment or in household and cause food poisoning. To this direction, the effect of rocket (Eruca sativa) extract on MRSA growth and proteome was examined in the present study. This goal was achieved with the comparative study of the MRSA strain COL proteome, cultivated in rocket extract versus the standard Luria-Bertani growth medium. The obtained results showed that MRSA was able to grow in rocket extract. In addition, proteome analysis using 2-DE method showed that MRSA strain COL is taking advantage of the sugar-, lipid-, and vitamin-rich substrate in the liquid rocket extract, although its growth was delayed in rocket extract compared to Luria–Bertani medium. This work could initiate further research about bacterial metabolism in plant-based media and defense mechanisms against plant-derived antibacterials. PMID:28529502
Doulgeraki, Agapi I; Efthimiou, Georgios; Paramithiotis, Spiros; Pappas, Katherine M; Typas, Milton A; Nychas, George-John
2017-01-01
The emergence of methicillin-resistant Staphylococcus aureus (MRSA) in food has provoked a great concern about the presence of MRSA in associated foodstuff. Although MRSA is often detected in various retailed meat products, it seems that food handlers are more strongly associated with this type of food contamination. Thus, it can be easily postulated that any food could be contaminated with this pathogen in an industrial environment or in household and cause food poisoning. To this direction, the effect of rocket (Eruca sativa ) extract on MRSA growth and proteome was examined in the present study. This goal was achieved with the comparative study of the MRSA strain COL proteome, cultivated in rocket extract versus the standard Luria-Bertani growth medium. The obtained results showed that MRSA was able to grow in rocket extract. In addition, proteome analysis using 2-DE method showed that MRSA strain COL is taking advantage of the sugar-, lipid-, and vitamin-rich substrate in the liquid rocket extract, although its growth was delayed in rocket extract compared to Luria-Bertani medium. This work could initiate further research about bacterial metabolism in plant-based media and defense mechanisms against plant-derived antibacterials.
Marine proteomics: a critical assessment of an emerging technology.
Slattery, Marc; Ankisetty, Sridevi; Corrales, Jone; Marsh-Hunkin, K Erica; Gochfeld, Deborah J; Willett, Kristine L; Rimoldi, John M
2012-10-26
The application of proteomics to marine sciences has increased in recent years because the proteome represents the interface between genotypic and phenotypic variability and, thus, corresponds to the broadest possible biomarker for eco-physiological responses and adaptations. Likewise, proteomics can provide important functional information regarding biosynthetic pathways, as well as insights into mechanism of action, of novel marine natural products. The goal of this review is to (1) explore the application of proteomics methodologies to marine systems, (2) assess the technical approaches that have been used, and (3) evaluate the pros and cons of this proteomic research, with the intent of providing a critical analysis of its future roles in marine sciences. To date, proteomics techniques have been utilized to investigate marine microbe, plant, invertebrate, and vertebrate physiology, developmental biology, seafood safety, susceptibility to disease, and responses to environmental change. However, marine proteomics studies often suffer from poor experimental design, sample processing/optimization difficulties, and data analysis/interpretation issues. Moreover, a major limitation is the lack of available annotated genomes and proteomes for most marine organisms, including several "model species". Even with these challenges in mind, there is no doubt that marine proteomics is a rapidly expanding and powerful integrative molecular research tool from which our knowledge of the marine environment, and the natural products from this resource, will be significantly expanded.
Mishra, Pragya; Singh, Shweta; Rathinam, Maniraj; Nandiganti, Muralimohan; Ram Kumar, Nikhil; Thangaraj, Arulprakash; Thimmegowda, Vinutha; Krishnan, Veda; Mishra, Vagish; Jain, Neha; Rai, Vandna; Pattanayak, Debasis; Sreevathsa, Rohini
2017-02-22
Safety assessment of genetically modified plants is an important aspect prior to deregulation. Demonstration of substantial equivalence of the transgenics compared to their nontransgenic counterparts can be performed using different techniques at various molecular levels. The present study is a first-ever comprehensive evaluation of pigeon pea transgenics harboring two independent cry genes, cry2Aa and cry1AcF. The absence of unintended effects in the transgenic seed components was demonstrated by proteome and nutritional composition profiling. Analysis revealed that no significant differences were found in the various nutritional compositional analyses performed. Additionally, 2-DGE-based proteome analysis of the transgenic and nontransgenic seed protein revealed that there were no major changes in the protein profile, although a minor fold change in the expression of a few proteins was observed. Furthermore, the study also demonstrated that neither the integration of T-DNA nor the expression of the cry genes resulted in the production of unintended effects in the form of new toxins or allergens.
Multivariate proteomic profiling identifies novel accessory proteins of coated vesicles
Antrobus, Robin; Hirst, Jennifer; Bhumbra, Gary S.; Kozik, Patrycja; Jackson, Lauren P.; Sahlender, Daniela A.
2012-01-01
Despite recent advances in mass spectrometry, proteomic characterization of transport vesicles remains challenging. Here, we describe a multivariate proteomics approach to analyzing clathrin-coated vesicles (CCVs) from HeLa cells. siRNA knockdown of coat components and different fractionation protocols were used to obtain modified coated vesicle-enriched fractions, which were compared by stable isotope labeling of amino acids in cell culture (SILAC)-based quantitative mass spectrometry. 10 datasets were combined through principal component analysis into a “profiling” cluster analysis. Overall, 136 CCV-associated proteins were predicted, including 36 new proteins. The method identified >93% of established CCV coat proteins and assigned >91% correctly to intracellular or endocytic CCVs. Furthermore, the profiling analysis extends to less well characterized types of coated vesicles, and we identify and characterize the first AP-4 accessory protein, which we have named tepsin. Finally, our data explain how sequestration of TACC3 in cytosolic clathrin cages causes the severe mitotic defects observed in auxilin-depleted cells. The profiling approach can be adapted to address related cell and systems biological questions. PMID:22472443
Alexander, William M; Ficarro, Scott B; Adelmant, Guillaume; Marto, Jarrod A
2017-08-01
The continued evolution of modern mass spectrometry instrumentation and associated methods represents a critical component in efforts to decipher the molecular mechanisms which underlie normal physiology and understand how dysregulation of biological pathways contributes to human disease. The increasing scale of these experiments combined with the technological diversity of mass spectrometers presents several challenges for community-wide data access, analysis, and distribution. Here we detail a redesigned version of multiplierz, our Python software library which leverages our common application programming interface (mzAPI) for analysis and distribution of proteomic data. New features include support for a wider range of native mass spectrometry file types, interfaces to additional database search engines, compatibility with new reporting formats, and high-level tools to perform post-search proteomic analyses. A GUI desktop environment, mzDesktop, provides access to multiplierz functionality through a user friendly interface. multiplierz is available for download from: https://github.com/BlaisProteomics/multiplierz; and mzDesktop is available for download from: https://sourceforge.net/projects/multiplierz/. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic analysis of endoplasmic reticulum stress responses in rice seeds.
Qian, Dandan; Tian, Lihong; Qu, Leqing
2015-09-23
The defects in storage proteins secretion in the endosperm of transgenic rice seeds often leads to endoplasmic reticulum (ER) stress, which produces floury and shrunken seeds, but the mechanism of this response remains unclear. We used an iTRAQ-based proteomics analysis of ER-stressed rice seeds due to the endosperm-specific suppression of OsSar1 to identify changes in the protein levels in response to ER stress. ER stress changed the expression of 405 proteins in rice seed by >2.0- fold compared with the wild-type control. Of these proteins, 140 were upregulated and 265 were downregulated. The upregulated proteins were mainly involved in protein modification, transport and degradation, and the downregulated proteins were mainly involved in metabolism and stress/defense responses. A KOBAS analysis revealed that protein-processing in the ER and degradation-related proteasome were the predominant upregulated pathways in the rice endosperm in response to ER stress. Trans-Golgi protein transport was also involved in the ER stress response. Combined with bioinformatic and molecular biology analyses, our proteomic data will facilitate our understanding of the systemic responses to ER stress in rice seeds.
Havugimana, Pierre C; Hu, Pingzhao; Emili, Andrew
2017-10-01
Elucidation of the networks of physical (functional) interactions present in cells and tissues is fundamental for understanding the molecular organization of biological systems, the mechanistic basis of essential and disease-related processes, and for functional annotation of previously uncharacterized proteins (via guilt-by-association or -correlation). After a decade in the field, we felt it timely to document our own experiences in the systematic analysis of protein interaction networks. Areas covered: Researchers worldwide have contributed innovative experimental and computational approaches that have driven the rapidly evolving field of 'functional proteomics'. These include mass spectrometry-based methods to characterize macromolecular complexes on a global-scale and sophisticated data analysis tools - most notably machine learning - that allow for the generation of high-quality protein association maps. Expert commentary: Here, we recount some key lessons learned, with an emphasis on successful workflows, and challenges, arising from our own and other groups' ongoing efforts to generate, interpret and report proteome-scale interaction networks in increasingly diverse biological contexts.
Ji, Xiaoyu; Liu, Xiaoqiang; Peng, Yuanxia; Zhan, Ruoting; Xu, Hui; Ge, Xijin
2017-12-09
Emodin has a strong antibacterial activity, including methicillin-resistant Staphylococcus aureus (MRSA). However, the mechanism by which emodin induces growth inhibition against MRSA remains unclear. In this study, the isobaric tags for relative and absolute quantitation (iTRAQ) proteomics approach was used to investigate the modes of action of emodin on a MRSA isolate and methicillin-sensitive S. aureus ATCC29213(MSSA). Proteomic analysis showed that expression levels of 145 and 122 proteins were changed significantly in MRSA and MSSA, respectively, after emodin treatment. Comparative analysis of the functions of differentially expressed proteins between the two strains was performed via bioinformatics tools blast2go and STRING database. Proteins related to pyruvate pathway imbalance induction, protein synthesis inhibition, and DNA synthesis suppression were found in both methicillin-sensitive and resistant strains. Moreover, Interference proteins related to membrane damage mechanism were also observed in MRSA. Our findings indicate that emodin is a potential antibacterial agent targeting MRSA via multiple mechanisms. Copyright © 2017 Elsevier Inc. All rights reserved.
Martinez-Pinna, Roxana; Gonzalez de Peredo, Anne; Monsarrat, Bernard; Burlet-Schiltz, Odile; Martin-Ventura, Jose Luis
2014-08-01
To find potential biomarkers of abdominal aortic aneurysms (AAA), we performed a differential proteomic study based on human plasma-derived microvesicles. Exosomes and microparticles isolated from plasma of AAA patients and control subjects (n = 10 each group) were analyzed by a label-free quantitative MS-based strategy. Homemade and publicly available software packages have been used for MS data analysis. The application of two kinds of bioinformatic tools allowed us to find differential protein profiles from AAA patients. Some of these proteins found by the two analysis methods belong to main pathological mechanisms of AAA such as oxidative stress, immune-inflammation, and thrombosis. Data analysis from label-free MS-based experiments requires the use of sophisticated bioinformatic approaches to perform quantitative studies from complex protein mixtures. The application of two of these bioinformatic tools provided us a preliminary list of differential proteins found in plasma-derived microvesicles not previously associated to AAA, which could help us to understand the pathological mechanisms related to this disease. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Efficient Site-Specific Labeling of Proteins via Cysteines
Kim, Younggyu; Ho, Sam O.; Gassman, Natalie R.; Korlann, You; Landorf, Elizabeth V.; Collart, Frank R.; Weiss, Shimon
2011-01-01
Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70–90%, and specificities are better than ~95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis. PMID:18275130
Efficient site-specific labeling of proteins via cysteines.
Kim, Younggyu; Ho, Sam O; Gassman, Natalie R; Korlann, You; Landorf, Elizabeth V; Collart, Frank R; Weiss, Shimon
2008-03-01
Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70-90%, and specificities are better than approximately 95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis.
A novel algorithm for validating peptide identification from a shotgun proteomics search engine.
Jian, Ling; Niu, Xinnan; Xia, Zhonghang; Samir, Parimal; Sumanasekera, Chiranthani; Mu, Zheng; Jennings, Jennifer L; Hoek, Kristen L; Allos, Tara; Howard, Leigh M; Edwards, Kathryn M; Weil, P Anthony; Link, Andrew J
2013-03-01
Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.
Linking the proteins--elucidation of proteome-scale networks using mass spectrometry.
Pflieger, Delphine; Gonnet, Florence; de la Fuente van Bentem, Sergio; Hirt, Heribert; de la Fuente, Alberto
2011-01-01
Proteomes are intricate. Typically, thousands of proteins interact through physical association and post-translational modifications (PTMs) to give rise to the emergent functions of cells. Understanding these functions requires one to study proteomes as "systems" rather than collections of individual protein molecules. The abstraction of the interacting proteome to "protein networks" has recently gained much attention, as networks are effective representations, that lose specific molecular details, but provide the ability to see the proteome as a whole. Mostly two aspects of the proteome have been represented by network models: proteome-wide physical protein-protein-binding interactions organized into Protein Interaction Networks (PINs), and proteome-wide PTM relations organized into Protein Signaling Networks (PSNs). Mass spectrometry (MS) techniques have been shown to be essential to reveal both of these aspects on a proteome-wide scale. Techniques such as affinity purification followed by MS have been used to elucidate protein-protein interactions, and MS-based quantitative phosphoproteomics is critical to understand the structure and dynamics of signaling through the proteome. We here review the current state-of-the-art MS-based analytical pipelines for the purpose to characterize proteome-scale networks. Copyright © 2010 Wiley Periodicals, Inc.
Liquid Chromatography-Mass Spectrometry-based Quantitative Proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xie, Fang; Liu, Tao; Qian, Weijun
2011-07-22
Liquid chromatography-mass spectrometry (LC-MS)-based quantitative proteomics has become increasingly applied for a broad range of biological applications due to growing capabilities for broad proteome coverage and good accuracy in quantification. Herein, we review the current LC-MS-based quantification methods with respect to their advantages and limitations, and highlight their potential applications.
Wang, Shuzhen; Chen, Wenyue; Yang, Changdeng; Yao, Jian; Xiao, Wenfei; Xin, Ya; Qiu, Jieren; Hu, Weimin; Yao, Haigen; Ying, Wu; Fu, Yaping; Tong, Jianxin; Chen, Zhongzhong; Ruan, Songlin; Ma, Huasheng
2016-09-13
Polyploidy has pivotal influences on rice (Oryza sativa L.) morphology and physiology, and is very important for understanding rice domestication and improving agricultural traits. Diploid (DP) and triploid (TP) rice shows differences in morphological parameters, such as plant height, leaf length, leaf width and the physiological index of chlorophyll content. However, the underlying mechanisms determining these morphological differences are remain to be defined. To better understand the proteomic changes between DP and TP, tandem mass tags (TMT) mass spectrometry (MS)/MS was used to detect the significant changes to protein expression between DP and TP. Results indicated that both photosynthesis and metabolic pathways were highly significantly associated with proteomic alteration between DP and TP based on biological process and pathway enrichment analysis, and 13 higher abundance chloroplast proteins involving in these two pathways were identified in TP. Quantitative real-time PCR analysis demonstrated that 5 of the 13 chloroplast proteins ATPF, PSAA, PSAB, PSBB and RBL in TP were higher abundance compared with those in DP. This study integrates morphology, physiology and proteomic profiling alteration of DP and TP to address their underlying different molecular mechanisms. Our finding revealed that ATPF, PSAA, PSAB, PSBB and RBL can induce considerable expression changes in TP and may affect the development and growth of rice through photosynthesis and metabolic pathways.
Skyline: an open source document editor for creating and analyzing targeted proteomics experiments.
MacLean, Brendan; Tomazela, Daniela M; Shulman, Nicholas; Chambers, Matthew; Finney, Gregory L; Frewen, Barbara; Kern, Randall; Tabb, David L; Liebler, Daniel C; MacCoss, Michael J
2010-04-01
Skyline is a Windows client application for targeted proteomics method creation and quantitative data analysis. It is open source and freely available for academic and commercial use. The Skyline user interface simplifies the development of mass spectrometer methods and the analysis of data from targeted proteomics experiments performed using selected reaction monitoring (SRM). Skyline supports using and creating MS/MS spectral libraries from a wide variety of sources to choose SRM filters and verify results based on previously observed ion trap data. Skyline exports transition lists to and imports the native output files from Agilent, Applied Biosystems, Thermo Fisher Scientific and Waters triple quadrupole instruments, seamlessly connecting mass spectrometer output back to the experimental design document. The fast and compact Skyline file format is easily shared, even for experiments requiring many sample injections. A rich array of graphs displays results and provides powerful tools for inspecting data integrity as data are acquired, helping instrument operators to identify problems early. The Skyline dynamic report designer exports tabular data from the Skyline document model for in-depth analysis with common statistical tools. Single-click, self-updating web installation is available at http://proteome.gs.washington.edu/software/skyline. This web site also provides access to instructional videos, a support board, an issues list and a link to the source code project.
Cloud parallel processing of tandem mass spectrometry based proteomics data.
Mohammed, Yassene; Mostovenko, Ekaterina; Henneman, Alex A; Marissen, Rob J; Deelder, André M; Palmblad, Magnus
2012-10-05
Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.
NCI's Proteome Characterization Centers Announced | Office of Cancer Clinical Proteomics Research
The National Cancer Institute (NCI), part of the National Institutes of Health, announces the launch of a Clinical Proteomic Tumor Analysis Consortium (CPTAC). CPTAC is a comprehensive, coordinated team effort to accelerate the understanding of the molecular basis of cancer through the application of robust, quantitative, proteomic technologies and workflows.
Pan, Lang; Zhang, Jian; Wang, Junzhi; Yu, Qin; Bai, Lianyang; Dong, Liyao
2017-05-08
American sloughgrass (Beckmannia syzigachne Steud.) is a weed widely distributed in wheat fields of China. In recent years, the evolution of herbicide (fenoxaprop-P-ethyl)-resistant populations has decreased the susceptibility of B. syzigachne. This study compared 4 B. syzigachne populations (3 resistant and 1 susceptible) using iTRAQ to characterize fenoxaprop-P-ethyl resistance in B. syzigachne at the proteomic level. Through searching the UniProt database, 3104 protein species were identified from 13,335 unique peptides. Approximately 2834 protein species were assigned to 23 functional classifications provided by the COG database. Among these, 2299 protein species were assigned to 125 predicted pathways. The resistant biotype contained 8 protein species that changed in abundance relative to the susceptible biotype; they were involved in photosynthesis, oxidative phosphorylation, and fatty acid biosynthesis pathways. In contrast to previous studies comparing only 1 resistant and 1 susceptible population, our use of 3 fenoxaprop-resistant B. syzigachne populations with different genetic backgrounds minimized irrelevant differential expression and eliminated false positives. Therefore, we could more confidently link the differentially expressed proteins to herbicide resistance. Proteomic analysis demonstrated that fenoxaprop-P-ethyl resistance is associated with photosynthetic capacity, a connection that might be related to the target-site mutations in resistant B. syzigachne. This is the first large-scale proteomics study examining herbicide stress responses in different B. syzigachne biotypes. This study has biological relevance because it is the first to employ proteomic analysis for understanding the mechanisms underlying Beckmannia syzigachne herbicide resistance. The plant is a major weed in China and negatively affects crop yield, but has developed considerable resistance to the most common herbicide, fenoxaprop-P-ethyl. Through comparisons of resistant and sensitive biotypes, our study identified multiple proteins (involved in photosynthesis, oxidative phosphorylation, and fatty acid biosynthesis) that are putatively linked to B. syzigachne herbicide response. This large-scale proteomics study, sorely lacking in weed science, contributes valuable data that can be applied to more fine-tuned analyses on the functions of specific proteins in herbicide resistance. Copyright © 2017 Elsevier B.V. All rights reserved.
Proteomic analysis of ligamentum flavum from patients with lumbar spinal stenosis.
Kamita, Masahiro; Mori, Taiki; Sakai, Yoshihito; Ito, Sadayuki; Gomi, Masahiro; Miyamoto, Yuko; Harada, Atsushi; Niida, Shumpei; Yamada, Tesshi; Watanabe, Ken; Ono, Masaya
2015-05-01
Lumbar spinal stenosis (LSS) is a syndromic degenerative spinal disease and is characterized by spinal canal narrowing with subsequent neural compression causing gait disturbances. Although LSS is a major age-related musculoskeletal disease that causes large decreases in the daily living activities of the elderly, its molecular pathology has not been investigated using proteomics. Thus, we used several proteomic technologies to analyze the ligamentum flavum (LF) of individuals with LSS. Using comprehensive proteomics with strong cation exchange fractionation, we detected 1288 proteins in these LF samples. A GO analysis of the comprehensive proteome revealed that more than 30% of the identified proteins were extracellular. Next, we used 2D image converted analysis of LC/MS to compare LF obtained from individuals with LSS to that obtained from individuals with disc herniation (nondegenerative control). We detected 64 781 MS peaks and identified 1675 differentially expressed peptides derived from 286 proteins. We verified four differentially expressed proteins (fibronectin, serine protease HTRA1, tenascin, and asporin) by quantitative proteomics using SRM/MRM. The present proteomic study is the first to identify proteins from degenerated and hypertrophied LF in LSS, which will help in studying LSS. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Thiele, Thomas; Steil, Leif; Völker, Uwe; Greinacher, Andreas
2007-01-01
Blood-based therapeutics are cellular or plasma components derived from human blood. Their production requires appropriate selection and treatment of the donor and processing of cells or plasma proteins. In contrast to clearly defined, chemically synthesized drugs, blood-derived therapeutics are highly complex mixtures of plasma proteins or even more complex cells. Pathogen transmission by the product as well as changes in the integrity of blood constituents resulting in loss of function or immune modulation are currently important issues in transfusion medicine. Protein modifications can occur during various steps of the production process, such as acquisition, enrichment of separate components (e.g. coagulation factors, cell populations), virus inactivation, conservation, and storage. Contemporary proteomic strategies allow a comprehensive assessment of protein modifications with high coverage, offer capabilities for qualitative and even quantitative analysis, and for high-throughput protein identification. Traditionally, proteomics approaches predominantly relied on two-dimensional gel electrophoresis (2-DE). Even if 2-DE is still state of the art, it has inherent limitations that are mainly based on the physicochemical properties of the proteins analyzed; for example, proteins with extremes in molecular mass and hydrophobicity (most membrane proteins) are difficult to assess by 2-DE. These limitations have fostered the development of mass spectrometry centered on non-gel-based separation approaches, which have proven to be highly successful and are thus complementing and even partially replacing 2-DE-based approaches. Although blood constituents have been extensively analyzed by proteomics, this technology has not been widely applied to assess or even improve blood-derived therapeutics, or to monitor the production processes. As proteomic technologies have the capacity to provide comprehensive information about changes occurring during processing and storage of blood products, proteomics can potentially guide improvement of pathogen inactivation procedures and engineering of stem cells, and may also allow a better understanding of factors influencing the immunogenicity of blood-derived therapeutics. An important development in proteomics is the reduction of inter-assay variability. This now allows the screening of samples taken from the same product over time or before and after processing. Optimized preparation procedures and storage conditions will reduce the risk of protein alterations, which in turn may contribute to better recovery, reduced exposure to allogeneic proteins, and increased transfusion safety.
Silva, Wanderson M; Carvalho, Rodrigo D; Soares, Siomar C; Bastos, Isabela Fs; Folador, Edson L; Souza, Gustavo Hmf; Le Loir, Yves; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco
2014-12-04
Corynebacterium pseudotuberculosis biovar ovis is a facultative intracellular pathogen, and the etiological agent of caseous lymphadenitis in small ruminants. During the infection process, the bacterium is subjected to several stress conditions, including nitrosative stress, which is caused by nitric oxide (NO). In silico analysis of the genome of C. pseudotuberculosis ovis 1002 predicted several genes that could influence the resistance of this pathogen to nitrosative stress. Here, we applied high-throughput proteomics using high definition mass spectrometry to characterize the functional genome of C. pseudotuberculosis ovis 1002 in the presence of NO-donor Diethylenetriamine/nitric oxide adduct (DETA/NO), with the aim of identifying proteins involved in nitrosative stress resistance. We characterized 835 proteins, representing approximately 41% of the predicted proteome of C. pseudotuberculosis ovis 1002, following exposure to nitrosative stress. In total, 102 proteins were exclusive to the proteome of DETA/NO-induced cells, and a further 58 proteins were differentially regulated between the DETA/NO and control conditions. An interactomic analysis of the differential proteome of C. pseudotuberculosis in response to nitrosative stress was also performed. Our proteomic data set suggested the activation of both a general stress response and a specific nitrosative stress response, as well as changes in proteins involved in cellular metabolism, detoxification, transcriptional regulation, and DNA synthesis and repair. Our proteomic analysis validated previously-determined in silico data for C. pseudotuberculosis ovis 1002. In addition, proteomic screening performed in the presence of NO enabled the identification of a set of factors that can influence the resistance and survival of C. pseudotuberculosis during exposure to nitrosative stress.
A new funding opportunity in support of the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) seeks to prospectively procure tumor samples, collected for proteomics investigation.
MinOmics, an Integrative and Immersive Tool for Multi-Omics Analysis.
Maes, Alexandre; Martinez, Xavier; Druart, Karen; Laurent, Benoist; Guégan, Sean; Marchand, Christophe H; Lemaire, Stéphane D; Baaden, Marc
2018-06-21
Proteomic and transcriptomic technologies resulted in massive biological datasets, their interpretation requiring sophisticated computational strategies. Efficient and intuitive real-time analysis remains challenging. We use proteomic data on 1417 proteins of the green microalga Chlamydomonas reinhardtii to investigate physicochemical parameters governing selectivity of three cysteine-based redox post translational modifications (PTM): glutathionylation (SSG), nitrosylation (SNO) and disulphide bonds (SS) reduced by thioredoxins. We aim to understand underlying molecular mechanisms and structural determinants through integration of redox proteome data from gene- to structural level. Our interactive visual analytics approach on an 8.3 m2 display wall of 25 MPixel resolution features stereoscopic three dimensions (3D) representation performed by UnityMol WebGL. Virtual reality headsets complement the range of usage configurations for fully immersive tasks. Our experiments confirm that fast access to a rich cross-linked database is necessary for immersive analysis of structural data. We emphasize the possibility to display complex data structures and relationships in 3D, intrinsic to molecular structure visualization, but less common for omics-network analysis. Our setup is powered by MinOmics, an integrated analysis pipeline and visualization framework dedicated to multi-omics analysis. MinOmics integrates data from various sources into a materialized physical repository. We evaluate its performance, a design criterion for the framework.
Proteomics: a new approach to the study of disease.
Chambers, G; Lawrie, L; Cash, P; Murray, G I
2000-11-01
The global analysis of cellular proteins has recently been termed proteomics and is a key area of research that is developing in the post-genome era. Proteomics uses a combination of sophisticated techniques including two-dimensional (2D) gel electrophoresis, image analysis, mass spectrometry, amino acid sequencing, and bio-informatics to resolve comprehensively, to quantify, and to characterize proteins. The application of proteomics provides major opportunities to elucidate disease mechanisms and to identify new diagnostic markers and therapeutic targets. This review aims to explain briefly the background to proteomics and then to outline proteomic techniques. Applications to the study of human disease conditions ranging from cancer to infectious diseases are reviewed. Finally, possible future advances are briefly considered, especially those which may lead to faster sample throughput and increased sensitivity for the detection of individual proteins. Copyright 2000 John Wiley & Sons, Ltd.
Gunawardena, Harsha P; O'Brien, Jonathon; Wrobel, John A; Xie, Ling; Davies, Sherri R; Li, Shunqiang; Ellis, Matthew J; Qaqish, Bahjat F; Chen, Xian
2016-02-01
Single quantitative platforms such as label-based or label-free quantitation (LFQ) present compromises in accuracy, precision, protein sequence coverage, and speed of quantifiable proteomic measurements. To maximize the quantitative precision and the number of quantifiable proteins or the quantifiable coverage of tissue proteomes, we have developed a unified approach, termed QuantFusion, that combines the quantitative ratios of all peptides measured by both LFQ and label-based methodologies. Here, we demonstrate the use of QuantFusion in determining the proteins differentially expressed in a pair of patient-derived tumor xenografts (PDXs) representing two major breast cancer (BC) subtypes, basal and luminal. Label-based in-spectra quantitative peptides derived from amino acid-coded tagging (AACT, also known as SILAC) of a non-malignant mammary cell line were uniformly added to each xenograft with a constant predefined ratio, from which Ratio-of-Ratio estimates were obtained for the label-free peptides paired with AACT peptides in each PDX tumor. A mixed model statistical analysis was used to determine global differential protein expression by combining complementary quantifiable peptide ratios measured by LFQ and Ratio-of-Ratios, respectively. With minimum number of replicates required for obtaining the statistically significant ratios, QuantFusion uses the distinct mechanisms to "rescue" the missing data inherent to both LFQ and label-based quantitation. Combined quantifiable peptide data from both quantitative schemes increased the overall number of peptide level measurements and protein level estimates. In our analysis of the PDX tumor proteomes, QuantFusion increased the number of distinct peptide ratios by 65%, representing differentially expressed proteins between the BC subtypes. This quantifiable coverage improvement, in turn, not only increased the number of measurable protein fold-changes by 8% but also increased the average precision of quantitative estimates by 181% so that some BC subtypically expressed proteins were rescued by QuantFusion. Thus, incorporating data from multiple quantitative approaches while accounting for measurement variability at both the peptide and global protein levels make QuantFusion unique for obtaining increased coverage and quantitative precision for tissue proteomes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Perturbations in the Urinary Exosome in Transplant Rejection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sigdel, Tara K.; NG, Yolanda; Lee, Sangho
Background: Urine exosomes, vesicles exocytosed into urine by all renal epithelial cell types, occur under normal physiologic and disease states. Exosome contents may mirror disease-specific proteome perturbations in kidney injury. Analysis methodologies for the exosomal fraction of the urinary proteome were developed and for comparing the urinary exosomal fraction versus unfractionated proteome for biomarker discovery. Methods: Urine exosomes were isolated by centrifugal filtration from mid-stream, second morning void, urine samples collected from kidney transplant recipients with and without biopsy matched acute rejection. The proteomes of unfractionated whole urine (Uw) and urine exosomes (Uexo) underwent mass spectrometry-based quantitative proteomics analysis. Themore » proteome data were analyzed for significant differential protein abundances in acute rejection (AR). Results: Identifications of 1018 and 349 proteins, Uw and Uexo fractions, respectively, demonstrated a 279 protein overlap between the two urinary compartments with 25%(70) of overlapping proteins unique to Uexoand represented membrane bound proteins (p=9.31e-7). Of 349 urine exosomal proteins identified in transplant patients 220 were not previously identified in the normal urine exosomal fraction. Uexo proteins (11), functioning in the inflammatory / stress response, were more abundant in patients with biopsy-confirmed acute rejection, 3 of which were exclusive to Uexo. Uexo AR-specific biomarkers (8) were also detected in Uw, but since they were observed at significantly lower abundances in Uw, they were not significant for AR in Uw. Conclusions: A rapid urinary exosome isolation method and quantitative measurement of enriched Uexo proteins was applied. Urine proteins specific to the exosomal fraction were detected either in unfractionated urine (at low abundances) or by Uexo fraction analysis. Perturbed proteins in the exosomal compartment of urine collected from kidney transplant patients were specific to inflammatory responses, and were not observed in the Uexo fraction from normal healthy subjects. Uexo specific protein alterations in renal disease provide potential mechanistic insights and offer a unique panel of sensitive biomarkers for monitoring for acute transplant rejection.« less
Comparative lipidomics and proteomics analysis of platelet lipid rafts using different detergents.
Rabani, Vahideh; Davani, Siamak; Gambert-Nicot, Ségolène; Meneveau, Nicolas; Montange, Damien
2016-11-01
Lipid rafts play a pivotal role in physiological functions of platelets. Their isolation using nonionic mild detergents is considered as the gold standard method, but there is no consensual detergent for lipid raft studies. We aimed to investigate which detergent is the most suitable for lipid raft isolation from platelet membrane, based on lipidomics and proteomics analysis. Platelets were obtained from healthy donors. Twelve sucrose fractions were extracted by three different detergents, namely Brij 35, Lubrol WX, and Triton X100, at 0.05% and 1%. After lipidomics analysis and determination of fractions enriched in cholesterol (Ch) and sphingomyelin (SM), proteomics analysis was performed. Lipid rafts were mainly observed in 1-4 fractions, and non-rafts were distributed on 5-12 fractions. Considering the concentration of Ch and SM, Lubrol WX 1% and Triton X100 1% were more suitable detergents as they were able to isolate lipid raft fractions that were more enriched than non-raft fractions. By proteomics analysis, overall, 822 proteins were identified in platelet membrane. Lipid raft fractions isolated with Lubrol WX 0.05% and Triton X100 1% contained mainly plasma membrane proteins. However, only Lubrol WX 0.05 and 1% and Triton X100 1% were able to extract non-denaturing proteins with more than 10 transmembrane domains. Our results suggest that Triton X100 1% is the most suitable detergent for global lipid and protein studies on platelet plasma membrane. However, the detergent should be adapted if investigation of an association between specific proteins and lipid rafts is planned.
State-of-the-art nanoplatform-integrated MALDI-MS impacting resolutions in urinary proteomics.
Gopal, Judy; Muthu, Manikandan; Chun, Se-Chul; Wu, Hui-Fen
2015-06-01
Urine proteomics has become a subject of interest, since it has led to a number of breakthroughs in disease diagnostics. Urine contains information not only from the kidney and the urinary tract but also from other organs, thus urinary proteome analysis allows for identification of biomarkers for both urogenital and systemic diseases. The following review gives a brief overview of the analytical techniques that have been in practice for urinary proteomics. MALDI-MS technique and its current application status in this area of clinical research have been discussed. The review comments on the challenges facing the conventional MALDI-MS technique and the upgradation of this technique with the introduction of nanotechnology. This review projects nano-based techniques such as nano-MALDI-MS, surface-assisted laser desorption/ionization, and nanostructure-initiator MS as the platforms that have the potential in trafficking MALDI-MS from the lab to the bedside. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Global Proteomics Analysis of the Response to Starvation in C. elegans*
Larance, Mark; Pourkarimi, Ehsan; Wang, Bin; Brenes Murillo, Alejandro; Kent, Robert; Lamond, Angus I.; Gartner, Anton
2015-01-01
Periodic starvation of animals induces large shifts in metabolism but may also influence many other cellular systems and can lead to adaption to prolonged starvation conditions. To date, there is limited understanding of how starvation affects gene expression, particularly at the protein level. Here, we have used mass-spectrometry-based quantitative proteomics to identify global changes in the Caenorhabditis elegans proteome due to acute starvation of young adult animals. Measuring changes in the abundance of over 5,000 proteins, we show that acute starvation rapidly alters the levels of hundreds of proteins, many involved in central metabolic pathways, highlighting key regulatory responses. Surprisingly, we also detect changes in the abundance of chromatin-associated proteins, including specific linker histones, histone variants, and histone posttranslational modifications associated with the epigenetic control of gene expression. To maximize community access to these data, they are presented in an online searchable database, the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd/). PMID:25963834
Mass spectrometry-based proteomics: basic principles and emerging technologies and directions.
Van Riper, Susan K; de Jong, Ebbing P; Carlis, John V; Griffin, Timothy J
2013-01-01
As the main catalytic and structural molecules within living systems, proteins are the most likely biomolecules to be affected by radiation exposure. Proteomics, the comprehensive characterization of proteins within complex biological samples, is therefore a research approach ideally suited to assess the effects of radiation exposure on cells and tissues. For comprehensive characterization of proteomes, an analytical platform capable of quantifying protein abundance, identifying post-translation modifications and revealing members of protein complexes on a system-wide level is necessary. Mass spectrometry (MS), coupled with technologies for sample fractionation and automated data analysis, provides such a versatile and powerful platform. In this chapter we offer a view on the current state of MS-proteomics, and focus on emerging technologies within three areas: (1) New instrumental methods; (2) New computational methods for peptide identification; and (3) Label-free quantification. These emerging technologies should be valuable for researchers seeking to better understand biological effects of radiation on living systems.
A multi-center study benchmarks software tools for label-free proteome quantification
Gillet, Ludovic C; Bernhardt, Oliver M.; MacLean, Brendan; Röst, Hannes L.; Tate, Stephen A.; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I.; Aebersold, Ruedi; Tenzer, Stefan
2016-01-01
The consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from SWATH-MS (sequential window acquisition of all theoretical fragment ion spectra), a method that uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test datasets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation windows setups. For consistent evaluation we developed LFQbench, an R-package to calculate metrics of precision and accuracy in label-free quantitative MS, and report the identification performance, robustness and specificity of each software tool. Our reference datasets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics. PMID:27701404
A multicenter study benchmarks software tools for label-free proteome quantification.
Navarro, Pedro; Kuharev, Jörg; Gillet, Ludovic C; Bernhardt, Oliver M; MacLean, Brendan; Röst, Hannes L; Tate, Stephen A; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I; Aebersold, Ruedi; Tenzer, Stefan
2016-11-01
Consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH 2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from sequential window acquisition of all theoretical fragment-ion spectra (SWATH)-MS, which uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test data sets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation-window setups. For consistent evaluation, we developed LFQbench, an R package, to calculate metrics of precision and accuracy in label-free quantitative MS and report the identification performance, robustness and specificity of each software tool. Our reference data sets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics.
Zhu, Ying; Piehowski, Paul D; Zhao, Rui; Chen, Jing; Shen, Yufeng; Moore, Ronald J; Shukla, Anil K; Petyuk, Vladislav A; Campbell-Thompson, Martha; Mathews, Clayton E; Smith, Richard D; Qian, Wei-Jun; Kelly, Ryan T
2018-02-28
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Between Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Ying; Piehowski, Paul D.; Zhao, Rui
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here in this paper, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Betweenmore » Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.« less
Zhu, Ying; Piehowski, Paul D.; Zhao, Rui; ...
2018-02-28
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here in this paper, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Betweenmore » Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.« less
Integrated Proteogenomic Characterization of Human High-Grade Serous Ovarian Cancer.
Zhang, Hui; Liu, Tao; Zhang, Zhen; Payne, Samuel H; Zhang, Bai; McDermott, Jason E; Zhou, Jian-Ying; Petyuk, Vladislav A; Chen, Li; Ray, Debjit; Sun, Shisheng; Yang, Feng; Chen, Lijun; Wang, Jing; Shah, Punit; Cha, Seong Won; Aiyetan, Paul; Woo, Sunghee; Tian, Yuan; Gritsenko, Marina A; Clauss, Therese R; Choi, Caitlin; Monroe, Matthew E; Thomas, Stefani; Nie, Song; Wu, Chaochao; Moore, Ronald J; Yu, Kun-Hsing; Tabb, David L; Fenyö, David; Bafna, Vineet; Wang, Yue; Rodriguez, Henry; Boja, Emily S; Hiltke, Tara; Rivers, Robert C; Sokoll, Lori; Zhu, Heng; Shih, Ie-Ming; Cope, Leslie; Pandey, Akhilesh; Zhang, Bing; Snyder, Michael P; Levine, Douglas A; Smith, Richard D; Chan, Daniel W; Rodland, Karin D
2016-07-28
To provide a detailed analysis of the molecular components and underlying mechanisms associated with ovarian cancer, we performed a comprehensive mass-spectrometry-based proteomic characterization of 174 ovarian tumors previously analyzed by The Cancer Genome Atlas (TCGA), of which 169 were high-grade serous carcinomas (HGSCs). Integrating our proteomic measurements with the genomic data yielded a number of insights into disease, such as how different copy-number alternations influence the proteome, the proteins associated with chromosomal instability, the sets of signaling pathways that diverse genome rearrangements converge on, and the ones most associated with short overall survival. Specific protein acetylations associated with homologous recombination deficiency suggest a potential means for stratifying patients for therapy. In addition to providing a valuable resource, these findings provide a view of how the somatic genome drives the cancer proteome and associations between protein and post-translational modification levels and clinical outcomes in HGSC. VIDEO ABSTRACT. Copyright © 2016 Elsevier Inc. All rights reserved.
Developmental cigarette smoke exposure II: Hepatic proteome profiles in 6 month old adult offspring.
Neal, Rachel E; Chen, Jing; Webb, Cindy; Stocke, Kendall; Gambrell, Caitlin; Greene, Robert M; Pisano, M Michele
2016-10-01
Utilizing a mouse model of 'active' developmental cigarette smoke exposure (CSE) [gestational day (GD) 1 through postnatal day (PD) 21] characterized by offspring low birth weight, the impact of developmental CSE on liver proteome profiles of adult offspring at 6 months of age was determined. Liver tissue was collected from Sham- and CSE-offspring for 2D-SDS-PAGE based proteome analysis with Partial Least Squares-Discriminant Analysis (PLS-DA). A similar study conducted at the cessation of exposure to cigarette smoke documented decreased gluconeogenesis coupled to oxidative stress in weanling offspring. In the current study, exposure throughout development to cigarette smoke resulted in impaired hepatic carbohydrate metabolism, decreased serum glucose levels, and increased gluconeogenic regulatory enzyme abundances during the fed-state coupled to decreased expression of SIRT1 as well as increased PEPCK and PGC1α expression. Together these findings indicate inappropriately timed gluconeogenesis that may reflect impaired insulin signaling in mature offspring exposed to 'active' developmental CSE. Copyright © 2016 Elsevier Inc. All rights reserved.
Li, Siyang; Plouffe, Brian D.; Belov, Arseniy M.; Ray, Somak; Wang, Xianzhe; Murthy, Shashi K.; Karger, Barry L.; Ivanov, Alexander R.
2015-01-01
Isolation and molecular characterization of rare cells (e.g. circulating tumor and stem cells) within biological fluids and tissues has significant potential in clinical diagnostics and personalized medicine. The present work describes an integrated platform of sample procurement, preparation, and analysis for deep proteomic profiling of rare cells in blood. Microfluidic magnetophoretic isolation of target cells spiked into 1 ml of blood at the level of 1000–2000 cells/ml, followed by focused acoustics-assisted sample preparation has been coupled with one-dimensional PLOT-LC-MS methodology. The resulting zeptomole detection sensitivity enabled identification of ∼4000 proteins with injection of the equivalent of only 100–200 cells per analysis. The characterization of rare cells in limited volumes of physiological fluids is shown by the isolation and quantitative proteomic profiling of first MCF-7 cells spiked into whole blood as a model system and then two CD133+ endothelial progenitor and hematopoietic cells in whole blood from volunteers. PMID:25755294
On September 4, 2013, NCI’s Clinical Proteomics Tumor Analysis Consortium (CPTAC) publicly released proteomic data produced from colorectal tumor samples previously analyzed by The Cancer Genome Atlas (TCGA). This is the initial release of proteomic tumor data designed to complement genomic data on the same tumors. The data is publicly available at the CPTAC data portal.
USDA-ARS?s Scientific Manuscript database
2-DE analysis of complex plant proteomes has limited dynamic resolution because only abundant proteins can be detected. Proteomic assessment of the low abundance proteins within leaf tissue is difficult when it is comprised of 30 – 50% of the CO2 fixation enzyme Rubisco. Resolution can be improved t...
Adaptation of Decoy Fusion Strategy for Existing Multi-Stage Search Workflows
NASA Astrophysics Data System (ADS)
Ivanov, Mark V.; Levitsky, Lev I.; Gorshkov, Mikhail V.
2016-09-01
A number of proteomic database search engines implement multi-stage strategies aiming at increasing the sensitivity of proteome analysis. These approaches often employ a subset of the original database for the secondary stage of analysis. However, if target-decoy approach (TDA) is used for false discovery rate (FDR) estimation, the multi-stage strategies may violate the underlying assumption of TDA that false matches are distributed uniformly across the target and decoy databases. This violation occurs if the numbers of target and decoy proteins selected for the second search are not equal. Here, we propose a method of decoy database generation based on the previously reported decoy fusion strategy. This method allows unbiased TDA-based FDR estimation in multi-stage searches and can be easily integrated into existing workflows utilizing popular search engines and post-search algorithms.
Fang, Yilin; Wilkins, Michael J; Yabusaki, Steven B; Lipton, Mary S; Long, Philip E
2012-12-01
Accurately predicting the interactions between microbial metabolism and the physical subsurface environment is necessary to enhance subsurface energy development, soil and groundwater cleanup, and carbon management. This study was an initial attempt to confirm the metabolic functional roles within an in silico model using environmental proteomic data collected during field experiments. Shotgun global proteomics data collected during a subsurface biostimulation experiment were used to validate a genome-scale metabolic model of Geobacter metallireducens-specifically, the ability of the metabolic model to predict metal reduction, biomass yield, and growth rate under dynamic field conditions. The constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes. Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low abundances of proteins associated with amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.
Åhrman, Emma; Hallgren, Oskar; Malmström, Lars; Hedström, Ulf; Malmström, Anders; Bjermer, Leif; Zhou, Xiao-Hong; Westergren-Thorsson, Gunilla; Malmström, Johan
2018-03-01
Remodeling of the extracellular matrix (ECM) is a common feature in lung diseases such as chronic obstructive pulmonary disease (COPD) and idiopathic pulmonary fibrosis (IPF). Here, we applied a sequential tissue extraction strategy to describe disease-specific remodeling of human lung tissue in disease, using end-stages of COPD and IPF. Our strategy was based on quantitative comparison of the disease proteomes, with specific focus on the matrisome, using data-independent acquisition and targeted data analysis (SWATH-MS). Our work provides an in-depth proteomic characterization of human lung tissue during impaired tissue remodeling. In addition, we show important quantitative and qualitative effects of the solubility of matrisome proteins. COPD was characterized by a disease-specific increase in ECM regulators, metalloproteinase inhibitor 3 (TIMP3) and matrix metalloproteinase 28 (MMP-28), whereas for IPF, impairment in cell adhesion proteins, such as collagen VI and laminins, was most prominent. For both diseases, we identified increased levels of proteins involved in the regulation of endopeptidase activity, with several proteins belonging to the serpin family. The established human lung quantitative proteome inventory and the construction of a tissue-specific protein assay library provides a resource for future quantitative proteomic analyses of human lung tissues. We present a sequential tissue extraction strategy to determine changes in extractability of matrisome proteins in end-stage COPD and IPF compared to healthy control tissue. Extensive quantitative analysis of the proteome changes of the disease states revealed altered solubility of matrisome proteins involved in ECM regulators and cell-ECM communication. The results highlight disease-specific remodeling mechanisms associated with COPD and IPF. Copyright © 2018 Elsevier B.V. All rights reserved.
de Bernonville, Thomas Dugé; Albenne, Cécile; Arlat, Matthieu; Hoffmann, Laurent; Lauber, Emmanuelle; Jamet, Elisabeth
2014-01-01
Proteomic analysis of xylem sap has recently become a major field of interest to understand several biological questions related to plant development and responses to environmental clues. The xylem sap appears as a dynamic fluid undergoing changes in its proteome upon abiotic and biotic stresses. Unlike cell compartments which are amenable to purification in sufficient amount prior to proteomic analysis, the xylem sap has to be collected in particular conditions to avoid contamination by intracellular proteins and to obtain enough material. A model plant like Arabidopsis thaliana is not suitable for such an analysis because efficient harvesting of xylem sap is difficult. The analysis of the xylem sap proteome also requires specific procedures to concentrate proteins and to focus on proteins predicted to be secreted. Indeed, xylem sap proteins appear to be synthesized and secreted in the root stele or to originate from dying differentiated xylem cells. This chapter describes protocols to collect xylem sap from Brassica species and to prepare total and N-glycoprotein extracts for identification of proteins by mass spectrometry analyses and bioinformatics.
Proteome analysis of snake venom toxins: pharmacological insights.
Georgieva, Dessislava; Arni, Raghuvir K; Betzel, Christian
2008-12-01
Snake venoms are an extremely rich source of pharmacologically active proteins with a considerable clinical and medical potential. To date, this potential has not been fully explored, mainly because of our incomplete knowledge of the venom proteome and the pharmacological properties of its components, in particular those devoid of enzymatic activity. This review summarizes the latest achievements in the determination of snake venom proteome, based primarily on the development of new strategies and techniques. Detailed knowledge of the venom toxin composition and biological properties of the protein constituents should provide the scaffold for the design of new more effective drugs for the treatment of the hemostatic system and heart disorders, inflammation, cancer and consequences of snake bites, as well as new tools for clinical diagnostic and assays of hemostatic parameters.
Takahashi, Eri; Okumura, Akinori; Unoki-Kubota, Hiroyuki; Hirano, Hisashi; Kasuga, Masato; Kaburagi, Yasushi
2013-06-12
To identify candidate serum molecules associated with the progression of type 2 diabetes mellitus (T2DM), we carried out differential proteomic analysis using the KK-A(y) mouse, an animal model of T2DM with obesity. We employed an iTRAQ-based quantitative proteomic approach to analyze the proteomic changes in the sera collected from a pair of 4-week-old KK-A(y) versus C57BL/6 mice. Among the 227 proteins identified, a total of 45 proteins were differentially expressed in KK-A(y) versus C57BL/6 mice. We comparatively analyzed a series of the sera collected at 4 and 12weeks of age from KK-A(y) and C57BL/6 mice for the target protein using multiple reaction monitoring analysis, and identified 8 differentially expressed proteins between the sera of these mice at both time points. Among them, serine (or cysteine) peptidase inhibitor, clade A, member 3K (SERPINA3K) levels were elevated significantly in the sera of KK-A(y) mice compared to C57BL/6 mice. An in vitro assay revealed that the human homologue SERPINA3 increased the transendothelial permeability of retinal microvascular endothelial cells, which may be involved in the pathogenesis of diabetes and/or diabetic retinopathy. With the identified proteins, our proteomics study could provide valuable clues for a better understanding of the underlying mechanisms associated with T2DM. In this paper, we investigated the serum proteome of KK-A(y) mice in a pre-diabetic state compared to that of wild type controls in an attempt to uncover early diagnostic markers of diabetes that are maintained through a diabetic phenotype. We used iTRAQ-based two-dimensional LC-MS/MS serum profiling, and identified several differentially expressed proteins at the pre-diabetic stage. The differential expression was confirmed by multiple reaction monitoring assay, which is fast gaining ground as a sensitive, specific, and cost-effective methodology for relative quantification of the candidate proteins. Using these techniques, we have identified eight candidate proteins of interest including SERPINA3K, which may be important in the pathology of T2DM and/or diabetic retinopathy. Copyright © 2013 Elsevier B.V. All rights reserved.
Ayyar, Vivaswath S; Almon, Richard R; DuBois, Debra C; Sukumaran, Siddharth; Qu, Jun; Jusko, William J
2017-05-08
Corticosteroids (CS) are anti-inflammatory agents that cause extensive pharmacogenomic and proteomic changes in multiple tissues. An understanding of the proteome-wide effects of CS in liver and its relationships to altered hepatic and systemic physiology remains incomplete. Here, we report the application of a functional pharmacoproteomic approach to gain integrated insight into the complex nature of CS responses in liver in vivo. An in-depth functional analysis was performed using rich pharmacodynamic (temporal-based) proteomic data measured over 66h in rat liver following a single dose of methylprednisolone (MPL). Data mining identified 451 differentially regulated proteins. These proteins were analyzed on the basis of temporal regulation, cellular localization, and literature-mined functional information. Of the 451 proteins, 378 were clustered into six functional groups based on major clinically-relevant effects of CS in liver. MPL-responsive proteins were highly localized in the mitochondria (20%) and cytosol (24%). Interestingly, several proteins were related to hepatic stress and signaling processes, which appear to be involved in secondary signaling cascades and in protecting the liver from CS-induced oxidative damage. Consistent with known adverse metabolic effects of CS, several rate-controlling enzymes involved in amino acid metabolism, gluconeogenesis, and fatty-acid metabolism were altered by MPL. In addition, proteins involved in the metabolism of endogenous compounds, xenobiotics, and therapeutic drugs including cytochrome P450 and Phase-II enzymes were differentially regulated. Proteins related to the inflammatory acute-phase response were up-regulated in response to MPL. Functionally-similar proteins showed large diversity in their temporal profiles, indicating complex mechanisms of regulation by CS. Clinical use of corticosteroid (CS) therapy is frequent and chronic. However, current knowledge on the proteome-level effects of CS in liver and other tissues is sparse. While transcriptomic regulation following methylprednisolone (MPL) dosing has been temporally examined in rat liver, proteomic assessments are needed to better characterize the tissue-specific functional aspects of MPL actions. This study describes a functional pharmacoproteomic analysis of dynamic changes in MPL-regulated proteins in liver and provides biological insight into how steroid-induced perturbations on a molecular level may relate to both adverse and therapeutic responses presented clinically. Copyright © 2017 Elsevier B.V. All rights reserved.
Design and analysis issues in quantitative proteomics studies.
Karp, Natasha A; Lilley, Kathryn S
2007-09-01
Quantitative proteomics is the comparison of distinct proteomes which enables the identification of protein species which exhibit changes in expression or post-translational state in response to a given stimulus. Many different quantitative techniques are being utilized and generate large datasets. Independent of the technique used, these large datasets need robust data analysis to ensure valid conclusions are drawn from such studies. Approaches to address the problems that arise with large datasets are discussed to give insight into the types of statistical analyses of data appropriate for the various experimental strategies that can be employed by quantitative proteomic studies. This review also highlights the importance of employing a robust experimental design and highlights various issues surrounding the design of experiments. The concepts and examples discussed within will show how robust design and analysis will lead to confident results that will ensure quantitative proteomics delivers.
Establishing Substantial Equivalence: Proteomics
NASA Astrophysics Data System (ADS)
Lovegrove, Alison; Salt, Louise; Shewry, Peter R.
Wheat is a major crop in world agriculture and is consumed after processing into a range of food products. It is therefore of great importance to determine the consequences (intended and unintended) of transgenesis in wheat and whether genetically modified lines are substantially equivalent to those produced by conventional plant breeding. Proteomic analysis is one of several approaches which can be used to address these questions. Two-dimensional PAGE (2D PAGE) remains the most widely available method for proteomic analysis, but is notoriously difficult to reproduce between laboratories. We therefore describe methods which have been developed as standard operating procedures in our laboratory to ensure the reproducibility of proteomic analyses of wheat using 2D PAGE analysis of grain proteins.
Chakraborty, Sandeep; Nascimento, Rafael; Zaini, Paulo A; Gouran, Hossein; Rao, Basuthkar J; Goulart, Luiz R; Dandekar, Abhaya M
2016-01-01
Background. Xylella fastidiosa, the causative agent of various plant diseases including Pierce's disease in the US, and Citrus Variegated Chlorosis in Brazil, remains a continual source of concern and economic losses, especially since almost all commercial varieties are sensitive to this Gammaproteobacteria. Differential expression of proteins in infected tissue is an established methodology to identify key elements involved in plant defense pathways. Methods. In the current work, we developed a methodology named CHURNER that emphasizes relevant protein functions from proteomic data, based on identification of proteins with similar structures that do not necessarily have sequence homology. Such clustering emphasizes protein functions which have multiple copies that are up/down-regulated, and highlights similar proteins which are differentially regulated. As a working example we present proteomic data enumerating differentially expressed proteins in xylem sap from grapevines that were infected with X. fastidiosa. Results. Analysis of this data by CHURNER highlighted pathogenesis related PR-1 proteins, reinforcing this as the foremost protein function in xylem sap involved in the grapevine defense response to X. fastidiosa. β-1, 3-glucanase, which has both anti-microbial and anti-fungal activities, is also up-regulated. Simultaneously, chitinases are found to be both up and down-regulated by CHURNER, and thus the net gain of this protein function loses its significance in the defense response. Discussion. We demonstrate how structural data can be incorporated in the pipeline of proteomic data analysis prior to making inferences on the importance of individual proteins to plant defense mechanisms. We expect CHURNER to be applicable to any proteomic data set.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mu, Jun; Institute of Neuroscience and the Collaborative Innovation Center for Brain Science, Chongqing Medical University, Chongqing; Chongqing Key Laboratory of Neurobiology, Chongqing
Purpose: Tuberculous meningitis (TBM) remains to be one of the most deadly infectious diseases. The pathogen interacts with the host immune system, the process of which is largely unknown. Various cellular processes of Mycobacterium tuberculosis (MTB) centers around lipid metabolism. To determine the lipid metabolism related proteins, a quantitative proteomic study was performed here to identify differential proteins in the cerebrospinal fluid (CSF) obtained from TBM patients (n = 12) and healthy controls (n = 12). Methods: CSF samples were desalted, concentrated, labelled with isobaric tags for relative and absolute quantitation (iTRAQ™), and analyzed by multi-dimensional liquid chromatography-tandem mass spectrometry (LC-MS/MS). Gene ontology andmore » proteomic phenotyping analysis of the differential proteins were conducted using Database for Annotation, Visualization, and Integrated Discovery (DAVID) Bioinformatics Resources. ApoE and ApoB were selected for validation by ELISA. Results: Proteomic phenotyping of the 4 differential proteins was invloved in the lipid metabolism. ELISA showed significantly increased ApoB levels in TBM subjects compared to healthy controls. Area under the receiver operating characteristic curve analysis demonstrated ApoB levels could distinguish TBM subjects from healthy controls and viral meningitis subjects with 89.3% sensitivity and 92% specificity. Conclusions: CSF lipid metabolism disregulation, especially elevated expression of ApoB, gives insights into the pathogenesis of TBM. Further evaluation of these findings in larger studies including anti-tuberculosis medicated and unmedicated patient cohorts with other center nervous system infectious diseases is required for successful clinical translation. - Highlights: • The first proteomic study on the cerebrospinal fluid of tuberculous meningitis patients using iTRAQ. • Identify 4 differential proteins invloved in the lipid metabolism. • Elevated expression of ApoB gives insights into the pathogenesis of TBM.« less
Murad, André M; Laumann, Raul A; Lima, Thaina de A; Sarmento, Rubia B C; Noronha, Eliane F; Rocha, Thales L; Valadares-Inglis, Maria C; Franco, Octávio L
2006-01-01
Cowpea crops are severely attacked by Callosobruchus maculatus, a Coleopteran that at the larval stage penetrates into stored seeds and feeds on cotyledons. Cowpea weevil control could be based in utilization of bacteria and fungi to reduce pest development. Entomopathogenic fungi, such as Metarhizium anisopliae, are able to control insect-pests and are widely applied in biological control. This report evaluated ten M. anisopliae isolates according to their virulence, correlating chitinolytic, proteolytic and alpha-amylolytic activities, as well proteomic analysis by two dimensional gels of fungal secretions in response to an induced medium containing C. maculatus shells, indicating novel biotechnological tools capable of improving cowpea crop resistance.
Welker, F
2018-02-20
The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
Wen, Bo; Xu, Shaohang; Sheynkman, Gloria M; Feng, Qiang; Lin, Liang; Wang, Quanhui; Xu, Xun; Wang, Jun; Liu, Siqi
2014-11-01
Single nucleotide variations (SNVs) located within a reading frame can result in single amino acid polymorphisms (SAPs), leading to alteration of the corresponding amino acid sequence as well as function of a protein. Accurate detection of SAPs is an important issue in proteomic analysis at the experimental and bioinformatic level. Herein, we present sapFinder, an R software package, for detection of the variant peptides based on tandem mass spectrometry (MS/MS)-based proteomics data. This package automates the construction of variation-associated databases from public SNV repositories or sample-specific next-generation sequencing (NGS) data and the identification of SAPs through database searching, post-processing and generation of HTML-based report with visualized interface. sapFinder is implemented as a Bioconductor package in R. The package and the vignette can be downloaded at http://bioconductor.org/packages/devel/bioc/html/sapFinder.html and are provided under a GPL-2 license. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
2017-01-01
Mass-spectrometry-based, high-throughput proteomics experiments produce large amounts of data. While typically acquired to answer specific biological questions, these data can also be reused in orthogonal ways to reveal new biological knowledge. We here present a novel method for such orthogonal data reuse of public proteomics data. Our method elucidates biological relationships between proteins based on the co-occurrence of these proteins across human experiments in the PRIDE database. The majority of the significantly co-occurring protein pairs that were detected by our method have been successfully mapped to existing biological knowledge. The validity of our novel method is substantiated by the extremely few pairs that can be mapped to existing knowledge based on random associations between the same set of proteins. Moreover, using literature searches and the STRING database, we were able to derive meaningful biological associations for unannotated protein pairs that were detected using our method, further illustrating that as-yet unknown associations present highly interesting targets for follow-up analysis. PMID:28480704
Deng, Ning; Li, Zhenye; Pan, Chao; Duan, Huilong
2015-01-01
Study of complex proteome brings forward higher request for the quantification method using mass spectrometry technology. In this paper, we present a mass spectrometry label-free quantification tool for complex proteomes, called freeQuant, which integrated quantification with functional analysis effectively. freeQuant consists of two well-integrated modules: label-free quantification and functional analysis with biomedical knowledge. freeQuant supports label-free quantitative analysis which makes full use of tandem mass spectrometry (MS/MS) spectral count, protein sequence length, shared peptides, and ion intensity. It adopts spectral count for quantitative analysis and builds a new method for shared peptides to accurately evaluate abundance of isoforms. For proteins with low abundance, MS/MS total ion count coupled with spectral count is included to ensure accurate protein quantification. Furthermore, freeQuant supports the large-scale functional annotations for complex proteomes. Mitochondrial proteomes from the mouse heart, the mouse liver, and the human heart were used to evaluate the usability and performance of freeQuant. The evaluation showed that the quantitative algorithms implemented in freeQuant can improve accuracy of quantification with better dynamic range.
Quantitative proteomics reveals the central changes of wheat in response to powdery mildew.
Fu, Ying; Zhang, Hong; Mandal, Siddikun Nabi; Wang, Changyou; Chen, Chunhuan; Ji, Wanquan
2016-01-01
Powdery mildew (Pm), caused by Blumeria graminis f. sp. tritici (Bgt), is one of the most important crop diseases, causing severe economic losses to wheat production worldwide. However, there are few reports about the proteomic response to Bgt infection in resistant wheat. Hence, quantitative proteomic analysis of N9134, a resistant wheat line, was performed to explore the molecular mechanism of wheat in defense against Bgt. Comparing the leaf proteins of Bgt-inoculated N9134 with that of mock-inoculated controls, a total of 2182 protein-species were quantified by iTRAQ at 24, 48 and 72h postinoculation (hpi) with Bgt, of which 394 showed differential accumulation. These differentially accumulated protein-species (DAPs) mainly included pathogenesis-related (PR) polypeptides, oxidative stress responsive proteins and components involved in primary metabolic pathways. KEGG enrichment analysis showed that phenylpropanoid biosynthesis, phenylalanine metabolism and photosynthesis-antenna proteins were the key pathways in response to Bgt infection. InterProScan 5 and the Gibbs Motif Sampler cluster 394 DAPs into eight conserved motifs, which shared leucine repeats and histidine sites in the sequence motifs. Moreover, eight separate protein-protein interaction (PPI) networks were predicted from STRING database. This study provides a powerful platform for further exploration of the molecular mechanism underlying resistant wheat responding to Bgt. Powdery mildew, caused by Blumeria graminis f. sp. tritici (Bgt), is a destructive pathogenic disease in wheat-producing regions worldwide, resulting in severe yield reductions. Although many resistant wheat varieties have been cultivated, there are few reports about the proteomic response to Bgt infection in resistant wheat. Therefore, an iTRAQ-based quantitative proteomic analysis of a resistant wheat line (N9134) in response to Bgt infection has been performed. This paper provides new insights into the underlying molecular mechanism of wheat in response to Bgt. The proteomic analysis can significantly narrow the field of potential defense-related protein-species, and is conducive to recognize the critical or effector protein under Bgt infection more precisely. Taken together, large amounts of high-throughput data provide a powerful platform for further exploration of the molecular mechanism on wheat-Bgt interactions. Copyright © 2015 Elsevier B.V. All rights reserved.
Mudaliar, Manikhandan; Tassi, Riccardo; Thomas, Funmilola C.; McNeilly, Tom N.; Weidt, Stefan K.; McLaughlin, Mark; Wilson, David; Burchmore, Richard; Herzyk, Pawel; Eckersall, P. David
2016-01-01
Mastitis, inflammation of the mammary gland, is the most common and costly disease of dairy cattle in the western world. It is primarily caused by bacteria, with Streptococcus uberis as one of the most prevalent causative agents. To characterize the proteome during Streptococcus uberis mastitis, an experimentally induced model of intramammary infection was used. Milk whey samples obtained from 6 cows at 6 time points were processed using label-free relative quantitative proteomics. This proteomic analysis complements clinical, bacteriological and immunological studies as well as peptidomic and metabolomic analysis of the same challenge model. A total of 2552 non-redundant bovine peptides were identified, and from these, 570 bovine proteins were quantified. Hierarchical cluster analysis and principal component analysis showed clear clustering of results by stage of infection, with similarities between pre-infection and resolution stages (0 and 312 h post challenge), early infection stages (36 and 42 h post challenge) and late infection stages (57 and 81 h post challenge). Ingenuity pathway analysis identified upregulation of acute phase protein pathways over the course of infection, with dominance of different acute phase proteins at different time points based on differential expression analysis. Antimicrobial peptides, notably cathelicidins and peptidoglycan recognition protein, were upregulated at all time points post challenge and peaked at 57 h, which coincided with 10 000-fold decrease in average bacterial counts. The integration of clinical, bacteriological, immunological and quantitative proteomics and other-omic data provides a more detailed systems level view of the host response to mastitis than has been achieved previously. PMID:27412694
2013-09-01
REFERENCES (1) Harsha, H. C.; Pandey, A. Phosphoproteomics in cancer. Mol. Oncol. 2010, 4 (6), 482−95. (2) Iliuk , A.; Liu, X. S.; Xue, L.; Liu, X...based proteomics and peptidomics for biomarker discovery in neurodegenerative diseases. Int. J. Clin. Exp. Pathol. 2009, 2 (2), 132−48. (4) Iliuk , A...phosphoproteins by immobilized metal (Fe3+) affinity chromatography. Anal. Biochem. 1986, 154 (1), 250−4. (6) Iliuk , A. B.; Martin, V. A.; Alicie, B. M
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whiteaker, Jeffrey R.; Halusa, Goran; Hoofnagle, Andrew N.
2016-02-12
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as an open-source repository of well-characterized targeted proteomic assays. The portal is designed to curate and disseminate highly characterized, targeted mass spectrometry (MS)-based assays by providing detailed assay performance characterization data, standard operating procedures, and access to reagents. Assay content is accessed via the portal through queries to find assays targeting proteins associated with specific cellular pathways, protein complexes, or specific chromosomal regions. The position of the peptide analytes for which there are available assays are mapped relative tomore » other features of interest in the protein, such as sequence domains, isoforms, single nucleotide polymorphisms, and post-translational modifications. The overarching goals are to enable robust quantification of all human proteins and to standardize the quantification of targeted MS-based assays to ultimately enable harmonization of results over time and across laboratories.« less
Whiteaker, Jeffrey R; Halusa, Goran N; Hoofnagle, Andrew N; Sharma, Vagisha; MacLean, Brendan; Yan, Ping; Wrobel, John A; Kennedy, Jacob; Mani, D R; Zimmerman, Lisa J; Meyer, Matthew R; Mesri, Mehdi; Boja, Emily; Carr, Steven A; Chan, Daniel W; Chen, Xian; Chen, Jing; Davies, Sherri R; Ellis, Matthew J C; Fenyö, David; Hiltke, Tara; Ketchum, Karen A; Kinsinger, Chris; Kuhn, Eric; Liebler, Daniel C; Liu, Tao; Loss, Michael; MacCoss, Michael J; Qian, Wei-Jun; Rivers, Robert; Rodland, Karin D; Ruggles, Kelly V; Scott, Mitchell G; Smith, Richard D; Thomas, Stefani; Townsend, R Reid; Whiteley, Gordon; Wu, Chaochao; Zhang, Hui; Zhang, Zhen; Rodriguez, Henry; Paulovich, Amanda G
2016-01-01
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI) has launched an Assay Portal (http://assays.cancer.gov) to serve as an open-source repository of well-characterized targeted proteomic assays. The portal is designed to curate and disseminate highly characterized, targeted mass spectrometry (MS)-based assays by providing detailed assay performance characterization data, standard operating procedures, and access to reagents. Assay content is accessed via the portal through queries to find assays targeting proteins associated with specific cellular pathways, protein complexes, or specific chromosomal regions. The position of the peptide analytes for which there are available assays are mapped relative to other features of interest in the protein, such as sequence domains, isoforms, single nucleotide polymorphisms, and posttranslational modifications. The overarching goals are to enable robust quantification of all human proteins and to standardize the quantification of targeted MS-based assays to ultimately enable harmonization of results over time and across laboratories.
Yang, Ting; Chen, Fei; Xu, Feifei; Wang, Fengliang; Xu, Qingqing; Chen, Yun
2014-09-25
P-glycoprotein (P-gp) can efflux drugs from cancer cells, and its overexpression is commonly associated with multi-drug resistance (MDR). Thus, the accurate quantification of P-gp would help predict the response to chemotherapy and for prognosis of breast cancer patients. An advanced liquid chromatography-tandem mass spectrometry (LC/MS/MS)-based targeted proteomics assay was developed and validated for monitoring P-gp levels in breast tissue. Tryptic peptide 368IIDNKPSIDSYSK380 was selected as a surrogate analyte for quantification, and immuno-depleted tissue extract was used as a surrogate matrix. Matched pairs of breast tissue samples from 60 patients who were suspected to have drug resistance were subject to analysis. The levels of P-gp were quantified. Using data from normal tissue, we suggested a P-gp reference interval. The experimental values of tumor tissue samples were compared with those obtained from Western blotting and immunohistochemistry (IHC). The result indicated that the targeted proteomics approach was comparable to IHC but provided a lower limit of quantification (LOQ) and could afford more reliable results at low concentrations than the other two methods. LC/MS/MS-based targeted proteomics may allow the quantification of P-gp in breast tissue in a more accurate manner. Copyright © 2014 Elsevier B.V. All rights reserved.
2011-01-01
Background Since its inception, proteomics has essentially operated in a discovery mode with the goal of identifying and quantifying the maximal number of proteins in a sample. Increasingly, proteomic measurements are also supporting hypothesis-driven studies, in which a predetermined set of proteins is consistently detected and quantified in multiple samples. Selected reaction monitoring (SRM) is a targeted mass spectrometric technique that supports the detection and quantification of specific proteins in complex samples at high sensitivity and reproducibility. Here, we describe ATAQS, an integrated software platform that supports all stages of targeted, SRM-based proteomics experiments including target selection, transition optimization and post acquisition data analysis. This software will significantly facilitate the use of targeted proteomic techniques and contribute to the generation of highly sensitive, reproducible and complete datasets that are particularly critical for the discovery and validation of targets in hypothesis-driven studies in systems biology. Result We introduce a new open source software pipeline, ATAQS (Automated and Targeted Analysis with Quantitative SRM), which consists of a number of modules that collectively support the SRM assay development workflow for targeted proteomic experiments (project management and generation of protein, peptide and transitions and the validation of peptide detection by SRM). ATAQS provides a flexible pipeline for end-users by allowing the workflow to start or end at any point of the pipeline, and for computational biologists, by enabling the easy extension of java algorithm classes for their own algorithm plug-in or connection via an external web site. This integrated system supports all steps in a SRM-based experiment and provides a user-friendly GUI that can be run by any operating system that allows the installation of the Mozilla Firefox web browser. Conclusions Targeted proteomics via SRM is a powerful new technique that enables the reproducible and accurate identification and quantification of sets of proteins of interest. ATAQS is the first open-source software that supports all steps of the targeted proteomics workflow. ATAQS also provides software API (Application Program Interface) documentation that enables the addition of new algorithms to each of the workflow steps. The software, installation guide and sample dataset can be found in http://tools.proteomecenter.org/ATAQS/ATAQS.html PMID:21414234
Proteomic analysis of albumin and globulin fractions of pea (Pisum sativum L.) seeds.
Dziuba, Jerzy; Szerszunowicz, Iwona; Nałęcz, Dorota; Dziuba, Marta
2014-01-01
Proteomic analysis is emerging as a highly useful tool in food research, including studies of food allergies. Two-dimensional gel electrophoresis involving isoelectric focusing and sodium dodecyl sulfate polyacrylamide gel electrophoresis is the most effective method of separating hundreds or even thousands of proteins. In this study, albumin and globulin tractions of pea seeds cv. Ramrod were subjected to proteomic analysis. Selected potentially alergenic proteins were identified based on their molecular weights and isoelectric points. Pea seeds (Pisum sativum L.) cv. Ramrod harvested over a period of two years (Plant Breeding Station in Piaski-Szelejewo) were used in the experiment. The isolated albumins, globulins and legumin and vicilin fractions of globulins were separated by two-dimensional gel electrophoresis. Proteomic images were analysed in the ImageMaster 2D Platinum program with the use of algorithms from the Melanie application. The relative content, isoelectric points and molecular weights were computed for all identified proteins. Electrophoregrams were analysed by matching spot positions from three independent replications. The proteomes of albumins, globulins and legumin and vicilin fractions of globulins produced up to several hundred spots (proteins). Spots most characteristic of a given fraction were identified by computer analysis and spot matching. The albumin proteome accumulated spots of relatively high intensity over a broad range of pi values of ~4.2-8.1 in 3 molecular weight (MW) ranges: I - high molecular-weight albumins with MW of ~50-110 kDa, II - average molecular-weight albumins with MW of ~20-35 kDa, and III - low molecular-weight albumins with MW of ~13-17 kDa. 2D gel electrophoregrams revealed the presence of 81 characteristic spots, including 24 characteristic of legumin and 14 - of vicilin. Two-dimensional gel electrophoresis proved to be a useful tool for identifying pea proteins. Patterns of spots with similar isoelectric points and different molecular weights or spots with different isoelectric points and similar molecular weights play an important role in proteome analysis. The regions characteristic of albumin, globulin and legumin and vicilin fractions of globulin with typical MW and pi values were identified as the results of performed 2D electrophoretic separations of pea proteins. 2D gel electrophoresis of albumins and the vicilin fraction of globulins revealed the presence of 4 and 2 spots, respectively, representing potentially allergenic proteins. They probably corresponded to vicilin fragments synthesized during post-translational modification of the analysed protein.
Estimation of the proteomic cancer co-expression sub networks by using association estimators.
Erdoğan, Cihat; Kurt, Zeyneb; Diri, Banu
2017-01-01
In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.
Sample handling for mass spectrometric proteomic investigations of human sera.
West-Nielsen, Mikkel; Høgdall, Estrid V; Marchiori, Elena; Høgdall, Claus K; Schou, Christian; Heegaard, Niels H H
2005-08-15
Proteomic investigations of sera are potentially of value for diagnosis, prognosis, choice of therapy, and disease activity assessment by virtue of discovering new biomarkers and biomarker patterns. Much debate focuses on the biological relevance and the need for identification of such biomarkers while less effort has been invested in devising standard procedures for sample preparation and storage in relation to model building based on complex sets of mass spectrometric (MS) data. Thus, development of standardized methods for collection and storage of patient samples together with standards for transportation and handling of samples are needed. This requires knowledge about how sample processing affects MS-based proteome analyses and thereby how nonbiological biased classification errors are avoided. In this study, we characterize the effects of sample handling, including clotting conditions, storage temperature, storage time, and freeze/thaw cycles, on MS-based proteomics of human serum by using principal components analysis, support vector machine learning, and clustering methods based on genetic algorithms as class modeling and prediction methods. Using spiking to artificially create differentiable sample groups, this integrated approach yields data that--even when working with sample groups that differ more than may be expected in biological studies--clearly demonstrate the need for comparable sampling conditions for samples used for modeling and for the samples that are going into the test set group. Also, the study emphasizes the difference between class prediction and class comparison studies as well as the advantages and disadvantages of different modeling methods.
Estimation of the proteomic cancer co-expression sub networks by using association estimators
Kurt, Zeyneb; Diri, Banu
2017-01-01
In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators’ performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists. PMID:29145449
Chen, Xiaoguang; Xu, Cunshuan
2017-06-01
After planarian tail is cut off, posterior end of the remaining fragment will regenerate a new tail within about 1 week. However, many details of this process remain unclear up to date. For this reason, we performed the dynamic proteomic analysis of the regenerating tail fragments at 6, 12, 24, 72, 120, and 168 h post-amputation (hpa). Using two-dimensional electrophoresis (2-DE) in combination with MALDI-TOF-TOF/MS analysis, a total of 1088 peptides were identified as significantly changed between tail-cutting groups and 0-h group, 482 of which have identifiable protein names. Of these 482 proteins, there were 111 originating from the Turbellaria. Protein functional categorization showed that these 111 proteins are mainly related to differentiation and development, transcription and translation, cell signal transduction, and cell proliferation. The screening of key protein considered the transcription factor Smad4 as important protein for planarian tail regeneration. Cell signaling pathway analysis, combined with proteomic profiling of regenerating tail fragment, showed that TGFβ/Smad4 pathway was activated during planarian tail regeneration. Based on a comprehensive analysis of 2-DE MALDI-TOF-TOF/MS and bioinformatics analyses, it could be concluded that TGFβ/Smad4 pathway perhaps plays an important role in tail regeneration via promoting cell differentiation.
Xu, Zhongwei; Jin, Xiaohan; Cai, Wei; Zhou, Maobin; Shao, Ping; Yang, Zhen; Fu, Rong; Cao, Jin; Liu, Yan; Yu, Fang; Fan, Rong; Zhang, Yan; Zou, Shuang; Zhou, Xin; Yang, Ning; Chen, Xu; Li, Yuming
2018-04-20
Early-onset preeclampsia (EOS-PE) refers to preeclampsia that occurred before 34 gestation weeks. This study is conducted to explore the relationship between mitochondrial dysfunction and the pathogenesis of EOS-PE using proteomic strategy. To identify altering expressed mitochondrial proteins between severe EOS-PE and healthy pregnancies, enrichment of mitochondria coupled with iTRAQ-based quantitative proteomic method is performed. Immunohistochemistry (IHC) and western blot are performed to detect the alteration of changing expression proteins, and confirmed the accuracy of proteomic results. A total of 1372 proteins were quantified and 132 altering expressed proteins were screened, including 86 downregulated expression proteins and 46 upregulated expression proteins (p < 0.05). Bioinformatics analysis showed that differentially expressed proteins participated in numerous biological processes, including oxidation-reduction process, respiratory electron transport chain, and oxidative phosphorylation. Especially, mitochondria-related molecules, PRDX2, PARK7, BNIP3, BCL2, PDHA1, SUCLG1, ACADM, and NDUFV1, are involved in energy-production process in the matrix and membrane of mitochondria. Results of the experiment show that abnormal electron transport, excessive oxidative stress, and mitochondrion disassembly might be the main cause of mitochondrial dysfunction, and is related to the pathogenesis of EOS-PE. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic Analysis of the Human Olfactory Bulb.
Dammalli, Manjunath; Dey, Gourav; Madugundu, Anil K; Kumar, Manish; Rodrigues, Benvil; Gowda, Harsha; Siddaiah, Bychapur Gowrishankar; Mahadevan, Anita; Shankar, Susarla Krishna; Prasad, Thottethodi Subrahmanya Keshava
2017-08-01
The importance of olfaction to human health and disease is often underappreciated. Olfactory dysfunction has been reported in association with a host of common complex diseases, including neurological diseases such as Alzheimer's disease and Parkinson's disease. For health, olfaction or the sense of smell is also important for most mammals, for optimal engagement with their environment. Indeed, animals have developed sophisticated olfactory systems to detect and interpret the rich information presented to them to assist in day-to-day activities such as locating food sources, differentiating food from poisons, identifying mates, promoting reproduction, avoiding predators, and averting death. In this context, the olfactory bulb is a vital component of the olfactory system receiving sensory information from the axons of the olfactory receptor neurons located in the nasal cavity and the first place that processes the olfactory information. We report in this study original observations on the human olfactory bulb proteome in healthy subjects, using a high-resolution mass spectrometry-based proteomic approach. We identified 7750 nonredundant proteins from human olfactory bulbs. Bioinformatics analysis of these proteins showed their involvement in biological processes associated with signal transduction, metabolism, transport, and olfaction. These new observations provide a crucial baseline molecular profile of the human olfactory bulb proteome, and should assist the future discovery of biomarker proteins and novel diagnostics associated with diseases characterized by olfactory dysfunction.
Predicting Amyloidogenic Proteins in the Proteomes of Plants.
Antonets, Kirill S; Nizhnikov, Anton A
2017-10-16
Amyloids are protein fibrils with characteristic spatial structure. Though amyloids were long perceived to be pathogens that cause dozens of incurable pathologies in humans and mammals, it is currently clear that amyloids also represent a functionally important form of protein structure implicated in a variety of biological processes in organisms ranging from archaea and bacteria to fungi and animals. Despite their social significance, plants remain the most poorly studied group of organisms in the field of amyloid biology. To date, amyloid properties have only been demonstrated in vitro or in heterologous systems for a small number of plant proteins. Here, for the first time, we performed a comprehensive analysis of the distribution of potentially amyloidogenic proteins in the proteomes of approximately 70 species of land plants using the Waltz and SARP (Sequence Analysis based on the Ranking of Probabilities) bioinformatic algorithms. We analyzed more than 2.9 million protein sequences and found that potentially amyloidogenic proteins are abundant in plant proteomes. We found that such proteins are overrepresented among membrane as well as DNA- and RNA-binding proteins of plants. Moreover, seed storage and defense proteins of most plant species are rich in amyloidogenic regions. Taken together, our data demonstrate the diversity of potentially amyloidogenic proteins in plant proteomes and suggest biological processes where formation of amyloids might be functionally important.
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.
Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-09-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets
Savitski, Mikhail M.; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-01-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target–decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target–decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The “picked” protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The “picked” target–decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used “classic” protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. PMID:25987413
DOE Office of Scientific and Technical Information (OSTI.GOV)
Denef, Vincent; Shah, Manesh B; Verberkmoes, Nathan C
The recent surge in microbial genomic sequencing, combined with the development of high-throughput liquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the question of the extent to which genomic information of one strain or environmental sample can be used to profile proteomes of related strains or samples. Even with decreasing sequencing costs, it remains impractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate how shotgun proteomics is affected by amino acid divergence between the sample and the genomic database using a probability-based model and a random mutation simulation model constrained by experimental data. To assess the effectsmore » of nonrandom distribution of mutations, we also evaluated identification levels using in silico peptide data from sequenced isolates with average amino acid identities (AAI) varying between 76 and 98%. We compared the predictions to experimental protein identification levels for a sample that was evaluated using a database that included genomic information for the dominant organism and for a closely related variant (95% AAI). The range of models set the boundaries at which half of the proteins in a proteomic experiment can be identified to be 77-92% AAI between orthologs in the sample and database. Consistent with this prediction, experimental data indicated loss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of the initial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI. Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most crossspecies false positives.« less
Mass spectrometry-based proteomics for translational research: a technical overview.
Paulo, Joao A; Kadiyala, Vivek; Banks, Peter A; Steen, Hanno; Conwell, Darwin L
2012-03-01
Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease.
Mass Spectrometry-Based Proteomics for Translational Research: A Technical Overview
Paulo, Joao A.; Kadiyala, Vivek; Banks, Peter A.; Steen, Hanno; Conwell, Darwin L.
2012-01-01
Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease. PMID:22461744
Early Prediction of Lupus Nephritis Using Advanced Proteomics
2010-06-01
SELDI-TOF-MS. Additional proteomic profiling studies using NMR- and MS-based metabonomics have been completed, and LC/MS based protein profiling using...Flight mass spectrometry (SELDI-TOF-MS). Changes in proteomic profiles will be confirmed and enhanced using NMR- and MS-based metabonomics , by Dr...performed using NMR- and MS-based metabonomics at Miami University, in the laboratory of Dr. Michael Kennedy. Initial spectra and profiles obtained show
Van Cutsem, Emmanuel; Simonart, Géraldine; Degand, Hervé; Faber, Anne-Marie; Morsomme, Pierre; Boutry, Marc
2011-02-01
Nicotiana tabacum leaves are covered by trichomes involved in the secretion of large amounts of secondary metabolites, some of which play a major role in plant defense. However, little is known about the metabolic pathways that operate in these structures. We undertook a proteomic analysis of N. tabacum trichomes in order to identify their protein complement. Efficient trichome isolation was obtained by abrading frozen leaves. After homogenization, soluble proteins and a microsomal fraction were prepared by centrifugation. Gel-based and gel-free proteomic analyses were then performed. 2-DE analysis of soluble proteins led to the identification of 1373 protein spots, which were digested and analyzed by MS/MS, leading to 680 unique identifications. Both soluble proteins and microsomal fraction were analyzed by LC MALDI-MS/MS after trypsin digestion, leading to 858 identifications, many of which had not been identified after 2-DE, indicating that the two methods complement each other. Many enzymes putatively involved in secondary metabolism were identified, including enzymes involved in the synthesis of terpenoid precursors and in acyl sugar production. Several transporters were also identified, some of which might be involved in secondary metabolite transport. Various (a)biotic stress response proteins were also detected, supporting the role of trichomes in plant defense. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data.
Tekwe, Carmen D; Carroll, Raymond J; Dabney, Alan R
2012-08-01
Protein abundance in quantitative proteomics is often based on observed spectral features derived from liquid chromatography mass spectrometry (LC-MS) or LC-MS/MS experiments. Peak intensities are largely non-normal in distribution. Furthermore, LC-MS-based proteomics data frequently have large proportions of missing peak intensities due to censoring mechanisms on low-abundance spectral features. Recognizing that the observed peak intensities detected with the LC-MS method are all positive, skewed and often left-censored, we propose using survival methodology to carry out differential expression analysis of proteins. Various standard statistical techniques including non-parametric tests such as the Kolmogorov-Smirnov and Wilcoxon-Mann-Whitney rank sum tests, and the parametric survival model and accelerated failure time-model with log-normal, log-logistic and Weibull distributions were used to detect any differentially expressed proteins. The statistical operating characteristics of each method are explored using both real and simulated datasets. Survival methods generally have greater statistical power than standard differential expression methods when the proportion of missing protein level data is 5% or more. In particular, the AFT models we consider consistently achieve greater statistical power than standard testing procedures, with the discrepancy widening with increasing missingness in the proportions. The testing procedures discussed in this article can all be performed using readily available software such as R. The R codes are provided as supplemental materials. ctekwe@stat.tamu.edu.
Majeran, Wojciech; Friso, Giulia; Ponnala, Lalit; Connolly, Brian; Huang, Mingshu; Reidel, Edwin; Zhang, Cankui; Asakura, Yukari; Bhuiyan, Nazmul H; Sun, Qi; Turgeon, Robert; van Wijk, Klaas J
2010-11-01
C(4) grasses, such as maize (Zea mays), have high photosynthetic efficiency through combined biochemical and structural adaptations. C(4) photosynthesis is established along the developmental axis of the leaf blade, leading from an undifferentiated leaf base just above the ligule into highly specialized mesophyll cells (MCs) and bundle sheath cells (BSCs) at the tip. To resolve the kinetics of maize leaf development and C(4) differentiation and to obtain a systems-level understanding of maize leaf formation, the accumulation profiles of proteomes of the leaf and the isolated BSCs with their vascular bundle along the developmental gradient were determined using large-scale mass spectrometry. This was complemented by extensive qualitative and quantitative microscopy analysis of structural features (e.g., Kranz anatomy, plasmodesmata, cell wall, and organelles). More than 4300 proteins were identified and functionally annotated. Developmental protein accumulation profiles and hierarchical cluster analysis then determined the kinetics of organelle biogenesis, formation of cellular structures, metabolism, and coexpression patterns. Two main expression clusters were observed, each divided in subclusters, suggesting that a limited number of developmental regulatory networks organize concerted protein accumulation along the leaf gradient. The coexpression with BSC and MC markers provided strong candidates for further analysis of C(4) specialization, in particular transporters and biogenesis factors. Based on the integrated information, we describe five developmental transitions that provide a conceptual and practical template for further analysis. An online protein expression viewer is provided through the Plant Proteome Database.
A catalogue of molecular aberrations that cause ovarian cancer is critical for developing and deploying diagnostics and therapies that will improve patients’ lives. Because a comprehensive molecular view of cancer is important for ultimately guiding treatment, the National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) has released the cancer proteome confirmatory ovarian study data sets.
Alvarez, Sophie; Roy Choudhury, Swarup; Pandey, Sona
2014-03-07
Wheat is one of the most highly cultivated cereals in the world. Like other cultivated crops, wheat production is significantly affected by abiotic stresses such as drought. Multiple wheat varieties suitable for different geographical regions of the world have been developed that are adapted to different environmental conditions; however, the molecular basis of such adaptations remains unknown in most cases. We have compared the quantitative proteomics profile of the roots of two different wheat varieties, Nesser (drought-tolerant) and Opata (drought-sensitive), in the absence and presence of abscisic acid (ABA, as a proxy for drought). A labeling LC-based quantitative proteomics approach using iTRAQ was applied to elucidate the changes in protein abundance levels. Quantitative differences in protein levels were analyzed for the evaluation of inherent differences between the two varieties as well as the overall and variety-specific effect of ABA on the root proteome. This study reveals the most elaborate ABA-responsive root proteome identified to date in wheat. A large number of proteins exhibited inherently different expression levels between Nesser and Opata. Additionally, significantly higher numbers of proteins were ABA-responsive in Nesser roots compared with Opata roots. Furthermore, several proteins showed variety-specific regulation by ABA, suggesting their role in drought adaptation.
Advances of Proteomic Sciences in Dentistry.
Khurshid, Zohaib; Zohaib, Sana; Najeeb, Shariq; Zafar, Muhammad Sohail; Rehman, Rabia; Rehman, Ihtesham Ur
2016-05-13
Applications of proteomics tools revolutionized various biomedical disciplines such as genetics, molecular biology, medicine, and dentistry. The aim of this review is to highlight the major milestones in proteomics in dentistry during the last fifteen years. Human oral cavity contains hard and soft tissues and various biofluids including saliva and crevicular fluid. Proteomics has brought revolution in dentistry by helping in the early diagnosis of various diseases identified by the detection of numerous biomarkers present in the oral fluids. This paper covers the role of proteomics tools for the analysis of oral tissues. In addition, dental materials proteomics and their future directions are discussed.
Van, Phu T; Schmid, Amy K; King, Nichole L; Kaur, Amardeep; Pan, Min; Whitehead, Kenia; Koide, Tie; Facciotti, Marc T; Goo, Young Ah; Deutsch, Eric W; Reiss, David J; Mallick, Parag; Baliga, Nitin S
2008-09-01
The relatively small numbers of proteins and fewer possible post-translational modifications in microbes provide a unique opportunity to comprehensively characterize their dynamic proteomes. We have constructed a PeptideAtlas (PA) covering 62.7% of the predicted proteome of the extremely halophilic archaeon Halobacterium salinarum NRC-1 by compiling approximately 636 000 tandem mass spectra from 497 mass spectrometry runs in 88 experiments. Analysis of the PA with respect to biophysical properties of constituent peptides, functional properties of parent proteins of detected peptides, and performance of different mass spectrometry approaches has highlighted plausible strategies for improving proteome coverage and selecting signature peptides for targeted proteomics. Notably, discovery of a significant correlation between absolute abundances of mRNAs and proteins has helped identify low abundance of proteins as the major limitation in peptide detection. Furthermore, we have discovered that iTRAQ labeling for quantitative proteomic analysis introduces a significant bias in peptide detection by mass spectrometry. Therefore, despite identifying at least one proteotypic peptide for almost all proteins in the PA, a context-dependent selection of proteotypic peptides appears to be the most effective approach for targeted proteomics.
Stable isotope labelling methods in mass spectrometry-based quantitative proteomics.
Chahrour, Osama; Cobice, Diego; Malone, John
2015-09-10
Mass-spectrometry based proteomics has evolved as a promising technology over the last decade and is undergoing a dramatic development in a number of different areas, such as; mass spectrometric instrumentation, peptide identification algorithms and bioinformatic computational data analysis. The improved methodology allows quantitative measurement of relative or absolute protein amounts, which is essential for gaining insights into their functions and dynamics in biological systems. Several different strategies involving stable isotopes label (ICAT, ICPL, IDBEST, iTRAQ, TMT, IPTL, SILAC), label-free statistical assessment approaches (MRM, SWATH) and absolute quantification methods (AQUA) are possible, each having specific strengths and weaknesses. Inductively coupled plasma mass spectrometry (ICP-MS), which is still widely recognised as elemental detector, has recently emerged as a complementary technique to the previous methods. The new application area for ICP-MS is targeting the fast growing field of proteomics related research, allowing absolute protein quantification using suitable elemental based tags. This document describes the different stable isotope labelling methods which incorporate metabolic labelling in live cells, ICP-MS based detection and post-harvest chemical label tagging for protein quantification, in addition to summarising their pros and cons. Copyright © 2015 Elsevier B.V. All rights reserved.
Topic model-based mass spectrometric data analysis in cancer biomarker discovery studies.
Wang, Minkun; Tsai, Tsung-Heng; Di Poto, Cristina; Ferrarini, Alessia; Yu, Guoqiang; Ressom, Habtom W
2016-08-18
A fundamental challenge in quantitation of biomolecules for cancer biomarker discovery is owing to the heterogeneous nature of human biospecimens. Although this issue has been a subject of discussion in cancer genomic studies, it has not yet been rigorously investigated in mass spectrometry based proteomic and metabolomic studies. Purification of mass spectometric data is highly desired prior to subsequent analysis, e.g., quantitative comparison of the abundance of biomolecules in biological samples. We investigated topic models to computationally analyze mass spectrometric data considering both integrated peak intensities and scan-level features, i.e., extracted ion chromatograms (EICs). Probabilistic generative models enable flexible representation in data structure and infer sample-specific pure resources. Scan-level modeling helps alleviate information loss during data preprocessing. We evaluated the capability of the proposed models in capturing mixture proportions of contaminants and cancer profiles on LC-MS based serum proteomic and GC-MS based tissue metabolomic datasets acquired from patients with hepatocellular carcinoma (HCC) and liver cirrhosis as well as synthetic data we generated based on the serum proteomic data. The results we obtained by analysis of the synthetic data demonstrated that both intensity-level and scan-level purification models can accurately infer the mixture proportions and the underlying true cancerous sources with small average error ratios (<7 %) between estimation and ground truth. By applying the topic model-based purification to mass spectrometric data, we found more proteins and metabolites with significant changes between HCC cases and cirrhotic controls. Candidate biomarkers selected after purification yielded biologically meaningful pathway analysis results and improved disease discrimination power in terms of the area under ROC curve compared to the results found prior to purification. We investigated topic model-based inference methods to computationally address the heterogeneity issue in samples analyzed by LC/GC-MS. We observed that incorporation of scan-level features have the potential to lead to more accurate purification results by alleviating the loss in information as a result of integrating peaks. We believe cancer biomarker discovery studies that use mass spectrometric analysis of human biospecimens can greatly benefit from topic model-based purification of the data prior to statistical and pathway analyses.
The journal Molecular & Cellular Proteomics (MCP), in collaboration with the Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI), part of the National Institutes of Health, announce new guidelines and requirements for papers describing the development and application of targeted mass spectrometry measurements of peptides, modified peptides and proteins (Mol Cell Proteomics 2017; PMID: 28183812). NCI’s participation is part of NIH’s overall effort to address the r
An estimated 252,710 new cases of female breast cancer, accounting for 15% of all new cancer cases, occurred in 2017. To better understand proteogenomic abnormalities in breast cancer, the National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) announces the release of the cancer proteome confirmatory breast study data. The goal of the study was to comprehensively characterize the proteome and phosphoproteome on approximately 100 prospectively collected breast tumor and adjacent normal tissues.
Proteomic analysis of bovine nucleolus.
Patel, Amrutlal K; Olson, Doug; Tikoo, Suresh K
2010-09-01
Nucleolus is the most prominent subnuclear structure, which performs a wide variety of functions in the eukaryotic cellular processes. In order to understand the structural and functional role of the nucleoli in bovine cells, we analyzed the proteomic composition of the bovine nucleoli. The nucleoli were isolated from Madin Darby bovine kidney cells and subjected to proteomic analysis by LC-MS/MS after fractionation by SDS-PAGE and strong cation exchange chromatography. Analysis of the data using the Mascot database search and the GPM database search identified 311 proteins in the bovine nucleoli, which contained 22 proteins previously not identified in the proteomic analysis of human nucleoli. Analysis of the identified proteins using the GoMiner software suggested that the bovine nucleoli contained proteins involved in ribosomal biogenesis, cell cycle control, transcriptional, translational and post-translational regulation, transport, and structural organization. Copyright © 2010 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
Proteomics Quality Control: Quality Control Software for MaxQuant Results.
Bielow, Chris; Mastrobuoni, Guido; Kempa, Stefan
2016-03-04
Mass spectrometry-based proteomics coupled to liquid chromatography has matured into an automatized, high-throughput technology, producing data on the scale of multiple gigabytes per instrument per day. Consequently, an automated quality control (QC) and quality analysis (QA) capable of detecting measurement bias, verifying consistency, and avoiding propagation of error is paramount for instrument operators and scientists in charge of downstream analysis. We have developed an R-based QC pipeline called Proteomics Quality Control (PTXQC) for bottom-up LC-MS data generated by the MaxQuant software pipeline. PTXQC creates a QC report containing a comprehensive and powerful set of QC metrics, augmented with automated scoring functions. The automated scores are collated to create an overview heatmap at the beginning of the report, giving valuable guidance also to nonspecialists. Our software supports a wide range of experimental designs, including stable isotope labeling by amino acids in cell culture (SILAC), tandem mass tags (TMT), and label-free data. Furthermore, we introduce new metrics to score MaxQuant's Match-between-runs (MBR) functionality by which peptide identifications can be transferred across Raw files based on accurate retention time and m/z. Last but not least, PTXQC is easy to install and use and represents the first QC software capable of processing MaxQuant result tables. PTXQC is freely available at https://github.com/cbielow/PTXQC .
Application of tissue mesodissection to molecular cancer diagnostics.
Krizman, David; Adey, Nils; Parry, Robert
2015-02-01
To demonstrate clinical application of a mesodissection platform that was developed to combine advantages of laser-based instrumentation with the speed/ease of manual dissection for automated dissection of tissue off standard glass slides. Genomic analysis for KRAS gene mutation was performed on formalin fixed paraffin embedded (FFPE) cancer patient tissue that was dissected using the mesodissection platform. Selected reaction monitoring proteomic analysis for quantitative Her2 protein expression was performed on FFPE patient tumour tissue dissected by a laser-based instrument and the MilliSect instrument. Genomic analysis demonstrates highly confident detection of KRAS mutation specifically in lung cancer cells and not the surrounding benign, non-tumour tissue. Proteomic analysis demonstrates Her2 quantitative protein expression in breast cancer cells dissected manually, by laser-based instrumentation and by MilliSect instrumentation (mesodissection). Slide-mounted tissue dissection is commonly performed using laser-based instruments or manually scraping tissue by scalpel. Here we demonstrate that the mesodissection platform as performed by the MilliSect instrument for tissue dissection is cost-effective; it functions comparably to laser-based dissection and which can be adopted into a clinical diagnostic workflow. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Keller, Martin; Hettich, Robert
2009-03-01
The increase in sequencing capacity led to a new wave of metagenomic projects, enabling and setting the prerequisite for the application of environmental proteomics technologies. This review describes the current status of environmental proteomics. It describes sample preparation as well as the two major technologies applied within this field: two-dimensional electrophoresis-based environmental proteomics and liquid chromatography-mass spectrometry-based environmental proteomics. It also highlights current publications and describes major scientific findings. The review closes with a discussion of critical improvements in the area of integrating experimental mass spectrometry technologies with bioinformatics as well as improved sample handling.
Proteomic analysis of Rhodotorula mucilaginosa: dealing with the issues of a non-conventional yeast.
Addis, Maria Filippa; Tanca, Alessandro; Landolfo, Sara; Abbondio, Marcello; Cutzu, Raffaela; Biosa, Grazia; Pagnozzi, Daniela; Uzzau, Sergio; Mannazzu, Ilaria
2016-08-01
Red yeasts ascribed to the species Rhodotorula mucilaginosa are gaining increasing attention, due to their numerous biotechnological applications, spanning carotenoid production, liquid bioremediation, heavy metal biotransformation and antifungal and plant growth-promoting actions, but also for their role as opportunistic pathogens. Nevertheless, their characterization at the 'omic' level is still scarce. Here, we applied different proteomic workflows to R. mucilaginosa with the aim of assessing their potential in generating information on proteins and functions of biotechnological interest, with a particular focus on the carotenogenic pathway. After optimization of protein extraction, we tested several gel-based (including 2D-DIGE) and gel-free sample preparation techniques, followed by tandem mass spectrometry analysis. Contextually, we evaluated different bioinformatic strategies for protein identification and interpretation of the biological significance of the dataset. When 2D-DIGE analysis was applied, not all spots returned a unambiguous identification and no carotenogenic enzymes were identified, even upon the application of different database search strategies. Then, the application of shotgun proteomic workflows with varying levels of sensitivity provided a picture of the information depth that can be reached with different analytical resources, and resulted in a plethora of information on R. mucilaginosa metabolism. However, also in these cases no proteins related to the carotenogenic pathway were identified, thus indicating that further improvements in sequence databases and functional annotations are strictly needed for increasing the outcome of proteomic analysis of this and other non-conventional yeasts. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Standardization approaches in absolute quantitative proteomics with mass spectrometry.
Calderón-Celis, Francisco; Encinar, Jorge Ruiz; Sanz-Medel, Alfredo
2017-07-31
Mass spectrometry-based approaches have enabled important breakthroughs in quantitative proteomics in the last decades. This development is reflected in the better quantitative assessment of protein levels as well as to understand post-translational modifications and protein complexes and networks. Nowadays, the focus of quantitative proteomics shifted from the relative determination of proteins (ie, differential expression between two or more cellular states) to absolute quantity determination, required for a more-thorough characterization of biological models and comprehension of the proteome dynamism, as well as for the search and validation of novel protein biomarkers. However, the physico-chemical environment of the analyte species affects strongly the ionization efficiency in most mass spectrometry (MS) types, which thereby require the use of specially designed standardization approaches to provide absolute quantifications. Most common of such approaches nowadays include (i) the use of stable isotope-labeled peptide standards, isotopologues to the target proteotypic peptides expected after tryptic digestion of the target protein; (ii) use of stable isotope-labeled protein standards to compensate for sample preparation, sample loss, and proteolysis steps; (iii) isobaric reagents, which after fragmentation in the MS/MS analysis provide a final detectable mass shift, can be used to tag both analyte and standard samples; (iv) label-free approaches in which the absolute quantitative data are not obtained through the use of any kind of labeling, but from computational normalization of the raw data and adequate standards; (v) elemental mass spectrometry-based workflows able to provide directly absolute quantification of peptides/proteins that contain an ICP-detectable element. A critical insight from the Analytical Chemistry perspective of the different standardization approaches and their combinations used so far for absolute quantitative MS-based (molecular and elemental) proteomics is provided in this review. © 2017 Wiley Periodicals, Inc.
Yang, Qiao-Song; Wu, Jun-Hua; Li, Chun-Yu; Wei, Yue-Rong; Sheng, Ou; Hu, Chun-Hua; Kuang, Rui-Bin; Huang, Yong-Hong; Peng, Xin-Xiang; McCardle, James A; Chen, Wei; Yang, Yong; Rose, Jocelyn K C; Zhang, Sheng; Yi, Gan-Jun
2012-12-01
Banana and its close relative, plantain are globally important crops and there is considerable interest in optimizing their cultivation. Plantain has superior cold tolerance compared with banana and a thorough understanding of the molecular mechanisms and responses of plantain to cold stress has great potential value for developing cold tolerant banana cultivars. In this study, we used iTRAQ-based comparative proteomic analysis to investigate the temporal responses of plantain to cold stress. Plantain seedlings were exposed for 0, 6, and 24 h of cold stress at 8 °C and subsequently allowed to recover for 24 h at 28 °C. A total of 3477 plantain proteins were identified, of which 809 showed differential expression from the three treatments. The majority of differentially expressed proteins were predicted to be involved in oxidation-reduction, including oxylipin biosynthesis, whereas others were associated with photosynthesis, photorespiration, and several primary metabolic processes, such as carbohydrate metabolic process and fatty acid beta-oxidation. Western blot analysis and enzyme activity assays were performed on seven differentially expressed, cold-response candidate plantain proteins to validate the proteomics data. Similar analyses of the seven candidate proteins were performed in cold-sensitive banana to examine possible functional conservation, and to compare the results to equivalent responses between the two species. Consistent results were achieved by Western blot and enzyme activity assays, demonstrating that the quantitative proteomics data collected in this study are reliable. Our results suggest that an increase of antioxidant capacity through adapted ROS scavenging capability, reduced production of ROS, and decreased lipid peroxidation contribute to molecular mechanisms for the increased cold tolerance in plantain. To the best of our knowledge, this is the first report of a global investigation on molecular responses of plantain to cold stress by proteomic analysis.
Yang, Qiao-Song; Wu, Jun-Hua; Li, Chun-Yu; Wei, Yue-Rong; Sheng, Ou; Hu, Chun-Hua; Kuang, Rui-Bin; Huang, Yong-Hong; Peng, Xin-Xiang; McCardle, James A.; Chen, Wei; Yang, Yong; Rose, Jocelyn K. C.; Zhang, Sheng; Yi, Gan-Jun
2012-01-01
Banana and its close relative, plantain are globally important crops and there is considerable interest in optimizing their cultivation. Plantain has superior cold tolerance compared with banana and a thorough understanding of the molecular mechanisms and responses of plantain to cold stress has great potential value for developing cold tolerant banana cultivars. In this study, we used iTRAQ-based comparative proteomic analysis to investigate the temporal responses of plantain to cold stress. Plantain seedlings were exposed for 0, 6, and 24 h of cold stress at 8 °C and subsequently allowed to recover for 24 h at 28 °C. A total of 3477 plantain proteins were identified, of which 809 showed differential expression from the three treatments. The majority of differentially expressed proteins were predicted to be involved in oxidation-reduction, including oxylipin biosynthesis, whereas others were associated with photosynthesis, photorespiration, and several primary metabolic processes, such as carbohydrate metabolic process and fatty acid beta-oxidation. Western blot analysis and enzyme activity assays were performed on seven differentially expressed, cold-response candidate plantain proteins to validate the proteomics data. Similar analyses of the seven candidate proteins were performed in cold-sensitive banana to examine possible functional conservation, and to compare the results to equivalent responses between the two species. Consistent results were achieved by Western blot and enzyme activity assays, demonstrating that the quantitative proteomics data collected in this study are reliable. Our results suggest that an increase of antioxidant capacity through adapted ROS scavenging capability, reduced production of ROS, and decreased lipid peroxidation contribute to molecular mechanisms for the increased cold tolerance in plantain. To the best of our knowledge, this is the first report of a global investigation on molecular responses of plantain to cold stress by proteomic analysis. PMID:22982374
Halobacterium salinarum NRC-1 PeptideAtlas: strategies for targeted proteomics
Van, Phu T.; Schmid, Amy K.; King, Nichole L.; Kaur, Amardeep; Pan, Min; Whitehead, Kenia; Koide, Tie; Facciotti, Marc T.; Goo, Young-Ah; Deutsch, Eric W.; Reiss, David J.; Mallick, Parag; Baliga, Nitin S.
2009-01-01
The relatively small numbers of proteins and fewer possible posttranslational modifications in microbes provides a unique opportunity to comprehensively characterize their dynamic proteomes. We have constructed a Peptide Atlas (PA) for 62.7% of the predicted proteome of the extremely halophilic archaeon Halobacterium salinarum NRC-1 by compiling approximately 636,000 tandem mass spectra from 497 mass spectrometry runs in 88 experiments. Analysis of the PA with respect to biophysical properties of constituent peptides, functional properties of parent proteins of detected peptides, and performance of different mass spectrometry approaches has helped highlight plausible strategies for improving proteome coverage and selecting signature peptides for targeted proteomics. Notably, discovery of a significant correlation between absolute abundances of mRNAs and proteins has helped identify low abundance of proteins as the major limitation in peptide detection. Furthermore we have discovered that iTRAQ labeling for quantitative proteomic analysis introduces a significant bias in peptide detection by mass spectrometry. Therefore, despite identifying at least one proteotypic peptide for almost all proteins in the PA, a context-dependent selection of proteotypic peptides appears to be the most effective approach for targeted proteomics. PMID:18652504
A genome-wide structure-based survey of nucleotide binding proteins in M. tuberculosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhagavat, Raghu; Kim, Heung -Bok; Kim, Chang -Yub
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived frommore » a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. In conclusion, as the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.« less
NASA Astrophysics Data System (ADS)
Keller, Heath; Cox, James R.
2004-04-01
Students taking courses in different disciplines can work together to add unique elements to their educational experience. A model for this type of pedagogical approach has been established in the Proteomics Stock Market Project, a collaborative effort between instructors and students in the Department of Chemistry and Department of Management, Marketing, and Business Administration at Murray State University. Stage I involved biochemistry students investigating the topic of proteomics and choosing companies for potential investment based only on scientific investigation. Marketing and management students completed Stage II and provided an investment analysis on the companies selected in Stage I. In Stage III, the biochemistry students focused on a particular company and investigated a protein-based therapeutic product. Blackboard software was utilized in each stage of the project to facilitate the exchange of information and electronic documents. This project was designed to give biochemistry students an appreciation for the emerging field of proteomics and the marketing and management students a flavor for real-world applications of business principles. During the project, students were exposed to ideas and concepts not typically covered in their courses. With this involvement, the students had the opportunity to gain a broader perspective of course content compared to a more traditional curriculum.
Boja, Emily S; Rodriguez, Henry
2012-04-01
Traditional shotgun proteomics used to detect a mixture of hundreds to thousands of proteins through mass spectrometric analysis, has been the standard approach in research to profile protein content in a biological sample which could lead to the discovery of new (and all) protein candidates with diagnostic, prognostic, and therapeutic values. In practice, this approach requires significant resources and time, and does not necessarily represent the goal of the researcher who would rather study a subset of such discovered proteins (including their variations or posttranslational modifications) under different biological conditions. In this context, targeted proteomics is playing an increasingly important role in the accurate measurement of protein targets in biological samples in the hope of elucidating the molecular mechanism of cellular function via the understanding of intricate protein networks and pathways. One such (targeted) approach, selected reaction monitoring (or multiple reaction monitoring) mass spectrometry (MRM-MS), offers the capability of measuring multiple proteins with higher sensitivity and throughput than shotgun proteomics. Developing and validating MRM-MS-based assays, however, is an extensive and iterative process, requiring a coordinated and collaborative effort by the scientific community through the sharing of publicly accessible data and datasets, bioinformatic tools, standard operating procedures, and well characterized reagents. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A genome-wide structure-based survey of nucleotide binding proteins in M. tuberculosis
Bhagavat, Raghu; Kim, Heung -Bok; Kim, Chang -Yub; ...
2017-10-02
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived frommore » a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. In conclusion, as the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.« less
Amjadi, Fatemehsadat; Mehdizadeh, Mehdi; Ashrafi, Mahnaz; Nasrabadi, Davood; Taleahmad, Sara; Mirzaei, Mehdi; Gupta, Vivek; Salekdeh, Ghasem Hosseini; Aflatoonian, Reza
2018-04-21
What is the molecular basis of infertility related to uterine dysfunction in women with polycystic ovary syndrome (PCOS)? In this study, differences in protein expression between PCOS and normal endometrium were identified using a proteomic approach based on two-dimensional electrophoresis (2-DE) coupled with mass spectrometry (MS). The proteome of endometrium were analysed during the proliferative (on day 2 or 3 before ovulation, n = 6) and luteal phases (on day 3-5 after ovulation, n = 6) from healthy women and PCOS patients (12-14 days after spontaneous bleeding, n = 12). The differentially expressed proteins were categorized based on the biological process using the DAVID bioinformatics resources. Over 803 reproducible protein spots were detected on gels, and 150 protein spots showed different intensities between PCOS and normal women during the proliferative and luteal phases. MS analysis detected 70 proteins out of 150 spots. For four of the 70 proteins, 14-3-3 protein, annexin A5, SERPINA1 and cathepsin D, 2-DE results were validated and localized by Western blot and immunohistochemistry, respectively, and their gene expression profiles were confirmed by real-time quantitative PCR. The obtained results corresponded to the proteomic analysis. The differentially expressed proteins identified are known to be involved in apoptosis, oxidative stress, inflammation and the cytoskeleton. The processes related to the differentially expressed proteins play important roles in fecundity and fecundability. The present study may reveal the cause of various endometrial aberrations as a limiting factor for achieving pregnancy in PCOS women. Copyright © 2018 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
Bergerat, Agnes; Decano, Julius; Wu, Chang-Jiun; Choi, Hyungwon; Nesvizhskii, Alexey I; Moran, Ann Marie; Ruiz-Opazo, Nelson; Steffen, Martin; Herrera, Victoria LM
2011-01-01
Stroke is the third leading cause of death in the United States with high rates of morbidity among survivors. The search to fill the unequivocal need for new therapeutic approaches would benefit from unbiased proteomic analyses of animal models of spontaneous stroke in the prestroke stage. Since brain microvessels play key roles in neurovascular coupling, we investigated prestroke microvascular proteome changes. Proteomic analysis of cerebral cortical microvessels (cMVs) was done by tandem mass spectrometry comparing two prestroke time points. Metaprotein-pathway analyses of proteomic spectral count data were done to identify risk factor–induced changes, followed by QSPEC-analyses of individual protein changes associated with increased stroke susceptibility. We report 26 cMV proteome profiles from male and female stroke-prone and non–stroke-prone rats at 2 months and 4.5 months of age prior to overt stroke events. We identified 1,934 proteins by two or more peptides. Metaprotein pathway analysis detected age-associated changes in energy metabolism and cell-to-microenvironment interactions, as well as sex-specific changes in energy metabolism and endothelial leukocyte transmigration pathways. Stroke susceptibility was associated independently with multiple protein changes associated with ischemia, angiogenesis or involved in blood brain barrier (BBB) integrity. Immunohistochemical analysis confirmed aquaporin-4 and laminin-α1 induction in cMVs, representative of proteomic changes with >65 Bayes factor (BF), associated with stroke susceptibility. Altogether, proteomic analysis demonstrates significant molecular changes in ischemic cerebral microvasculature in the prestroke stage, which could contribute to the observed model phenotype of microhemorrhages and postischemic hemorrhagic transformation. These pathways comprise putative targets for translational research of much needed novel diagnostic and therapeutic approaches for stroke. PMID:21519634
Skyline: an open source document editor for creating and analyzing targeted proteomics experiments
MacLean, Brendan; Tomazela, Daniela M.; Shulman, Nicholas; Chambers, Matthew; Finney, Gregory L.; Frewen, Barbara; Kern, Randall; Tabb, David L.; Liebler, Daniel C.; MacCoss, Michael J.
2010-01-01
Summary: Skyline is a Windows client application for targeted proteomics method creation and quantitative data analysis. It is open source and freely available for academic and commercial use. The Skyline user interface simplifies the development of mass spectrometer methods and the analysis of data from targeted proteomics experiments performed using selected reaction monitoring (SRM). Skyline supports using and creating MS/MS spectral libraries from a wide variety of sources to choose SRM filters and verify results based on previously observed ion trap data. Skyline exports transition lists to and imports the native output files from Agilent, Applied Biosystems, Thermo Fisher Scientific and Waters triple quadrupole instruments, seamlessly connecting mass spectrometer output back to the experimental design document. The fast and compact Skyline file format is easily shared, even for experiments requiring many sample injections. A rich array of graphs displays results and provides powerful tools for inspecting data integrity as data are acquired, helping instrument operators to identify problems early. The Skyline dynamic report designer exports tabular data from the Skyline document model for in-depth analysis with common statistical tools. Availability: Single-click, self-updating web installation is available at http://proteome.gs.washington.edu/software/skyline. This web site also provides access to instructional videos, a support board, an issues list and a link to the source code project. Contact: brendanx@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20147306
Urine Sample Preparation in 96-Well Filter Plates for Quantitative Clinical Proteomics
2015-01-01
Urine is an important, noninvasively collected body fluid source for the diagnosis and prognosis of human diseases. Liquid chromatography mass spectrometry (LC-MS) based shotgun proteomics has evolved as a sensitive and informative technique to discover candidate disease biomarkers from urine specimens. Filter-aided sample preparation (FASP) generates peptide samples from protein mixtures of cell lysate or body fluid origin. Here, we describe a FASP method adapted to 96-well filter plates, named 96FASP. Soluble urine concentrates containing ∼10 μg of total protein were processed by 96FASP and LC-MS resulting in 700–900 protein identifications at a 1% false discovery rate (FDR). The experimental repeatability, as assessed by label-free quantification and Pearson correlation analysis for shared proteins among replicates, was high (R ≥ 0.97). Application to urinary pellet lysates which is of particular interest in the context of urinary tract infection analysis was also demonstrated. On average, 1700 proteins (±398) were identified in five experiments. In a pilot study using 96FASP for analysis of eight soluble urine samples, we demonstrated that protein profiles of technical replicates invariably clustered; the protein profiles for distinct urine donors were very different from each other. Robust, highly parallel methods to generate peptide mixtures from urine and other body fluids are critical to increase cost-effectiveness in clinical proteomics projects. This 96FASP method has potential to become a gold standard for high-throughput quantitative clinical proteomics. PMID:24797144
Gonzalez-Begne, Mireya; Lu, Bingwen; Liao, Lujian; Xu, Tao; Bedi, Gurrinder; Melvin, James E.; Yates, John R.
2011-01-01
In-depth analysis of the salivary proteome is fundamental to understanding the functions of salivary proteins in the oral cavity and to reveal disease biomarkers involved in different pathophysiological conditions, with the ultimate goal of improving patient diagnosis and prognosis. Submandibular and sublingual glands contribute saliva rich in glycoproteins to the total saliva output, making them valuable sources for glycoproteomic analysis. Lectin-affinity chromatography coupled to mass spectrometry-based shotgun proteomics was used to explore the submandibular/sublingual (SM/SL) saliva glycoproteome. A total of 262 N- and O-linked glycoproteins were identified by multidimensional protein identification technology (MudPIT). Only 38 were previously described in SM and SL salivas from the human salivary N-linked glycoproteome, while 224 were unique. Further comparison analysis with SM/SL saliva of the human saliva proteome, revealed 125 glycoproteins not formerly reported in this secretion. KEGG pathway analyses demonstrated that many of these glycoproteins are involved in processes such as complement and coagulation cascades, cell communication, glycosphingolipid biosynthesis neo-lactoseries, O-glycan biosynthesis, glycan structures-biosynthesis 2, starch and sucrose metabolism, peptidoglycan biosynthesis or others pathways. In summary, lectin-affinity chromatography coupled to MudPIT mass spectrometry identified many novel glycoproteins in SM/SL saliva. These new additions to the salivary proteome may prove to be a critical step for providing reliable biomarkers in the diagnosis of a myriad of oral and systemic diseases. PMID:21936497
QCloud: A cloud-based quality control system for mass spectrometry-based proteomics laboratories
Chiva, Cristina; Olivella, Roger; Borràs, Eva; Espadas, Guadalupe; Pastor, Olga; Solé, Amanda
2018-01-01
The increasing number of biomedical and translational applications in mass spectrometry-based proteomics poses new analytical challenges and raises the need for automated quality control systems. Despite previous efforts to set standard file formats, data processing workflows and key evaluation parameters for quality control, automated quality control systems are not yet widespread among proteomics laboratories, which limits the acquisition of high-quality results, inter-laboratory comparisons and the assessment of variability of instrumental platforms. Here we present QCloud, a cloud-based system to support proteomics laboratories in daily quality assessment using a user-friendly interface, easy setup, automated data processing and archiving, and unbiased instrument evaluation. QCloud supports the most common targeted and untargeted proteomics workflows, it accepts data formats from different vendors and it enables the annotation of acquired data and reporting incidences. A complete version of the QCloud system has successfully been developed and it is now open to the proteomics community (http://qcloud.crg.eu). QCloud system is an open source project, publicly available under a Creative Commons License Attribution-ShareAlike 4.0. PMID:29324744
Li, Xiao-jun; Yi, Eugene C; Kemp, Christopher J; Zhang, Hui; Aebersold, Ruedi
2005-09-01
There is an increasing interest in the quantitative proteomic measurement of the protein contents of substantially similar biological samples, e.g. for the analysis of cellular response to perturbations over time or for the discovery of protein biomarkers from clinical samples. Technical limitations of current proteomic platforms such as limited reproducibility and low throughput make this a challenging task. A new LC-MS-based platform is able to generate complex peptide patterns from the analysis of proteolyzed protein samples at high throughput and represents a promising approach for quantitative proteomics. A crucial component of the LC-MS approach is the accurate evaluation of the abundance of detected peptides over many samples and the identification of peptide features that can stratify samples with respect to their genetic, physiological, or environmental origins. We present here a new software suite, SpecArray, that generates a peptide versus sample array from a set of LC-MS data. A peptide array stores the relative abundance of thousands of peptide features in many samples and is in a format identical to that of a gene expression microarray. A peptide array can be subjected to an unsupervised clustering analysis to stratify samples or to a discriminant analysis to identify discriminatory peptide features. We applied the SpecArray to analyze two sets of LC-MS data: one was from four repeat LC-MS analyses of the same glycopeptide sample, and another was from LC-MS analysis of serum samples of five male and five female mice. We demonstrate through these two study cases that the SpecArray software suite can serve as an effective software platform in the LC-MS approach for quantitative proteomics.
The accurate quantitation of proteins or peptides using Mass Spectrometry (MS) is gaining prominence in the biomedical research community as an alternative method for analyte measurement. The Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigators have been at the forefront in the promotion of reproducible MS techniques, through the development and application of standardized proteomic methods for protein quantitation on biologically relevant samples.
accumulation," J. Proteomics (2013) "Comparative Proteomics Lends Insight into Genotype-Specific Pathogenicity," J. Proteomics (2013) "De Novo Transcriptomic Analysis of Hydrogen Production in the amino acid changes in the small envelope protein and rescued by a novel glycosolation site," J
Nyman, Tuula A; Lorey, Martina B; Cypryk, Wojciech; Matikainen, Sampsa
2017-05-01
The immune system is our defense system against microbial infections and tissue injury, and understanding how it works in detail is essential for developing drugs for different diseases. Mass spectrometry-based proteomics can provide in-depth information on the molecular mechanisms involved in immune responses. Areas covered: Summarized are the key immunology findings obtained with MS-based proteomics in the past five years, with a focus on inflammasome activation, global protein secretion, mucosal immunology, immunopeptidome and T cells. Special focus is on extracellular vesicle-mediated protein secretion and its role in immune responses. Expert commentary: Proteomics is an essential part of modern omics-scale immunology research. To date, MS-based proteomics has been used in immunology to study protein expression levels, their subcellular localization, secretion, post-translational modifications, and interactions in immune cells upon activation by different stimuli. These studies have made major contributions to understanding the molecular mechanisms involved in innate and adaptive immune responses. New developments in proteomics offer constantly novel possibilities for exploring the immune system. Examples of these techniques include mass cytometry and different MS-based imaging approaches which can be widely used in immunology.
Characterization of the Low-Molecular-Weight Human Plasma Peptidome.
Greening, David W; Simpson, Richard J
2017-01-01
The human plasma proteome represents an important secreted sub-proteome. Proteomic analysis of blood plasma with mass spectrometry is a challenging task. The high complexity and wide dynamic range of proteins as well as the presence of several proteins at very high concentrations complicate the profiling of the human plasma proteome. The peptidome (or low-molecular-weight fraction, LMF) of the human plasma proteome is an invaluable source of biological information, especially in the context of identifying plasma-based markers of disease. Peptides are generated by active synthesis and proteolytic processing, often yielding proteolytic fragments that mediate a variety of physiological and pathological functions. As such, degradomic studies, investigating cleavage products via peptidomics and top-down proteomics in particular, have warranted significant research interest. However, due to their molecular weight, abundance, and solubility, issues with identifying specific cleavage sites and coverage of peptide fragments remain challenging. Peptidomics is currently focused toward comprehensively studying peptides cleaved from precursor proteins by endogenous proteases. This protocol outlines a standardized rapid and reproducible procedure for peptidomic profiling of human plasma using centrifugal ultrafiltration and mass spectrometry. Ultrafiltration is a convective process that uses anisotropic semipermeable membranes to separate macromolecular species on the basis of size. We have optimized centrifugal ultrafiltration (cellulose triacetate membrane) for plasma fractionation with respect to buffer and solvent composition, centrifugal force, duration, and temperature to facilitate recovery >95% and enrichment of the human plasma peptidome. This method serves as a comprehensive and facile process to enrich and identify a key, underrepresented sub-proteome of human blood plasma.
Li, Jiajia; Ding, Xianlong; Han, Shaohuai; He, Tingting; Zhang, Hao; Yang, Longshu; Yang, Shouping; Gai, Junyi
2016-04-14
To further elucidate the molecular mechanism of cytoplasmic male sterility (CMS) in soybean, a differential proteomic analysis was completed between the CMS line NJCMS1A and its maintainer NJCMS1B using iTRAQ-based strategy. As a result, 180 differential abundance proteins (DAPs) were identified, of which, 60 were down-regulated and 120 were up-regulated in NJCMS1A compared with NJCMS1B. Bioinformatic analysis showed that 167 DAPs were annotated in 41 Gene Ontology functional groups, 106 DAPs were classified into 20 clusters of orthologous groups of protein categories, and 128 DAPs were enrichment in 53 KEGG pathways. Fifteen differential level proteins/genes with the same expression pattern were identified in the further conjoint analysis of DAPs and the previously reported differential expression genes. Moreover, multiple reaction monitoring test, qRT-PCR analysis and enzyme activity assay validated that the iTRAQ results were reliable. Based on functional analysis of DAPs, we concluded that male sterility in NJCMS1A might be related to insufficiencies in energy supply, unbalance of protein synthesis and degradation, disruption of flavonoid synthesis, programmed cell death, abnormalities of substance metabolism, etc. These results might facilitate our understanding of the molecular mechanisms behind CMS in soybean. Soybean is an important global crop that provides protein and oil. Heterosis is a significantly potential approach to increase the yield of soybean. Cytoplasmic male sterility (CMS) plays a vital role in the production of hybrid seeds. However, the genetic and molecular mechanisms of male sterility in soybean still need to be further elucidated. In the present paper, a differential proteomic analysis was carried out and the results showed that several key proteins involved in key pathways were associated with male sterility in soybean. This work provides a new insight to understand the genetic and molecular mechanisms underlying CMS in soybean. Copyright © 2016 Elsevier B.V. All rights reserved.
dbHiMo: a web-based epigenomics platform for histone-modifying enzymes
Choi, Jaeyoung; Kim, Ki-Tae; Huh, Aram; Kwon, Seomun; Hong, Changyoung; Asiegbu, Fred O.; Jeon, Junhyun; Lee, Yong-Hwan
2015-01-01
Over the past two decades, epigenetics has evolved into a key concept for understanding regulation of gene expression. Among many epigenetic mechanisms, covalent modifications such as acetylation and methylation of lysine residues on core histones emerged as a major mechanism in epigenetic regulation. Here, we present the database for histone-modifying enzymes (dbHiMo; http://hme.riceblast.snu.ac.kr/) aimed at facilitating functional and comparative analysis of histone-modifying enzymes (HMEs). HMEs were identified by applying a search pipeline built upon profile hidden Markov model (HMM) to proteomes. The database incorporates 11 576 HMEs identified from 603 proteomes including 483 fungal, 32 plants and 51 metazoan species. The dbHiMo provides users with web-based personalized data browsing and analysis tools, supporting comparative and evolutionary genomics. With comprehensive data entries and associated web-based tools, our database will be a valuable resource for future epigenetics/epigenomics studies. Database URL: http://hme.riceblast.snu.ac.kr/ PMID:26055100
dbHiMo: a web-based epigenomics platform for histone-modifying enzymes.
Choi, Jaeyoung; Kim, Ki-Tae; Huh, Aram; Kwon, Seomun; Hong, Changyoung; Asiegbu, Fred O; Jeon, Junhyun; Lee, Yong-Hwan
2015-01-01
Over the past two decades, epigenetics has evolved into a key concept for understanding regulation of gene expression. Among many epigenetic mechanisms, covalent modifications such as acetylation and methylation of lysine residues on core histones emerged as a major mechanism in epigenetic regulation. Here, we present the database for histone-modifying enzymes (dbHiMo; http://hme.riceblast.snu.ac.kr/) aimed at facilitating functional and comparative analysis of histone-modifying enzymes (HMEs). HMEs were identified by applying a search pipeline built upon profile hidden Markov model (HMM) to proteomes. The database incorporates 11,576 HMEs identified from 603 proteomes including 483 fungal, 32 plants and 51 metazoan species. The dbHiMo provides users with web-based personalized data browsing and analysis tools, supporting comparative and evolutionary genomics. With comprehensive data entries and associated web-based tools, our database will be a valuable resource for future epigenetics/epigenomics studies. © The Author(s) 2015. Published by Oxford University Press.
Mohammed, Yassene; Domański, Dominik; Jackson, Angela M; Smith, Derek S; Deelder, André M; Palmblad, Magnus; Borchers, Christoph H
2014-06-25
One challenge in Multiple Reaction Monitoring (MRM)-based proteomics is to select the most appropriate surrogate peptides to represent a target protein. We present here a software package to automatically generate these most appropriate surrogate peptides for an LC/MRM-MS analysis. Our method integrates information about the proteins, their tryptic peptides, and the suitability of these peptides for MRM which is available online in UniProtKB, NCBI's dbSNP, ExPASy, PeptideAtlas, PRIDE, and GPMDB. The scoring algorithm reflects our knowledge in choosing the best candidate peptides for MRM, based on the uniqueness of the peptide in the targeted proteome, its physiochemical properties, and whether it previously has been observed. The modularity of the workflow allows further extension and additional selection criteria to be incorporated. We have developed a simple Web interface where the researcher provides the protein accession number, the subject organism, and peptide-specific options. Currently, the software is designed for human and mouse proteomes, but additional species can be easily be added. Our software improved the peptide selection by eliminating human error, considering multiple data sources and all of the isoforms of the protein, and resulted in faster peptide selection - approximately 50 proteins per hour compared to 8 per day. Compiling a list of optimal surrogate peptides for target proteins to be analyzed by LC/MRM-MS has been a cumbersome process, in which expert researchers retrieved information from different online repositories and used their own reasoning to find the most appropriate peptides. Our scientific workflow automates this process by integrating information from different data sources including UniProt, Global Proteome Machine, NCBI's dbSNP, and PeptideAtlas, simulating the researchers' reasoning, and incorporating their knowledge of how to select the best proteotypic peptides for an MRM analysis. The developed software can help to standardize the selection of peptides, eliminate human error, and increase productivity. Copyright © 2014 Elsevier B.V. All rights reserved.
Nanjo, Yohei; Jang, Hee-Young; Kim, Hong-Sig; Hiraga, Susumu; Woo, Sun-Hee; Komatsu, Setsuko
2014-10-01
Flooding of fields due to heavy and/or continuous rainfall influences soybean production. To identify soybean varieties with flooding tolerance at the seedling emergence stage, 128 soybean varieties were evaluated using a flooding tolerance index, which is based on plant survival rates, the lack of apparent damage and lateral root development, and post-flooding radicle elongation rate. The soybean varieties were ranked according to their flooding tolerance index, and it was found that the tolerance levels of soybean varieties exhibit a continuum of differences between varieties. Subsequently, tolerant, moderately tolerant and sensitive varieties were selected and subjected to comparative proteomic analysis to clarify the tolerance mechanism. Proteomic analysis of the radicles, combined with correlation analysis, showed that the ratios of RNA binding/processing related proteins and flooding stress indicator proteins were significantly correlated with flooding tolerance index. The RNA binding/processing related proteins were positively correlated in untreated soybeans, whereas flooding stress indicator proteins were negatively correlated in flooded soybeans. These results suggest that flooding tolerance is regulated by mechanisms through multiple factors and is associated with abundance levels of the identified proteins. Copyright © 2014 Elsevier Ltd. All rights reserved.
Basco, Davide; Nicchia, Grazia Paola; Desaphy, Jean-François; Camerino, Diana Conte; Frigeri, Antonio; Svelto, Maria
2010-12-01
Muscle atrophy occurring in several pathophysiological conditions determines decreases in muscle protein synthesis, increases in the rate of proteolysis and changes in muscle fiber composition. To determine the effect of muscle atrophy induced by hindlimb unloading (HU) on membrane proteins from rat soleus, a proteomic approach based on two-dimensional Blue Native/SDS-PAGE was performed. Proteomic analysis of normal and HU soleus muscle demonstrates statistically significant changes in the relative level of 36 proteins. Among the proteins identified by mass spectrometry, most are involved in pathways associated with muscle fuel utilization, indicating a shift in metabolism from oxidative to glycolytic. Moreover, immunoblotting analysis revealed an increase in aquaporin-4 (AQP4) water channel and an alteration of proteins belonging to the dystrophin-glycoprotein complex (DGC). AQP4 and DGC are regulated in soleus muscle subjected to simulated microgravity in response to compensatory mechanisms induced by muscle atrophy, and they parallel the slow-to-fast twitch conversion that occurs in soleus fibers during HU. In conclusion, the alterations of soleus muscle membrane proteome may play a pivotal role in the mechanisms involved in disuse-induced muscle atrophy.
Bengtsson, Oskar; Arntzen, Magnus Ø; Mathiesen, Geir; Skaugen, Morten; Eijsink, Vincent G H
2016-01-10
Analysis of the secretomes of filamentous fungi growing on insoluble lignocellulosic substrates is of major current interest because of the industrial potential of secreted fungal enzymes. Importantly, such studies can help identifying key enzymes from a large arsenal of bioinformatically detected candidates in fungal genomes. We describe a simple, plate-based method to analyze the secretome of Hypocrea jecorina growing on insoluble substrates that allows harsh sample preparation methods promoting desorption, and subsequent identification, of substrate-bound proteins, while minimizing contamination with non-secreted proteins from leaking or lysed cells. The validity of the method was demonstrated by comparative secretome analysis of wild-type H.jecorina strain QM6a growing on bagasse, birch wood, spruce wood or pure cellulose, using label-fee quantification. The proteomic data thus obtained were consistent with existing data from transcriptomics and proteomics studies and revealed clear differences in the responses to complex lignocellulosic substrates and the response to pure cellulose. This easy method is likely to be generally applicable to filamentous fungi and to other microorganisms growing on insoluble substrates. Copyright © 2015 Elsevier B.V. All rights reserved.
Romeo, Stefania; Trupiano, Dalila; Ariani, Andrea; Renzone, Giovanni; Scippa, Gabriella S; Scaloni, Andrea; Sebastiani, Luca
2014-07-15
Contamination of soil and water by heavy metals has become a widespread problem; environmental pollution by high zinc (Zn) concentration occurs frequently. Although poplar (Populus spp.) has been identified as suitable for phytoremediation approaches, its response to high Zn concentrations are still not clearly understood. For this reason, we investigated the effects of Zn in Populus×euramericana clone I-214 roots by proteomic analysis. Comparative experiments were conducted on rooted woody cuttings grown in nutrient solutions containing 1mM (treatment) or 1μM (control) Zn concentrations. A gel-based proteomic approach coupled with morphological and chemical analysis was used to identify differentially represented proteins in treated roots and to investigate the effect of Zn treatment on the poplar root system. Data shows that Zn was accumulated preferentially in roots, that the antioxidant system, the carbohydrate/energy and amino acid metabolisms were the main pathways modulated by Zn excess, and that mitochondria and vacuoles were the cellular organelles predominately affected by Zn stress. A coordination between cell death and proliferation/growth seems to occur under this condition to counteract the Zn-induced damage. Copyright © 2014 Elsevier GmbH. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clair, Geremy; Piehowski, Paul D.; Nicola, Teodora
Global proteomics approaches allow characterization of whole tissue lysates to an impressive depth. However, it is now increasingly recognized that to better understand the complexity of multicellular organisms, global protein profiling of specific spatially defined regions/substructures of tissues (i.e. spatially-resolved proteomics) is essential. Laser capture microdissection (LCM) enables microscopic isolation of defined regions of tissues preserving crucial spatial information. However, current proteomics workflows entail several manual sample preparation steps and are challenged by the microscopic mass-limited samples generated by LCM, and that impact measurement robustness, quantification, and throughput. Here, we coupled LCM with a fully automated sample preparation workflow thatmore » with a single manual step allows: protein extraction, tryptic digestion, peptide cleanup and LC-MS/MS analysis of proteomes from microdissected tissues. Benchmarking against the current state of the art in ultrasensitive global proteomic analysis, our approach demonstrated significant improvements in quantification and throughput. Using our LCM-SNaPP proteomics approach, we characterized to a depth of more than 3,400 proteins, the ontogeny of protein changes during normal lung development in laser capture microdissected alveolar tissue containing ~4,000 cells per sample. Importantly, the data revealed quantitative changes for 350 low abundance transcription factors and signaling molecules, confirming earlier transcript-level observations and defining seven modules of coordinated transcription factor/signaling molecule expression patterns, suggesting that a complex network of temporal regulatory control directs normal lung development with epigenetic regulation fine-tuning pre-natal developmental processes. Our LCM-proteomics approach facilitates efficient, spatially-resolved, ultrasensitive global proteomics analyses in high-throughput that will be enabling for several clinical and biological applications.« less
Advances of Proteomic Sciences in Dentistry
Khurshid, Zohaib; Zohaib, Sana; Najeeb, Shariq; Zafar, Muhammad Sohail; Rehman, Rabia; Rehman, Ihtesham Ur
2016-01-01
Applications of proteomics tools revolutionized various biomedical disciplines such as genetics, molecular biology, medicine, and dentistry. The aim of this review is to highlight the major milestones in proteomics in dentistry during the last fifteen years. Human oral cavity contains hard and soft tissues and various biofluids including saliva and crevicular fluid. Proteomics has brought revolution in dentistry by helping in the early diagnosis of various diseases identified by the detection of numerous biomarkers present in the oral fluids. This paper covers the role of proteomics tools for the analysis of oral tissues. In addition, dental materials proteomics and their future directions are discussed. PMID:27187379
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Xia; Liu, Siwen; Chongqing Key Laboratory of Neurobiology, Chongqing Medical University, Chongqing 400016
2015-11-15
Background: Borna disease virus (BDV) is a neurotropic RNA virus persistently infecting mammalian hosts including humans. Lysine acetylation (Kac) is a key protein post-translational modification (PTM). The unexpectedly broad regulatory scope of Kac let us to profile the entire acetylome upon BDV infection. Methods: The acetylome was profiled through stable isotope labeling for cell culture (SILAC)-based quantitative proteomics. The quantifiable proteome was annotated using bioinformatics. Results: We identified and quantified 791 Kac sites in 473 Kac proteins in human BDV Hu-H1-infected and non-infected oligodendroglial (OL) cells. Bioinformatic analysis revealed that BDV infection alters the acetylation of metabolic proteins, membrane-associated proteinsmore » and transmembrane transporter activity, and affects the acetylation of several lysine acetyltransferases (KAT). Conclusions: Upon BDV persistence the OL acetylome is manipulated towards higher energy and transporter levels necessary for shuttling BDV proteins to and from nuclear replication sites. - Highlights: • We used SILAC-based proteomics to analyze the acetylome of BDV infected OL cells. • We quantified 791Kac sites in 473 proteins. • Bioinformatic analysis revealed altered acetylation of metabolic proteins et al. • BDV manipulates the OL acetylome towards higher energy and transporter levels. • BDV infection is associated with enriched phosphate-associated metabolic processes.« less
Jafari, M; Mirzaie, M; Khodabandeh, M; Rezadoost, H; Ghassempour, A; Aboul-Enein, H Y
2016-07-01
During recent decades, hydrophilic interaction liquid chromatography (HILIC) ahs been introduced to fractionate or purify especially polar solutes such as peptides and proteins while reversed-phase liquid chromatography (RPLC) is also a common strategy. RPLC is also a common dimension in multidimensional chromatography. In this study, the potential of HILIC vs RPLC chromatography was compared for proteome mapping of human peripheral blood mononuclear cell extract. In HILIC a silica-based stationary phase and for RPLC a C18 column were applied. Then separated proteins were eluted to an ion trap mass spectrometry system. Our results showed that the HILIC leads to more proteins being identified in comparison to RPLC. Among the total 181 identified proteins, 56 and 38 proteins were fractionated specifically by HILIC and RPLC, respectively. In order to demonstrate this, the physicochemical properties of identified proteins such as polarity and hydrophobicity were considered. This analysis indicated that polarity may play a major role in the HILIC separation of proteins vs RPLC. Using gene ontology enrichment analysis, it was also observed that differences in physicochemical properties conform to the cellular compartment and biological features. Finally, this study highlighted the potential of HILIC and the great orthogonality of RPLC in gel-free proteomic studies. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.