Cloud parallel processing of tandem mass spectrometry based proteomics data.
Mohammed, Yassene; Mostovenko, Ekaterina; Henneman, Alex A; Marissen, Rob J; Deelder, André M; Palmblad, Magnus
2012-10-05
Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.
Röst, Hannes L; Liu, Yansheng; D'Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi
2016-09-01
Next-generation mass spectrometric (MS) techniques such as SWATH-MS have substantially increased the throughput and reproducibility of proteomic analysis, but ensuring consistent quantification of thousands of peptide analytes across multiple liquid chromatography-tandem MS (LC-MS/MS) runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we developed TRIC (http://proteomics.ethz.ch/tric/), a software tool that utilizes fragment-ion data to perform cross-run alignment, consistent peak-picking and quantification for high-throughput targeted proteomics. TRIC reduced the identification error compared to a state-of-the-art SWATH-MS analysis without alignment by more than threefold at constant recall while correcting for highly nonlinear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups. Thus, TRIC fills a gap in the pipeline for automated analysis of massively parallel targeted proteomics data sets.
Hufnagel, Peter; Rabus, Ralf
2006-01-01
The rapidly developing proteomics technologies help to advance the global understanding of physiological and cellular processes. The lifestyle of a study organism determines the type and complexity of a given proteomic project. The complexity of this study is characterized by a broad collection of pathway-specific subproteomes, reflecting the metabolic versatility as well as the regulatory potential of the aromatic-degrading, denitrifying bacterium 'Aromatoleum' sp. strain EbN1. Differences in protein profiles were determined using a gel-based approach. Protein identification was based on a progressive application of MALDI-TOF-MS, MALDI-TOF-MS/MS and LC-ESI-MS/MS. This progression was result-driven and automated by software control. The identification rate was increased by the assembly of a project-specific list of background signals that was used for internal calibration of the MS spectra, and by the combination of two search engines using a dedicated MetaScoring algorithm. In total, intelligent bioinformatics could increase the identification yield from 53 to 70% of the analyzed 5,050 gel spots; a total of 556 different proteins were identified. MS identification was highly reproducible: most proteins were identified more than twice from parallel 2DE gels with an average sequence coverage of >50% and rather restrictive score thresholds (Mascot >or=95, ProFound >or=2.2, MetaScore >or=97). The MS technologies and bioinformatics tools that were implemented and integrated to handle this complex proteomic project are presented. In addition, we describe the basic principles and current developments of the applied technologies and provide an overview over the current state of microbial proteome research. Copyright (c) 2006 S. Karger AG, Basel.
Celorio-Mancera, Maria de la Paz; Courtiade, Juliette; Muck, Alexander; Heckel, David G.; Musser, Richard O.; Vogel, Heiko
2011-01-01
Although the importance of insect saliva in insect-host plant interactions has been acknowledged, there is very limited information on the nature and complexity of the salivary proteome in lepidopteran herbivores. We inspected the labial salivary transcriptome and proteome of Helicoverpa armigera, an important polyphagous pest species. To identify the majority of the salivary proteins we have randomly sequenced 19,389 expressed sequence tags (ESTs) from a normalized cDNA library of salivary glands. In parallel, a non-cytosolic enriched protein fraction was obtained from labial salivary glands and subjected to two-dimensional gel electrophoresis (2-DE) and de novo peptide sequencing. This procedure allowed comparison of peptides and EST sequences and enabled us to identify 65 protein spots from the secreted labial saliva 2DE proteome. The mass spectrometry analysis revealed ecdysone, glucose oxidase, fructosidase, carboxyl/cholinesterase and an uncharacterized protein previously detected in H. armigera midgut proteome. Consistently, their corresponding transcripts are among the most abundant in our cDNA library. We did find redundancy of sequence identification of saliva-secreted proteins suggesting multiple isoforms. As expected, we found several enzymes responsible for digestion and plant offense. In addition, we identified non-digestive proteins such as an arginine kinase and abundant proteins of unknown function. This identification of secreted salivary gland proteins allows a more comprehensive understanding of insect feeding and poses new challenges for the elucidation of protein function. PMID:22046331
Technical advances in proteomics: new developments in data-independent acquisition.
Hu, Alex; Noble, William S; Wolf-Yadlin, Alejandro
2016-01-01
The ultimate aim of proteomics is to fully identify and quantify the entire complement of proteins and post-translational modifications in biological samples of interest. For the last 15 years, liquid chromatography-tandem mass spectrometry (LC-MS/MS) in data-dependent acquisition (DDA) mode has been the standard for proteomics when sampling breadth and discovery were the main objectives; multiple reaction monitoring (MRM) LC-MS/MS has been the standard for targeted proteomics when precise quantification, reproducibility, and validation were the main objectives. Recently, improvements in mass spectrometer design and bioinformatics algorithms have resulted in the rediscovery and development of another sampling method: data-independent acquisition (DIA). DIA comprehensively and repeatedly samples every peptide in a protein digest, producing a complex set of mass spectra that is difficult to interpret without external spectral libraries. Currently, DIA approaches the identification breadth of DDA while achieving the reproducible quantification characteristic of MRM or its newest version, parallel reaction monitoring (PRM). In comparative de novo identification and quantification studies in human cell lysates, DIA identified up to 89% of the proteins detected in a comparable DDA experiment while providing reproducible quantification of over 85% of them. DIA analysis aided by spectral libraries derived from prior DIA experiments or auxiliary DDA data produces identification and quantification as reproducible and precise as that achieved by MRM/PRM, except on low‑abundance peptides that are obscured by stronger signals. DIA is still a work in progress toward the goal of sensitive, reproducible, and precise quantification without external spectral libraries. New software tools applied to DIA analysis have to deal with deconvolution of complex spectra as well as proper filtering of false positives and false negatives. However, the future outlook is positive, and various researchers are working on novel bioinformatics techniques to address these issues and increase the reproducibility, fidelity, and identification breadth of DIA.
Dey, Debarati; Saha, Bodhisattwa; Sircar, Gaurab; Ghosal, Kavita; Bhattacharya, Swati Gupta
2016-07-01
The worldwide prevalence of fungal allergy in recent years has augmented mining allergens from yet unexplored ones. Curvularia pallescens (CP) being a dominant aerospore in India and a major sensitiser on a wide range of allergic population, pose a serious threat to human health. Therefore, we aimed to identify novel allergens from CP in our present study. A cohort of 22 CP-sensitised patients was selected by positive Skin prick grade. Individual sera exhibited elevated specific IgE level and significant histamine release on a challenge with antigenic extract of CP. First gel-based profiling of CP proteome was done by 1- and 2-dimensional gel. Parallel 1- and 2-dimensional immunoblot were performed applying individual as well as pooled patient sera. Identification of the sero-reactive spots from the 2-dimensional gel was found to be challenging as CP was not previously sequenced. Hence, mass spectrometry-based proteomic workflow consisting of conventional database search was not alone sufficient. Therefore, de novo sequencing preceded homology search was implemented for further identification. Altogether 11 allergenic proteins including Brn-1, vacuolar protease, and fructose-bis-phosphate aldolase were identified with high statistical confidence (p<0.05). This is the first study to report on any allergens from CP. This kind of proteome-based analysis provided a catalogue of CP allergens that would lead an improved way of diagnosis and therapy of CP-related allergy. Copyright © 2016 Elsevier B.V. All rights reserved.
Analysis of the Proteome of Hair-Cell Stereocilia by Mass Spectrometry
Krey, Jocelyn F.; Wilmarth, Philip A.; David, Larry L.; Barr-Gillespie, Peter G.
2017-01-01
Characterization of proteins that mediate mechanotransduction by hair cells, the sensory cells of the inner ear, is hampered by the scarcity of these cells and their sensory organelle, the hair bundle. Mass spectrometry, with its high sensitivity and identification precision, is the ideal method for determining which proteins are present in bundles and what proteins they interact with. We describe here the isolation of mouse hair bundles, as well as preparation of bundle-protein samples for mass spectrometry. We also describe protocols for data-dependent (shotgun) and parallel-reaction-monitoring (targeted) mass spectrometry that allow us to identify and quantify proteins of the hair bundle. These sensitive methods are particularly useful for comparing proteomes of wild-type and mice with deafness mutations affecting hair-bundle proteins. (120 words; maximum 250) PMID:28109437
Rohban, Rokhsareh; Reinisch, Andreas; Etchart, Nathalie; Schallmoser, Katharina; Hofmann, Nicole A.; Szoke, Krisztina; Brinchmann, Jan E.; Rad, Ehsan Bonyadi; Rohde, Eva; Strunk, Dirk
2013-01-01
Therapeutic neo-vasculogenesis in vivo can be achieved by the co-transplantation of human endothelial colony-forming progenitor cells (ECFCs) with mesenchymal stem/progenitor cells (MSPCs). The underlying mechanism is not completely understood thus hampering the development of novel stem cell therapies. We hypothesized that proteomic profiling could be used to retrieve the in vivo signaling signature during the initial phase of human neo-vasculogenesis. ECFCs and MSPCs were therefore either transplanted alone or co-transplanted subcutaneously into immune deficient mice. Early cell signaling, occurring within the first 24 hours in vivo, was analyzed using antibody microarray proteomic profiling. Vessel formation and persistence were verified in parallel transplants for up to 24 weeks. Proteomic analysis revealed significant alteration of regulatory components including caspases, calcium/calmodulin-dependent protein kinase, DNA protein kinase, human ErbB2 receptor-tyrosine kinase as well as mitogen-activated protein kinases. Caspase-4 was selected from array results as one therapeutic candidate for targeting vascular network formation in vitro as well as modulating therapeutic vasculogenesis in vivo. As a proof-of-principle, caspase-4 and general caspase-blocking led to diminished endothelial network formation in vitro and significantly decreased vasculogenesis in vivo. Proteomic profiling ex vivo thus unraveled a signaling signature which can be used for target selection to modulate neo-vasculogenesis in vivo. PMID:23826172
Welker, F
2018-02-20
The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
Liberato, Tarcísio; Troncone, Lanfranco Ranieri Paolo; Yamashiro, Edson T; Serrano, Solange M T; Zelanis, André
2016-03-01
Here we present a proteomic characterization of Phoneutria nigriventer venom. A shotgun proteomic approach allowed the identification, for the first time, of O-glycosyl hydrolases (chitinases) in P. nigriventer venom. The electrophoretic profiles under nonreducing and reducing conditions, and protein identification by mass spectrometry, indicated the presence of oligomeric toxin structures in the venom. Complementary proteomic approaches allowed for a qualitative and semi-quantitative profiling of P. nigriventer venom complexity, expanding its known venom proteome diversity.
Bigaud, Emilie; Corrales, Fernando J.
2016-01-01
Methylthioadenosine phosphorylase (MTAP), a key enzyme in the adenine and methionine salvage pathways, catalyzes the hydrolysis of methylthioadenosine (MTA), a compound suggested to affect pivotal cellular processes in part through the regulation of protein methylation. MTAP is expressed in a wide range of cell types and tissues, and its deletion is common to cancer cells and in liver injury. The aim of this study was to investigate the proteome and methyl proteome alterations triggered by MTAP deficiency in liver cells to define novel regulatory mechanisms that may explain the pathogenic processes of liver diseases. iTRAQ analysis resulted in the identification of 216 differential proteins (p < 0.05) that suggest deregulation of cellular pathways as those mediated by ERK or NFκB. R-methyl proteome analysis led to the identification of 74 differentially methylated proteins between SK-Hep1 and SK-Hep1+ cells, including 116 new methylation sites. Restoring normal MTA levels in SK-Hep1+ cells parallels the specific methylation of 56 proteins, including KRT8, TGF, and CTF8A, which provides a novel regulatory mechanism of their activity with potential implications in carcinogenesis. Inhibition of RNA-binding proteins methylation is especially relevant upon accumulation of MTA. As an example, methylation of quaking protein in Arg242 and Arg256 in SK-Hep1+ cells may play a pivotal role in the regulation of its activity as indicated by the up-regulation of its target protein p27kip1. The phenotype associated with a MTAP deficiency was further verified in the liver of MTAP± mice. Our data support that MTAP deficiency leads to MTA accumulation and deregulation of central cellular pathways, increasing proliferation and decreasing the susceptibility to chemotherapeutic drugs, which involves differential protein methylation. Data are available via ProteomeXchange with identifier PXD002957 (http://www.ebi.ac.uk/pride/archive/projects/PXD002957). PMID:26819315
Guidelines for reporting quantitative mass spectrometry based experiments in proteomics.
Martínez-Bartolomé, Salvador; Deutsch, Eric W; Binz, Pierre-Alain; Jones, Andrew R; Eisenacher, Martin; Mayer, Gerhard; Campos, Alex; Canals, Francesc; Bech-Serra, Joan-Josep; Carrascal, Montserrat; Gay, Marina; Paradela, Alberto; Navajas, Rosana; Marcilla, Miguel; Hernáez, María Luisa; Gutiérrez-Blázquez, María Dolores; Velarde, Luis Felipe Clemente; Aloria, Kerman; Beaskoetxea, Jabier; Medina-Aunon, J Alberto; Albar, Juan P
2013-12-16
Mass spectrometry is already a well-established protein identification tool and recent methodological and technological developments have also made possible the extraction of quantitative data of protein abundance in large-scale studies. Several strategies for absolute and relative quantitative proteomics and the statistical assessment of quantifications are possible, each having specific measurements and therefore, different data analysis workflows. The guidelines for Mass Spectrometry Quantification allow the description of a wide range of quantitative approaches, including labeled and label-free techniques and also targeted approaches such as Selected Reaction Monitoring (SRM). The HUPO Proteomics Standards Initiative (HUPO-PSI) has invested considerable efforts to improve the standardization of proteomics data handling, representation and sharing through the development of data standards, reporting guidelines, controlled vocabularies and tooling. In this manuscript, we describe a key output from the HUPO-PSI-namely the MIAPE Quant guidelines, which have developed in parallel with the corresponding data exchange format mzQuantML [1]. The MIAPE Quant guidelines describe the HUPO-PSI proposal concerning the minimum information to be reported when a quantitative data set, derived from mass spectrometry (MS), is submitted to a database or as supplementary information to a journal. The guidelines have been developed with input from a broad spectrum of stakeholders in the proteomics field to represent a true consensus view of the most important data types and metadata, required for a quantitative experiment to be analyzed critically or a data analysis pipeline to be reproduced. It is anticipated that they will influence or be directly adopted as part of journal guidelines for publication and by public proteomics databases and thus may have an impact on proteomics laboratories across the world. This article is part of a Special Issue entitled: Standardization and Quality Control. Copyright © 2013 Elsevier B.V. All rights reserved.
Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library*
Yen, Chia-Yu; Houel, Stephane; Ahn, Natalie G.; Old, William M.
2011-01-01
The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library of confidently assigned spectra. Two problems relate to library size. First, reference spectral libraries are limited to rediscovery of previously identified peptides and are not applicable to new peptides, because of their incomplete coverage of the human proteome. Second, problems arise when searching a spectral library the size of the entire human proteome. We observed that traditional dot product scoring methods do not scale well with spectral library size, showing reduction in sensitivity when library size is increased. We show that this problem can be addressed by optimizing scoring metrics for spectrum-to-spectrum searches with large spectral libraries. MS/MS spectra for the 1.3 million predicted tryptic peptides in the human proteome are simulated using a kinetic fragmentation model (MassAnalyzer version2.1) to create a proteome-wide simulated spectral library. Searches of the simulated library increase MS/MS assignments by 24% compared with Mascot, when using probabilistic and rank based scoring methods. The proteome-wide coverage of the simulated library leads to 11% increase in unique peptide assignments, compared with parallel searches of a reference spectral library. Further improvement is attained when reference spectra and simulated spectra are combined into a hybrid spectral library, yielding 52% increased MS/MS assignments compared with Mascot searches. Our study demonstrates the advantages of using probabilistic and rank based scores to improve performance of spectrum-to-spectrum search strategies. PMID:21532008
NASA Astrophysics Data System (ADS)
Diaz, K. S.; Kim, E. H.; Jones, R. M.; de Leon, K. C.; Woodcroft, B. J.; Tyson, G. W.; Rich, V. I.
2014-12-01
The growing field of metaproteomics links microbial communities to their expressed functions by using mass spectrometry methods to characterize community proteins. Comparison of mass spectrometry protein search algorithms and their biases is crucial for maximizing the quality and amount of protein identifications in mass spectral data. Available algorithms employ different approaches when mapping mass spectra to peptides against a database. We compared mass spectra from four microbial proteomes derived from high-organic content soils searched with two search algorithms: 1) Sequest HT as packaged within Proteome Discoverer (v.1.4) and 2) X!Tandem as packaged in TransProteomicPipeline (v.4.7.1). Searches used matched metagenomes, and results were filtered to allow identification of high probability proteins. There was little overlap in proteins identified by both algorithms, on average just ~24% of the total. However, when adjusted for spectral abundance, the overlap improved to ~70%. Proteome Discoverer generally outperformed X!Tandem, identifying an average of 12.5% more proteins than X!Tandem, with X!Tandem identifying more proteins only in the first two proteomes. For spectrally-adjusted results, the algorithms were similar, with X!Tandem marginally outperforming Proteome Discoverer by an average of ~4%. We then assessed differences in heat shock proteins (HSP) identification by the two algorithms by BLASTing identified proteins against the Heat Shock Protein Information Resource, because HSP hits typically account for the majority signal in proteomes, due to extraction protocols. Total HSP identifications for each of the 4 proteomes were approximately ~15%, ~11%, ~17%, and ~19%, with ~14% for total HSPs with redundancies removed. Of the ~15% average of proteins from the 4 proteomes identified as HSPs, ~10% of proteins and spectra were identified by both algorithms. On average, Proteome Discoverer identified ~9% more HSPs than X!Tandem.
Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides*
Bogdanow, Boris; Zauber, Henrik; Selbach, Matthias
2016-01-01
The principle of shotgun proteomics is to use peptide mass spectra in order to identify corresponding sequences in a protein database. The quality of peptide and protein identification and quantification critically depends on the sensitivity and specificity of this assignment process. Many peptides in proteomic samples carry biochemical modifications, and a large fraction of unassigned spectra arise from modified peptides. Spectra derived from modified peptides can erroneously be assigned to wrong amino acid sequences. However, the impact of this problem on proteomic data has not yet been investigated systematically. Here we use combinations of different database searches to show that modified peptides can be responsible for 20–50% of false positive identifications in deep proteomic data sets. These false positive hits are particularly problematic as they have significantly higher scores and higher intensities than other false positive matches. Furthermore, these wrong peptide assignments lead to hundreds of false protein identifications and systematic biases in protein quantification. We devise a “cleaned search” strategy to address this problem and show that this considerably improves the sensitivity and specificity of proteomic data. In summary, we show that modified peptides cause systematic errors in peptide and protein identification and quantification and should therefore be considered to further improve the quality of proteomic data annotation. PMID:27215553
Fraisier, Christophe; Koraka, Penelope; Belghazi, Maya; Bakli, Mahfoud; Granjeaud, Samuel; Pophillat, Matthieu; Lim, Stephanie M.; Osterhaus, Albert; Martina, Byron; Camoin, Luc; Almeras, Lionel
2014-01-01
Recent outbreaks of Chikungunya virus (CHIKV) infection have been characterized by an increasing number of severe cases with atypical manifestations including neurological complications. In parallel, the risk map of CHIKV outbreaks has expanded because of improved vector competence. These features make CHIKV infection a major public health concern that requires a better understanding of the underlying physiopathological processes for the development of antiviral strategies to protect individuals from severe disease. To decipher the mechanisms of CHIKV infection in the nervous system, a kinetic analysis on the host proteome modifications in the brain of CHIKV-infected mice sampled before and after the onset of clinical symptoms was performed. The combination of 2D-DIGE and iTRAQ proteomic approaches, followed by mass spectrometry protein identification revealed 177 significantly differentially expressed proteins. This kinetic analysis revealed a dramatic down-regulation of proteins before the appearance of the clinical symptoms followed by the increased expression of most of these proteins in the acute symptomatic phase. Bioinformatic analyses of the protein datasets enabled the identification of the major biological processes that were altered during the time course of CHIKV infection, such as integrin signaling and cytoskeleton dynamics, endosome machinery and receptor recycling related to virus transport and synapse function, regulation of gene expression, and the ubiquitin-proteasome pathway. These results reveal the putative mechanisms associated with severe CHIKV infection-mediated neurological disease and highlight the potential markers or targets that can be used to develop diagnostic and/or antiviral tools. PMID:24618821
Yeast proteome map (last update).
Perrot, Michel; Moes, Suzette; Massoni, Aurélie; Jenoe, Paul; Boucherie, Hélian
2009-10-01
The identification of proteins separated on 2-D gels is essential to exploit the full potential of 2-D gel electrophoresis for proteomic investigations. For this purpose we have undertaken the systematic identification of Saccharomyces cerevisiae proteins separated on 2-D gels. We report here the identification by mass spectrometry of 100 novel yeast protein spots that have so far not been tackled due to their scarcity on our standard 2-D gels. These identifications extend the number of protein spots identified on our yeast 2-D proteome map to 716. They correspond to 485 unique proteins. Among these, 154 were resolved into several isoforms. The present data set can now be expanded to report for the first time a map of 363 protein isoforms that significantly deepens our knowledge of the yeast proteome. The reference map and a list of all identified proteins can be accessed on the Yeast Protein Map server (www.ibgc.u-bordeaux2.fr/YPM).
TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics
Röst, Hannes L.; Liu, Yansheng; D’Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C.; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi
2016-01-01
Large scale, quantitative proteomic studies have become essential for the analysis of clinical cohorts, large perturbation experiments and systems biology studies. While next-generation mass spectrometric techniques such as SWATH-MS have substantially increased throughput and reproducibility, ensuring consistent quantification of thousands of peptide analytes across multiple LC-MS/MS runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we have developed the TRIC software which utilizes fragment ion data to perform cross-run alignment, consistent peak-picking and quantification for high throughput targeted proteomics. TRIC uses a graph-based alignment strategy based on non-linear retention time correction to integrate peak elution information from all LC-MS/MS runs acquired in a study. When compared to state-of-the-art SWATH-MS data analysis, the algorithm was able to reduce the identification error by more than 3-fold at constant recall, while correcting for highly non-linear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem (iPS) cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups and substantially increased the quantitative completeness and biological information in the data, providing insights into protein dynamics of iPS cells. Overall, this study demonstrates the importance of consistent quantification in highly challenging experimental setups, and proposes an algorithm to automate this task, constituting the last missing piece in a pipeline for automated analysis of massively parallel targeted proteomics datasets. PMID:27479329
Computer applications making rapid advances in high throughput microbial proteomics (HTMP).
Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen
2014-02-01
The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.
Shteynberg, David; Deutsch, Eric W.; Lam, Henry; Eng, Jimmy K.; Sun, Zhi; Tasman, Natalie; Mendoza, Luis; Moritz, Robert L.; Aebersold, Ruedi; Nesvizhskii, Alexey I.
2011-01-01
The combination of tandem mass spectrometry and sequence database searching is the method of choice for the identification of peptides and the mapping of proteomes. Over the last several years, the volume of data generated in proteomic studies has increased dramatically, which challenges the computational approaches previously developed for these data. Furthermore, a multitude of search engines have been developed that identify different, overlapping subsets of the sample peptides from a particular set of tandem mass spectrometry spectra. We present iProphet, the new addition to the widely used open-source suite of proteomic data analysis tools Trans-Proteomics Pipeline. Applied in tandem with PeptideProphet, it provides more accurate representation of the multilevel nature of shotgun proteomic data. iProphet combines the evidence from multiple identifications of the same peptide sequences across different spectra, experiments, precursor ion charge states, and modified states. It also allows accurate and effective integration of the results from multiple database search engines applied to the same data. The use of iProphet in the Trans-Proteomics Pipeline increases the number of correctly identified peptides at a constant false discovery rate as compared with both PeptideProphet and another state-of-the-art tool Percolator. As the main outcome, iProphet permits the calculation of accurate posterior probabilities and false discovery rate estimates at the level of sequence identical peptide identifications, which in turn leads to more accurate probability estimates at the protein level. Fully integrated with the Trans-Proteomics Pipeline, it supports all commonly used MS instruments, search engines, and computer platforms. The performance of iProphet is demonstrated on two publicly available data sets: data from a human whole cell lysate proteome profiling experiment representative of typical proteomic data sets, and from a set of Streptococcus pyogenes experiments more representative of organism-specific composite data sets. PMID:21876204
Analysis of high accuracy, quantitative proteomics data in the MaxQB database.
Schaab, Christoph; Geiger, Tamar; Stoehr, Gabriele; Cox, Juergen; Mann, Matthias
2012-03-01
MS-based proteomics generates rapidly increasing amounts of precise and quantitative information. Analysis of individual proteomic experiments has made great strides, but the crucial ability to compare and store information across different proteome measurements still presents many challenges. For example, it has been difficult to avoid contamination of databases with low quality peptide identifications, to control for the inflation in false positive identifications when combining data sets, and to integrate quantitative data. Although, for example, the contamination with low quality identifications has been addressed by joint analysis of deposited raw data in some public repositories, we reasoned that there should be a role for a database specifically designed for high resolution and quantitative data. Here we describe a novel database termed MaxQB that stores and displays collections of large proteomics projects and allows joint analysis and comparison. We demonstrate the analysis tools of MaxQB using proteome data of 11 different human cell lines and 28 mouse tissues. The database-wide false discovery rate is controlled by adjusting the project specific cutoff scores for the combined data sets. The 11 cell line proteomes together identify proteins expressed from more than half of all human genes. For each protein of interest, expression levels estimated by label-free quantification can be visualized across the cell lines. Similarly, the expression rank order and estimated amount of each protein within each proteome are plotted. We used MaxQB to calculate the signal reproducibility of the detected peptides for the same proteins across different proteomes. Spearman rank correlation between peptide intensity and detection probability of identified proteins was greater than 0.8 for 64% of the proteome, whereas a minority of proteins have negative correlation. This information can be used to pinpoint false protein identifications, independently of peptide database scores. The information contained in MaxQB, including high resolution fragment spectra, is accessible to the community via a user-friendly web interface at http://www.biochem.mpg.de/maxqb.
USDA-ARS?s Scientific Manuscript database
Matrix-assisted laser desorption/ionization tandem time-of-flight (MALDI-TOF-TOF) mass spectrometry is increasingly utilized for rapid top-down proteomic identification of proteins. This identification may involve analysis of either a pure protein or a protein mixture. For analysis of a pure protein...
Computer aided manual validation of mass spectrometry-based proteomic data.
Curran, Timothy G; Bryson, Bryan D; Reigelhaupt, Michael; Johnson, Hannah; White, Forest M
2013-06-15
Advances in mass spectrometry-based proteomic technologies have increased the speed of analysis and the depth provided by a single analysis. Computational tools to evaluate the accuracy of peptide identifications from these high-throughput analyses have not kept pace with technological advances; currently the most common quality evaluation methods are based on statistical analysis of the likelihood of false positive identifications in large-scale data sets. While helpful, these calculations do not consider the accuracy of each identification, thus creating a precarious situation for biologists relying on the data to inform experimental design. Manual validation is the gold standard approach to confirm accuracy of database identifications, but is extremely time-intensive. To palliate the increasing time required to manually validate large proteomic datasets, we provide computer aided manual validation software (CAMV) to expedite the process. Relevant spectra are collected, catalogued, and pre-labeled, allowing users to efficiently judge the quality of each identification and summarize applicable quantitative information. CAMV significantly reduces the burden associated with manual validation and will hopefully encourage broader adoption of manual validation in mass spectrometry-based proteomics. Copyright © 2013 Elsevier Inc. All rights reserved.
Thiele, H.; Glandorf, J.; Koerting, G.; Reidegeld, K.; Blüggel, M.; Meyer, H.; Stephan, C.
2007-01-01
In today’s proteomics research, various techniques and instrumentation bioinformatics tools are necessary to manage the large amount of heterogeneous data with an automatic quality control to produce reliable and comparable results. Therefore a data-processing pipeline is mandatory for data validation and comparison in a data-warehousing system. The proteome bioinformatics platform ProteinScape has been proven to cover these needs. The reprocessing of HUPO BPP participants’ MS data was done within ProteinScape. The reprocessed information was transferred into the global data repository PRIDE. ProteinScape as a data-warehousing system covers two main aspects: archiving relevant data of the proteomics workflow and information extraction functionality (protein identification, quantification and generation of biological knowledge). As a strategy for automatic data validation, different protein search engines are integrated. Result analysis is performed using a decoy database search strategy, which allows the measurement of the false-positive identification rate. Peptide identifications across different workflows, different MS techniques, and different search engines are merged to obtain a quality-controlled protein list. The proteomics identifications database (PRIDE), as a public data repository, is an archiving system where data are finally stored and no longer changed by further processing steps. Data submission to PRIDE is open to proteomics laboratories generating protein and peptide identifications. An export tool has been developed for transferring all relevant HUPO BPP data from ProteinScape into PRIDE using the PRIDE.xml format. The EU-funded ProDac project will coordinate the development of software tools covering international standards for the representation of proteomics data. The implementation of data submission pipelines and systematic data collection in public standards–compliant repositories will cover all aspects, from the generation of MS data in each laboratory to the conversion of all the annotating information and identifications to a standardized format. Such datasets can be used in the course of publishing in scientific journals.
Fagerquist, Clifton K
2017-01-01
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) is increasingly utilized as a rapid technique to identify microorganisms including pathogenic bacteria. However, little attention has been paid to the significant proteomic information encoded in the MS peaks that collectively constitute the MS 'fingerprint'. This review/perspective is intended to explore this topic in greater detail in the hopes that it may spur interest and further research in this area. Areas covered: This paper examines the recent literature on utilizing MALDI-TOF for bacterial identification. Critical works highlighting protein biomarker identification of bacteria, arguments for and against protein biomarker identification, proteomic approaches to biomarker identification, emergence of MALDI-TOF-TOF platforms and their use for top-down proteomic identification of bacterial proteins, protein denaturation and its effect on protein ion fragmentation, collision cross-sections and energy deposition during desorption/ionization are also explored. Expert commentary: MALDI-TOF and TOF-TOF mass spectrometry platforms will continue to provide chemical analyses that are rapid, cost-effective and high throughput. These instruments have proven their utility in the taxonomic identification of pathogenic bacteria at the genus and species level and are poised to more fully characterize these microorganisms to the benefit of clinical microbiology, food safety and other fields.
Liu, Ming-Qi; Zeng, Wen-Feng; Fang, Pan; Cao, Wei-Qian; Liu, Chao; Yan, Guo-Quan; Zhang, Yang; Peng, Chao; Wu, Jian-Qiang; Zhang, Xiao-Jin; Tu, Hui-Jun; Chi, Hao; Sun, Rui-Xiang; Cao, Yong; Dong, Meng-Qiu; Jiang, Bi-Yun; Huang, Jiang-Ming; Shen, Hua-Li; Wong, Catherine C L; He, Si-Min; Yang, Peng-Yuan
2017-09-05
The precise and large-scale identification of intact glycopeptides is a critical step in glycoproteomics. Owing to the complexity of glycosylation, the current overall throughput, data quality and accessibility of intact glycopeptide identification lack behind those in routine proteomic analyses. Here, we propose a workflow for the precise high-throughput identification of intact N-glycopeptides at the proteome scale using stepped-energy fragmentation and a dedicated search engine. pGlyco 2.0 conducts comprehensive quality control including false discovery rate evaluation at all three levels of matches to glycans, peptides and glycopeptides, improving the current level of accuracy of intact glycopeptide identification. The N-glycoproteome of samples metabolically labeled with 15 N/ 13 C were analyzed quantitatively and utilized to validate the glycopeptide identification, which could be used as a novel benchmark pipeline to compare different search engines. Finally, we report a large-scale glycoproteome dataset consisting of 10,009 distinct site-specific N-glycans on 1988 glycosylation sites from 955 glycoproteins in five mouse tissues.Protein glycosylation is a heterogeneous post-translational modification that generates greater proteomic diversity that is difficult to analyze. Here the authors describe pGlyco 2.0, a workflow for the precise one step identification of intact N-glycopeptides at the proteome scale.
Urine Sample Preparation in 96-Well Filter Plates for Quantitative Clinical Proteomics
2015-01-01
Urine is an important, noninvasively collected body fluid source for the diagnosis and prognosis of human diseases. Liquid chromatography mass spectrometry (LC-MS) based shotgun proteomics has evolved as a sensitive and informative technique to discover candidate disease biomarkers from urine specimens. Filter-aided sample preparation (FASP) generates peptide samples from protein mixtures of cell lysate or body fluid origin. Here, we describe a FASP method adapted to 96-well filter plates, named 96FASP. Soluble urine concentrates containing ∼10 μg of total protein were processed by 96FASP and LC-MS resulting in 700–900 protein identifications at a 1% false discovery rate (FDR). The experimental repeatability, as assessed by label-free quantification and Pearson correlation analysis for shared proteins among replicates, was high (R ≥ 0.97). Application to urinary pellet lysates which is of particular interest in the context of urinary tract infection analysis was also demonstrated. On average, 1700 proteins (±398) were identified in five experiments. In a pilot study using 96FASP for analysis of eight soluble urine samples, we demonstrated that protein profiles of technical replicates invariably clustered; the protein profiles for distinct urine donors were very different from each other. Robust, highly parallel methods to generate peptide mixtures from urine and other body fluids are critical to increase cost-effectiveness in clinical proteomics projects. This 96FASP method has potential to become a gold standard for high-throughput quantitative clinical proteomics. PMID:24797144
Current algorithmic solutions for peptide-based proteomics data generation and identification.
Hoopmann, Michael R; Moritz, Robert L
2013-02-01
Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics. Copyright © 2012 Elsevier Ltd. All rights reserved.
Wu, Pei-Wen; Mason, Katelyn E; Durbin-Johnson, Blythe P; Salemi, Michelle; Phinney, Brett S; Rocke, David M; Parker, Glendon J; Rice, Robert H
2017-07-01
Forensic association of hair shaft evidence with individuals is currently assessed by comparing mitochondrial DNA haplotypes of reference and casework samples, primarily for exclusionary purposes. Present work tests and validates more recent proteomic approaches to extract quantitative transcriptional and genetic information from hair samples of monozygotic twin pairs, which would be predicted to partition away from unrelated individuals if the datasets contain identifying information. Protein expression profiles and polymorphic, genetically variant hair peptides were generated from ten pairs of monozygotic twins. Profiling using the protein tryptic digests revealed that samples from identical twins had typically an order of magnitude fewer protein expression differences than unrelated individuals. The data did not indicate that the degree of difference within twin pairs increased with age. In parallel, data from the digests were used to detect genetically variant peptides that result from common nonsynonymous single nucleotide polymorphisms in genes expressed in the hair follicle. Compilation of the variants permitted sorting of the samples by hierarchical clustering, permitting accurate matching of twin pairs. The results demonstrate that genetic differences are detectable by proteomic methods and provide a framework for developing quantitative statistical estimates of personal identification that increase the value of hair shaft evidence. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Baracat-Pereira, Maria Cristina; de Oliveira Barbosa, Meire; Magalhães, Marcos Jorge; Carrijo, Lanna Clicia; Games, Patrícia Dias; Almeida, Hebréia Oliveira; Sena Netto, José Fabiano; Pereira, Matheus Rodrigues; de Barros, Everaldo Gonçalves
2012-06-01
The enrichment and isolation of proteins are considered limiting steps in proteomic studies. Identification of proteins whose expression is transient, those that are of low-abundance, and of natural peptides not described in databases, is still a great challenge. Plant extracts are in general complex, and contaminants interfere with the identification of proteins involved in important physiological processes, such as plant defense against pathogens. This review discusses the challenges and strategies of separomics applied to the identification of low-abundance proteins and peptides in plants, especially in plants challenged by pathogens. Separomics is described as a group of methodological strategies for the separation of protein molecules for proteomics. Several tools have been used to remove highly abundant proteins from samples and also non-protein contaminants. The use of chromatographic techniques, the partition of the proteome into subproteomes, and an effort to isolate proteins in their native form have allowed the isolation and identification of rare proteins involved in different processes.
Baracat-Pereira, Maria Cristina; de Oliveira Barbosa, Meire; Magalhães, Marcos Jorge; Carrijo, Lanna Clicia; Games, Patrícia Dias; Almeida, Hebréia Oliveira; Sena Netto, José Fabiano; Pereira, Matheus Rodrigues; de Barros, Everaldo Gonçalves
2012-01-01
The enrichment and isolation of proteins are considered limiting steps in proteomic studies. Identification of proteins whose expression is transient, those that are of low-abundance, and of natural peptides not described in databases, is still a great challenge. Plant extracts are in general complex, and contaminants interfere with the identification of proteins involved in important physiological processes, such as plant defense against pathogens. This review discusses the challenges and strategies of separomics applied to the identification of low-abundance proteins and peptides in plants, especially in plants challenged by pathogens. Separomics is described as a group of methodological strategies for the separation of protein molecules for proteomics. Several tools have been used to remove highly abundant proteins from samples and also non-protein contaminants. The use of chromatographic techniques, the partition of the proteome into subproteomes, and an effort to isolate proteins in their native form have allowed the isolation and identification of rare proteins involved in different processes. PMID:22802713
Schubert, Peter; Devine, Dana V
2010-01-03
Proteomics has brought new perspectives to the fields of hematology and transfusion medicine in the last decade. The steady improvement of proteomic technology is propelling novel discoveries of molecular mechanisms by studying protein expression, post-translational modifications and protein interactions. This review article focuses on the application of proteomics to the identification of molecular mechanisms leading to the deterioration of blood platelets during storage - a critical aspect in the provision of platelet transfusion products. Several proteomic approaches have been employed to analyse changes in the platelet protein profile during storage and the obtained data now need to be translated into platelet biochemistry in order to connect the results to platelet function. Targeted biochemical applications then allow the identification of points for intervention in signal transduction pathways. Once validated and placed in a transfusion context, these data will provide further understanding of the underlying molecular mechanisms leading to platelet storage lesion. Future aspects of proteomics in blood banking will aim to make use of protein markers identified for platelet storage lesion development to monitor proteome changes when alterations such as the use of additive solutions or pathogen reduction strategies are put in place in order to improve platelet quality for patients. (c) 2009 Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Immunogenic, pathogen-specific proteins have excellent potential for development of novel management modalities. Here, we describe an innovative application of proteomics called Microbial protein-Antigenome Determination (MAD) Technology for rapid identification of native microbial proteins that el...
USDA-ARS?s Scientific Manuscript database
Immunogenic, pathogen-specific proteins have excellent potential for development of novel management modalities. Here, we describe an innovative application of proteomics called Microbial protein-Antigenome Determination (MAD) Technology for rapid identification of native microbial proteins that eli...
Mühlhaus, Timo; Weiss, Julia; Hemme, Dorothea; Sommer, Frederik; Schroda, Michael
2011-01-01
Crop-plant-yield safety is jeopardized by temperature stress caused by the global climate change. To take countermeasures by breeding and/or transgenic approaches it is essential to understand the mechanisms underlying plant acclimation to heat stress. To this end proteomics approaches are most promising, as acclimation is largely mediated by proteins. Accordingly, several proteomics studies, mainly based on two-dimensional gel-tandem MS approaches, were conducted in the past. However, results often were inconsistent, presumably attributable to artifacts inherent to the display of complex proteomes via two-dimensional-gels. We describe here a new approach to monitor proteome dynamics in time course experiments. This approach involves full 15N metabolic labeling and mass spectrometry based quantitative shotgun proteomics using a uniform 15N standard over all time points. It comprises a software framework, IOMIQS, that features batch job mediated automated peptide identification by four parallelized search engines, peptide quantification and data assembly for the processing of large numbers of samples. We have applied this approach to monitor proteome dynamics in a heat stress time course using the unicellular green alga Chlamydomonas reinhardtii as model system. We were able to identify 3433 Chlamydomonas proteins, of which 1116 were quantified in at least three of five time points of the time course. Statistical analyses revealed that levels of 38 proteins significantly increased, whereas levels of 206 proteins significantly decreased during heat stress. The increasing proteins comprise 25 (co-)chaperones and 13 proteins involved in chromatin remodeling, signal transduction, apoptosis, photosynthetic light reactions, and yet unknown functions. Proteins decreasing during heat stress were significantly enriched in functional categories that mediate carbon flux from CO2 and external acetate into protein biosynthesis, which also correlated with a rapid, but fully reversible cell cycle arrest after onset of stress. Our approach opens up new perspectives for plant systems biology and provides novel insights into plant stress acclimation. PMID:21610104
Jones, K.; Kim, K.; Patel, B.; Kelsen, S.; Braverman, A.; Swinton, D.; Gafken, P.; Jones, L.; Lane, W.; Neveu, J.; Leung, H.; Shaffer, S.; Leszyk, J.; Stanley, B.; Fox, T.; Stanley, A.; Yeung, Anthony
2013-01-01
Proteomic research can benefit from simultaneous access to multiple cutting-edge mass spectrometers. 18 core facilities responded to our investigators seeking service through the ABRF Discussion Forum. Five of the facilities selected completed four plasma proteomics experiments as routine fee-for-service. Each biological experiment entailed an iTRAQ 4-plex proteome comparison of immunodepleted plasma provided as 30 labeled-peptide fractions. Identical samples were analyzed by two AB SCIEX TripleTOF 5600 and three Thermo Orbitrap (Elite/Velos Pro/Q Exactive) instruments. 480 LC-MS/MS runs delivered >250 GB of data over two months. We compare herein routine service analyses of three peptide fractions of different peptide abundance. Data files from each instrument were studied to develop optimal analysis parameters to compare with default parameters in Mascot Distiller 2.4, ProteinPilot 4.5 beta, AB Sciex MS Data Converter 1.3 beta, and Proteome Discover 1.3. Peak-picking for TripleTOFs was best by ProteinPilot 4.5 beta while Mascot Distiller and Proteome Discoverer were comparable for the Orbitraps. We compared protein identification and quantitation in SwissProt 2012_07 database by Mascot Server 2.4.01 versus ProteinPilot. By all search methods, more proteins, up to two fold, were identified using the Q Exactive than others. Q Exactive excelled also at the number of unique significant peptide ion sequences. However, software-dependent impact on subsequent interpretation, due to peptide modifications, can be critical. These findings may have special implications for iTRAQ plasma proteomics. For the low abundance peptide ions, the slope of the dynamic range drop-off in the plasma proteome is uniquely sharp compared with cell lysates. Our study provides data for testable improvements in the operation of these mass spectrometers. More importantly, we have demonstrated a new affordable expedient workflow for investigators to perform proteomic experiments through the ABRF infrastructure. (We acknowledge John Cottrell for optimizing the peak-picking parameters for Mascot Distiller).
Orton, Dennis J.; Doucette, Alan A.
2013-01-01
Identification of biomarkers capable of differentiating between pathophysiological states of an individual is a laudable goal in the field of proteomics. Protein biomarker discovery generally employs high throughput sample characterization by mass spectrometry (MS), being capable of identifying and quantifying thousands of proteins per sample. While MS-based technologies have rapidly matured, the identification of truly informative biomarkers remains elusive, with only a handful of clinically applicable tests stemming from proteomic workflows. This underlying lack of progress is attributed in large part to erroneous experimental design, biased sample handling, as well as improper statistical analysis of the resulting data. This review will discuss in detail the importance of experimental design and provide some insight into the overall workflow required for biomarker identification experiments. Proper balance between the degree of biological vs. technical replication is required for confident biomarker identification. PMID:28250400
Dormeyer, Wilma; van Hoof, Dennis; Mummery, Christine L; Krijgsveld, Jeroen; Heck, Albert J R
2008-10-01
The identification of (plasma) membrane proteins in cells can provide valuable insights into the regulation of their biological processes. Pluripotent cells such as human embryonic stem cells and embryonal carcinoma cells are capable of unlimited self-renewal and share many of the biological mechanisms that regulate proliferation and differentiation. The comparison of their membrane proteomes will help unravel the biological principles of pluripotency, and the identification of biomarker proteins in their plasma membranes is considered a crucial step to fully exploit pluripotent cells for therapeutic purposes. For these tasks, membrane proteomics is the method of choice, but as indicated by the scarce identification of membrane and plasma membrane proteins in global proteomic surveys it is not an easy task. In this minireview, we first describe the general challenges of membrane proteomics. We then review current sample preparation steps and discuss protocols that we found particularly beneficial for the identification of large numbers of (plasma) membrane proteins in human tumour- and embryo-derived stem cells. Our optimized assembled protocol led to the identification of a large number of membrane proteins. However, as the composition of cells and membranes is highly variable we still recommend adapting the sample preparation protocol for each individual system.
Clinical veterinary proteomics: Techniques and approaches to decipher the animal plasma proteome.
Ghodasara, P; Sadowski, P; Satake, N; Kopp, S; Mills, P C
2017-12-01
Over the last two decades, technological advancements in the field of proteomics have advanced our understanding of the complex biological systems of living organisms. Techniques based on mass spectrometry (MS) have emerged as powerful tools to contextualise existing genomic information and to create quantitative protein profiles from plasma, tissues or cell lines of various species. Proteomic approaches have been used increasingly in veterinary science to investigate biological processes responsible for growth, reproduction and pathological events. However, the adoption of proteomic approaches by veterinary investigators lags behind that of researchers in the human medical field. Furthermore, in contrast to human proteomics studies, interpretation of veterinary proteomic data is difficult due to the limited protein databases available for many animal species. This review article examines the current use of advanced proteomics techniques for evaluation of animal health and welfare and covers the current status of clinical veterinary proteomics research, including successful protein identification and data interpretation studies. It includes a description of an emerging tool, sequential window acquisition of all theoretical fragment ion mass spectra (SWATH-MS), available on selected mass spectrometry instruments. This newly developed data acquisition technique combines advantages of discovery and targeted proteomics approaches, and thus has the potential to advance the veterinary proteomics field by enhancing identification and reproducibility of proteomics data. Copyright © 2017 Elsevier Ltd. All rights reserved.
Bailey, Ulla-Maja; Schulz, Benjamin L
2013-04-01
Post-translational modification of proteins with glycosylation is of key importance in many biological systems in eukaryotes, influencing fundamental biological processes and regulating protein function. Changes in glycosylation are therefore of interest in understanding these processes and are also useful as clinical biomarkers of disease. The presence of glycosylation can also inhibit protease digestion and lower the quality and confidence of protein identification by mass spectrometry. While deglycosylation can improve the efficiency of subsequent protease digest and increase protein coverage, this step is often excluded from proteomic workflows. Here, we performed a systematic analysis that showed that deglycosylation with peptide-N-glycosidase F (PNGase F) prior to protease digestion with AspN or trypsin improved the quality of identification of the yeast cell wall proteome. The improvement in the confidence of identification of glycoproteins following PNGase F deglycosylation correlated with a higher density of glycosylation sites. Optimal identification across the proteome was achieved with PNGase F deglycosylation and complementary proteolysis with either AspN or trypsin. We used this combination of deglycosylation and complementary protease digest to identify changes in the yeast cell wall proteome caused by lack of the Alg3p protein, a key component of the biosynthetic pathway of protein N-glycosylation. The cell wall of yeast lacking Alg3p showed specifically increased levels of Cis3p, a protein important for cell wall integrity. Our results showed that deglycosylation prior to protease digestion improved the quality of proteomic analyses even if protein glycosylation is not of direct relevance to the study at hand. Copyright © 2013 Elsevier B.V. All rights reserved.
Proteomics for understanding miRNA biology
Huang, Tai-Chung; Pinto, Sneha M.; Pandey, Akhilesh
2013-01-01
MicroRNAs (miRNAs) are small noncoding RNAs that play important roles in posttranscriptional regulation of gene expression. Mature miRNAs associate with the RNA interference silencing complex to repress mRNA translation and/or degrade mRNA transcripts. Mass spectrometry-based proteomics has enabled identification of several core components of the canonical miRNA processing pathway and their posttranslational modifications which are pivotal in miRNA regulatory mechanisms. The use of quantitative proteomic strategies has also emerged as a key technique for experimental identification of miRNA targets by allowing direct determination of proteins whose levels are altered because of translational suppression. This review focuses on the role of proteomics and labeling strategies to understand miRNA biology. PMID:23125164
Park, Gun Wook; Hwang, Heeyoun; Kim, Kwang Hoe; Lee, Ju Yeon; Lee, Hyun Kyoung; Park, Ji Yeong; Ji, Eun Sun; Park, Sung-Kyu Robin; Yates, John R; Kwon, Kyung-Hoon; Park, Young Mok; Lee, Hyoung-Joo; Paik, Young-Ki; Kim, Jin Young; Yoo, Jong Shin
2016-11-04
In the Chromosome-Centric Human Proteome Project (C-HPP), false-positive identification by peptide spectrum matches (PSMs) after database searches is a major issue for proteogenomic studies using liquid-chromatography and mass-spectrometry-based large proteomic profiling. Here we developed a simple strategy for protein identification, with a controlled false discovery rate (FDR) at the protein level, using an integrated proteomic pipeline (IPP) that consists of four engrailed steps as follows. First, using three different search engines, SEQUEST, MASCOT, and MS-GF+, individual proteomic searches were performed against the neXtProt database. Second, the search results from the PSMs were combined using statistical evaluation tools including DTASelect and Percolator. Third, the peptide search scores were converted into E-scores normalized using an in-house program. Last, ProteinInferencer was used to filter the proteins containing two or more peptides with a controlled FDR of 1.0% at the protein level. Finally, we compared the performance of the IPP to a conventional proteomic pipeline (CPP) for protein identification using a controlled FDR of <1% at the protein level. Using the IPP, a total of 5756 proteins (vs 4453 using the CPP) including 477 alternative splicing variants (vs 182 using the CPP) were identified from human hippocampal tissue. In addition, a total of 10 missing proteins (vs 7 using the CPP) were identified with two or more unique peptides, and their tryptic peptides were validated using MS/MS spectral pattern from a repository database or their corresponding synthetic peptides. This study shows that the IPP effectively improved the identification of proteins, including alternative splicing variants and missing proteins, in human hippocampal tissues for the C-HPP. All RAW files used in this study were deposited in ProteomeXchange (PXD000395).
MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics*
Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.; Chen, Albert J.; Ayaz-Guner, Serife; Peng, Ying; Valeja, Santosh G.; Liu, Xiaowen; Ge, Ying
2016-01-01
Top-down mass spectrometry (MS)-based proteomics is arguably a disruptive technology for the comprehensive analysis of all proteoforms arising from genetic variation, alternative splicing, and posttranslational modifications (PTMs). However, the complexity of top-down high-resolution mass spectra presents a significant challenge for data analysis. In contrast to the well-developed software packages available for data analysis in bottom-up proteomics, the data analysis tools in top-down proteomics remain underdeveloped. Moreover, despite recent efforts to develop algorithms and tools for the deconvolution of top-down high-resolution mass spectra and the identification of proteins from complex mixtures, a multifunctional software platform, which allows for the identification, quantitation, and characterization of proteoforms with visual validation, is still lacking. Herein, we have developed MASH Suite Pro, a comprehensive software tool for top-down proteomics with multifaceted functionality. MASH Suite Pro is capable of processing high-resolution MS and tandem MS (MS/MS) data using two deconvolution algorithms to optimize protein identification results. In addition, MASH Suite Pro allows for the characterization of PTMs and sequence variations, as well as the relative quantitation of multiple proteoforms in different experimental conditions. The program also provides visualization components for validation and correction of the computational outputs. Furthermore, MASH Suite Pro facilitates data reporting and presentation via direct output of the graphics. Thus, MASH Suite Pro significantly simplifies and speeds up the interpretation of high-resolution top-down proteomics data by integrating tools for protein identification, quantitation, characterization, and visual validation into a customizable and user-friendly interface. We envision that MASH Suite Pro will play an integral role in advancing the burgeoning field of top-down proteomics. PMID:26598644
Shotgun proteomics of the barley seed proteome
USDA-ARS?s Scientific Manuscript database
Barley seed proteins are of prime importance to the brewing industry, human and animal nutrition and in plant breeding for cultivar identification. To obtain comprehensive proteomic data from barley seeds, acetone precipitated proteins were in-solution digested and the resulting peptides were analyz...
Aasebø, Elise; Forthun, Rakel B.; Berven, Frode; Selheim, Frode; Hernandez-Valladares, Maria
2016-01-01
The identification of protein biomarkers for acute myeloid leukemia (AML) that could find applications in AML diagnosis and prognosis, treatment and the selection for bone marrow transplant requires substantial comparative analyses of the proteomes from AML patients. In the past years, several studies have suggested some biomarkers for AML diagnosis or AML classification using methods for sample preparation with low proteome coverage and low resolution mass spectrometers. However, most of the studies did not follow up, confirm or validate their candidates with more patient samples. Current proteomics methods, new high resolution and fast mass spectrometers allow the identification and quantification of several thousands of proteins obtained from few tens of μg of AML cell lysate. Enrichment methods for posttranslational modifications (PTM), such as phosphorylation, can isolate several thousands of site-specific phosphorylated peptides from AML patient samples, which subsequently can be quantified with high confidence in new mass spectrometers. While recent reports aiming to propose proteomic or phosphoproteomic biomarkers on the studied AML patient samples have taken advantage of the technological progress, the access to large cohorts of AML patients to sample from and the availability of appropriate control samples still remain challenging. PMID:26306748
Chen, Jin-Qiu; Wakefield, Lalage M; Goldstein, David J
2015-06-06
There is an emerging demand for the use of molecular profiling to facilitate biomarker identification and development, and to stratify patients for more efficient treatment decisions with reduced adverse effects. In the past decade, great strides have been made to advance genomic, transcriptomic and proteomic approaches to address these demands. While there has been much progress with these large scale approaches, profiling at the protein level still faces challenges due to limitations in clinical sample size, poor reproducibility, unreliable quantitation, and lack of assay robustness. A novel automated capillary nano-immunoassay (CNIA) technology has been developed. This technology offers precise and accurate measurement of proteins and their post-translational modifications using either charge-based or size-based separation formats. The system not only uses ultralow nanogram levels of protein but also allows multi-analyte analysis using a parallel single-analyte format for increased sensitivity and specificity. The high sensitivity and excellent reproducibility of this technology make it particularly powerful for analysis of clinical samples. Furthermore, the system can distinguish and detect specific protein post-translational modifications that conventional Western blot and other immunoassays cannot easily capture. This review will summarize and evaluate the latest progress to optimize the CNIA system for comprehensive, quantitative protein and signaling event characterization. It will also discuss how the technology has been successfully applied in both discovery research and clinical studies, for signaling pathway dissection, proteomic biomarker assessment, targeted treatment evaluation and quantitative proteomic analysis. Lastly, a comparison of this novel system with other conventional immuno-assay platforms is performed.
Quantitative body fluid proteomics in medicine - A focus on minimal invasiveness.
Csősz, Éva; Kalló, Gergő; Márkus, Bernadett; Deák, Eszter; Csutak, Adrienne; Tőzsér, József
2017-02-05
Identification of new biomarkers specific for various pathological conditions is an important field in medical sciences. Body fluids have emerging potential in biomarker studies especially those which are continuously available and can be collected by non-invasive means. Changes in the protein composition of body fluids such as tears, saliva, sweat, etc. may provide information on both local and systemic conditions of medical relevance. In this review, our aim is to discuss the quantitative proteomics techniques used in biomarker studies, and to present advances in quantitative body fluid proteomics of non-invasively collectable body fluids with relevance to biomarker identification. The advantages and limitations of the widely used quantitative proteomics techniques are also presented. Based on the reviewed literature, we suggest an ideal pipeline for body fluid analyses aiming at biomarkers discoveries: starting from identification of biomarker candidates by shotgun quantitative proteomics or protein arrays, through verification of potential biomarkers by targeted mass spectrometry, to the antibody-based validation of biomarkers. The importance of body fluids as a rich source of biomarkers is discussed. Quantitative proteomics is a challenging part of proteomics applications. The body fluids collected by non-invasive means have high relevance in medicine; they are good sources for biomarkers used in establishing the diagnosis, follow up of disease progression and predicting high risk groups. The review presents the most widely used quantitative proteomics techniques in body fluid analysis and lists the potential biomarkers identified in tears, saliva, sweat, nasal mucus and urine for local and systemic diseases. Copyright © 2016 Elsevier B.V. All rights reserved.
Pre-fractionation strategies to resolve pea (Pisum sativum) sub-proteomes
Meisrimler, Claudia-Nicole; Menckhoff, Ljiljana; Kukavica, Biljana M.; Lüthje, Sabine
2015-01-01
Legumes are important crop plants and pea (Pisum sativum L.) has been investigated as a model with respect to several physiological aspects. The sequencing of the pea genome has not been completed. Therefore, proteomic approaches are currently limited. Nevertheless, the increasing numbers of available EST-databases as well as the high homology of the pea and medicago genome (Medicago truncatula Gaertner) allow the successful identification of proteins. Due to the un-sequenced pea genome, pre-fractionation approaches have been used in pea proteomic surveys in the past. Aside from a number of selective proteome studies on crude extracts and the chloroplast, few studies have targeted other components such as the pea secretome, an important sub-proteome of interest due to its role in abiotic and biotic stress processes. The secretome itself can be further divided into different sub-proteomes (plasma membrane, apoplast, cell wall proteins). Cell fractionation in combination with different gel-electrophoresis, chromatography methods and protein identification by mass spectrometry are important partners to gain insight into pea sub-proteomes, post-translational modifications and protein functions. Overall, pea proteomics needs to link numerous existing physiological and biochemical data to gain further insight into adaptation processes, which play important roles in field applications. Future developments and directions in pea proteomics are discussed. PMID:26539198
A standardized framing for reporting protein identifications in mzIdentML 1.2
Seymour, Sean L.; Farrah, Terry; Binz, Pierre-Alain; Chalkley, Robert J.; Cottrell, John S.; Searle, Brian C.; Tabb, David L.; Vizcaíno, Juan Antonio; Prieto, Gorka; Uszkoreit, Julian; Eisenacher, Martin; Martínez-Bartolomé, Salvador; Ghali, Fawaz; Jones, Andrew R.
2015-01-01
Inferring which protein species have been detected in bottom-up proteomics experiments has been a challenging problem for which solutions have been maturing over the past decade. While many inference approaches now function well in isolation, comparing and reconciling the results generated across different tools remains difficult. It presently stands as one of the greatest barriers in collaborative efforts such as the Human Proteome Project and public repositories like the PRoteomics IDEntifications (PRIDE) database. Here we present a framework for reporting protein identifications that seeks to improve capabilities for comparing results generated by different inference tools. This framework standardizes the terminology for describing protein identification results, associated with the HUPO-Proteomics Standards Initiative (PSI) mzIdentML standard, while still allowing for differing methodologies to reach that final state. It is proposed that developers of software for reporting identification results will adopt this terminology in their outputs. While the new terminology does not require any changes to the core mzIdentML model, it represents a significant change in practice, and, as such, the rules will be released via a new version of the mzIdentML specification (version 1.2) so that consumers of files are able to determine whether the new guidelines have been adopted by export software. PMID:25092112
Proteomic analysis of human aqueous humor using multidimensional protein identification technology
Richardson, Matthew R.; Price, Marianne O.; Price, Francis W.; Pardo, Jennifer C.; Grandin, Juan C.; You, Jinsam; Wang, Mu
2009-01-01
Aqueous humor (AH) supports avascular tissues in the anterior segment of the eye, maintains intraocular pressure, and potentially influences the pathogenesis of ocular diseases. Nevertheless, the AH proteome is still poorly defined despite several previous efforts, which were hindered by interfering high abundance proteins, inadequate animal models, and limited proteomic technologies. To facilitate future investigations into AH function, the AH proteome was extensively characterized using an advanced proteomic approach. Samples from patients undergoing cataract surgery were pooled and depleted of interfering abundant proteins and thereby divided into two fractions: albumin-bound and albumin-depleted. Multidimensional Protein Identification Technology (MudPIT) was utilized for each fraction; this incorporates strong cation exchange chromatography to reduce sample complexity before reversed-phase liquid chromatography and tandem mass spectrometric analysis. Twelve proteins had multi-peptide, high confidence identifications in the albumin-bound fraction and 50 proteins had multi-peptide, high confidence identifications in the albumin-depleted fraction. Gene ontological analyses were performed to determine which cellular components and functions were enriched. Many proteins were previously identified in the AH and for several their potential role in the AH has been investigated; however, the majority of identified proteins were novel and only speculative roles can be suggested. The AH was abundant in anti-oxidant and immunoregulatory proteins as well as anti-angiogenic proteins, which may be involved in maintaining the avascular tissues. This is the first known report to extensively characterize and describe the human AH proteome and lays the foundation for future work regarding its function in homeostatic and pathologic states. PMID:20019884
Piehowski, Paul D; Petyuk, Vladislav A; Sandoval, John D; Burnum, Kristin E; Kiebel, Gary R; Monroe, Matthew E; Anderson, Gordon A; Camp, David G; Smith, Richard D
2013-03-01
For bottom-up proteomics, there are wide variety of database-searching algorithms in use for matching peptide sequences to tandem MS spectra. Likewise, there are numerous strategies being employed to produce a confident list of peptide identifications from the different search algorithm outputs. Here we introduce a grid-search approach for determining optimal database filtering criteria in shotgun proteomics data analyses that is easily adaptable to any search. Systematic Trial and Error Parameter Selection--referred to as STEPS--utilizes user-defined parameter ranges to test a wide array of parameter combinations to arrive at an optimal "parameter set" for data filtering, thus maximizing confident identifications. The benefits of this approach in terms of numbers of true-positive identifications are demonstrated using datasets derived from immunoaffinity-depleted blood serum and a bacterial cell lysate, two common proteomics sample types. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic platform for the identification of proteins in olive (Olea europaea) pulp.
Capriotti, Anna Laura; Cavaliere, Chiara; Foglia, Patrizia; Piovesana, Susy; Samperi, Roberto; Stampachiacchiere, Serena; Laganà, Aldo
2013-10-24
The nutritional and cancer-protective properties of the oil extracted mechanically from the ripe fruits of Olea europaea trees are attracting constantly more attention worldwide. The preparation of high-quality protein samples from plant tissues for proteomic analysis poses many challenging problems. In this study we employed a proteomic platform based on two different extraction methods, SDS and CHAPS based protocols, followed by two precipitation protocols, TCA/acetone and MeOH precipitation, in order to increase the final number of identified proteins. The use of advanced MS techniques in combination with the Swissprot and NCBI Viridiplantae databases and TAIR10 Arabidopsis database allowed us to identify 1265 proteins, of which 22 belong to O. europaea. The application of this proteomic platform for protein extraction and identification will be useful also for other proteomic studies on recalcitrant plant/fruit tissues. Copyright © 2013. Published by Elsevier B.V.
Proteomics: Protein Identification Using Online Databases
ERIC Educational Resources Information Center
Eurich, Chris; Fields, Peter A.; Rice, Elizabeth
2012-01-01
Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…
Proteomics for understanding miRNA biology.
Huang, Tai-Chung; Pinto, Sneha M; Pandey, Akhilesh
2013-02-01
MicroRNAs (miRNAs) are small noncoding RNAs that play important roles in posttranscriptional regulation of gene expression. Mature miRNAs associate with the RNA interference silencing complex to repress mRNA translation and/or degrade mRNA transcripts. Mass spectrometry-based proteomics has enabled identification of several core components of the canonical miRNA processing pathway and their posttranslational modifications which are pivotal in miRNA regulatory mechanisms. The use of quantitative proteomic strategies has also emerged as a key technique for experimental identification of miRNA targets by allowing direct determination of proteins whose levels are altered because of translational suppression. This review focuses on the role of proteomics and labeling strategies to understand miRNA biology. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Denef, Vincent; Shah, Manesh B; Verberkmoes, Nathan C
The recent surge in microbial genomic sequencing, combined with the development of high-throughput liquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the question of the extent to which genomic information of one strain or environmental sample can be used to profile proteomes of related strains or samples. Even with decreasing sequencing costs, it remains impractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate how shotgun proteomics is affected by amino acid divergence between the sample and the genomic database using a probability-based model and a random mutation simulation model constrained by experimental data. To assess the effectsmore » of nonrandom distribution of mutations, we also evaluated identification levels using in silico peptide data from sequenced isolates with average amino acid identities (AAI) varying between 76 and 98%. We compared the predictions to experimental protein identification levels for a sample that was evaluated using a database that included genomic information for the dominant organism and for a closely related variant (95% AAI). The range of models set the boundaries at which half of the proteins in a proteomic experiment can be identified to be 77-92% AAI between orthologs in the sample and database. Consistent with this prediction, experimental data indicated loss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of the initial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI. Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most crossspecies false positives.« less
PARALLEL ASSAY OF OXYGEN EQUILIBRIA OF HEMOGLOBIN
Lilly, Laura E.; Blinebry, Sara K.; Viscardi, Chelsea M.; Perez, Luis; Bonaventura, Joe; McMahon, Tim J.
2013-01-01
Methods to systematically analyze in parallel the function of multiple protein or cell samples in vivo or ex vivo (i.e. functional proteomics) in a controlled gaseous environment have thus far been limited. Here we describe an apparatus and procedure that enables, for the first time, parallel assay of oxygen equilibria in multiple samples. Using this apparatus, numerous simultaneous oxygen equilibrium curves (OECs) can be obtained under truly identical conditions from blood cell samples or purified hemoglobins (Hbs). We suggest that the ability to obtain these parallel datasets under identical conditions can be of immense value, both to biomedical researchers and clinicians who wish to monitor blood health, and to physiologists studying non-human organisms and the effects of climate change on these organisms. Parallel monitoring techniques are essential in order to better understand the functions of critical cellular proteins. The procedure can be applied to human studies, wherein an OEC can be analyzed in light of an individual’s entire genome. Here, we analyzed intraerythrocytic Hb, a protein that operates at the organism’s environmental interface and then comes into close contact with virtually all of the organism’s cells. The apparatus is theoretically scalable, and establishes a functional proteomic screen that can be correlated with genomic information on the same individuals. This new method is expected to accelerate our general understanding of protein function, an increasingly challenging objective as advances in proteomic and genomic throughput outpace the ability to study proteins’ functional properties. PMID:23827235
Schokraie, Elham; Hotz-Wagenblatt, Agnes; Warnken, Uwe; Mali, Brahim; Frohme, Marcus; Förster, Frank; Dandekar, Thomas; Hengherr, Steffen; Schill, Ralph O; Schnölzer, Martina
2010-03-03
Tardigrades are small, multicellular invertebrates which are able to survive times of unfavourable environmental conditions using their well-known capability to undergo cryptobiosis at any stage of their life cycle. Milnesium tardigradum has become a powerful model system for the analysis of cryptobiosis. While some genetic information is already available for Milnesium tardigradum the proteome is still to be discovered. Here we present to the best of our knowledge the first comprehensive study of Milnesium tardigradum on the protein level. To establish a proteome reference map we developed optimized protocols for protein extraction from tardigrades in the active state and for separation of proteins by high resolution two-dimensional gel electrophoresis. Since only limited sequence information of M. tardigradum on the genome and gene expression level is available to date in public databases we initiated in parallel a tardigrade EST sequencing project to allow for protein identification by electrospray ionization tandem mass spectrometry. 271 out of 606 analyzed protein spots could be identified by searching against the publicly available NCBInr database as well as our newly established tardigrade protein database corresponding to 144 unique proteins. Another 150 spots could be identified in the tardigrade clustered EST database corresponding to 36 unique contigs and ESTs. Proteins with annotated function were further categorized in more detail by their molecular function, biological process and cellular component. For the proteins of unknown function more information could be obtained by performing a protein domain annotation analysis. Our results include proteins like protein member of different heat shock protein families and LEA group 3, which might play important roles in surviving extreme conditions. The proteome reference map of Milnesium tardigradum provides the basis for further studies in order to identify and characterize the biochemical mechanisms of tolerance to extreme desiccation. The optimized proteomics workflow will enable application of sensitive quantification techniques to detect differences in protein expression, which are characteristic of the active and anhydrobiotic states of tardigrades.
Schokraie, Elham; Hotz-Wagenblatt, Agnes; Warnken, Uwe; Mali, Brahim; Frohme, Marcus; Förster, Frank; Dandekar, Thomas; Hengherr, Steffen; Schill, Ralph O.; Schnölzer, Martina
2010-01-01
Background Tardigrades are small, multicellular invertebrates which are able to survive times of unfavourable environmental conditions using their well-known capability to undergo cryptobiosis at any stage of their life cycle. Milnesium tardigradum has become a powerful model system for the analysis of cryptobiosis. While some genetic information is already available for Milnesium tardigradum the proteome is still to be discovered. Principal Findings Here we present to the best of our knowledge the first comprehensive study of Milnesium tardigradum on the protein level. To establish a proteome reference map we developed optimized protocols for protein extraction from tardigrades in the active state and for separation of proteins by high resolution two-dimensional gel electrophoresis. Since only limited sequence information of M. tardigradum on the genome and gene expression level is available to date in public databases we initiated in parallel a tardigrade EST sequencing project to allow for protein identification by electrospray ionization tandem mass spectrometry. 271 out of 606 analyzed protein spots could be identified by searching against the publicly available NCBInr database as well as our newly established tardigrade protein database corresponding to 144 unique proteins. Another 150 spots could be identified in the tardigrade clustered EST database corresponding to 36 unique contigs and ESTs. Proteins with annotated function were further categorized in more detail by their molecular function, biological process and cellular component. For the proteins of unknown function more information could be obtained by performing a protein domain annotation analysis. Our results include proteins like protein member of different heat shock protein families and LEA group 3, which might play important roles in surviving extreme conditions. Conclusions The proteome reference map of Milnesium tardigradum provides the basis for further studies in order to identify and characterize the biochemical mechanisms of tolerance to extreme desiccation. The optimized proteomics workflow will enable application of sensitive quantification techniques to detect differences in protein expression, which are characteristic of the active and anhydrobiotic states of tardigrades. PMID:20224743
Computational approaches to protein inference in shotgun proteomics
2012-01-01
Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programing and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area. PMID:23176300
Mass Defect Labeling of Cysteine for Improving Peptide Assignment in Shotgun Proteomic Analyses
Hernandez, Hilda; Niehauser, Sarah; Boltz, Stacey A.; Gawandi, Vijay; Phillips, Robert S.; Amster, I. Jonathan
2006-01-01
A method for improving the identification of peptides in a shotgun proteome analysis using accurate mass measurement has been developed. The improvement is based upon the derivatization of cysteine residues with a novel reagent, 2,4-dibromo-(2′-iodo)acetanilide. The derivitization changes the mass defect of cysteine-containing proteolytic peptides in a manner that increases their identification specificity. Peptide masses were measured using matrix-assisted laser desorption/ionization Fourier transform ion cyclotron mass spectrometry. Reactions with protein standards show that the derivatization of cysteine is rapid and quantitative, and the data suggest that the derivatized peptides are more easily ionized or detected than unlabeled cysteine-containing peptides. The reagent was tested on a 15N-metabolically labeled proteome from M. maripaludis. Proteins were identified by their accurate mass values and from their nitrogen stoichiometry. A total of 47% of the labeled peptides are identified versus 27% for the unlabeled peptides. This procedure permits the identification of proteins from the M. maripaludis proteome that are not usually observed by the standard protocol and shows that better protein coverage is obtained with this methodology. PMID:16689545
Zhao, Yimeng; Sun, Liangliang; Zhu, Guijie; Dovichi, Norman J
2016-10-07
We used reversed-phase liquid chromatography to separate the yeast proteome into 23 fractions. These fractions were then analyzed using capillary zone electrophoresis (CZE) coupled to a Q-Exactive HF mass spectrometer using an electrokinetically pumped sheath flow interface. The parameters of the mass spectrometer were first optimized for top-down proteomics using a mixture of seven model proteins; we observed that intact protein mode with a trapping pressure of 0.2 and normalized collision energy of 20% produced the highest intact protein signals and most protein identifications. Then, we applied the optimized parameters for analysis of the fractionated yeast proteome. From this, 580 proteoforms and 180 protein groups were identified via database searching of the MS/MS spectra. This number of proteoform identifications is two times larger than that of previous CZE-MS/MS studies. An additional 3,243 protein species were detected based on the parent ion spectra. Post-translational modifications including N-terminal acetylation, signal peptide removal, and oxidation were identified.
Viglio, Simona; Stolk, Jan; Iadarola, Paolo; Giuliano, Serena; Luisetti, Maurizio; Salvini, Roberta; Fumagalli, Marco; Bardoni, Anna
2014-01-22
To improve the knowledge on a variety of severe disorders, research has moved from the analysis of individual proteins to the investigation of all proteins expressed by a tissue/organism. This global proteomic approach could prove very useful: (i) for investigating the biochemical pathways involved in disease; (ii) for generating hypotheses; or (iii) as a tool for the identification of proteins differentially expressed in response to the disease state. Proteomics has not been used yet in the field of respiratory research as extensively as in other fields, only a few reproducible and clinically applicable molecular markers, which can assist in diagnosis, having been currently identified. The continuous advances in both instrumentation and methodology, which enable sensitive and quantitative proteomic analyses in much smaller amounts of biological material than before, will hopefully promote the identification of new candidate biomarkers in this area. The aim of this report is to critically review the application over the decade 2004-2013 of very sophisticated technologies to the study of respiratory disorders. The observed changes in protein expression profiles from tissues/fluids of patients affected by pulmonary disorders opens the route for the identification of novel pathological mediators of these disorders.
Wang, Hongbin; Zhang, Yongqian; Gui, Shuqi; Zhang, Yong; Lu, Fuping; Deng, Yulin
2017-08-15
Comparisons across large numbers of samples are frequently necessary in quantitative proteomics. Many quantitative methods used in proteomics are based on stable isotope labeling, but most of these are only useful for comparing two samples. For up to eight samples, the iTRAQ labeling technique can be used. For greater numbers of samples, the label-free method has been used, but this method was criticized for low reproducibility and accuracy. An ingenious strategy has been introduced, comparing each sample against a 18 O-labeled reference sample that was created by pooling equal amounts of all samples. However, it is necessary to use proportion-known protein mixtures to investigate and evaluate this new strategy. Another problem for comparative proteomics of multiple samples is the poor coincidence and reproducibility in protein identification results across samples. In present study, a method combining 18 O-reference strategy and a quantitation and identification-decoupled strategy was investigated with proportion-known protein mixtures. The results obviously demonstrated that the 18 O-reference strategy had greater accuracy and reliability than other previously used comparison methods based on transferring comparison or label-free strategies. By the decoupling strategy, the quantification data acquired by LC-MS and the identification data acquired by LC-MS/MS are matched and correlated to identify differential expressed proteins, according to retention time and accurate mass. This strategy made protein identification possible for all samples using a single pooled sample, and therefore gave a good reproducibility in protein identification across multiple samples, and allowed for optimizing peptide identification separately so as to identify more proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
Schwarz, Jennifer Jasmin; Wiese, Heike; Tölle, Regine Charlotte; Zarei, Mostafa; Dengjel, Jörn; Warscheid, Bettina; Thedieck, Kathrin
2015-01-01
The serine/threonine kinase mammalian target of rapamycin (mTOR) governs growth, metabolism, and aging in response to insulin and amino acids (aa), and is often activated in metabolic disorders and cancer. Much is known about the regulatory signaling network that encompasses mTOR, but surprisingly few direct mTOR substrates have been established to date. To tackle this gap in our knowledge, we took advantage of a combined quantitative phosphoproteomic and interactomic strategy. We analyzed the insulin- and aa-responsive phosphoproteome upon inhibition of the mTOR complex 1 (mTORC1) component raptor, and investigated in parallel the interactome of endogenous mTOR. By overlaying these two datasets, we identified acinus L as a potential novel mTORC1 target. We confirmed acinus L as a direct mTORC1 substrate by co-immunoprecipitation and MS-enhanced kinase assays. Our study delineates a triple proteomics strategy of combined phosphoproteomics, interactomics, and MS-enhanced kinase assays for the de novo-identification of mTOR network components, and provides a rich source of potential novel mTOR interactors and targets for future investigation. PMID:25907765
Soares, Renata; Franco, Catarina; Pires, Elisabete; Ventosa, Miguel; Palhinhas, Rui; Koci, Kamila; Martinho de Almeida, André; Varela Coelho, Ana
2012-07-19
Proteomic approaches are gaining increasing importance in the context of all fields of animal and veterinary sciences, including physiology, productive characterization, and disease/parasite tolerance, among others. Proteomic studies mainly aim the proteome characterization of a certain organ, tissue, cell type or organism, either in a specific condition or comparing protein differential expression within two or more selected situations. Due to the high complexity of samples, usually total protein extracts, proteomics relies heavily on separation procedures, being 2D-electrophoresis and HPLC the most common, as well as on protein identification using mass spectrometry (MS) based methodologies. Despite the increasing importance of MS in the context of animal and veterinary science studies, the usefulness of such tools is still poorly perceived by the animal science community. This is primarily due to the limited knowledge on mass spectrometry by animal scientists. Additionally, confidence and success in protein identification is hindered by the lack of information in public databases for most of farm animal species and their pathogens, with the exception of cattle (Bos taurus), pig (Sus scrofa) and chicken (Gallus gallus). In this article, we will briefly summarize the main methodologies available for protein identification using mass spectrometry providing a case study of specific applications in the field of animal science. We will also address the difficulties inherent to protein identification using MS, with particular reference to experiments using animal species poorly described in public databases. Additionally, we will suggest strategies to increase the rate of successful identifications when working with farm animal species. Copyright © 2012 Elsevier B.V. All rights reserved.
Top-down Proteomics: Technology Advancements and Applications to Heart Diseases
Cai, Wenxuan; Tucholski, Trisha M.; Gregorich, Zachery R.; Ge, Ying
2016-01-01
Introduction Diseases of the heart are a leading cause of morbidity and mortality for both men and women worldwide, and impose significant economic burdens on the healthcare systems. Despite substantial effort over the last several decades, the molecular mechanisms underlying diseases of the heart remain poorly understood. Areas covered Altered protein post-translational modifications (PTMs) and protein isoform switching are increasingly recognized as important disease mechanisms. Top-down high-resolution mass spectrometry (MS)-based proteomics has emerged as the most powerful method for the comprehensive analysis of PTMs and protein isoforms. Here, we will review recent technology developments in the field of top-down proteomics, as well as highlight recent studies utilizing top-down proteomics to decipher the cardiac proteome for the understanding of the molecular mechanisms underlying diseases of the heart. Expert commentary Top-down proteomics is a premier method for the global and comprehensive study of protein isoforms and their PTMs, enabling the identification of novel protein isoforms and PTMs, characterization of sequence variations, and quantification of disease-associated alterations. Despite significant challenges, continuous development of top-down proteomics technology will greatly aid the dissection of the molecular mechanisms underlying diseases of the hearts for the identification of novel biomarkers and therapeutic targets. PMID:27448560
Proteogenomics Dashboard for the Human Proteome Project.
Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto
2015-09-04
dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.
Global analysis of Brucella melitensis proteomes.
Mujer, Cesar V; Wagner, Mary Ann; Eschenbrenner, Michel; Horn, Troy; Kraycer, Jo Ann; Redkar, Rajendra; Hagius, Sue; Elzer, Philip; Delvecchio, Vito G
2002-10-01
Brucella melitensis is a facultative, intracellular, gram-negative cocco-bacillus that causes Malta fever in humans and brucellosis in animals. There are at least six species in the genus, and the disease is classified as zoonotic because several species infect humans. Using 2-D gel electrophoresis and mass spectrometry, we have initiated (i) a comprehensive mapping and identification of all the expressed proteins of B. melitensis virulent strain 16M, and (ii) a comparative study of its proteome with the attentuated vaccinal strain Rev 1. Comprehensive proteome maps of all six Brucella species will be generated in order to obtain vital information for vaccine development, identification of pathogenicity islands, and establishment of host specificity and evolutionary relatedness.
NOVEL METHODS FOR TARGET PROTEIN IDENTIFICATION USING IMMUNOPRECIPITATION - LC/MS/MS
Proteomics provides a powerful approach to screen and analyze responses to environmental exposures which induce alterations in protein expression, phosphorylation. ubiquitinylation, oxidation. and modulation of general proteome function. Post-translational modifications (PTM) of ...
Comparative proteomics lends insight into genotype-specific pathogenicity.
Guarnieri, Michael T
2013-09-01
Comparative proteomic analyses have emerged as a powerful tool for the identification of unique biomarkers and mechanisms of pathogenesis. In this issue of Proteomics, Murugaiyan et al. utilize difference gel electrophoresis (DIGE) to examine differential protein expression between nonpathogenic and pathogenic genotypes of Prototheca zopfii, a causative agent in bovine enteritis and mastitis. Their findings provide insights into molecular mechanisms of infection and evolutionary adaptation of pathogenic genotypes, demonstrating the power of comparative proteomic analyses. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomics Analysis of Bladder Cancer Exosomes*
Welton, Joanne L.; Khanna, Sanjay; Giles, Peter J.; Brennan, Paul; Brewis, Ian A.; Staffurth, John; Mason, Malcolm D.; Clayton, Aled
2010-01-01
Exosomes are nanometer-sized vesicles, secreted by various cell types, present in biological fluids that are particularly rich in membrane proteins. Ex vivo analysis of exosomes may provide biomarker discovery platforms and form non-invasive tools for disease diagnosis and monitoring. These vesicles have never before been studied in the context of bladder cancer, a major malignancy of the urological tract. We present the first proteomics analysis of bladder cancer cell exosomes. Using ultracentrifugation on a sucrose cushion, exosomes were highly purified from cultured HT1376 bladder cancer cells and verified as low in contaminants by Western blotting and flow cytometry of exosome-coated beads. Solubilization in a buffer containing SDS and DTT was essential for achieving proteomics analysis using an LC-MALDI-TOF/TOF MS approach. We report 353 high quality identifications with 72 proteins not previously identified by other human exosome proteomics studies. Overrepresentation analysis to compare this data set with previous exosome proteomics studies (using the ExoCarta database) revealed that the proteome was consistent with that of various exosomes with particular overlap with exosomes of carcinoma origin. Interrogating the Gene Ontology database highlighted a strong association of this proteome with carcinoma of bladder and other sites. The data also highlighted how homology among human leukocyte antigen haplotypes may confound MASCOT designation of major histocompatability complex Class I nomenclature, requiring data from PCR-based human leukocyte antigen haplotyping to clarify anomalous identifications. Validation of 18 MS protein identifications (including basigin, galectin-3, trophoblast glycoprotein (5T4), and others) was performed by a combination of Western blotting, flotation on linear sucrose gradients, and flow cytometry, confirming their exosomal expression. Some were confirmed positive on urinary exosomes from a bladder cancer patient. In summary, the exosome proteomics data set presented is of unrivaled quality. The data will aid in the development of urine exosome-based clinical tools for monitoring disease and will inform follow-up studies into varied aspects of exosome manufacture and function. PMID:20224111
Controlled vocabularies and ontologies in proteomics: Overview, principles and practice☆
Mayer, Gerhard; Jones, Andrew R.; Binz, Pierre-Alain; Deutsch, Eric W.; Orchard, Sandra; Montecchi-Palazzi, Luisa; Vizcaíno, Juan Antonio; Hermjakob, Henning; Oveillero, David; Julian, Randall; Stephan, Christian; Meyer, Helmut E.; Eisenacher, Martin
2014-01-01
This paper focuses on the use of controlled vocabularies (CVs) and ontologies especially in the area of proteomics, primarily related to the work of the Proteomics Standards Initiative (PSI). It describes the relevant proteomics standard formats and the ontologies used within them. Software and tools for working with these ontology files are also discussed. The article also examines the “mapping files” used to ensure correct controlled vocabulary terms that are placed within PSI standards and the fulfillment of the MIAPE (Minimum Information about a Proteomics Experiment) requirements. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23429179
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.
Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-09-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets
Savitski, Mikhail M.; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus
2015-01-01
Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target–decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target–decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The “picked” protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The “picked” target–decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used “classic” protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. PMID:25987413
PIQMIe: a web server for semi-quantitative proteomics data management and analysis
Kuzniar, Arnold; Kanaar, Roland
2014-01-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. PMID:24861615
PIQMIe: a web server for semi-quantitative proteomics data management and analysis.
Kuzniar, Arnold; Kanaar, Roland
2014-07-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Xue, Lu; Lin, Lin; Zhou, Wenbin; Chen, Wendong; Tang, Jun; Sun, Xiujie; Huang, Peiwu; Tian, Ruijun
2018-06-09
Plasma proteome profiling by LC-MS based proteomics has drawn great attention recently for biomarker discovery from blood liquid biopsy. Due to standard multi-step sample preparation could potentially cause plasma protein degradation and analysis variation, integrated proteomics sample preparation technologies became promising solution towards this end. Here, we developed a fully integrated proteomics sample preparation technology for both fast and deep plasma proteome profiling under its native pH. All the sample preparation steps, including protein digestion and two-dimensional fractionation by both mixed-mode ion exchange and high-pH reversed phase mechanism were integrated into one spintip device for the first time. The mixed-mode ion exchange beads design achieved the sample loading at neutral pH and protein digestion within 30 min. Potential sample loss and protein degradation by pH changing could be voided. 1 μL of plasma sample with depletion of high abundant proteins was processed by the developed technology with 12 equally distributed fractions and analyzed with 12 h of LC-MS gradient time, resulting in the identification of 862 proteins. The combination of the Mixed-mode-SISPROT and data-independent MS method achieved fast plasma proteome profiling in 2 h with high identification overlap and quantification precision for a proof-of-concept study of plasma samples from 5 healthy donors. We expect that the Mixed-mode-SISPROT become a generally applicable sample preparation technology for clinical oriented plasma proteome profiling. Copyright © 2018 Elsevier B.V. All rights reserved.
Zhou, Hu; Wang, Fangjun; Wang, Yuwei; Ning, Zhibin; Hou, Weimin; Wright, Theodore G.; Sundaram, Meenakshi; Zhong, Shumei; Yao, Zemin; Figeys, Daniel
2011-01-01
Despite their importance in many biological processes, membrane proteins are underrepresented in proteomic analysis because of their poor solubility (hydrophobicity) and often low abundance. We describe a novel approach for the identification of plasma membrane proteins and intracellular microsomal proteins that combines membrane fractionation, a centrifugal proteomic reactor for streamlined protein extraction, protein digestion and fractionation by centrifugation, and high performance liquid chromatography-electrospray ionization-tandem MS. The performance of this approach was illustrated for the study of the proteome of ER and Golgi microsomal membranes in rat hepatic cells. The centrifugal proteomic reactor identified 945 plasma membrane proteins and 955 microsomal membrane proteins, of which 63 and 47% were predicted as bona fide membrane proteins, respectively. Among these proteins, >800 proteins were undetectable by the conventional in-gel digestion approach. The majority of the membrane proteins only identified by the centrifugal proteomic reactor were proteins with ≥2 transmembrane segments or proteins with high molecular mass (e.g. >150 kDa) and hydrophobicity. The improved proteomic reactor allowed the detection of a group of endocytic and/or signaling receptor proteins on the plasma membrane, as well as apolipoproteins and glycerolipid synthesis enzymes that play a role in the assembly and secretion of apolipoprotein B100-containing very low density lipoproteins. Thus, the centrifugal proteomic reactor offers a new analytical tool for structure and function studies of membrane proteins involved in lipid and lipoprotein metabolism. PMID:21749988
Beer, Lynn A; Wang, Huan; Tang, Hsin-Yao; Cao, Zhijun; Chang-Wong, Tony; Tanyi, Janos L; Zhang, Rugang; Liu, Qin; Speicher, David W
2013-01-01
The most cancer-specific biomarkers in blood are likely to be proteins shed directly by the tumor rather than less specific inflammatory or other host responses. The use of xenograft mouse models together with in-depth proteome analysis for identification of human proteins in the mouse blood is an under-utilized strategy that can clearly identify proteins shed by the tumor. In the current study, 268 human proteins shed into mouse blood from human OVCAR-3 serous tumors were identified based upon human vs. mouse species differences using a four-dimensional plasma proteome fractionation strategy. A multi-step prioritization and verification strategy was subsequently developed to efficiently select some of the most promising biomarkers from this large number of candidates. A key step was parallel analysis of human proteins detected in the tumor supernatant, because substantially greater sequence coverage for many of the human proteins initially detected in the xenograft mouse plasma confirmed assignments as tumor-derived human proteins. Verification of candidate biomarkers in patient sera was facilitated by in-depth, label-free quantitative comparisons of serum pools from patients with ovarian cancer and benign ovarian tumors. The only proteins that advanced to multiple reaction monitoring (MRM) assay development were those that exhibited increases in ovarian cancer patients compared with benign tumor controls. MRM assays were facilely developed for all 11 novel biomarker candidates selected by this process and analysis of larger pools of patient sera suggested that all 11 proteins are promising candidate biomarkers that should be further evaluated on individual patient blood samples.
Polyphemus, Odysseus and the ovine milk proteome.
Cunsolo, Vincenzo; Fasoli, Elisa; Di Francesco, Antonella; Saletti, Rosaria; Muccilli, Vera; Gallina, Serafina; Righetti, Pier Giorgio; Foti, Salvatore
2017-01-30
In the last years the amount of ovine milk production, mainly used to formulate a wide range of different and exclusive dairy products often categorized as gourmet food, has been progressively increasing. Taking also into account that sheep milk (SM) also appears to be potentially less allergenic than cow's one, an in-depth information about its protein composition is essential to improve the comprehension of its potential benefits for human consumption. The present work reports the results of an in-depth characterization of SM whey proteome, carried out by coupling the CPLL technology with SDS-PAGE and high resolution UPLC-nESI MS/MS analysis. This approach allowed the identification of 718 different protein components, 644 of which are from unique genes. Particularly, this identification has expanded literature data about sheep whey proteome by 193 novel proteins previously undetected, many of which are involved in the defence/immunity mechanisms or in the nutrient delivery system. A comparative analysis of SM proteome known to date with cow's milk proteome, evidenced that while about 29% of SM proteins are also present in CM, 71% of the identified components appear to be unique of SM proteome and include a heterogeneous group of components which seem to have health-promoting benefits. The data have been deposited to the ProteomeXchange with identifier
Thiele, Herbert; Glandorf, Jörg; Hufnagel, Peter
2010-05-27
With the large variety of Proteomics workflows, as well as the large variety of instruments and data-analysis software available, researchers today face major challenges validating and comparing their Proteomics data. Here we present a new generation of the ProteinScape bioinformatics platform, now enabling researchers to manage Proteomics data from the generation and data warehousing to a central data repository with a strong focus on the improved accuracy, reproducibility and comparability demanded by many researchers in the field. It addresses scientists; current needs in proteomics identification, quantification and validation. But producing large protein lists is not the end point in Proteomics, where one ultimately aims to answer specific questions about the biological condition or disease model of the analyzed sample. In this context, a new tool has been developed at the Spanish Centro Nacional de Biotecnologia Proteomics Facility termed PIKE (Protein information and Knowledge Extractor) that allows researchers to control, filter and access specific information from genomics and proteomic databases, to understand the role and relationships of the proteins identified in the experiments. Additionally, an EU funded project, ProDac, has coordinated systematic data collection in public standards-compliant repositories like PRIDE. This will cover all aspects from generating MS data in the laboratory, assembling the whole annotation information and storing it together with identifications in a standardised format.
Tabb, David L.; Vega-Montoto, Lorenzo; Rudnick, Paul A.; Variyath, Asokan Mulayath; Ham, Amy-Joan L.; Bunk, David M.; Kilpatrick, Lisa E.; Billheimer, Dean D.; Blackman, Ronald K.; Cardasis, Helene L.; Carr, Steven A.; Clauser, Karl R.; Jaffe, Jacob D.; Kowalski, Kevin A.; Neubert, Thomas A.; Regnier, Fred E.; Schilling, Birgit; Tegeler, Tony J.; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Fisher, Susan J.; Gibson, Bradford W.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Steven E.; Tempst, Paul; Paulovich, Amanda G.; Liebler, Daniel C.; Spiegelman, Cliff
2009-01-01
The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35–60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind repeatability of technical replicates on a single instrument by several percent. These findings reinforce the importance of evaluating repeatability as a fundamental characteristic of analytical technologies. PMID:19921851
Cheng, Keding; Chui, Huixia; Domish, Larissa; Hernandez, Drexler; Wang, Gehua
2016-04-01
Identification and typing of bacteria occupy a large fraction of time and work in clinical microbiology laboratories. With the certification of some MS platforms in recent years, more applications and tests of MS-based diagnosis methods for bacteria identification and typing have been created, not only on well-accepted MALDI-TOF-MS-based fingerprint matches, but also on solving the insufficiencies of MALDI-TOF-MS-based platforms and advancing the technology to areas such as targeted MS identification and typing of bacteria, bacterial toxin identification, antibiotics susceptibility/resistance tests, and MS-based diagnostic method development on unique bacteria such as Clostridium and Mycobacteria. This review summarizes the recent development in MS platforms and applications in bacteria identification and typing of common pathogenic bacteria. © 2016 The Authors. PROTEOMICS - Clinical Applications Published by WILEY-VCH Verlag GmbH & Co. KGaA.
Highly Efficient Proteolysis Accelerated by Electromagnetic Waves for Peptide Mapping
Chen, Qiwen; Liu, Ting; Chen, Gang
2011-01-01
Proteomics will contribute greatly to the understanding of gene functions in the post-genomic era. In proteome research, protein digestion is a key procedure prior to mass spectrometry identification. During the past decade, a variety of electromagnetic waves have been employed to accelerate proteolysis. This review focuses on the recent advances and the key strategies of these novel proteolysis approaches for digesting and identifying proteins. The subjects covered include microwave-accelerated protein digestion, infrared-assisted proteolysis, ultraviolet-enhanced protein digestion, laser-assisted proteolysis, and future prospects. It is expected that these novel proteolysis strategies accelerated by various electromagnetic waves will become powerful tools in proteome research and will find wide applications in high throughput protein digestion and identification. PMID:22379392
Using the Proteomics Identifications Database (PRIDE).
Martens, Lennart; Jones, Phil; Côté, Richard
2008-03-01
The Proteomics Identifications Database (PRIDE) is a public data repository designed to store, disseminate, and analyze mass spectrometry based proteomics datasets. The PRIDE database can accommodate any level of detailed metadata about the submitted results, which can be queried, explored, viewed, or downloaded via the PRIDE Web interface. The PRIDE database also provides a simple, yet powerful, access control mechanism that fully supports confidential peer-reviewing of data related to a manuscript, ensuring that these results remain invisible to the general public while allowing referees and journal editors anonymized access to the data. This unit describes in detail the functionality that PRIDE provides with regards to searching, viewing, and comparing the available data, as well as different options for submitting data to PRIDE.
Proteomics in the investigation of HIV-1 interactions with host proteins.
Li, Ming
2015-02-01
Productive HIV-1 infection depends on host machinery, including a broad array of cellular proteins. Proteomics has played a significant role in the discovery of HIV-1 host proteins. In this review, after a brief survey of the HIV-1 host proteins that were discovered by proteomic analyses, I focus on analyzing the interactions between the virion and host proteins, as well as the technologies and strategies used in those proteomic studies. With the help of proteomics, the identification and characterization of HIV-1 host proteins can be translated into novel antiretroviral therapeutics. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Colaert, Niklaas; Barsnes, Harald; Vaudel, Marc; Helsens, Kenny; Timmerman, Evy; Sickmann, Albert; Gevaert, Kris; Martens, Lennart
2011-08-05
The Thermo Proteome Discoverer program integrates both peptide identification and quantification into a single workflow for peptide-centric proteomics. Furthermore, its close integration with Thermo mass spectrometers has made it increasingly popular in the field. Here, we present a Java library to parse the msf files that constitute the output of Proteome Discoverer. The parser is also implemented as a graphical user interface allowing convenient access to the information found in the msf files, and in Rover, a program to analyze and validate quantitative proteomics information. All code, binaries, and documentation is freely available at http://thermo-msf-parser.googlecode.com.
Owens, Rebecca A.; Hammel, Stephen; Sheridan, Kevin J.; Jones, Gary W.; Doyle, Sean
2014-01-01
A combined proteomics and metabolomics approach was utilised to advance the identification and characterisation of secondary metabolites in Aspergillus fumigatus. Here, implementation of a shotgun proteomic strategy led to the identification of non-redundant mycelial proteins (n = 414) from A. fumigatus including proteins typically under-represented in 2-D proteome maps: proteins with multiple transmembrane regions, hydrophobic proteins and proteins with extremes of molecular mass and pI. Indirect identification of secondary metabolite cluster expression was also achieved, with proteins (n = 18) from LaeA-regulated clusters detected, including GliT encoded within the gliotoxin biosynthetic cluster. Biochemical analysis then revealed that gliotoxin significantly attenuates H2O2-induced oxidative stress in A. fumigatus (p>0.0001), confirming observations from proteomics data. A complementary 2-D/LC-MS/MS approach further elucidated significantly increased abundance (p<0.05) of proliferating cell nuclear antigen (PCNA), NADH-quinone oxidoreductase and the gliotoxin oxidoreductase GliT, along with significantly attenuated abundance (p<0.05) of a heat shock protein, an oxidative stress protein and an autolysis-associated chitinase, when gliotoxin and H2O2 were present, compared to H2O2 alone. Moreover, gliotoxin exposure significantly reduced the abundance of selected proteins (p<0.05) involved in de novo purine biosynthesis. Significantly elevated abundance (p<0.05) of a key enzyme, xanthine-guanine phosphoribosyl transferase Xpt1, utilised in purine salvage, was observed in the presence of H2O2 and gliotoxin. This work provides new insights into the A. fumigatus proteome and experimental strategies, plus mechanistic data pertaining to gliotoxin functionality in the organism. PMID:25198175
Fagerquist, Clifton K; Zaragoza, William J; Sultan, Omar; Woo, Nathan; Quiñones, Beatriz; Cooley, Michael B; Mandrell, Robert E
2014-05-01
We have analyzed 26 Shiga toxin-producing Escherichia coli (STEC) strains for Shiga toxin 2 (Stx2) production using matrix-assisted laser desorption ionization (MALDI)-tandem time of flight (TOF-TOF) tandem mass spectrometry (MS/MS) and top-down proteomic analysis. STEC strains were induced to overexpress Stx2 by overnight culturing on solid agar supplemented with either ciprofloxacin or mitomycin C. Harvested cells were lysed by bead beating, and unfractionated bacterial cell lysates were ionized by MALDI. The A2 fragment of the A subunit and the mature B subunit of Stx2 were analyzed by MS/MS. Sequence-specific fragment ions were used to identify amino acid subtypes of Stx2 using top-down proteomic analysis using software developed in-house at the U.S. Department of Agriculture (USDA). Stx2 subtypes (a, c, d, f, and g) were identified on the basis of the mass of the A2 fragment and the B subunit as well as from their sequence-specific fragment ions by MS/MS (postsource decay). Top-down proteomic identification was in agreement with DNA sequencing of the full Stx2 operon (stx2) for all strains. Top-down results were also compared to a bioassay using a Vero-d2EGFP cell line. Our results suggest that top-down proteomic identification is a rapid, highly specific technique for distinguishing Stx2 subtypes.
Gao, Jing; Zhong, Shaoyun; Zhou, Yanting; He, Han; Peng, Shuying; Zhu, Zhenyun; Liu, Xing; Zheng, Jing; Xu, Bin; Zhou, Hu
2017-06-06
Detergents and salts are widely used in lysis buffers to enhance protein extraction from biological samples, facilitating in-depth proteomic analysis. However, these detergents and salt additives must be efficiently removed from the digested samples prior to LC-MS/MS analysis to obtain high-quality mass spectra. Although filter-aided sample preparation (FASP), acetone precipitation (AP), followed by in-solution digestion, and strong cation exchange-based centrifugal proteomic reactors (CPRs) are commonly used for proteomic sample processing, little is known about their efficiencies at removing detergents and salt additives. In this study, we (i) developed an integrative workflow for the quantification of small molecular additives in proteomic samples, developing a multiple reaction monitoring (MRM)-based LC-MS approach for the quantification of six additives (i.e., Tris, urea, CHAPS, SDS, SDC, and Triton X-100) and (ii) systematically evaluated the relationships between the level of additive remaining in samples following sample processing and the number of peptides/proteins identified by mass spectrometry. Although FASP outperformed the other two methods, the results were complementary in terms of peptide/protein identification, as well as the GRAVY index and amino acid distributions. This is the first systematic and quantitative study of the effect of detergents and salt additives on protein identification. This MRM-based approach can be used for an unbiased evaluation of the performance of new sample preparation methods. Data are available via ProteomeXchange under identifier PXD005405.
Zaragoza, William J.; Sultan, Omar; Woo, Nathan; Quiñones, Beatriz; Cooley, Michael B.; Mandrell, Robert E.
2014-01-01
We have analyzed 26 Shiga toxin-producing Escherichia coli (STEC) strains for Shiga toxin 2 (Stx2) production using matrix-assisted laser desorption ionization (MALDI)–tandem time of flight (TOF-TOF) tandem mass spectrometry (MS/MS) and top-down proteomic analysis. STEC strains were induced to overexpress Stx2 by overnight culturing on solid agar supplemented with either ciprofloxacin or mitomycin C. Harvested cells were lysed by bead beating, and unfractionated bacterial cell lysates were ionized by MALDI. The A2 fragment of the A subunit and the mature B subunit of Stx2 were analyzed by MS/MS. Sequence-specific fragment ions were used to identify amino acid subtypes of Stx2 using top-down proteomic analysis using software developed in-house at the U.S. Department of Agriculture (USDA). Stx2 subtypes (a, c, d, f, and g) were identified on the basis of the mass of the A2 fragment and the B subunit as well as from their sequence-specific fragment ions by MS/MS (postsource decay). Top-down proteomic identification was in agreement with DNA sequencing of the full Stx2 operon (stx2) for all strains. Top-down results were also compared to a bioassay using a Vero-d2EGFP cell line. Our results suggest that top-down proteomic identification is a rapid, highly specific technique for distinguishing Stx2 subtypes. PMID:24584253
Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus
2014-01-01
The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868
George, Iniga S; Fennell, Anne Y; Haynes, Paul A
2015-09-01
Protein sample preparation optimisation is critical for establishing reproducible high throughput proteomic analysis. In this study, two different fractionation sample preparation techniques (in-gel digestion and in-solution digestion) for shotgun proteomics were used to quantitatively compare proteins identified in Vitis riparia leaf samples. The total number of proteins and peptides identified were compared between filter aided sample preparation (FASP) coupled with gas phase fractionation (GPF) and SDS-PAGE methods. There was a 24% increase in the total number of reproducibly identified proteins when FASP-GPF was used. FASP-GPF is more reproducible, less expensive and a better method than SDS-PAGE for shotgun proteomics of grapevine samples as it significantly increases protein identification across biological replicates. Total peptide and protein information from the two fractionation techniques is available in PRIDE with the identifier PXD001399 (http://proteomecentral.proteomexchange.org/dataset/PXD001399). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A multi-center study benchmarks software tools for label-free proteome quantification
Gillet, Ludovic C; Bernhardt, Oliver M.; MacLean, Brendan; Röst, Hannes L.; Tate, Stephen A.; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I.; Aebersold, Ruedi; Tenzer, Stefan
2016-01-01
The consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from SWATH-MS (sequential window acquisition of all theoretical fragment ion spectra), a method that uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test datasets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation windows setups. For consistent evaluation we developed LFQbench, an R-package to calculate metrics of precision and accuracy in label-free quantitative MS, and report the identification performance, robustness and specificity of each software tool. Our reference datasets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics. PMID:27701404
A multicenter study benchmarks software tools for label-free proteome quantification.
Navarro, Pedro; Kuharev, Jörg; Gillet, Ludovic C; Bernhardt, Oliver M; MacLean, Brendan; Röst, Hannes L; Tate, Stephen A; Tsou, Chih-Chiang; Reiter, Lukas; Distler, Ute; Rosenberger, George; Perez-Riverol, Yasset; Nesvizhskii, Alexey I; Aebersold, Ruedi; Tenzer, Stefan
2016-11-01
Consistent and accurate quantification of proteins by mass spectrometry (MS)-based proteomics depends on the performance of instruments, acquisition methods and data analysis software. In collaboration with the software developers, we evaluated OpenSWATH, SWATH 2.0, Skyline, Spectronaut and DIA-Umpire, five of the most widely used software methods for processing data from sequential window acquisition of all theoretical fragment-ion spectra (SWATH)-MS, which uses data-independent acquisition (DIA) for label-free protein quantification. We analyzed high-complexity test data sets from hybrid proteome samples of defined quantitative composition acquired on two different MS instruments using different SWATH isolation-window setups. For consistent evaluation, we developed LFQbench, an R package, to calculate metrics of precision and accuracy in label-free quantitative MS and report the identification performance, robustness and specificity of each software tool. Our reference data sets enabled developers to improve their software tools. After optimization, all tools provided highly convergent identification and reliable quantification performance, underscoring their robustness for label-free quantitative proteomics.
Automation, parallelism, and robotics for proteomics.
Alterovitz, Gil; Liu, Jonathan; Chow, Jijun; Ramoni, Marco F
2006-07-01
The speed of the human genome project (Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C. et al., Nature 2001, 409, 860-921) was made possible, in part, by developments in automation of sequencing technologies. Before these technologies, sequencing was a laborious, expensive, and personnel-intensive task. Similarly, automation and robotics are changing the field of proteomics today. Proteomics is defined as the effort to understand and characterize proteins in the categories of structure, function and interaction (Englbrecht, C. C., Facius, A., Comb. Chem. High Throughput Screen. 2005, 8, 705-715). As such, this field nicely lends itself to automation technologies since these methods often require large economies of scale in order to achieve cost and time-saving benefits. This article describes some of the technologies and methods being applied in proteomics in order to facilitate automation within the field as well as in linking proteomics-based information with other related research areas.
Performance Evaluation of the Q Exactive HF-X for Shotgun Proteomics.
Kelstrup, Christian D; Bekker-Jensen, Dorte B; Arrey, Tabiwang N; Hogrebe, Alexander; Harder, Alexander; Olsen, Jesper V
2018-01-05
Progress in proteomics is mainly driven by advances in mass spectrometric (MS) technologies. Here we benchmarked the performance of the latest MS instrument in the benchtop Orbitrap series, the Q Exactive HF-X, against its predecessor for proteomics applications. A new peak-picking algorithm, a brighter ion source, and optimized ion transfers enable productive MS/MS acquisition above 40 Hz at 7500 resolution. The hardware and software improvements collectively resulted in improved peptide and protein identifications across all comparable conditions, with an increase of up to 50 percent at short LC-MS gradients, yielding identification rates of more than 1000 unique peptides per minute. Alternatively, the Q Exactive HF-X is capable of achieving the same proteome coverage as its predecessor in approximately half the gradient time or at 10-fold lower sample loads. The Q Exactive HF-X also enables rapid phosphoproteomics with routine analysis of more than 5000 phosphopeptides with short single-shot 15 min LC-MS/MS measurements, or 16 700 phosphopeptides quantified across ten conditions in six gradient hours using TMT10-plex and offline peptide fractionation. Finally, exciting perspectives for data-independent acquisition are highlighted with reproducible identification of 55 000 unique peptides covering 5900 proteins in half an hour of MS analysis.
Toward an Upgraded Honey Bee (Apis mellifera L.) Genome Annotation Using Proteogenomics.
McAfee, Alison; Harpur, Brock A; Michaud, Sarah; Beavis, Ronald C; Kent, Clement F; Zayed, Amro; Foster, Leonard J
2016-02-05
The honey bee is a key pollinator in agricultural operations as well as a model organism for studying the genetics and evolution of social behavior. The Apis mellifera genome has been sequenced and annotated twice over, enabling proteomics and functional genomics methods for probing relevant aspects of their biology. One troubling trend that emerged from proteomic analyses is that honey bee peptide samples consistently result in lower peptide identification rates compared with other organisms. This suggests that the genome annotation can be improved, or atypical biological processes are interfering with the mass spectrometry workflow. First, we tested whether high levels of polymorphisms could explain some of the missed identifications by searching spectra against the reference proteome (OGSv3.2) versus a customized proteome of a single honey bee, but our results indicate that this contribution was minor. Likewise, error-tolerant peptide searches lead us to eliminate unexpected post-translational modifications as a major factor in missed identifications. We then used a proteogenomic approach with ~1500 raw files to search for missing genes and new exons, to revive discarded annotations and to identify over 2000 new coding regions. These results will contribute to a more comprehensive genome annotation and facilitate continued research on this important insect.
Absolute Quantification of Middle- to High-Abundant Plasma Proteins via Targeted Proteomics.
Dittrich, Julia; Ceglarek, Uta
2017-01-01
The increasing number of peptide and protein biomarker candidates requires expeditious and reliable quantification strategies. The utilization of liquid chromatography coupled to quadrupole tandem mass spectrometry (LC-MS/MS) for the absolute quantitation of plasma proteins and peptides facilitates the multiplexed verification of tens to hundreds of biomarkers from smallest sample quantities. Targeted proteomics assays derived from bottom-up proteomics principles rely on the identification and analysis of proteotypic peptides formed in an enzymatic digestion of the target protein. This protocol proposes a procedure for the establishment of a targeted absolute quantitation method for middle- to high-abundant plasma proteins waiving depletion or enrichment steps. Essential topics as proteotypic peptide identification and LC-MS/MS method development as well as sample preparation and calibration strategies are described in detail.
Quantitative trait loci mapping of the mouse plasma proteome (pQTL).
Holdt, Lesca M; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel
2013-02-01
A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F(2) intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins.
Quantitative Trait Loci Mapping of the Mouse Plasma Proteome (pQTL)
Holdt, Lesca M.; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel
2013-01-01
A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F2 intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins. PMID:23172855
Evaluation of Proteomic Search Engines for the Analysis of Histone Modifications
2015-01-01
Identification of histone post-translational modifications (PTMs) is challenging for proteomics search engines. Including many histone PTMs in one search increases the number of candidate peptides dramatically, leading to low search speed and fewer identified spectra. To evaluate database search engines on identifying histone PTMs, we present a method in which one kind of modification is searched each time, for example, unmodified, individually modified, and multimodified, each search result is filtered with false discovery rate less than 1%, and the identifications of multiple search engines are combined to obtain confident results. We apply this method for eight search engines on histone data sets. We find that two search engines, pFind and Mascot, identify most of the confident results at a reasonable speed, so we recommend using them to identify histone modifications. During the evaluation, we also find some important aspects for the analysis of histone modifications. Our evaluation of different search engines on identifying histone modifications will hopefully help those who are hoping to enter the histone proteomics field. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD001118. PMID:25167464
Evaluation of proteomic search engines for the analysis of histone modifications.
Yuan, Zuo-Fei; Lin, Shu; Molden, Rosalynn C; Garcia, Benjamin A
2014-10-03
Identification of histone post-translational modifications (PTMs) is challenging for proteomics search engines. Including many histone PTMs in one search increases the number of candidate peptides dramatically, leading to low search speed and fewer identified spectra. To evaluate database search engines on identifying histone PTMs, we present a method in which one kind of modification is searched each time, for example, unmodified, individually modified, and multimodified, each search result is filtered with false discovery rate less than 1%, and the identifications of multiple search engines are combined to obtain confident results. We apply this method for eight search engines on histone data sets. We find that two search engines, pFind and Mascot, identify most of the confident results at a reasonable speed, so we recommend using them to identify histone modifications. During the evaluation, we also find some important aspects for the analysis of histone modifications. Our evaluation of different search engines on identifying histone modifications will hopefully help those who are hoping to enter the histone proteomics field. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD001118.
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Recent advances in mass spectrometry-based proteomics of gastric cancer.
Kang, Changwon; Lee, Yejin; Lee, J Eugene
2016-10-07
The last decade has witnessed remarkable technological advances in mass spectrometry-based proteomics. The development of proteomics techniques has enabled the reliable analysis of complex proteomes, leading to the identification and quantification of thousands of proteins in gastric cancer cells, tissues, and sera. This quantitative information has been used to profile the anomalies in gastric cancer and provide insights into the pathogenic mechanism of the disease. In this review, we mainly focus on the advances in mass spectrometry and quantitative proteomics that were achieved in the last five years and how these up-and-coming technologies are employed to track biochemical changes in gastric cancer cells. We conclude by presenting a perspective on quantitative proteomics and its future applications in the clinic and translational gastric cancer research.
USDA-ARS?s Scientific Manuscript database
Introduction: Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS)is increasingly utilized as a rapid technique to identify microorganisms including pathogenic bacteria. However, little attention has been paid to the significant proteomic information encoded in ...
Cehofski, Lasse Jørgensen; Honoré, Bent; Vorum, Henrik
2017-04-28
Retinal artery occlusion (RAO), retinal vein occlusion (RVO), diabetic retinopathy (DR) and age-related macular degeneration (AMD) are frequent ocular diseases with potentially sight-threatening outcomes. In the present review we discuss major findings of proteomic studies of RAO, RVO, DR and AMD, including an overview of ocular proteome changes associated with anti-vascular endothelial growth factor (VEGF) treatments. Despite the severe outcomes of RAO, the proteome of the disease remains largely unstudied. There is also limited knowledge about the proteome of RVO, but proteomic studies suggest that RVO is associated with remodeling of the extracellular matrix and adhesion processes. Proteomic studies of DR have resulted in the identification of potential therapeutic targets such as carbonic anhydrase-I. Proliferative diabetic retinopathy is the most intensively studied stage of DR. Proteomic studies have established VEGF, pigment epithelium-derived factor (PEDF) and complement components as key factors associated with AMD. The aim of this review is to highlight the major milestones in proteomics in RAO, RVO, DR and AMD. Through large-scale protein analyses, proteomics is bringing new important insights into these complex pathological conditions.
2016 update of the PRIDE database and its related tools
Vizcaíno, Juan Antonio; Csordas, Attila; del-Toro, Noemi; Dianes, José A.; Griss, Johannes; Lavidas, Ilias; Mayer, Gerhard; Perez-Riverol, Yasset; Reisinger, Florian; Ternent, Tobias; Xu, Qing-Wei; Wang, Rui; Hermjakob, Henning
2016-01-01
The PRoteomics IDEntifications (PRIDE) database is one of the world-leading data repositories of mass spectrometry (MS)-based proteomics data. Since the beginning of 2014, PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) is the new PRIDE archival system, replacing the original PRIDE database. Here we summarize the developments in PRIDE resources and related tools since the previous update manuscript in the Database Issue in 2013. PRIDE Archive constitutes a complete redevelopment of the original PRIDE, comprising a new storage backend, data submission system and web interface, among other components. PRIDE Archive supports the most-widely used PSI (Proteomics Standards Initiative) data standard formats (mzML and mzIdentML) and implements the data requirements and guidelines of the ProteomeXchange Consortium. The wide adoption of ProteomeXchange within the community has triggered an unprecedented increase in the number of submitted data sets (around 150 data sets per month). We outline some statistics on the current PRIDE Archive data contents. We also report on the status of the PRIDE related stand-alone tools: PRIDE Inspector, PRIDE Converter 2 and the ProteomeXchange submission tool. Finally, we will give a brief update on the resources under development ‘PRIDE Cluster’ and ‘PRIDE Proteomes’, which provide a complementary view and quality-scored information of the peptide and protein identification data available in PRIDE Archive. PMID:26527722
Colangelo, Christopher M.; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L.; Carriero, Nicholas J.; Gulcicek, Erol E.; Lam, TuKiet T.; Wu, Terence; Bjornson, Robert D.; Bruce, Can; Nairn, Angus C.; Rinehart, Jesse; Miller, Perry L.; Williams, Kenneth R.
2015-01-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry (LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED’s database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. PMID:25712262
Molecular Diagnosis and Biomarker Identification on SELDI proteomics data by ADTBoost method.
Wang, Lu-Yong; Chakraborty, Amit; Comaniciu, Dorin
2005-01-01
Clinical proteomics is an emerging field that will have great impact on molecular diagnosis, identification of disease biomarkers, drug discovery and clinical trials in the post-genomic era. Protein profiling in tissues and fluids in disease and pathological control and other proteomics techniques will play an important role in molecular diagnosis with therapeutics and personalized healthcare. We introduced a new robust diagnostic method based on ADTboost algorithm, a novel algorithm in proteomics data analysis to improve classification accuracy. It generates classification rules, which are often smaller and easier to interpret. This method often gives most discriminative features, which can be utilized as biomarkers for diagnostic purpose. Also, it has a nice feature of providing a measure of prediction confidence. We carried out this method in amyotrophic lateral sclerosis (ALS) disease data acquired by surface enhanced laser-desorption/ionization-time-of-flight mass spectrometry (SELDI-TOF MS) experiments. Our method is shown to have outstanding prediction capacity through the cross-validation, ROC analysis results and comparative study. Our molecular diagnosis method provides an efficient way to distinguish ALS disease from neurological controls. The results are expressed in a simple and straightforward alternating decision tree format or conditional format. We identified most discriminative peaks in proteomic data, which can be utilized as biomarkers for diagnosis. It will have broad application in molecular diagnosis through proteomics data analysis and personalized medicine in this post-genomic era.
Colangelo, Christopher M; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L; Carriero, Nicholas J; Gulcicek, Erol E; Lam, TuKiet T; Wu, Terence; Bjornson, Robert D; Bruce, Can; Nairn, Angus C; Rinehart, Jesse; Miller, Perry L; Williams, Kenneth R
2015-02-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Russell, Jason D.; Scalf, Mark; Book, Adam J.; Ladror, Daniel T.; Vierstra, Richard D.; Smith, Lloyd M.; Coon, Joshua J.
2013-01-01
Quantification of gas-phase intact protein ions by mass spectrometry (MS) is impeded by highly-variable ionization, ion transmission, and ion detection efficiencies. Therefore, quantification of proteins using MS-associated techniques is almost exclusively done after proteolysis where peptides serve as proxies for estimating protein abundance. Advances in instrumentation, protein separations, and informatics have made large-scale sequencing of intact proteins using top-down proteomics accessible to the proteomics community; yet quantification of proteins using a top-down workflow has largely been unaddressed. Here we describe a label-free approach to determine the abundance of intact proteins separated by nanoflow liquid chromatography prior to MS analysis by using solution-phase measurements of ultraviolet light-induced intrinsic fluorescence (UV-IF). UV-IF is measured directly at the electrospray interface just prior to the capillary exit where proteins containing at least one tryptophan residue are readily detected. UV-IF quantification was demonstrated using commercially available protein standards and provided more accurate and precise protein quantification than MS ion current. We evaluated the parallel use of UV-IF and top-down tandem MS for quantification and identification of protein subunits and associated proteins from an affinity-purified 26S proteasome sample from Arabidopsis thaliana. We identified 26 unique proteins and quantified 13 tryptophan-containing species. Our analyses discovered previously unidentified N-terminal processing of the β6 (PBF1) and β7 (PBG1) subunit - such processing of PBG1 may generate a heretofore unknown additional protease active site upon cleavage. In addition, our approach permitted the unambiguous identification and quantification both isoforms of the proteasome-associated protein DSS1. PMID:23536786
Russell, Jason D; Scalf, Mark; Book, Adam J; Ladror, Daniel T; Vierstra, Richard D; Smith, Lloyd M; Coon, Joshua J
2013-01-01
Quantification of gas-phase intact protein ions by mass spectrometry (MS) is impeded by highly-variable ionization, ion transmission, and ion detection efficiencies. Therefore, quantification of proteins using MS-associated techniques is almost exclusively done after proteolysis where peptides serve as proxies for estimating protein abundance. Advances in instrumentation, protein separations, and informatics have made large-scale sequencing of intact proteins using top-down proteomics accessible to the proteomics community; yet quantification of proteins using a top-down workflow has largely been unaddressed. Here we describe a label-free approach to determine the abundance of intact proteins separated by nanoflow liquid chromatography prior to MS analysis by using solution-phase measurements of ultraviolet light-induced intrinsic fluorescence (UV-IF). UV-IF is measured directly at the electrospray interface just prior to the capillary exit where proteins containing at least one tryptophan residue are readily detected. UV-IF quantification was demonstrated using commercially available protein standards and provided more accurate and precise protein quantification than MS ion current. We evaluated the parallel use of UV-IF and top-down tandem MS for quantification and identification of protein subunits and associated proteins from an affinity-purified 26S proteasome sample from Arabidopsis thaliana. We identified 26 unique proteins and quantified 13 tryptophan-containing species. Our analyses discovered previously unidentified N-terminal processing of the β6 (PBF1) and β7 (PBG1) subunit - such processing of PBG1 may generate a heretofore unknown additional protease active site upon cleavage. In addition, our approach permitted the unambiguous identification and quantification both isoforms of the proteasome-associated protein DSS1.
Beer, Lynn A.; Wang, Huan; Tang, Hsin-Yao; Cao, Zhijun; Chang-Wong, Tony; Tanyi, Janos L.; Zhang, Rugang; Liu, Qin; Speicher, David W.
2013-01-01
The most cancer-specific biomarkers in blood are likely to be proteins shed directly by the tumor rather than less specific inflammatory or other host responses. The use of xenograft mouse models together with in-depth proteome analysis for identification of human proteins in the mouse blood is an under-utilized strategy that can clearly identify proteins shed by the tumor. In the current study, 268 human proteins shed into mouse blood from human OVCAR-3 serous tumors were identified based upon human vs. mouse species differences using a four-dimensional plasma proteome fractionation strategy. A multi-step prioritization and verification strategy was subsequently developed to efficiently select some of the most promising biomarkers from this large number of candidates. A key step was parallel analysis of human proteins detected in the tumor supernatant, because substantially greater sequence coverage for many of the human proteins initially detected in the xenograft mouse plasma confirmed assignments as tumor-derived human proteins. Verification of candidate biomarkers in patient sera was facilitated by in-depth, label-free quantitative comparisons of serum pools from patients with ovarian cancer and benign ovarian tumors. The only proteins that advanced to multiple reaction monitoring (MRM) assay development were those that exhibited increases in ovarian cancer patients compared with benign tumor controls. MRM assays were facilely developed for all 11 novel biomarker candidates selected by this process and analysis of larger pools of patient sera suggested that all 11 proteins are promising candidate biomarkers that should be further evaluated on individual patient blood samples. PMID:23544127
Perez-Riverol, Yasset; Xu, Qing-Wei; Wang, Rui; Uszkoreit, Julian; Griss, Johannes; Sanchez, Aniel; Reisinger, Florian; Csordas, Attila; Ternent, Tobias; Del-Toro, Noemi; Dianes, Jose A; Eisenacher, Martin; Hermjakob, Henning; Vizcaíno, Juan Antonio
2016-01-01
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE.The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX "complete" submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Perez-Riverol, Yasset; Xu, Qing-Wei; Wang, Rui; Uszkoreit, Julian; Griss, Johannes; Sanchez, Aniel; Reisinger, Florian; Csordas, Attila; Ternent, Tobias; del-Toro, Noemi; Dianes, Jose A.; Eisenacher, Martin; Hermjakob, Henning; Vizcaíno, Juan Antonio
2016-01-01
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE. The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX “complete” submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/. PMID:26545397
NASA Astrophysics Data System (ADS)
Susnea, Iuliana; Bunk, Sebastian; Wendel, Albrecht; Hermann, Corinna; Przybylski, Michael
2011-04-01
We report here an affinity-proteomics approach that combines 2D-gel electrophoresis and immunoblotting with high performance mass spectrometry to the identification of both full length protein antigens and antigenic fragments of Chlamydophila pneumoniae (C. pneumoniae). The present affinity-mass spectrometry approach effectively utilized high resolution FTICR mass spectrometry and LC-tandem-MS for protein identification, and enabled the identification of several new highly antigenic C. pneumoniae proteins that were not hitherto reported or previously detected only in other Chlamydia species, such as Chlamydia trachomatis. Moreover, high resolution affinity-MS provided the identification of several neo-antigenic protein fragments containing N- and C-terminal, and central domains such as fragments of the membrane protein Pmp21 and the secreted chlamydial proteasome-like factor (Cpaf), representing specific biomarker candidates.
Nomura, Fumio
2015-06-01
Rapid and accurate identification of microorganisms, a prerequisite for appropriate patient care and infection control, is a critical function of any clinical microbiology laboratory. Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) is a quick and reliable method for identification of microorganisms, including bacteria, yeast, molds, and mycobacteria. Indeed, there has been a revolutionary shift in clinical diagnostic microbiology. In the present review, the state of the art and advantages of MALDI-TOF MS-based bacterial identification are described. The potential of this innovative technology for use in strain typing and detection of antibiotic resistance is also discussed. This article is part of a Special Issue entitled: Medical Proteomics. Copyright © 2014 Elsevier B.V. All rights reserved.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V.; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J.; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wiśniewski, Jacek R.; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools. PMID:17090601
Human body fluid proteome analysis
Hu, Shen; Loo, Joseph A.; Wong, David T.
2010-01-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future. PMID:17083142
Human body fluid proteome analysis.
Hu, Shen; Loo, Joseph A; Wong, David T
2006-12-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future.
Understanding Cullin-RING E3 Biology through Proteomics-based Substrate Identification*
Harper, J. Wade; Tan, Meng-Kwang Marcus
2012-01-01
Protein turnover through the ubiquitin-proteasome pathway controls numerous developmental decisions and biochemical processes in eukaryotes. Central to protein ubiquitylation are ubiquitin ligases, which provide specificity in targeted ubiquitylation. With more than 600 ubiquitin ligases encoded by the human genome, many of which remain to be studied, considerable effort is being placed on the development of methods for identifying substrates of specific ubiquitin ligases. In this review, we describe proteomic technologies for the identification of ubiquitin ligase targets, with a particular focus on members of the cullin-RING E3 class of ubiquitin ligases, which use F-box proteins as substrate specific adaptor proteins. Various proteomic methods are described and are compared with genetic approaches that are available. The continued development of such methods is likely to have a substantial impact on the ubiquitin-proteasome field. PMID:22962057
Understanding cullin-RING E3 biology through proteomics-based substrate identification.
Harper, J Wade; Tan, Meng-Kwang Marcus
2012-12-01
Protein turnover through the ubiquitin-proteasome pathway controls numerous developmental decisions and biochemical processes in eukaryotes. Central to protein ubiquitylation are ubiquitin ligases, which provide specificity in targeted ubiquitylation. With more than 600 ubiquitin ligases encoded by the human genome, many of which remain to be studied, considerable effort is being placed on the development of methods for identifying substrates of specific ubiquitin ligases. In this review, we describe proteomic technologies for the identification of ubiquitin ligase targets, with a particular focus on members of the cullin-RING E3 class of ubiquitin ligases, which use F-box proteins as substrate specific adaptor proteins. Various proteomic methods are described and are compared with genetic approaches that are available. The continued development of such methods is likely to have a substantial impact on the ubiquitin-proteasome field.
Steps for successful implementation of proteomic research in the OR.
Martin, Chidima Tsion; Henry, Linda; Martin, Lisa; Ad, Niv
2010-02-01
Proteomic studies (ie, the investigation and identification of proteins found in biological samples such as blood and tissue) are at the forefront of the identification of disease biomarkers and the understanding of proteins. These studies promise to enhance diagnostic and prognostic analysis across all disciplines of clinical practice. As the practice of nursing and medicine becomes more preventative in nature and predictive in terms of patient care, successfully integrating and implementing proteomic research will become increasingly important, especially in the OR. It is imperative that perioperative nurses and researchers establish a collaborative process for specimen collection. Steps in establishing and maintaining a successful specimen collection program include implementing and evaluating a protocol, developing good communication, and keeping all participants up to date on the progress of the study. Copyright 2010 AORN, Inc. Published by Elsevier Inc. All rights reserved.
Giménez, Estela; Gay, Marina; Vilaseca, Marta
2017-01-30
Here we demonstrate the potential of nano-UPLC-LTQ-FT-MS and the Byonic™ proteomic search engine for the separation, detection, and identification of N- and O-glycopeptide glycoforms in standard glycoproteins. The use of a BEH C18 nanoACQUITY column allowed the separation of the glycopeptides present in the glycoprotein digest and a baseline-resolution of the glycoforms of the same glycopeptide on the basis of the number of sialic acids. Moreover, we evaluated several acquisition strategies in order to improve the detection and characterization of glycopeptide glycoforms with the maximum number of identification percentages. The proposed strategy is simple to set up with the technology platforms commonly used in proteomic labs. The method allows the straightforward and rapid obtention of a general glycosylated map of a given protein, including glycosites and their corresponding glycosylated structures. The MS strategy selected in this work, based on a gas phase fractionation approach, led to 136 unique peptides from four standard proteins, which represented 78% of the total number of peptides identified. Moreover, the method does not require an extra glycopeptide enrichment step, thus preventing the bias that this step could cause towards certain glycopeptide species. Data are available via ProteomeXchange with identifier PXD003578. We propose a simple and high-throughput glycoproteomics-based methodology that allows the separation of glycopeptide glycoforms on the basis of the number of sialic acids, and their automatic and rapid identification without prior knowledge of protein glycosites or type and structure of the glycans. Copyright © 2016 Elsevier B.V. All rights reserved.
Generic comparison of protein inference engines.
Claassen, Manfred; Reiter, Lukas; Hengartner, Michael O; Buhmann, Joachim M; Aebersold, Ruedi
2012-04-01
Protein identifications, instead of peptide-spectrum matches, constitute the biologically relevant result of shotgun proteomics studies. How to appropriately infer and report protein identifications has triggered a still ongoing debate. This debate has so far suffered from the lack of appropriate performance measures that allow us to objectively assess protein inference approaches. This study describes an intuitive, generic and yet formal performance measure and demonstrates how it enables experimentalists to select an optimal protein inference strategy for a given collection of fragment ion spectra. We applied the performance measure to systematically explore the benefit of excluding possibly unreliable protein identifications, such as single-hit wonders. Therefore, we defined a family of protein inference engines by extending a simple inference engine by thousands of pruning variants, each excluding a different specified set of possibly unreliable identifications. We benchmarked these protein inference engines on several data sets representing different proteomes and mass spectrometry platforms. Optimally performing inference engines retained all high confidence spectral evidence, without posterior exclusion of any type of protein identifications. Despite the diversity of studied data sets consistently supporting this rule, other data sets might behave differently. In order to ensure maximal reliable proteome coverage for data sets arising in other studies we advocate abstaining from rigid protein inference rules, such as exclusion of single-hit wonders, and instead consider several protein inference approaches and assess these with respect to the presented performance measure in the specific application context.
Sys-BodyFluid: a systematical database for human body fluid proteome research
Li, Su-Jun; Peng, Mao; Li, Hong; Liu, Bo-Shu; Wang, Chuan; Wu, Jia-Rui; Li, Yi-Xue; Zeng, Rong
2009-01-01
Recently, body fluids have widely become an important target for proteomic research and proteomic study has produced more and more body fluid related protein data. A database is needed to collect and analyze these proteome data. Thus, we developed this web-based body fluid proteome database Sys-BodyFluid. It contains eleven kinds of body fluid proteomes, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, seminal fluid, human milk and amniotic fluid. Over 10 000 proteins are presented in the Sys-BodyFluid. Sys-BodyFluid provides the detailed protein annotations, including protein description, Gene Ontology, domain information, protein sequence and involved pathways. These proteome data can be retrieved by using protein name, protein accession number and sequence similarity. In addition, users can query between these different body fluids to get the different proteins identification information. Sys-BodyFluid database can facilitate the body fluid proteomics and disease proteomics research as a reference database. It is available at http://www.biosino.org/bodyfluid/. PMID:18978022
Sys-BodyFluid: a systematical database for human body fluid proteome research.
Li, Su-Jun; Peng, Mao; Li, Hong; Liu, Bo-Shu; Wang, Chuan; Wu, Jia-Rui; Li, Yi-Xue; Zeng, Rong
2009-01-01
Recently, body fluids have widely become an important target for proteomic research and proteomic study has produced more and more body fluid related protein data. A database is needed to collect and analyze these proteome data. Thus, we developed this web-based body fluid proteome database Sys-BodyFluid. It contains eleven kinds of body fluid proteomes, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, seminal fluid, human milk and amniotic fluid. Over 10,000 proteins are presented in the Sys-BodyFluid. Sys-BodyFluid provides the detailed protein annotations, including protein description, Gene Ontology, domain information, protein sequence and involved pathways. These proteome data can be retrieved by using protein name, protein accession number and sequence similarity. In addition, users can query between these different body fluids to get the different proteins identification information. Sys-BodyFluid database can facilitate the body fluid proteomics and disease proteomics research as a reference database. It is available at http://www.biosino.org/bodyfluid/.
Seo, Moon-Hyeong; Nim, Satra; Jeon, Jouhyun; Kim, Philip M
2017-01-01
Protein-protein interactions are essential to cellular functions and signaling pathways. We recently combined bioinformatics and custom oligonucleotide arrays to construct custom-made peptide-phage libraries for screening peptide-protein interactions, an approach we call proteomic peptide-phage display (ProP-PD). In this chapter, we describe protocols for phage display for the identification of natural peptide binders for a given protein. We finally describe deep sequencing for the analysis of the proteomic peptide-phage display.
2014-09-01
total number of 538 phosphopeptides were identified, among which 350 phosphopeptides had been identified with the first round of TiO2 enrichment and 430...year research and the collection of proteomic and phosphoproteomic data is still in process. PRODUCTS Manuscripts: Yue XS , Hummon AB. Combining...of IMAC and TiO2 enrichment methods to increase phosphoproteomic identifications, manuscript in preparation. Yue XS , Hummon AB. Proteomic and
Scientific Workflow Management in Proteomics
de Bruin, Jeroen S.; Deelder, André M.; Palmblad, Magnus
2012-01-01
Data processing in proteomics can be a challenging endeavor, requiring extensive knowledge of many different software packages, all with different algorithms, data format requirements, and user interfaces. In this article we describe the integration of a number of existing programs and tools in Taverna Workbench, a scientific workflow manager currently being developed in the bioinformatics community. We demonstrate how a workflow manager provides a single, visually clear and intuitive interface to complex data analysis tasks in proteomics, from raw mass spectrometry data to protein identifications and beyond. PMID:22411703
Helsens, Kenny; Colaert, Niklaas; Barsnes, Harald; Muth, Thilo; Flikka, Kristian; Staes, An; Timmerman, Evy; Wortelkamp, Steffi; Sickmann, Albert; Vandekerckhove, Joël; Gevaert, Kris; Martens, Lennart
2010-03-01
MS-based proteomics produces large amounts of mass spectra that require processing, identification and possibly quantification before interpretation can be undertaken. High-throughput studies require automation of these various steps, and management of the data in association with the results obtained. We here present ms_lims (http://genesis.UGent.be/ms_lims), a freely available, open-source system based on a central database to automate data management and processing in MS-driven proteomics analyses.
A HUPO test sample study reveals common problems in mass spectrometry-based proteomics
Bell, Alexander W.; Deutsch, Eric W.; Au, Catherine E.; Kearney, Robert E.; Beavis, Ron; Sechi, Salvatore; Nilsson, Tommy; Bergeron, John J.M.
2009-01-01
We carried out a test sample study to try to identify errors leading to irreproducibility, including incompleteness of peptide sampling, in LC-MS-based proteomics. We distributed a test sample consisting of an equimolar mix of 20 highly purified recombinant human proteins, to 27 laboratories for identification. Each protein contained one or more unique tryptic peptides of 1250 Da to also test for ion selection and sampling in the mass spectrometer. Of the 27 labs, initially only 7 labs reported all 20 proteins correctly, and only 1 lab reported all the tryptic peptides of 1250 Da. Nevertheless, a subsequent centralized analysis of the raw data revealed that all 20 proteins and most of the 1250 Da peptides had in fact been detected by all 27 labs. The centralized analysis allowed us to determine sources of problems encountered in the study, which include missed identifications (false negatives), environmental contamination, database matching, and curation of protein identifications. Improved search engines and databases are likely to increase the fidelity of mass spectrometry-based proteomics. PMID:19448641
Identification of Carboxypeptidase Substrates by C-Terminal COFRADIC.
Tanco, Sebastian; Aviles, Francesc Xavier; Gevaert, Kris; Lorenzo, Julia; Van Damme, Petra
2017-01-01
We here present a detailed procedure for studying protein C-termini and their posttranslational modifications by C-terminal COFRADIC. In fact, this procedure can enrich for both C-terminal and N-terminal peptides through a combination of a strong cation exchange fractionation step at low pH, which removes the majority of nonterminal peptides in whole-proteome digests, while the actual COFRADIC step segregates C-terminal peptides from N-terminal peptides. When used in a differential mode, C-terminal COFRADIC allows for the identification of neo-C-termini generated by the action of proteases, which in turn leads to the identification of protease substrates. More specifically, this technology can be applied to determine the natural substrate repertoire of carboxypeptidases on a proteome-wide scale.
Trends in mass spectrometry instrumentation for proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Richard D.
2002-12-01
Mass spectrometry has become a primary tool for proteomics due to its capabilities for rapid and sensitive protein identification and quantitation. It is now possible to identify thousands of proteins from microgram sample quantities in a single day and to quantify relative protein abundances. However, the needs for increased capabilities for proteome measurements are immense and are now driving both new strategies and instrument advances. These developments include those based on integration with multi-dimensional liquid separations and high accuracy mass measurements, and promise more than order of magnitude improvements in sensitivity, dynamic range, and throughput for proteomic analyses in themore » near future.« less
Mass spectrometry-based proteomics for translational research: a technical overview.
Paulo, Joao A; Kadiyala, Vivek; Banks, Peter A; Steen, Hanno; Conwell, Darwin L
2012-03-01
Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease.
Label-free proteome of water buffalo (Bubalus bubalis) seminal plasma.
Brito, Mayara F; Auler, Patrícia A; Tavares, Guilherme C; Rezende, Cristiana P; Almeida, Gabriel M F; Pereira, Felipe L; Leal, Carlos A G; Moura, Arlindo de Alencar; Figueiredo, Henrique C P; Henry, Marc
2018-06-11
The study aimed to describe the Bubalus bubalis seminal plasma proteome using a label-free shotgun UDMS E approach. A total of 859 nonredundant proteins were identified across five biological replicates with stringent identification. Proteins specifically related to sperm maturation and protection, capacitation, fertilization and metabolic activity were detected in the buffalo seminal fluid. In conclusion, we provide a comprehensive proteomic profile of buffalo seminal plasma, which establishes a foundation for further studies designed to understand regulation of sperm function and discovery of novel biomarkers for fertility. MS data are available in the ProteomeXchange with identifier PXD003728. © 2018 Blackwell Verlag GmbH.
Mass Spectrometry-Based Proteomics for Translational Research: A Technical Overview
Paulo, Joao A.; Kadiyala, Vivek; Banks, Peter A.; Steen, Hanno; Conwell, Darwin L.
2012-01-01
Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease. PMID:22461744
Using HPLC-Mass Spectrometry to Teach Proteomics Concepts with Problem-Based Techniques
ERIC Educational Resources Information Center
Short, Michael; Short, Anne; Vankempen, Rachel; Seymour, Michael; Burnatowska-Hledin, Maria
2010-01-01
Practical instruction of proteomics concepts was provided using high-performance liquid chromatography coupled with a mass selective detection system (HPLC-MS) for the analysis of simulated protein digests. The samples were prepared from selected dipeptides in order to facilitate the mass spectral identification. As part of the prelaboratory…
MALDI-TOF MS of Trichoderma: A model system for the identification of microfungi
USDA-ARS?s Scientific Manuscript database
This investigation aimed to assess whether MALDI-TOF MS analysis of proteomics could be applied to the study of Trichoderma, a fungal genus selected because it includes many species and is phylogenetically well defined. We also investigated whether MALDI-TOF MS analysis of proteomics would reveal ap...
P2P proteomics -- data sharing for enhanced protein identification
2012-01-01
Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that could be derived from the proposed P2P proteomics system, however it is not straightforward to arrive to the same conclusion by conventional means as it is difficult to discard organic contamination of samples. The actual presence of this contaminant was only stated after the ABRF study of all the identifications reported by the laboratories. PMID:22293032
Li, Nan; Stein, Richard S L; He, Wei; Komives, Elizabeth; Wang, Wei
2013-10-01
Methylation is one of the important post-translational modifications that play critical roles in regulating protein functions. Proteomic identification of this post-translational modification and understanding how it affects protein activity remain great challenges. We tackled this problem from the aspect of methylation mediating protein-protein interaction. Using the chromodomain of human chromobox protein homolog 6 as a model system, we developed a systematic approach that integrates structure modeling, bioinformatics analysis, and peptide microarray experiments to identify lysine residues that are methylated and recognized by the chromodomain in the human proteome. Given the important role of chromobox protein homolog 6 as a reader of histone modifications, it was interesting to find that the majority of its interacting partners identified via this approach function in chromatin remodeling and transcriptional regulation. Our study not only illustrates a novel angle for identifying methyllysines on a proteome-wide scale and elucidating their potential roles in regulating protein function, but also suggests possible strategies for engineering the chromodomain-peptide interface to enhance the recognition of and manipulate the signal transduction mediated by such interactions.
Skillbäck, Tobias; Mattsson, Niklas; Hansson, Karl; Mirgorodskaya, Ekaterina; Dahlén, Rahil; van der Flier, Wiesje; Scheltens, Philip; Duits, Floor; Hansson, Oskar; Teunissen, Charlotte; Blennow, Kaj; Zetterberg, Henrik; Gobom, Johan
2017-10-17
We present a new, quantification-driven proteomic approach to identifying biomarkers. In contrast to the identification-driven approach, limited in scope to peptides that are identified by database searching in the first step, all MS data are considered to select biomarker candidates. The endopeptidome of cerebrospinal fluid from 40 Alzheimer's disease (AD) patients, 40 subjects with mild cognitive impairment, and 40 controls with subjective cognitive decline was analyzed using multiplex isobaric labeling. Spectral clustering was used to match MS/MS spectra. The top biomarker candidate cluster (215% higher in AD compared to controls, area under ROC curve = 0.96) was identified as a fragment of pleiotrophin located near the protein's C-terminus. Analysis of another cohort (n = 60 over four clinical groups) verified that the biomarker was increased in AD patients while no change in controls, Parkinson's disease or progressive supranuclear palsy was observed. The identification of the novel biomarker pleiotrophin 151-166 demonstrates that our quantification-driven proteomic approach is a promising method for biomarker discovery, which may be universally applicable in clinical proteomics.
Implementation of statistical process control for proteomic experiments via LC MS/MS.
Bereman, Michael S; Johnson, Richard; Bollinger, James; Boss, Yuval; Shulman, Nick; MacLean, Brendan; Hoofnagle, Andrew N; MacCoss, Michael J
2014-04-01
Statistical process control (SPC) is a robust set of tools that aids in the visualization, detection, and identification of assignable causes of variation in any process that creates products, services, or information. A tool has been developed termed Statistical Process Control in Proteomics (SProCoP) which implements aspects of SPC (e.g., control charts and Pareto analysis) into the Skyline proteomics software. It monitors five quality control metrics in a shotgun or targeted proteomic workflow. None of these metrics require peptide identification. The source code, written in the R statistical language, runs directly from the Skyline interface, which supports the use of raw data files from several of the mass spectrometry vendors. It provides real time evaluation of the chromatographic performance (e.g., retention time reproducibility, peak asymmetry, and resolution), and mass spectrometric performance (targeted peptide ion intensity and mass measurement accuracy for high resolving power instruments) via control charts. Thresholds are experiment- and instrument-specific and are determined empirically from user-defined quality control standards that enable the separation of random noise and systematic error. Finally, Pareto analysis provides a summary of performance metrics and guides the user to metrics with high variance. The utility of these charts to evaluate proteomic experiments is illustrated in two case studies.
Cehofski, Lasse Jørgensen; Honoré, Bent; Vorum, Henrik
2017-01-01
Retinal artery occlusion (RAO), retinal vein occlusion (RVO), diabetic retinopathy (DR) and age-related macular degeneration (AMD) are frequent ocular diseases with potentially sight-threatening outcomes. In the present review we discuss major findings of proteomic studies of RAO, RVO, DR and AMD, including an overview of ocular proteome changes associated with anti-vascular endothelial growth factor (VEGF) treatments. Despite the severe outcomes of RAO, the proteome of the disease remains largely unstudied. There is also limited knowledge about the proteome of RVO, but proteomic studies suggest that RVO is associated with remodeling of the extracellular matrix and adhesion processes. Proteomic studies of DR have resulted in the identification of potential therapeutic targets such as carbonic anhydrase-I. Proliferative diabetic retinopathy is the most intensively studied stage of DR. Proteomic studies have established VEGF, pigment epithelium-derived factor (PEDF) and complement components as key factors associated with AMD. The aim of this review is to highlight the major milestones in proteomics in RAO, RVO, DR and AMD. Through large-scale protein analyses, proteomics is bringing new important insights into these complex pathological conditions. PMID:28452939
Liu, Junyan; Liu, Yang; Gao, Mingxia; Zhang, Xiangmin
2012-08-01
A facile proteomic quantification method, fluorescent labeling absolute quantification (FLAQ), was developed. Instead of using MS for quantification, the FLAQ method is a chromatography-based quantification in combination with MS for identification. Multidimensional liquid chromatography (MDLC) with laser-induced fluorescence (LIF) detection with high accuracy and tandem MS system were employed for FLAQ. Several requirements should be met for fluorescent labeling in MS identification: Labeling completeness, minimum side-reactions, simple MS spectra, and no extra tandem MS fragmentations for structure elucidations. A fluorescence dye, 5-iodoacetamidofluorescein, was finally chosen to label proteins on all cysteine residues. The fluorescent dye was compatible with the process of the trypsin digestion and MALDI MS identification. Quantitative labeling was achieved with optimization of reacting conditions. A synthesized peptide and model proteins, BSA (35 cysteines), OVA (five cysteines), were used for verifying the completeness of labeling. Proteins were separated through MDLC and quantified based on fluorescent intensities, followed by MS identification. High accuracy (RSD% < 1.58) and wide linearity of quantification (1-10(5) ) were achieved by LIF detection. The limit of quantitation for the model protein was as low as 0.34 amol. Parts of proteins in human liver proteome were quantified and demonstrated using FLAQ. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kim, So W.; Gupta, Ravi; Lee, Seo H.; Min, Cheol W.; Agrawal, Ganesh K.; Rakwal, Randeep; Kim, Jong B.; Jo, Ick H.; Park, Soo-Yun; Kim, Jae K.; Kim, Young-Chang; Bang, Kyong H.; Kim, Sun T.
2016-01-01
Panax ginseng roots are well known for their medicinal properties and have been used in Korean and Chinese traditional medicines for 1000s of years. However, the medicinal value of P. ginseng fruits remain poorly characterized. In this study, we used an integrated biochemical, proteomics, and metabolomics approach to look into the medicinal properties of ginseng fruits. DPPH (1,1-diphenyl-2-picrylhydrazyl) and ABTS [2,2′-azino-bis (3-ethylbenzothiazoline-6-sulphonic acid)] assays showed higher antioxidant activities in ginseng fruits than leaves or roots. Two-dimensional gel electrophoresis (2-DE) profiling of ginseng fruit proteins (cv. Cheongsun) showed more than 400 spots wherein a total of 81 protein spots were identified by mass spectrometry using NCBInr, UniRef, and an in-house developed RNAseq (59,251 protein sequences)-based databases. Gene ontology analysis showed that most of the identified proteins were related to the hydrolase (18%), oxidoreductase (16%), and ATP binding (15%) activities. Further, a comparative proteome analysis of four cultivars of ginseng fruits (cvs. Yunpoong, Gumpoong, Chunpoong, and Cheongsun) led to the identification of 22 differentially modulated protein spots. Using gas chromatography-time of flight mass spectrometry (GC-TOF MS), 66 metabolites including amino acids, sugars, organic acids, phenolic acids, phytosterols, tocopherols, and policosanols were identified and quantified. Some of these are well known medicinal compounds and were not previously identified in ginseng. Interestingly, the concentration of almost all metabolites was higher in the Chunpoong and Gumpoong cultivars. Parallel comparison of the four cultivars also revealed higher amounts of the medicinal metabolites in Chunpoong and Gumpoong cultivars. Taken together, our results demonstrate that ginseng fruits are a rich source of medicinal compounds with potential beneficial health effects. PMID:27458475
Targeted proteomics identifies liquid-biopsy signatures for extracapsular prostate cancer
Kim, Yunee; Jeon, Jouhyun; Mejia, Salvador; Yao, Cindy Q; Ignatchenko, Vladimir; Nyalwidhe, Julius O; Gramolini, Anthony O; Lance, Raymond S; Troyer, Dean A; Drake, Richard R; Boutros, Paul C; Semmes, O. John; Kislinger, Thomas
2016-01-01
Biomarkers are rapidly gaining importance in personalized medicine. Although numerous molecular signatures have been developed over the past decade, there is a lack of overlap and many biomarkers fail to validate in independent patient cohorts and hence are not useful for clinical application. For these reasons, identification of novel and robust biomarkers remains a formidable challenge. We combine targeted proteomics with computational biology to discover robust proteomic signatures for prostate cancer. Quantitative proteomics conducted in expressed prostatic secretions from men with extraprostatic and organ-confined prostate cancers identified 133 differentially expressed proteins. Using synthetic peptides, we evaluate them by targeted proteomics in a 74-patient cohort of expressed prostatic secretions in urine. We quantify a panel of 34 candidates in an independent 207-patient cohort. We apply machine-learning approaches to develop clinical predictive models for prostate cancer diagnosis and prognosis. Our results demonstrate that computationally guided proteomics can discover highly accurate non-invasive biomarkers. PMID:27350604
Vizcaíno, Juan Antonio; Foster, Joseph M.; Martens, Lennart
2010-01-01
Despite the fact that data deposition is not a generalised fact yet in the field of proteomics, several mass spectrometry (MS) based proteomics repositories are publicly available for the scientific community. The main existing resources are: the Global Proteome Machine Database (GPMDB), PeptideAtlas, the PRoteomics IDEntifications database (PRIDE), Tranche, and NCBI Peptidome. In this review the capabilities of each of these will be described, paying special attention to four key properties: data types stored, applicable data submission strategies, supported formats, and available data mining and visualization tools. Additionally, the data contents from model organisms will be enumerated for each resource. There are other valuable smaller and/or more specialized repositories but they will not be covered in this review. Finally, the concept behind the ProteomeXchange consortium, a collaborative effort among the main resources in the field, will be introduced. PMID:20615486
Mass spectrometry based proteomics: existing capabilities and future directions
Angel, Thomas E.; Aryal, Uma K.; Hengel, Shawna M.; Baker, Erin S.; Kelly, Ryan T.; Robinson, Errol W.; Smith, Richard D.
2012-01-01
Mass spectrometry (MS)-based proteomics is emerging as a broadly effective means for identification, characterization, and quantification of proteins that are integral components of the processes essential for life. Characterization of proteins at the proteome and sub-proteome (e.g., the phosphoproteome, proteoglycome, or degradome/peptidome) levels provides a foundation for understanding fundamental aspects of biology. Emerging technologies such as ion mobility separations coupled with MS and microchip-based-proteome measurements combined with MS instrumentation and chromatographic separation techniques, such as nanoscale reversed phase liquid chromatography and capillary electrophoresis, show great promise for both broad undirected and targeted highly sensitive measurements. MS-based proteomics is increasingly contribute to our understanding of the dynamics, interactions, and roles that proteins and peptides play, advancing our understanding of biology on a systems wide level for a wide range of applications including investigations of microbial communities, bioremediation, and human health. PMID:22498958
Zhou, Li; Wang, Kui; Li, Qifu; Nice, Edouard C; Zhang, Haiyuan; Huang, Canhua
2016-01-01
Cancer is a common disease that is a leading cause of death worldwide. Currently, early detection and novel therapeutic strategies are urgently needed for more effective management of cancer. Importantly, protein profiling using clinical proteomic strategies, with spectacular sensitivity and precision, offer excellent promise for the identification of potential biomarkers that would direct the development of targeted therapeutic anticancer drugs for precision medicine. In particular, clinical sample sources, including tumor tissues and body fluids (blood, feces, urine and saliva), have been widely investigated using modern high-throughput mass spectrometry-based proteomic approaches combined with bioinformatic analysis, to pursue the possibilities of precision medicine for targeted cancer therapy. Discussed in this review are the current advantages and limitations of clinical proteomics, the available strategies of clinical proteomics for the management of precision medicine, as well as the challenges and future perspectives of clinical proteomics-driven precision medicine for targeted cancer therapy.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-µm-i.d. columns increase signal intensity by >3-fold relative to those using 75-µm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos mass spectrometer significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap), leading tomore » a ~3× increase in peptide identifications and 1.7× increase in identified protein groups for 2 ng tryptic digests of bacterial lysate. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~ 95% for 0.5 ng samples and by ~42% for 2 ng samples. The present platform is capable of identifying >3000 protein groups from tryptic digestion of cell lysates equivalent to 50 HeLa cells and 100 THP-1 cells (~10 ng total proteins), respectively, and >950 proteins from subnanogram bacterial and archaeal cell lysates. The present ultrasensitive LC-MS platform is expected to enable deep proteome coverage for subnanogram samples, including single mammalian cells.« less
Identification of new intrinsic proteins in Arabidopsis plasma membrane proteome.
Marmagne, Anne; Rouet, Marie-Aude; Ferro, Myriam; Rolland, Norbert; Alcon, Carine; Joyard, Jacques; Garin, Jérome; Barbier-Brygoo, Hélène; Ephritikhine, Geneviève
2004-07-01
Identification and characterization of anion channel genes in plants represent a goal for a better understanding of their central role in cell signaling, osmoregulation, nutrition, and metabolism. Though channel activities have been well characterized in plasma membrane by electrophysiology, the corresponding molecular entities are little documented. Indeed, the hydrophobic protein equipment of plant plasma membrane still remains largely unknown, though several proteomic approaches have been reported. To identify new putative transport systems, we developed a new proteomic strategy based on mass spectrometry analyses of a plasma membrane fraction enriched in hydrophobic proteins. We produced from Arabidopsis cell suspensions a highly purified plasma membrane fraction and characterized it in detail by immunological and enzymatic tests. Using complementary methods for the extraction of hydrophobic proteins and mass spectrometry analyses on mono-dimensional gels, about 100 proteins have been identified, 95% of which had never been found in previous proteomic studies. The inventory of the plasma membrane proteome generated by this approach contains numerous plasma membrane integral proteins, one-third displaying at least four transmembrane segments. The plasma membrane localization was confirmed for several proteins, therefore validating such proteomic strategy. An in silico analysis shows a correlation between the putative functions of the identified proteins and the expected roles for plasma membrane in transport, signaling, cellular traffic, and metabolism. This analysis also reveals 10 proteins that display structural properties compatible with transport functions and will constitute interesting targets for further functional studies.
Choosing an Optimal Database for Protein Identification from Tandem Mass Spectrometry Data.
Kumar, Dhirendra; Yadav, Amit Kumar; Dash, Debasis
2017-01-01
Database searching is the preferred method for protein identification from digital spectra of mass to charge ratios (m/z) detected for protein samples through mass spectrometers. The search database is one of the major influencing factors in discovering proteins present in the sample and thus in deriving biological conclusions. In most cases the choice of search database is arbitrary. Here we describe common search databases used in proteomic studies and their impact on final list of identified proteins. We also elaborate upon factors like composition and size of the search database that can influence the protein identification process. In conclusion, we suggest that choice of the database depends on the type of inferences to be derived from proteomics data. However, making additional efforts to build a compact and concise database for a targeted question should generally be rewarding in achieving confident protein identifications.
Dentistry proteomics: from laboratory development to clinical practice.
Rezende, Taia M B; Lima, Stella M F; Petriz, Bernardo A; Silva, Osmar N; Freire, Mirna S; Franco, Octávio L
2013-12-01
Despite all the dental information acquired over centuries and the importance of proteome research, the cross-link between these two areas only emerged around mid-nineties. Proteomic tools can help dentistry in the identification of risk factors, early diagnosis, prevention, and systematic control that will promote the evolution of treatment in all dentistry specialties. This review mainly focuses on the evolution of dentistry in different specialties based on proteomic research and how these tools can improve knowledge in dentistry. The subjects covered are an overview of proteomics in dentistry, specific information on different fields in dentistry (dental structure, restorative dentistry, endodontics, periodontics, oral pathology, oral surgery, and orthodontics) and future directions. There are many new proteomic technologies that have never been used in dentistry studies and some dentistry areas that have never been explored by proteomic tools. It is expected that a greater integration of these areas will help to understand what is still unknown in oral health and disease. Copyright © 2013 Wiley Periodicals, Inc.
Affordable proteomics: the two-hybrid systems.
Gillespie, Marc
2003-06-01
Numerous proteomic methodologies exist, but most require a heavy investment in expertise and technology. This puts these approaches out of reach for many laboratories and small companies, rarely allowing proteomics to be used as a pilot approach for biomarker or target identification. Two proteomic approaches, 2D gel electrophoresis and the two-hybrid systems, are currently available to most researchers. The two-hybrid systems, though accommodating to large-scale experiments, were originally designed as practical screens, that by comparison to current proteomics tools were small-scale, affordable and technically feasible. The screens rapidly generated data, identifying protein interactions that were previously uncharacterized. The foundation for a two-hybrid proteomic investigation can be purchased as separate kits from a number of companies. The true power of the technique lies not in its affordability, but rather in its portability. The two-hybrid system puts proteomics back into laboratories where the output of the screens can be evaluated by researchers with experience in the particular fields of basic research, cancer biology, toxicology or drug development.
Tipton, Jeremiah D; Tran, John C; Catherman, Adam D; Ahlf, Dorothy R; Durbin, Kenneth R; Lee, Ji Eun; Kellie, John F; Kelleher, Neil L; Hendrickson, Christopher L; Marshall, Alan G
2012-03-06
Current high-throughput top-down proteomic platforms provide routine identification of proteins less than 25 kDa with 4-D separations. This short communication reports the application of technological developments over the past few years that improve protein identification and characterization for masses greater than 25 kDa. Advances in separation science have allowed increased numbers of proteins to be identified, especially by nanoliquid chromatography (nLC) prior to mass spectrometry (MS) analysis. Further, a goal of high-throughput top-down proteomics is to extend the mass range for routine nLC MS analysis up to 80 kDa because gene sequence analysis predicts that ~70% of the human proteome is transcribed to be less than 80 kDa. Normally, large proteins greater than 50 kDa are identified and characterized by top-down proteomics through fraction collection and direct infusion at relatively low throughput. Further, other MS-based techniques provide top-down protein characterization, however at low resolution for intact mass measurement. Here, we present analysis of standard (up to 78 kDa) and whole cell lysate proteins by Fourier transform ion cyclotron resonance mass spectrometry (nLC electrospray ionization (ESI) FTICR MS). The separation platform reduced the complexity of the protein matrix so that, at 14.5 T, proteins from whole cell lysate up to 72 kDa are baseline mass resolved on a nano-LC chromatographic time scale. Further, the results document routine identification of proteins at improved throughput based on accurate mass measurement (less than 10 ppm mass error) of precursor and fragment ions for proteins up to 50 kDa.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes.
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wisniewski, Jacek R; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at http://www.mapuproteome.com using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools.
Proteomics in medical microbiology.
Cash, P
2000-04-01
The techniques of proteomics (high resolution two-dimensional electrophoresis and protein characterisation) are widely used for microbiological research to analyse global protein synthesis as an indicator of gene expression. The rapid progress in microbial proteomics has been achieved through the wide availability of whole genome sequences for a number of bacterial groups. Beyond providing a basic understanding of microbial gene expression, proteomics has also played a role in medical areas of microbiology. Progress has been made in the use of the techniques for investigating the epidemiology and taxonomy of human microbial pathogens, the identification of novel pathogenic mechanisms and the analysis of drug resistance. In each of these areas, proteomics has provided new insights that complement genomic-based investigations. This review describes the current progress in these research fields and highlights some of the technical challenges existing for the application of proteomics in medical microbiology. The latter concern the analysis of genetically heterogeneous bacterial populations and the integration of the proteomic and genomic data for these bacteria. The characterisation of the proteomes of bacterial pathogens growing in their natural hosts remains a future challenge.
A new calibrant for MALDI-TOF-TOF-PSD-MS/MS of non-digested proteins for top-down proteomic analysis
USDA-ARS?s Scientific Manuscript database
RATIONALE: Matrix-assisted laser desorption/ionization (MALDI) time-of-flight-time-of-flight (TOF-TOF) tandem mass spectrometry (MS/MS) has seen increasing use for post-source decay (PSD)-MS/MS analysis of non-digested protein ions for top-down proteomic identification. However, there is no commonl...
Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.
Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin
2014-06-13
Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Uddin, Reaz; Sufian, Muhammad
2016-01-01
Infections caused by Salmonella enterica, a Gram-negative facultative anaerobic bacteria belonging to the family of Enterobacteriaceae, are major threats to the health of humans and animals. The recent availability of complete genome data of pathogenic strains of the S. enterica gives new avenues for the identification of drug targets and drug candidates. We have used the genomic and metabolic pathway data to identify pathways and proteins essential to the pathogen and absent from the host. We took the whole proteome sequence data of 42 strains of S. enterica and Homo sapiens along with KEGG-annotated metabolic pathway data, clustered proteins sequences using CD-HIT, identified essential genes using DEG database and discarded S. enterica homologs of human proteins in unique metabolic pathways (UMPs) and characterized hypothetical proteins with SVM-prot and InterProScan. Through this core proteomic analysis we have identified enzymes essential to the pathogen. The identification of 73 enzymes common in 42 strains of S. enterica is the real strength of the current study. We proposed all 73 unexplored enzymes as potential drug targets against the infections caused by the S. enterica. The study is comprehensive around S. enterica and simultaneously considered every possible pathogenic strain of S. enterica. This comprehensiveness turned the current study significant since, to the best of our knowledge it is the first subtractive core proteomic analysis of the unique metabolic pathways applied to any pathogen for the identification of drug targets. We applied extensive computational methods to shortlist few potential drug targets considering the druggability criteria e.g. Non-homologous to the human host, essential to the pathogen and playing significant role in essential metabolic pathways of the pathogen (i.e. S. enterica). In the current study, the subtractive proteomics through a novel approach was applied i.e. by considering only proteins of the unique metabolic pathways of the pathogens and mining the proteomic data of all completely sequenced strains of the pathogen, thus improving the quality and application of the results. We believe that the sharing of the knowledge from this study would eventually lead to bring about novel and unique therapeutic regimens against the infections caused by the S. enterica.
Tandem Mass Spectrum Sequencing: An Alternative to Database Search Engines in Shotgun Proteomics.
Muth, Thilo; Rapp, Erdmann; Berven, Frode S; Barsnes, Harald; Vaudel, Marc
2016-01-01
Protein identification via database searches has become the gold standard in mass spectrometry based shotgun proteomics. However, as the quality of tandem mass spectra improves, direct mass spectrum sequencing gains interest as a database-independent alternative. In this chapter, the general principle of this so-called de novo sequencing is introduced along with pitfalls and challenges of the technique. The main tools available are presented with a focus on user friendly open source software which can be directly applied in everyday proteomic workflows.
Audain, Enrique; Uszkoreit, Julian; Sachsenberg, Timo; Pfeuffer, Julianus; Liang, Xiao; Hermjakob, Henning; Sanchez, Aniel; Eisenacher, Martin; Reinert, Knut; Tabb, David L; Kohlbacher, Oliver; Perez-Riverol, Yasset
2017-01-06
In mass spectrometry-based shotgun proteomics, protein identifications are usually the desired result. However, most of the analytical methods are based on the identification of reliable peptides and not the direct identification of intact proteins. Thus, assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is a critical step in proteomics research. Currently, different protein inference algorithms and tools are available for the proteomics community. Here, we evaluated five software tools for protein inference (PIA, ProteinProphet, Fido, ProteinLP, MSBayesPro) using three popular database search engines: Mascot, X!Tandem, and MS-GF+. All the algorithms were evaluated using a highly customizable KNIME workflow using four different public datasets with varying complexities (different sample preparation, species and analytical instruments). We defined a set of quality control metrics to evaluate the performance of each combination of search engines, protein inference algorithm, and parameters on each dataset. We show that the results for complex samples vary not only regarding the actual numbers of reported protein groups but also concerning the actual composition of groups. Furthermore, the robustness of reported proteins when using databases of differing complexities is strongly dependant on the applied inference algorithm. Finally, merging the identifications of multiple search engines does not necessarily increase the number of reported proteins, but does increase the number of peptides per protein and thus can generally be recommended. Protein inference is one of the major challenges in MS-based proteomics nowadays. Currently, there are a vast number of protein inference algorithms and implementations available for the proteomics community. Protein assembly impacts in the final results of the research, the quantitation values and the final claims in the research manuscript. Even though protein inference is a crucial step in proteomics data analysis, a comprehensive evaluation of the many different inference methods has never been performed. Previously Journal of proteomics has published multiple studies about other benchmark of bioinformatics algorithms (PMID: 26585461; PMID: 22728601) in proteomics studies making clear the importance of those studies for the proteomics community and the journal audience. This manuscript presents a new bioinformatics solution based on the KNIME/OpenMS platform that aims at providing a fair comparison of protein inference algorithms (https://github.com/KNIME-OMICS). Six different algorithms - ProteinProphet, MSBayesPro, ProteinLP, Fido and PIA- were evaluated using the highly customizable workflow on four public datasets with varying complexities. Five popular database search engines Mascot, X!Tandem, MS-GF+ and combinations thereof were evaluated for every protein inference tool. In total >186 proteins lists were analyzed and carefully compare using three metrics for quality assessments of the protein inference results: 1) the numbers of reported proteins, 2) peptides per protein, and the 3) number of uniquely reported proteins per inference method, to address the quality of each inference method. We also examined how many proteins were reported by choosing each combination of search engines, protein inference algorithms and parameters on each dataset. The results show that using 1) PIA or Fido seems to be a good choice when studying the results of the analyzed workflow, regarding not only the reported proteins and the high-quality identifications, but also the required runtime. 2) Merging the identifications of multiple search engines gives almost always more confident results and increases the number of peptides per protein group. 3) The usage of databases containing not only the canonical, but also known isoforms of proteins has a small impact on the number of reported proteins. The detection of specific isoforms could, concerning the question behind the study, compensate for slightly shorter reports using the parsimonious reports. 4) The current workflow can be easily extended to support new algorithms and search engine combinations. Copyright © 2016. Published by Elsevier B.V.
Guan, Xiaoyan; Brownstein, Naomi C; Young, Nicolas L; Marshall, Alan G
2017-01-30
Bottom-up tandem mass spectrometry (MS/MS) is regularly used in proteomics to identify proteins from a sequence database. De novo sequencing is also available for sequencing peptides with relatively short sequence lengths. We recently showed that paired Lys-C and Lys-N proteases produce peptides of identical mass and similar retention time, but different tandem mass spectra. Such parallel experiments provide complementary information, and allow for up to 100% MS/MS sequence coverage. Here, we report digestion by paired Lys-C and Lys-N proteases of a seven-protein mixture: human hemoglobin alpha, bovine carbonic anhydrase 2, horse skeletal muscle myoglobin, hen egg white lysozyme, bovine pancreatic ribonuclease, bovine rhodanese, and bovine serum albumin, followed by reversed-phase nanoflow liquid chromatography, collision-induced dissociation, and 14.5 T Fourier transform ion cyclotron resonance mass spectrometry. Matched pairs of product peptide ions of equal precursor mass and similar retention times from each digestion are compared, leveraging single-residue transposed information with independent interferences to confidently identify fragment ion types, residues, and peptides. Selected pairs of product ion mass spectra for de novo sequenced protein segments from each member of the mixture are presented. Pairs of the transposed product ions as well as complementary information from the parallel experiments allow for both high MS/MS coverage for long peptide sequences and high confidence in the amino acid identification. Moreover, the parallel experiments in the de novo sequencing reduce false-positive matches of product ions from the single-residue transposed peptides from the same segment, and thereby further improve the confidence in protein identification. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Mesquita, Rosilene Oliveira; de Almeida Soares, Eduardo; de Barros, Everaldo Gonçalves; Loureiro, Marcelo Ehlers
2012-01-01
The most critical step in any proteomic study is protein extraction and sample preparation. Better solubilization increases the separation and resolution of gels, allowing identification of a higher number of proteins and more accurate quantitation of differences in gene expression. Despite the existence of published results for the optimization of proteomic analyses of soybean seeds, no comparable data are available for proteomic studies of soybean leaf tissue. In this work we have tested the effects of modification of a TCA-acetone method on the resolution of 2-DE gels of leaves and roots of soybean. Better focusing was obtained when both mercaptoethanol and dithiothreitol were used in the extraction buffer simultaneously. Increasing the number of washes of TCA precipitated protein with acetone, using a final wash with 80% ethanol and using sonication to ressuspend the pellet increased the number of detected proteins as well the resolution of the 2-DE gels. Using this approach we have constructed a soybean protein map. The major group of identified proteins corresponded to genes of unknown function. The second and third most abundant groups of proteins were composed of photosynthesis and metabolism related genes. The resulting protocol improved protein solubility and gel resolution allowing the identification of 122 soybean leaf proteins, 72 of which were not detected in other published soybean leaf 2-DE gel datasets, including a transcription factor and several signaling proteins. PMID:22802721
Proteomic and Bioinformatic Profile of Primary Human Oral Epithelial Cells
Ghosh, Santosh K.; Yohannes, Elizabeth; Bebek, Gurkan; Weinberg, Aaron; Jiang, Bin; Willard, Belinda; Chance, Mark R.; Kinter, Michael T.; McCormick, Thomas S.
2012-01-01
Wounding of the oral mucosa occurs frequently in a highly septic environment. Remarkably, these wounds heal quickly and the oral cavity, for the most part, remains healthy. Deciphering the normal human oral epithelial cell (NHOEC) proteome is critical for understanding the mechanism(s) of protection elicited when the mucosal barrier is intact, as well as when it is breached. Combining 2D gel electrophoresis with shotgun proteomics resulted in identification of 1662 NHOEC proteins. Proteome annotations were performed based on protein classes, molecular functions, disease association and membership in canonical and metabolic signaling pathways. Comparing the NHOEC proteome with a database of innate immunity-relevant interactions (InnateDB) identified 64 common proteins associated with innate immunity. Comparison with published salivary proteomes revealed that 738/1662 NHOEC proteins were common, suggesting that significant numbers of salivary proteins are of epithelial origin. Gene ontology analysis showed similarities in the distributions of NHOEC and saliva proteomes with regard to biological processes, and molecular functions. We also assessed the inter-individual variability of the NHOEC proteome and observed it to be comparable with other primary cells. The baseline proteome described in this study should serve as a resource for proteome studies of the oral mucosa, especially in relation to disease processes. PMID:23035736
Unraveling snake venom complexity with 'omics' approaches: challenges and perspectives.
Zelanis, André; Tashima, Alexandre Keiji
2014-09-01
The study of snake venom proteomes (venomics) has been experiencing a burst of reports, however the comprehensive knowledge of the dynamic range of proteins present within a single venom, the set of post-translational modifications (PTMs) as well as the lack of a comprehensive database related to venom proteins are among the main challenges in venomics research. The phenotypic plasticity in snake venom proteomes together with their inherent toxin proteoform diversity, points out to the use of integrative analysis in order to better understand their actual complexity. In this regard, such a systems venomics task should encompass the integration of data from transcriptomic and proteomic studies (specially the venom gland proteome), the identification of biological PTMs, and the estimation of artifactual proteomes and peptidomes generated by sample handling procedures. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cassidy, Liam; Prasse, Daniela; Linke, Dennis; Schmitz, Ruth A; Tholey, Andreas
2016-10-07
The recent discovery of an increasing number of small open reading frames (sORF) creates the need for suitable analytical technologies for the comprehensive identification of the corresponding gene products. For biological and functional studies the knowledge of the entire set of proteins and sORF gene products is essential. Consequently in the present study we evaluated analytical approaches that will allow for simultaneous analysis of widest parts of the proteome together with the predicted sORF. We performed a full proteome analysis of the methane producing archaeon Methanosarcina mazei strain Gö1 cytosolic proteome using a high/low pH reversed phase LC-MS bottom-up approach. The second analytical approach was based on semi-top-down strategy, encompassing a separation at intact protein level using a GelFree system, followed by digestion and LC-MS analysis. A high overlap in identified proteins was found for both approaches yielding the most comprehensive coverage of the cytosolic proteome of this organism achieved so far. The application of the second approach in combination with an adjustment of the search criteria for database searches further led to a significant increase of sORF peptide identifications, finally allowing to detect and identify 28 sORF gene products.
Veit, Johannes; Sachsenberg, Timo; Chernev, Aleksandar; Aicheler, Fabian; Urlaub, Henning; Kohlbacher, Oliver
2016-09-02
Modern mass spectrometry setups used in today's proteomics studies generate vast amounts of raw data, calling for highly efficient data processing and analysis tools. Software for analyzing these data is either monolithic (easy to use, but sometimes too rigid) or workflow-driven (easy to customize, but sometimes complex). Thermo Proteome Discoverer (PD) is a powerful software for workflow-driven data analysis in proteomics which, in our eyes, achieves a good trade-off between flexibility and usability. Here, we present two open-source plugins for PD providing additional functionality: LFQProfiler for label-free quantification of peptides and proteins, and RNP(xl) for UV-induced peptide-RNA cross-linking data analysis. LFQProfiler interacts with existing PD nodes for peptide identification and validation and takes care of the entire quantitative part of the workflow. We show that it performs at least on par with other state-of-the-art software solutions for label-free quantification in a recently published benchmark ( Ramus, C.; J. Proteomics 2016 , 132 , 51 - 62 ). The second workflow, RNP(xl), represents the first software solution to date for identification of peptide-RNA cross-links including automatic localization of the cross-links at amino acid resolution and localization scoring. It comes with a customized integrated cross-link fragment spectrum viewer for convenient manual inspection and validation of the results.
Sheng, Quanhu; Li, Rongxia; Dai, Jie; Li, Qingrun; Su, Zhiduan; Guo, Yan; Li, Chen; Shyr, Yu; Zeng, Rong
2015-01-01
Isobaric labeling techniques coupled with high-resolution mass spectrometry have been widely employed in proteomic workflows requiring relative quantification. For each high-resolution tandem mass spectrum (MS/MS), isobaric labeling techniques can be used not only to quantify the peptide from different samples by reporter ions, but also to identify the peptide it is derived from. Because the ions related to isobaric labeling may act as noise in database searching, the MS/MS spectrum should be preprocessed before peptide or protein identification. In this article, we demonstrate that there are a lot of high-frequency, high-abundance isobaric related ions in the MS/MS spectrum, and removing isobaric related ions combined with deisotoping and deconvolution in MS/MS preprocessing procedures significantly improves the peptide/protein identification sensitivity. The user-friendly software package TurboRaw2MGF (v2.0) has been implemented for converting raw TIC data files to mascot generic format files and can be downloaded for free from https://github.com/shengqh/RCPA.Tools/releases as part of the software suite ProteomicsTools. The data have been deposited to the ProteomeXchange with identifier PXD000994. PMID:25435543
Data Portal | Office of Cancer Clinical Proteomics Research
The CPTAC Data Portal is a centralized repository for the public dissemination of proteomic sequence datasets collected by CPTAC, along with corresponding genomic sequence datasets. In addition, available are analyses of CPTAC's raw mass spectrometry-based data files (mapping of spectra to peptide sequences and protein identification) by individual investigators from CPTAC and by a Common Data Analysis Pipeline.
Tryptic digests of human serum albumin (HSA) and human lung epithelial cell lysates were used as test samples in a novel proteomics study. Peptides were separated and analyzed using 2D-nano-LC/MSMS with strong cation exchange (SCX) and reverse phase (RP) chromatography and contin...
Duan, Xiaotao; Young, Rebecca; Straubinger, Robert M.; Page, Brian J.; Cao, Jin; Wang, Hao; Yu, Haoying; Canty, John M.; Qu, Jun
2009-01-01
For label-free expression profiling of tissue proteomes, efficient protein extraction, thorough and quantitative sample cleanup and digestion procedures, as well as sufficient and reproducible chromatographic separation, are highly desirable but remain challenging. However, optimal methodology has remained elusive, especially for proteomes that are rich in membrane proteins, such as the mitochondria. Here we describe a straightforward and reproducible sample preparation procedure, coupled with a highly selective and sensitive nano-LC/Orbitrap analysis, which enables reliable and comprehensive expression profiling of tissue mitochondria. The mitochondrial proteome of swine heart was selected as a test system. Efficient protein extraction was accomplished using a strong buffer containing both ionic and non-ionic detergents. Overnight precipitation was used for cleanup of the extract, and the sample was subjected to an optimized 2-step, on-pellet digestion approach. In the first step, the protein pellet was dissolved via a 4 h tryptic digestion under vigorous agitation, which nano-LC/LTQ/ETD showed to produce large and incompletely cleaved tryptic peptides. The mixture was then reduced, alkylated, and digested into its full complement of tryptic peptides with additional trypsin. This solvent precipitation/on-pellet digestion procedure achieved significantly higher and more reproducible peptide recovery of the mitochondrial preparation, than observed using a prevalent alternative procedure for label-free expression profiling, SDS-PAGE/in-gel digestion (87% vs. 54%). Furthermore, uneven peptide losses were lower than observed with SDS-PAGE/in-gel digestion. The resulting peptides were sufficiently resolved by a 5 h gradient using a nano-LC configuration that features a low-void-volume, high chromatographic reproducibility, and an LTQ/Orbitrap analyzer for protein identification and quantification. The developed method was employed for label-free comparison of the mitochondrial proteomes of myocardium from healthy animals vs. those with hibernating myocardium. Each experimental group consisted of a relatively large number of animals (n=10), and samples were analyzed in random order to minimize quantitative false-positives. Using this approach, 904 proteins were identified and quantified with high confidence, and those mitochondrial proteins that were altered significantly between groups were compared with the results of a parallel 2D-DIGE analysis. The sample preparation and analytical strategy developed here represents an advancement that can be adapted to analyze other tissue proteomes. PMID:19290621
A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*
Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing
2011-01-01
Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108
Weissgerber, Thomas; Sylvester, Marc; Kröninger, Lena
2014-01-01
In the present study, we compared the proteome response of Allochromatium vinosum when growing photoautotrophically in the presence of sulfide, thiosulfate, and elemental sulfur with the proteome response when the organism was growing photoheterotrophically on malate. Applying tandem mass tag analysis as well as two-dimensional (2D) PAGE, we detected 1,955 of the 3,302 predicted proteins by identification of at least two peptides (59.2%) and quantified 1,848 of the identified proteins. Altered relative protein amounts (≥1.5-fold) were observed for 385 proteins, corresponding to 20.8% of the quantified A. vinosum proteome. A significant number of the proteins exhibiting strongly enhanced relative protein levels in the presence of reduced sulfur compounds are well documented essential players during oxidative sulfur metabolism, e.g., the dissimilatory sulfite reductase DsrAB. Changes in protein levels generally matched those observed for the respective relative mRNA levels in a previous study and allowed identification of new genes/proteins participating in oxidative sulfur metabolism. One gene cluster (hyd; Alvin_2036-Alvin_2040) and one hypothetical protein (Alvin_2107) exhibiting strong responses on both the transcriptome and proteome levels were chosen for gene inactivation and phenotypic analyses of the respective mutant strains, which verified the importance of the so-called Isp hydrogenase supercomplex for efficient oxidation of sulfide and a crucial role of Alvin_2107 for the oxidation of sulfur stored in sulfur globules to sulfite. In addition, we analyzed the sulfur globule proteome and identified a new sulfur globule protein (SgpD; Alvin_2515). PMID:24487535
Zhu, Ying; Zhao, Rui; Piehowski, Paul D.; ...
2017-09-01
One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here in this study, we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-μm-i.d. columns increase signal intensity by >3-fold relative to those using 75-μm-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos MS significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap),more » leading to a ~3-fold increase in peptide identifications and 1.7-fold increase in identified protein groups for 2 ng tryptic digests of the bacterium S. oneidensis. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by ~95% for 0.5 ng samples and by ~42% for 2 ng samples. Using the best combination of the above variables, we were able to identify >3,000 proteins from 10 ng tryptic digests from both HeLa and THP-1 mammalian cell lines. We also identified >950 proteins from subnanogram archaeal/bacterial cocultures. Finally, the present ultrasensitive LC-MS platform achieves a level of proteome coverage not previously realized for ultra-small sample loadings, and is expected to facilitate the analysis of subnanogram samples, including single mammalian cells.« less
Nuclear proteome analysis of undifferentiated mouse embryonic stem and germ cells.
Buhr, Nicolas; Carapito, Christine; Schaeffer, Christine; Kieffer, Emmanuelle; Van Dorsselaer, Alain; Viville, Stéphane
2008-06-01
Embryonic stem cells (ESCs) and embryonic germ cells (EGCs) provide exciting models for understanding the underlying mechanisms that make a cell pluripotent. Indeed, such understanding would enable dedifferentiation and reprogrammation of any cell type from a patient needing a cell therapy treatment. Proteome analysis has emerged as an important technology for deciphering these biological processes and thereby ESC and EGC proteomes are increasingly studied. Nevertheless, their nuclear proteomes have only been poorly investigated up to now. In order to investigate signaling pathways potentially involved in pluripotency, proteomic analyses have been performed on mouse ESC and EGC nuclear proteins. Nuclei from ESCs and EGCs at undifferentiated stage were purified by subcellular fractionation. After 2-D separation, a subtractive strategy (subtracting culture environment contaminating spots) was applied and a comparison of ESC, (8.5 day post coïtum (dpc))-EGC and (11.5 dpc)-EGC specific nuclear proteomes was performed. A total of 33 ESC, 53 (8.5 dpc)-EGC, and 36 (11.5 dpc)-EGC spots were identified by MALDI-TOF-MS and/or nano-LC-MS/MS. This approach led to the identification of two isoforms (with and without N-terminal acetylation) of a known pluripotency marker, namely developmental pluripotency associated 5 (DPPA5), which has never been identified before in 2-D gel-MS studies of ESCs and EGCs. Furthermore, we demonstrated the efficiency of our subtracting strategy, in association with a nuclear subfractionation by the identification of a new protein (protein arginine N-methyltransferase 7; PRMT7) behaving as proteins involved in pluripotency.
Statistical design of quantitative mass spectrometry-based proteomic experiments.
Oberg, Ann L; Vitek, Olga
2009-05-01
We review the fundamental principles of statistical experimental design, and their application to quantitative mass spectrometry-based proteomics. We focus on class comparison using Analysis of Variance (ANOVA), and discuss how randomization, replication and blocking help avoid systematic biases due to the experimental procedure, and help optimize our ability to detect true quantitative changes between groups. We also discuss the issues of pooling multiple biological specimens for a single mass analysis, and calculation of the number of replicates in a future study. When applicable, we emphasize the parallels between designing quantitative proteomic experiments and experiments with gene expression microarrays, and give examples from that area of research. We illustrate the discussion using theoretical considerations, and using real-data examples of profiling of disease.
Proteomics and plant disease: advances in combating a major threat to the global food supply.
Rampitsch, Christof; Bykova, Natalia V
2012-02-01
The study of plant disease and immunity is benefiting tremendously from proteomics. Parallel streams of research from model systems, from pathogens in vitro and from the relevant pathogen-crop interactions themselves have begun to reveal a model of how plants succumb to invading pathogens and how they defend themselves without the benefit of a circulating immune system. In this review, we discuss the contribution of proteomics to these advances, drawing mainly on examples from crop-fungus interactions, from Arabidopsis-bacteria interactions, from elicitor-based model systems and from pathogen studies, to highlight also the important contribution of non-crop systems to advancing crop protection. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Deswal, Renu; Abat, Jasmeet Kaur; Sehrawat, Ankita; Gupta, Ravi; Kashyap, Prakriti; Sharma, Shruti; Sharma, Bhavana; Chaurasia, Satya Prakash; Chanu, Sougrakpam Yaiphabi; Masi, Antonio; Agrawal, Ganesh Kumar; Sarkar, Abhijit; Agrawal, Raj; Dunn, Michael J; Renaut, Jenny; Rakwal, Randeep
2014-07-01
International Plant Proteomics Organization (INPPO) outlined ten initiatives to promote plant proteomics in each and every country. With greater emphasis in developing countries, one of those was to "organize workshops at national and international levels to train manpower and exchange information". This third INPPO highlights covers the workshop organized for the very first time in a developing country, India, at the Department of Botany in University of Delhi on December 26-30, 2013 titled - "1(st) Plant Proteomics Workshop / Training Program" under the umbrella of INPPO India-Nepal chapter. Selected 20 participants received on-hand training mainly on gel-based proteomics approach along with manual booklet and parallel lectures on this and associated topics. In house, as well as invited experts drawn from other Universities and Institutes (national and international), delivered talks on different aspects of gel-based and gel-free proteomics. Importance of gel-free proteomics approach, translational proteomics, and INPPO roles were presented and interactively discussed by a group of three invited speakers Drs. Ganesh Kumar Agrawal (Nepal), Randeep Rakwal (Japan), and Antonio Masi (Italy). Given the output of this systematic workshop, it was proposed and thereafter decided to be organized every alternate year; the next workshop will be held in 2015. Furthermore, possibilities on providing advanced training to those students / researchers / teachers with basic knowledge in proteomics theory and experiments at national and international levels were discussed. INPPO is committed to generating next-generation trained manpower in proteomics, and it would only happen by the firm determination of scientists to come forward and do it. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
From a 2DE-gel spot to protein function: lesson learned from HS1 in chronic lymphocytic leukemia.
Apollonio, Benedetta; Bertilaccio, Maria Teresa Sabrina; Restuccia, Umberto; Ranghetti, Pamela; Barbaglio, Federica; Ghia, Paolo; Caligaris-Cappio, Federico; Scielzo, Cristina
2014-10-19
The identification of molecules involved in tumor initiation and progression is fundamental for understanding disease's biology and, as a consequence, for the clinical management of patients. In the present work we will describe an optimized proteomic approach for the identification of molecules involved in the progression of Chronic Lymphocytic Leukemia (CLL). In detail, leukemic cell lysates are resolved by 2-dimensional Electrophoresis (2DE) and visualized as "spots" on the 2DE gels. Comparative analysis of proteomic maps allows the identification of differentially expressed proteins (in terms of abundance and post-translational modifications) that are picked, isolated and identified by Mass Spectrometry (MS). The biological function of the identified candidates can be tested by different assays (i.e. migration, adhesion and F-actin polymerization), that we have optimized for primary leukemic cells.
Combining results of multiple search engines in proteomics.
Shteynberg, David; Nesvizhskii, Alexey I; Moritz, Robert L; Deutsch, Eric W
2013-09-01
A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques.
Combining Results of Multiple Search Engines in Proteomics*
Shteynberg, David; Nesvizhskii, Alexey I.; Moritz, Robert L.; Deutsch, Eric W.
2013-01-01
A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques. PMID:23720762
Parasites, proteomes and systems: has Descartes' clock run out of time?
Wastling, J M; Armstrong, S D; Krishna, R; Xia, D
2012-08-01
Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types.
The online Tabloid Proteome: an annotated database of protein associations
Turan, Demet; Tavernier, Jan
2018-01-01
Abstract A complete knowledge of the proteome can only be attained by determining the associations between proteins, along with the nature of these associations (e.g. physical contact in protein–protein interactions, participation in complex formation or different roles in the same pathway). Despite extensive efforts in elucidating direct protein interactions, our knowledge on the complete spectrum of protein associations remains limited. We therefore developed a new approach that detects protein associations from identifications obtained after re-processing of large-scale, public mass spectrometry-based proteomics data. Our approach infers protein association based on the co-occurrence of proteins across many different proteomics experiments, and provides information that is almost completely complementary to traditional direct protein interaction studies. We here present a web interface to query and explore the associations derived from this method, called the online Tabloid Proteome. The online Tabloid Proteome also integrates biological knowledge from several existing resources to annotate our derived protein associations. The online Tabloid Proteome is freely available through a user-friendly web interface, which provides intuitive navigation and data exploration options for the user at http://iomics.ugent.be/tabloidproteome. PMID:29040688
Parasites, proteomes and systems: has Descartes’ clock run out of time?
WASTLING, J. M.; ARMSTRONG, S. D.; KRISHNA, R.; XIA, D.
2012-01-01
SUMMARY Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types. PMID:22828391
The Gel Electrophoresis Markup Language (GelML) from the Proteomics Standards Initiative
Gibson, Frank; Hoogland, Christine; Martinez-Bartolomé, Salvador; Medina-Aunon, J. Alberto; Albar, Juan Pablo; Babnigg, Gyorgy; Wipat, Anil; Hermjakob, Henning; Almeida, Jonas S; Stanislaus, Romesh; Paton, Norman W; Jones, Andrew R
2011-01-01
The Human Proteome Organisation’s Proteomics Standards Initiative (HUPO-PSI) has developed the GelML data exchange format for representing gel electrophoresis experiments performed in proteomics investigations. The format closely follows the reporting guidelines for gel electrophoresis, which are part of the Minimum Information About a Proteomics Experiment (MIAPE) set of modules. GelML supports the capture of metadata (such as experimental protocols) and data (such as gel images) resulting from gel electrophoresis so that laboratories can be compliant with the MIAPE Gel Electrophoresis guidelines, while allowing such data sets to be exchanged or downloaded from public repositories. The format is sufficiently flexible to capture data from a broad range of experimental processes, and complements other PSI formats for mass spectrometry data and the results of protein and peptide identifications to capture entire gel-based proteome workflows. GelML has resulted from the open standardisation process of PSI consisting of both public consultation and anonymous review of the specifications. PMID:20677327
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gritsenko, Marina A.; Xu, Zhe; Liu, Tao
Comprehensive, quantitative information on abundances of proteins and their post-translational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labelling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification andmore » quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples, and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.« less
A proteomic approach to obesity and type 2 diabetes
López-Villar, Elena; Martos-Moreno, Gabriel Á; Chowen, Julie A; Okada, Shigeru; Kopchick, John J; Argente, Jesús
2015-01-01
The incidence of obesity and type diabetes 2 has increased dramatically resulting in an increased interest in its biomedical relevance. However, the mechanisms that trigger the development of diabetes type 2 in obese patients remain largely unknown. Scientific, clinical and pharmaceutical communities are dedicating vast resources to unravel this issue by applying different omics tools. During the last decade, the advances in proteomic approaches and the Human Proteome Organization have opened and are opening a new door that may be helpful in the identification of patients at risk and to improve current therapies. Here, we briefly review some of the advances in our understanding of type 2 diabetes that have occurred through the application of proteomics. We also review, in detail, the current improvements in proteomic methodologies and new strategies that could be employed to further advance our understanding of this pathology. By applying these new proteomic advances, novel therapeutic and/or diagnostic protein targets will be discovered in the obesity/Type 2 diabetes area. PMID:25960181
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Zhou; Adams, Rachel M; Chourey, Karuna
2012-01-01
A variety of quantitative proteomics methods have been developed, including label-free, metabolic labeling, and isobaric chemical labeling using iTRAQ or TMT. Here, these methods were compared in terms of the depth of proteome coverage, quantification accuracy, precision, and reproducibility using a high-performance hybrid mass spectrometer, LTQ Orbitrap Velos. Our results show that (1) the spectral counting method provides the deepest proteome coverage for identification, but its quantification performance is worse than labeling-based approaches, especially the quantification reproducibility; (2) metabolic labeling and isobaric chemical labeling are capable of accurate, precise, and reproducible quantification and provide deep proteome coverage for quantification. Isobaricmore » chemical labeling surpasses metabolic labeling in terms of quantification precision and reproducibility; (3) iTRAQ and TMT perform similarly in all aspects compared in the current study using a CID-HCD dual scan configuration. Based on the unique advantages of each method, we provide guidance for selection of the appropriate method for a quantitative proteomics study.« less
Gritsenko, Marina A; Xu, Zhe; Liu, Tao; Smith, Richard D
2016-01-01
Comprehensive, quantitative information on abundances of proteins and their posttranslational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labeling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification and quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.
The gel electrophoresis markup language (GelML) from the Proteomics Standards Initiative.
Gibson, Frank; Hoogland, Christine; Martinez-Bartolomé, Salvador; Medina-Aunon, J Alberto; Albar, Juan Pablo; Babnigg, Gyorgy; Wipat, Anil; Hermjakob, Henning; Almeida, Jonas S; Stanislaus, Romesh; Paton, Norman W; Jones, Andrew R
2010-09-01
The Human Proteome Organisation's Proteomics Standards Initiative has developed the GelML (gel electrophoresis markup language) data exchange format for representing gel electrophoresis experiments performed in proteomics investigations. The format closely follows the reporting guidelines for gel electrophoresis, which are part of the Minimum Information About a Proteomics Experiment (MIAPE) set of modules. GelML supports the capture of metadata (such as experimental protocols) and data (such as gel images) resulting from gel electrophoresis so that laboratories can be compliant with the MIAPE Gel Electrophoresis guidelines, while allowing such data sets to be exchanged or downloaded from public repositories. The format is sufficiently flexible to capture data from a broad range of experimental processes, and complements other PSI formats for MS data and the results of protein and peptide identifications to capture entire gel-based proteome workflows. GelML has resulted from the open standardisation process of PSI consisting of both public consultation and anonymous review of the specifications.
Beyond the proteome: Mass Spectrometry Special Interest Group (MS-SIG) at ISMB/ECCB 2013
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ryu, Soyoung; Payne, Samuel H.; Schaab, Christoph
2014-07-02
Mass spectrometry special interest group (MS-SIG) aims to bring together experts from the global research community to discuss highlights and challenges in the field of mass spectrometry (MS)-based proteomics and computational biology. The rapid echnological developments in MS-based proteomics have enabled the generation of a large amount of meaningful information on hundreds to thousands of proteins simultaneously from a biological sample; however, the complexity of the MS data require sophisticated computational algorithms and software for data analysis and interpretation. This year’s MS-SIG meeting theme was ‘Beyond the Proteome’ with major focuses on improving protein identification/quantification and using proteomics data tomore » solve interesting problems in systems biology and clinical research.« less
Making a protein extract from plant pathogenic fungi for gel- and LC-based proteomics.
Fernández, Raquel González; Redondo, Inmaculada; Jorrin-Novo, Jesus V
2014-01-01
Proteomic technologies have become a successful tool to provide relevant information on fungal biology. In the case of plant pathogenic fungi, this approach would allow a deeper knowledge of the interaction and the biological cycle of the pathogen, as well as the identification of pathogenicity and virulence factors. These two elements open up new possibilities for crop disease diagnosis and environment-friendly crop protection. Phytopathogenic fungi, due to its particular cellular characteristics, can be considered as a recalcitrant biological material, which makes it difficult to obtain quality protein samples for proteomic analysis. This chapter focuses on protein extraction for gel- and LC-based proteomics with specific protocols of our current research with Botrytis cinerea.
Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes
NASA Astrophysics Data System (ADS)
Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy
2007-01-01
Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.
Wimmer, Helge; Gundacker, Nina C; Griss, Johannes; Haudek, Verena J; Stättner, Stefan; Mohr, Thomas; Zwickl, Hannes; Paulitschke, Verena; Baron, David M; Trittner, Wolfgang; Kubicek, Markus; Bayer, Editha; Slany, Astrid; Gerner, Christopher
2009-06-01
Interpretation of proteome data with a focus on biomarker discovery largely relies on comparative proteome analyses. Here, we introduce a database-assisted interpretation strategy based on proteome profiles of primary cells. Both 2-D-PAGE and shotgun proteomics are applied. We obtain high data concordance with these two different techniques. When applying mass analysis of tryptic spot digests from 2-D gels of cytoplasmic fractions, we typically identify several hundred proteins. Using the same protein fractions, we usually identify more than thousand proteins by shotgun proteomics. The data consistency obtained when comparing these independent data sets exceeds 99% of the proteins identified in the 2-D gels. Many characteristic differences in protein expression of different cells can thus be independently confirmed. Our self-designed SQL database (CPL/MUW - database of the Clinical Proteomics Laboratories at the Medical University of Vienna accessible via www.meduniwien.ac.at/proteomics/database) facilitates (i) quality management of protein identification data, which are based on MS, (ii) the detection of cell type-specific proteins and (iii) of molecular signatures of specific functional cell states. Here, we demonstrate, how the interpretation of proteome profiles obtained from human liver tissue and hepatocellular carcinoma tissue is assisted by the Clinical Proteomics Laboratories at the Medical University of Vienna-database. Therefore, we suggest that the use of reference experiments supported by a tailored database may substantially facilitate data interpretation of proteome profiling experiments.
The role of targeted chemical proteomics in pharmacology
Sutton, Chris W
2012-01-01
Traditionally, proteomics is the high-throughput characterization of the global complement of proteins in a biological system using cutting-edge technologies (robotics and mass spectrometry) and bioinformatics tools (Internet-based search engines and databases). As the field of proteomics has matured, a diverse range of strategies have evolved to answer specific problems. Chemical proteomics is one such direction that provides the means to enrich and detect less abundant proteins (the ‘hidden’ proteome) from complex mixtures of wide dynamic range (the ‘deep’ proteome). In pharmacology, chemical proteomics has been utilized to determine the specificity of drugs and their analogues, for anticipated known targets, only to discover other proteins that bind and could account for side effects observed in preclinical and clinical trials. As a consequence, chemical proteomics provides a valuable accessory in refinement of second- and third-generation drug design for treatment of many diseases. However, determining definitive affinity capture of proteins by a drug immobilized on soft gel chromatography matrices has highlighted some of the challenges that remain to be addressed. Examples of the different strategies that have emerged using well-established drugs against pharmaceutically important enzymes, such as protein kinases, metalloproteases, PDEs, cytochrome P450s, etc., indicate the potential opportunity to employ chemical proteomics as an early-stage screening approach in the identification of new targets. PMID:22074351
Characterization of individual mouse cerebrospinal fluid proteomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Jeffrey S.; Angel, Thomas E.; Chavkin, Charles
2014-03-20
Analysis of cerebrospinal fluid (CSF) offers key insight into the status of the central nervous system. Characterization of murine CSF proteomes can provide a valuable resource for studying central nervous system injury and disease in animal models. However, the small volume of CSF in mice has thus far limited individual mouse proteome characterization. Through non-terminal CSF extractions in C57Bl/6 mice and high-resolution liquid chromatography-mass spectrometry analysis of individual murine samples, we report the most comprehensive proteome characterization of individual murine CSF to date. Utilizing stringent protein inclusion criteria that required the identification of at least two unique peptides (1% falsemore » discovery rate at the peptide level) we identified a total of 566 unique proteins, including 128 proteins from three individual CSF samples that have been previously identified in brain tissue. Our methods and analysis provide a mechanism for individual murine CSF proteome analysis.« less
Design and analysis issues in quantitative proteomics studies.
Karp, Natasha A; Lilley, Kathryn S
2007-09-01
Quantitative proteomics is the comparison of distinct proteomes which enables the identification of protein species which exhibit changes in expression or post-translational state in response to a given stimulus. Many different quantitative techniques are being utilized and generate large datasets. Independent of the technique used, these large datasets need robust data analysis to ensure valid conclusions are drawn from such studies. Approaches to address the problems that arise with large datasets are discussed to give insight into the types of statistical analyses of data appropriate for the various experimental strategies that can be employed by quantitative proteomic studies. This review also highlights the importance of employing a robust experimental design and highlights various issues surrounding the design of experiments. The concepts and examples discussed within will show how robust design and analysis will lead to confident results that will ensure quantitative proteomics delivers.
Hill, Ryan C.; Wither, Matthew J.; Nemkov, Travis; Barrett, Alexander; D'Alessandro, Angelo; Dzieciatkowska, Monika; Hansen, Kirk C.
2015-01-01
Bone samples from several vertebrates were collected from the Ziegler Reservoir fossil site, in Snowmass Village, Colorado, and processed for proteomics analysis. The specimens come from Pleistocene megafauna Bison latifrons, dating back ∼120,000 years. Proteomics analysis using a simplified sample preparation procedure and tandem mass spectrometry (MS/MS) was applied to obtain protein identifications. Several bioinformatics resources were used to obtain peptide identifications based on sequence homology to extant species with annotated genomes. With the exception of soil sample controls, all samples resulted in confident peptide identifications that mapped to type I collagen. In addition, we analyzed a specimen from the extinct B. latifrons that yielded peptide identifications mapping to over 33 bovine proteins. Our analysis resulted in extensive fibrillar collagen sequence coverage, including the identification of posttranslational modifications. Hydroxylysine glucosylgalactosylation, a modification thought to be involved in collagen fiber formation and bone mineralization, was identified for the first time in an ancient protein dataset. Meta-analysis of data from other studies indicates that this modification may be common in well-preserved prehistoric samples. Additional peptide sequences from extracellular matrix (ECM) and non-ECM proteins have also been identified for the first time in ancient tissue samples. These data provide a framework for analyzing ancient protein signatures in well-preserved fossil specimens, while also contributing novel insights into the molecular basis of organic matter preservation. As such, this analysis has unearthed common posttranslational modifications of collagen that may assist in its preservation over time. The data are available via ProteomeXchange with identifier PXD001827. PMID:25948757
Ni, Mao-Wei; Wang, Lu; Chen, Wei; Mou, Han-Zhou; Zhou, Jie; Zheng, Zhi-Guo
2017-01-30
Mass spectrometry (MS)-based protein identification depends mainly on protein extraction and digestion. Although sodium dodecyl sulfate (SDS) can preclude enzymatic digestion and interfere with MS analysis, it is still the most widely used surfactant in these steps. To overcome these disadvantages, a SDS-compatible proteomic technique for SDS removal prior to MS-based analyses was developed, namely filter-aided sample preparation (FASP). Herein, based on the effectiveness of sodium deoxycholate and a detergent removal spin column, we developed a modified FASP (mFASP) method and compared its overall performance, total number of peptides and proteins identified for shotgun proteomic experiments with that of the FASP method. Identification of 4570 ± 392 and 9139 ± 317 peptides and description of 862 ± 46 and 1377 ± 33 protein groups with two or more peptides from the ovarian cancer cell line A2780 was accomplished by FASP and mFASP methods, respectively. The mFASP method (21.2 ± 0.2%) had higher average peptide to protein coverage than FASP method (13.2 ± 0.5%). More hydrophobic peptides were identified by mFASP than by FASP, as indicated by the GRAVY score distribution. The reported method enables reliable and efficient identification of proteins and peptides in whole-cell extracts containing SDS. The new approach allows for higher throughput (the simultaneous identification of more proteins), a more comprehensive investigation of proteins, and potentially the discovery of new biomarkers. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
USDA-ARS?s Scientific Manuscript database
We have analyzed 26 Shiga toxin-producing Escherichia coli (STEC) strains for Shiga toxin 2 (Stx2) production using matrix-assisted laser desorption/ionization time-of-flight-time-of-flight tandem mass spectrometry (MALDI-TOF-TOF-MS/MS) and top-down proteomic analysis. STEC strains were induced to ...
Elucidating Proteoform Families from Proteoform Intact-Mass and Lysine-Count Measurements
2016-01-01
Proteomics is presently dominated by the “bottom-up” strategy, in which proteins are enzymatically digested into peptides for mass spectrometric identification. Although this approach is highly effective at identifying large numbers of proteins present in complex samples, the digestion into peptides renders it impossible to identify the proteoforms from which they were derived. We present here a powerful new strategy for the identification of proteoforms and the elucidation of proteoform families (groups of related proteoforms) from the experimental determination of the accurate proteoform mass and number of lysine residues contained. Accurate proteoform masses are determined by standard LC–MS analysis of undigested protein mixtures in an Orbitrap mass spectrometer, and the lysine count is determined using the NeuCode isotopic tagging method. We demonstrate the approach in analysis of the yeast proteome, revealing 8637 unique proteoforms and 1178 proteoform families. The elucidation of proteoforms and proteoform families afforded here provides an unprecedented new perspective upon proteome complexity and dynamics. PMID:26941048
Berger, Sebastian T; Ahmed, Saima; Muntel, Jan; Cuevas Polo, Nerea; Bachur, Richard; Kentsis, Alex; Steen, Judith; Steen, Hanno
2015-10-01
We describe a 96-well plate compatible membrane-based proteomic sample processing method, which enables the complete processing of 96 samples (or multiples thereof) within a single workday. This method uses a large-pore hydrophobic PVDF membrane that efficiently adsorbs proteins, resulting in fast liquid transfer through the membrane and significantly reduced sample processing times. Low liquid transfer speeds have prevented the useful 96-well plate implementation of FASP as a widely used membrane-based proteomic sample processing method. We validated our approach on whole-cell lysate and urine and cerebrospinal fluid as clinically relevant body fluids. Without compromising peptide and protein identification, our method uses a vacuum manifold and circumvents the need for digest desalting, making our processing method compatible with standard liquid handling robots. In summary, our new method maintains the strengths of FASP and simultaneously overcomes one of the major limitations of FASP without compromising protein identification and quantification. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Purification and proteomic analysis of plant plasma membranes.
Alexandersson, Erik; Gustavsson, Niklas; Bernfur, Katja; Karlsson, Adine; Kjellbom, Per; Larsson, Christer
2008-01-01
All techniques needed for proteomic analyses of plant plasma membranes are described in detail, from isolation of plasma membranes to protein identification by mass spectrometry (MS). Plasma membranes are isolated by aqueous two-phase partitioning yielding vesicles with a cytoplasmic side-in orientation and a purity of about 95%. These vesicles are turned inside-out by treatment with Brij 58, which removes soluble contaminating proteins enclosed in the vesicles as well as loosely attached proteins. The final plasma membrane preparation thus retains all integral proteins and many peripheral proteins. Proteins are separated by one-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE), and protein bands are excised and digested with trypsin. Peptides in tryptic digests are separated by nanoflow liquid chromatography and either fed directly into an ESI-MS or spotted onto matrix-assisted laser desorption ionization (MALDI) plates for analysis with MALDI-MS. Finally, data processing and database searching are used for protein identification to define a plasma membrane proteome.
Spencer, Jean L; Bhatia, Vivek N; Whelan, Stephen A; Costello, Catherine E; McComb, Mark E
2013-12-01
The identification of protein post-translational modifications (PTMs) is an increasingly important component of proteomics and biomarker discovery, but very few tools exist for performing fast and easy characterization of global PTM changes and differential comparison of PTMs across groups of data obtained from liquid chromatography-tandem mass spectrometry experiments. STRAP PTM (Software Tool for Rapid Annotation of Proteins: Post-Translational Modification edition) is a program that was developed to facilitate the characterization of PTMs using spectral counting and a novel scoring algorithm to accelerate the identification of differential PTMs from complex data sets. The software facilitates multi-sample comparison by collating, scoring, and ranking PTMs and by summarizing data visually. The freely available software (beta release) installs on a PC and processes data in protXML format obtained from files parsed through the Trans-Proteomic Pipeline. The easy-to-use interface allows examination of results at protein, peptide, and PTM levels, and the overall design offers tremendous flexibility that provides proteomics insight beyond simple assignment and counting.
Tyan, Yu-Chang; Wu, Hsin-Yi; Lai, Wu-Wei; Su, Wu-Chou; Liao, Pao-Chi
2005-01-01
Pleural effusion, an accumulation of pleural fluid, contains proteins originated from plasma filtrate and, especially when tissues are damaged, parenchyma interstitial spaces of lungs and/or other organs. This study details protein profiles in human pleural effusion from 43 lung adenocarcinoma patients by a two-dimensional nano-high performance liquid chromatography electrospray ionization tandem mass spectrometry (2D nano-HPLC-ESI-MS/MS) system. The experimental results revealed the identification of 1415 unique proteins from human pleural effusion. Among these 124 proteins identified with higher confidence levels, some proteins have not been reported in plasma and may represent proteins specifically present in pleural effusion. These proteins are valuable for mass identification of differentially expressed proteins involved in proteomics database and screening biomarker to further study in human lung adenocarcinoma. The significance of the use of proteomics analysis of human pleural fluid for the search of new lung cancer marker proteins, and for their simultaneous display and analysis in patients suffering from lung disorders has been examined.
Bourchookarn, Apichai; Havanapan, Phattara-Orn; Thongboonkerd, Visith; Krittanai, Chartchai
2008-03-01
A comparative proteomic analysis was employed to identify altered proteins in the yellow head virus (YHV) infected lymphoid organ (LO) of Penaeus monodon. At 24 h post-infection, the infected shrimps showed obvious signs of infection, while the control shrimps remained healthy. Two-dimensional electrophoresis of proteins extracted from the LO revealed significant alterations in abundance of several proteins in the infected group. Protein identification by MALDI-TOF MS and nanoLC-ESI-MS/MS revealed significant increase of transglutaminase, protein disulfide isomerase, ATP synthase beta subunit, V-ATPase subunit A, and hemocyanin fragments. A significant decrease was also identified for Rab GDP-dissociation inhibitor, 6-phosphogluconate dehydrogenase, actin, fast tropomyosin isoform, and hemolymph clottable protein. Some of these altered proteins were further investigated at the mRNA level using real-time RT-PCR, which confirmed the proteomic data. Identification of these altered proteins in the YHV-infected shrimps may provide novel insights into the molecular responses of P. monodon to YHV infection.
Berger, Sebastian T.; Ahmed, Saima; Muntel, Jan; Cuevas Polo, Nerea; Bachur, Richard; Kentsis, Alex; Steen, Judith; Steen, Hanno
2015-01-01
We describe a 96-well plate compatible membrane-based proteomic sample processing method, which enables the complete processing of 96 samples (or multiples thereof) within a single workday. This method uses a large-pore hydrophobic PVDF membrane that efficiently adsorbs proteins, resulting in fast liquid transfer through the membrane and significantly reduced sample processing times. Low liquid transfer speeds have prevented the useful 96-well plate implementation of FASP as a widely used membrane-based proteomic sample processing method. We validated our approach on whole-cell lysate and urine and cerebrospinal fluid as clinically relevant body fluids. Without compromising peptide and protein identification, our method uses a vacuum manifold and circumvents the need for digest desalting, making our processing method compatible with standard liquid handling robots. In summary, our new method maintains the strengths of FASP and simultaneously overcomes one of the major limitations of FASP without compromising protein identification and quantification. PMID:26223766
ProCon - PROteomics CONversion tool.
Mayer, Gerhard; Stephan, Christian; Meyer, Helmut E; Kohl, Michael; Marcus, Katrin; Eisenacher, Martin
2015-11-03
With the growing amount of experimental data produced in proteomics experiments and the requirements/recommendations of journals in the proteomics field to publicly make available data described in papers, a need for long-term storage of proteomics data in public repositories arises. For such an upload one needs proteomics data in a standardized format. Therefore, it is desirable, that the proprietary vendor's software will integrate in the future such an export functionality using the standard formats for proteomics results defined by the HUPO-PSI group. Currently not all search engines and analysis tools support these standard formats. In the meantime there is a need to provide user-friendly free-to-use conversion tools that can convert the data into such standard formats in order to support wet-lab scientists in creating proteomics data files ready for upload into the public repositories. ProCon is such a conversion tool written in Java for conversion of proteomics identification data into standard formats mzIdentML and Pride XML. It allows the conversion of Sequest™/Comet .out files, of search results from the popular and often used ProteomeDiscoverer® 1.x (x=versions 1.1 to1.4) software and search results stored in the LIMS systems ProteinScape® 1.3 and 2.1 into mzIdentML and PRIDE XML. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
Virtual Labs in proteomics: new E-learning tools.
Ray, Sandipan; Koshy, Nicole Rachel; Reddy, Panga Jaipal; Srivastava, Sanjeeva
2012-05-17
Web-based educational resources have gained enormous popularity recently and are increasingly becoming a part of modern educational systems. Virtual Labs are E-learning platforms where learners can gain the experience of practical experimentation without any direct physical involvement on real bench work. They use computerized simulations, models, videos, animations and other instructional technologies to create interactive content. Proteomics being one of the most rapidly growing fields of the biological sciences is now an important part of college and university curriculums. Consequently, many E-learning programs have started incorporating the theoretical and practical aspects of different proteomic techniques as an element of their course work in the form of Video Lectures and Virtual Labs. To this end, recently we have developed a Virtual Proteomics Lab at the Indian Institute of Technology Bombay, which demonstrates different proteomics techniques, including basic and advanced gel and MS-based protein separation and identification techniques, bioinformatics tools and molecular docking methods, and their applications in different biological samples. This Tutorial will discuss the prominent Virtual Labs featuring proteomics content, including the Virtual Proteomics Lab of IIT-Bombay, and E-resources available for proteomics study that are striving to make proteomic techniques and concepts available and accessible to the student and research community. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 14). Details can be found at: http://www.proteomicstutorials.org/. Copyright © 2012 Elsevier B.V. All rights reserved.
MALDI-TOF mass spectrometry proteomic phenotyping of clinically relevant fungi.
Putignani, Lorenza; Del Chierico, Federica; Onori, Manuela; Mancinelli, Livia; Argentieri, Marta; Bernaschi, Paola; Coltella, Luana; Lucignano, Barbara; Pansani, Laura; Ranno, Stefania; Russo, Cristina; Urbani, Andrea; Federici, Giorgio; Menichella, Donato
2011-03-01
Proteomics is particularly suitable for characterising human pathogens with high life cycle complexity, such as fungi. Protein content and expression levels may be affected by growth states and life cycle morphs and correlate to species and strain variation. Identification and typing of fungi by conventional methods are often difficult, time-consuming and frequently, for unusual species, inconclusive. Proteomic phenotypes from MALDI-TOF MS were employed as analytical and typing expression profiling of yeast, yeast-like species and strain variants in order to achieve a microbial proteomics population study. Spectra from 303 clinical isolates were generated and processed by standard pattern matching with a MALDI-TOF Biotyper (MT). Identifications (IDs) were compared to a reference biochemical-based system (Vitek-2) and, when discordant, MT IDs were verified with genotyping IDs, obtained by sequencing the 25-28S rRNA hypervariable D2 region. Spectra were converted into virtual gel-like formats, and hierarchical clustering analysis was performed for 274 Candida profiles to investigate species and strain typing correlation. MT provided 257/303 IDs consistent with Vitek-2 ones. However, amongst 26/303 discordant MT IDs, only 5 appeared "true". No MT identification was achieved for 20/303 isolates for incompleteness of database species variants. Candida spectra clustering agreed with identified species and topology of Candida albicans and Candida parapsilosis specific dendrograms. MT IDs show a high analytical performance and profiling heterogeneity which seems to complement or even outclass existing typing tools. This variability reflects the high biological complexity of yeasts and may be properly exploited to provide epidemiological tracing and infection dispersion patterns.
Grobei, Monica A.; Qeli, Ermir; Brunner, Erich; Rehrauer, Hubert; Zhang, Runxuan; Roschitzki, Bernd; Basler, Konrad; Ahrens, Christian H.; Grossniklaus, Ueli
2009-01-01
Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified ∼3500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference data set, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model–protein sequence–protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics data set, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues. PMID:19546170
Birth of plant proteomics in India: a new horizon.
Narula, Kanika; Pandey, Aarti; Gayali, Saurabh; Chakraborty, Niranjan; Chakraborty, Subhra
2015-09-08
In the post-genomic era, proteomics is acknowledged as the next frontier for biological research. Although India has a long and distinguished tradition in protein research, the initiation of proteomics studies was a new horizon. Protein research witnessed enormous progress in protein separation, high-resolution refinements, biochemical identification of the proteins, protein-protein interaction, and structure-function analysis. Plant proteomics research, in India, began its journey on investigation of the proteome profiling, complexity analysis, protein trafficking, and biochemical modeling. The research article by Bhushan et al. in 2006 marked the birth of the plant proteomics research in India. Since then plant proteomics studies expanded progressively and are now being carried out in various institutions spread across the country. The compilation presented here seeks to trace the history of development in the area during the past decade based on publications till date. In this review, we emphasize on outcomes of the field providing prospects on proteomic pathway analyses. Finally, we discuss the connotation of strategies and the potential that would provide the framework of plant proteome research. The past decades have seen rapidly growing number of sequenced plant genomes and associated genomic resources. To keep pace with this increasing body of data, India is in the provisional phase of proteomics research to develop a comparative hub for plant proteomes and protein families, but it requires a strong impetus from intellectuals, entrepreneurs, and government agencies. Here, we aim to provide an overview of past, present and future of Indian plant proteomics, which would serve as an evaluation platform for those seeking to incorporate proteomics into their research programs. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.
Madio, Bruno; Undheim, Eivind A B; King, Glenn F
2017-08-23
More than a century of research on sea anemone venoms has shown that they contain a diversity of biologically active proteins and peptides. However, recent omics studies have revealed that much of the venom proteome remains unexplored. We used, for the first time, a combination of proteomic and transcriptomic techniques to obtain a holistic overview of the venom arsenal of the well-studied sea anemone Stichodactyla haddoni. A purely search-based approach to identify putative toxins in a transcriptome from tentacles regenerating after venom extraction identified 508 unique toxin-like transcripts grouped into 63 families. However, proteomic analysis of venom revealed that 52 of these toxin families are likely false positives. In contrast, the combination of transcriptomic and proteomic data enabled positive identification of 23 families of putative toxins, 12 of which have no homology known proteins or peptides. Our data highlight the importance of using proteomics of milked venom to correctly identify venom proteins/peptides, both known and novel, while minimizing false positive identifications from non-toxin homologues identified in transcriptomes of venom-producing tissues. This work lays the foundation for uncovering the role of individual toxins in sea anemone venom and how they contribute to the envenomation of prey, predators, and competitors. Proteomic analysis of milked venom combined with analysis of a tentacle transcriptome revealed the full extent of the venom arsenal of the sea anemone Stichodactyla haddoni. This combined approach led to the discovery of 12 entirely new families of disulfide-rich peptides and proteins in a genus of anemones that have been studied for over a century. Copyright © 2017 Elsevier B.V. All rights reserved.
Quantitative Proteomic Analysis of the Hfq-Regulon in Sinorhizobium meliloti 2011
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with 15N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti. PMID:23119037
Quantitative proteomic analysis of the Hfq-regulon in Sinorhizobium meliloti 2011.
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with (15)N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti.
Identification and validation nucleolin as a target of curcumol in nasopharyngeal carcinoma cells.
Wang, Juan; Wu, Jiacai; Li, Xumei; Liu, Haowei; Qin, Jianli; Bai, Zhun; Chi, Bixia; Chen, Xu
2018-06-30
Identification of the specific protein target(s) of a drug is a critical step in unraveling its mechanisms of action (MOA) in many natural products. Curcumol, isolated from well known Chinese medicinal plant Curcuma zedoary, has been shown to possess multiple biological activities. It can inhibit nasopharyngeal carcinoma (NPC) proliferation and induce apoptosis, but its target protein(s) in NPC cells remains unclear. In this study, we employed a mass spectrometry-based chemical proteomics approach reveal the possible protein targets of curcumol in NPC cells. Cellular thermal shift assay (CETSA), molecular docking and cell-based assay was used to validate the binding interactions. Chemical proteomics capturing uncovered that NCL is a target of curcumol in NPC cells, Molecular docking showed that curcumol bound to NCL with an -7.8 kcal/mol binding free energy. Cell function analysis found that curcumol's treatment leads to a degradation of NCL in NPC cells, and it showed slight effects on NP69 cells. In conclusion, our results providing evidences that NCL is a target protein of curcumol. We revealed that the anti-cancer effects of curcumol in NPC cells are mediated, at least in part, by NCL inhibition. Many natural products showed high bioactivity, while their mechanisms of action (MOA) are very poor or completely missed. Understanding the MOA of natural drugs can thoroughly exploit their therapeutic potential and minimize their adverse side effects. Identification of the specific protein target(s) of a drug is a critical step in unraveling its MOA. Compound-centric chemical proteomics is a classic chemical proteomics approach which integrates chemical synthesis with cell biology and mass spectrometry (MS) to identify protein targets of natural products determine the drug mechanism of action, describe its toxicity, and figure out the possible cause of off-target. It is an affinity-based chemical proteomics method to identify small molecule-protein interactions through affinity chromatography approach coupled with mass spectrometry, has been conventionally used to identify target proteins and has yielded good results. Curcumol, has shown effective inhibition on Nasopharyngeal Carcinoma (NPC) Cells, interacted with NCL and then initiated the anti-tumor biological effect. This research demonstrated the effectiveness of chemical proteomics approaches in natural drugs molecular target identification, revealing and understanding of the novel mechanism of actions of curcumol is crucial for cancer prevention and treatment in nasopharynx cancer. Copyright © 2018 Elsevier B.V. All rights reserved.
2014-01-01
Background Canine babesiosis is a tick-borne disease that is caused by the haemoprotozoan parasites of the genus Babesia. There are limited data on serum proteomics in dogs, and none of the effect of babesiosis on the serum proteome. The aim of this study was to identify the potential serum biomarkers of babesiosis using proteomic techniques in order to increase our understanding about disease pathogenesis. Results Serum samples were collected from 25 dogs of various breeds and sex with naturally occurring babesiosis caused by B. canis canis. Blood was collected on the day of admission (day 0), and subsequently on the 1st and 6th day of treatment. Two-dimensional electrophoresis (2DE) of pooled serum samples of dogs with naturally occurring babesiosis (day 0, day 1 and day 6) and healthy dogs were run in triplicate. 2DE image analysis showed 64 differentially expressed spots with p ≤ 0.05 and 49 spots with fold change ≥2. Six selected spots were excised manually and subjected to trypsin digest prior to identification by electrospray ionisation mass spectrometry on an Amazon ion trap tandem mass spectrometry (MS/MS). Mass spectrometry data was processed using Data Analysis software and the automated Matrix Science Mascot Daemon server. Protein identifications were assigned using the Mascot search engine to interrogate protein sequences in the NCBI Genbank database. A number of differentially expressed serum proteins involved in inflammation mediated acute phase response, complement and coagulation cascades, apolipoproteins and vitamin D metabolism pathway were identified in dogs with babesiosis. Conclusions Our findings confirmed two dominant pathogenic mechanisms of babesiosis, haemolysis and acute phase response. These results may provide possible serum biomarker candidates for clinical monitoring of babesiosis and this study could serve as the basis for further proteomic investigations in canine babesiosis. PMID:24885808
Wu, Qi; Yuan, Huiming; Zhang, Lihua; Zhang, Yukui
2012-06-20
With the acceleration of proteome research, increasing attention has been paid to multidimensional liquid chromatography-mass spectrometry (MDLC-MS) due to its high peak capacity and separation efficiency. Recently, many efforts have been put to improve MDLC-based strategies including "top-down" and "bottom-up" to enable highly sensitive qualitative and quantitative analysis of proteins, as well as accelerate the whole analytical procedure. Integrated platforms with combination of sample pretreatment, multidimensional separations and identification were also developed to achieve high throughput and sensitive detection of proteomes, facilitating highly accurate and reproducible quantification. This review summarized the recent advances of such techniques and their applications in qualitative and quantitative analysis of proteomes. Copyright © 2012 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Hong; Yang, Yanling; Li, Yuxin
2015-02-06
Development of high resolution liquid chromatography (LC) is essential for improving the sensitivity and throughput of mass spectrometry (MS)-based proteomics. Here we present systematic optimization of a long gradient LC-MS/MS platform to enhance protein identification from a complex mixture. The platform employed an in-house fabricated, reverse phase column (100 μm x 150 cm) coupled with Q Exactive MS. The column was capable of achieving a peak capacity of approximately 700 in a 720 min gradient of 10-45% acetonitrile. The optimal loading level was about 6 micrograms of peptides, although the column allowed loading as many as 20 micrograms. Gas phasemore » fractionation of peptide ions further increased the number of peptide identification by ~10%. Moreover, the combination of basic pH LC pre-fractionation with the long gradient LC-MS/MS platform enabled the identification of 96,127 peptides and 10,544 proteins at 1% protein false discovery rate in a postmortem brain sample of Alzheimer’s disease. As deep RNA sequencing of the same specimen suggested that ~16,000 genes were expressed, current analysis covered more than 60% of the expressed proteome. Further improvement strategies of the LC/LC-MS/MS platform were also discussed.« less
Proteomics and Metabolomics: Two Emerging Areas for Legume Improvement
Ramalingam, Abirami; Kudapa, Himabindu; Pazhamala, Lekha T.; Weckwerth, Wolfram; Varshney, Rajeev K.
2015-01-01
The crop legumes such as chickpea, common bean, cowpea, peanut, pigeonpea, soybean, etc. are important sources of nutrition and contribute to a significant amount of biological nitrogen fixation (>20 million tons of fixed nitrogen) in agriculture. However, the production of legumes is constrained due to abiotic and biotic stresses. It is therefore imperative to understand the molecular mechanisms of plant response to different stresses and identify key candidate genes regulating tolerance which can be deployed in breeding programs. The information obtained from transcriptomics has facilitated the identification of candidate genes for the given trait of interest and utilizing them in crop breeding programs to improve stress tolerance. However, the mechanisms of stress tolerance are complex due to the influence of multi-genes and post-transcriptional regulations. Furthermore, stress conditions greatly affect gene expression which in turn causes modifications in the composition of plant proteomes and metabolomes. Therefore, functional genomics involving various proteomics and metabolomics approaches have been obligatory for understanding plant stress tolerance. These approaches have also been found useful to unravel different pathways related to plant and seed development as well as symbiosis. Proteome and metabolome profiling using high-throughput based systems have been extensively applied in the model legume species, Medicago truncatula and Lotus japonicus, as well as in the model crop legume, soybean, to examine stress signaling pathways, cellular and developmental processes and nodule symbiosis. Moreover, the availability of protein reference maps as well as proteomics and metabolomics databases greatly support research and understanding of various biological processes in legumes. Protein-protein interaction techniques, particularly the yeast two-hybrid system have been advantageous for studying symbiosis and stress signaling in legumes. In this review, several studies on proteomics and metabolomics in model and crop legumes have been discussed. Additionally, applications of advanced proteomics and metabolomics approaches have also been included in this review for future applications in legume research. The integration of these “omics” approaches will greatly support the identification of accurate biomarkers in legume smart breeding programs. PMID:26734026
Kim, Young-Ha; slam, Mohammad Saiful; You, Myung-Jo
2015-01-01
Proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. For detection of antigens from Haemaphysalis longicornis, 1-dimensional electrophoresis (1-DE) quantitative immunoblotting technique combined with 2-dimensional electrophoresis (2-DE) immunoblotting was used for whole body proteins from unfed and partially fed female ticks. Reactivity bands and 2-DE immunoblotting were performed following 2-DE electrophoresis to identify protein spots. The proteome of the partially fed female had a larger number of lower molecular weight proteins than that of the unfed female tick. The total number of detected spots was 818 for unfed and 670 for partially fed female ticks. The 2-DE immunoblotting identified 10 antigenic spots from unfed females and 8 antigenic spots from partially fed females. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry (MALDI-TOF) of relevant spots identified calreticulin, putative secreted WC salivary protein, and a conserved hypothetical protein from the National Center for Biotechnology Information and Swiss Prot protein sequence databases. These findings indicate that most of the whole body components of these ticks are non-immunogenic. The data reported here will provide guidance in the identification of antigenic proteins to prevent infestation and diseases transmitted by H. longicornis. PMID:25748713
2014-01-01
The molecular mechanisms underlying skeletal muscle aging and associated sarcopenia have been linked to an altered oxidative status of redox-sensitive proteins. Reactive oxygen and reactive nitrogen species (ROS/RNS) generated by contracting skeletal muscle are necessary for optimal protein function, signaling, and adaptation. To investigate the redox proteome of aging gastrocnemius muscles from adult and old male mice, we developed a label-free quantitative proteomic approach that includes a differential cysteine labeling step. The approach allows simultaneous identification of up- and downregulated proteins between samples in addition to the identification and relative quantification of the reversible oxidation state of susceptible redox cysteine residues. Results from muscles of adult and old mice indicate significant changes in the content of chaperone, glucose metabolism, and cytoskeletal regulatory proteins, including Protein DJ-1, cAMP-dependent protein kinase type II, 78 kDa glucose regulated protein, and a reduction in the number of redox-responsive proteins identified in muscle of old mice. Results demonstrate skeletal muscle aging causes a reduction in redox-sensitive proteins involved in the generation of precursor metabolites and energy metabolism, indicating a loss in the flexibility of the redox energy response. Data is available via ProteomeXchange with identifier PXD001054. PMID:25181601
Exploring the context of the lung proteome within the airway mucosa following allergen challenge.
Fehniger, Thomas E; Sato-Folatre, José-Gabriel; Malmström, Johan; Berglund, Magnus; Lindberg, Claes; Brange, Charlotte; Lindberg, Henrik; Marko-Varga, György
2004-01-01
The lung proteome is a dynamic collection of specialized proteins related to pulmonary function. Many cells of different derivations, activation states, and levels of maturity contribute to the changing environment, which produces the lung proteome. Inflammatory cells reacting to environmental challenge, for example from allergens, produce and secrete proteins which have profound effects on both resident and nonresident cells located in airways, alveoli, and the vascular tree which provides blood cells to the parenchyma alveolar bed for gas exchange. In an experimental model of allergic airway inflammation, we have compared control and allergen challenged lung compartments to determine global protein expression patterns using 2D-gel electrophoresis and subsequent spot identification by MS/MS mass spectrometry. We have then specifically isolated the epithelial mucosal layer, which lines conducting airways, from control and allergen challenged lungs, using laser capture technology and performed proteome identification on these selected cell samples. A central component of our investigations has been to contextually relate the histological features of the dynamic pulmonary environment to the changes in protein expression observed following challenge. Our results provide new information of the complexity of the submucosa/epithelium interface and the mechanisms behind the transformation of airway epithelium from normal steady states to functionally activated states.
Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.; ...
2015-11-24
Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experimentmore » (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.
Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experimentmore » (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.« less
Lehmann, Sylvain; Hoofnagle, Andrew; Hochstrasser, Denis; Brede, Cato; Glueckmann, Matthias; Cocho, José A; Ceglarek, Uta; Lenz, Christof; Vialaret, Jérôme; Scherl, Alexander; Hirtz, Christophe
2013-05-01
Proteomics studies typically aim to exhaustively detect peptides/proteins in a given biological sample. Over the past decade, the number of publications using proteomics methodologies has exploded. This was made possible due to the availability of high-quality genomic data and many technological advances in the fields of microfluidics and mass spectrometry. Proteomics in biomedical research was initially used in 'functional' studies for the identification of proteins involved in pathophysiological processes, complexes and networks. Improved sensitivity of instrumentation facilitated the analysis of even more complex sample types, including human biological fluids. It is at that point the field of clinical proteomics was born, and its fundamental aim was the discovery and (ideally) validation of biomarkers for the diagnosis, prognosis, or therapeutic monitoring of disease. Eventually, it was recognized that the technologies used in clinical proteomics studies [particularly liquid chromatography-tandem mass spectrometry (LC-MS/MS)] could represent an alternative to classical immunochemical assays. Prior to deploying MS in the measurement of peptides/proteins in the clinical laboratory, it seems likely that traditional proteomics workflows and data management systems will need to adapt to the clinical environment and meet in vitro diagnostic (IVD) regulatory constraints. This defines a new field, as reviewed in this article, that we have termed quantitative Clinical Chemistry Proteomics (qCCP).
Top-down proteomics for the analysis of proteolytic events - Methods, applications and perspectives.
Tholey, Andreas; Becker, Alexander
2017-11-01
Mass spectrometry based proteomics is an indispensable tool for almost all research areas relevant for the understanding of proteolytic processing, ranging from the identification of substrates, products and cleavage sites up to the analysis of structural features influencing protease activity. The majority of methods for these studies are based on bottom-up proteomics performing analysis at peptide level. As this approach is characterized by a number of pitfalls, e.g. loss of molecular information, there is an ongoing effort to establish top-down proteomics, performing separation and MS analysis both at intact protein level. We briefly introduce major approaches of bottom-up proteomics used in the field of protease research and highlight the shortcomings of these methods. We then discuss the present state-of-the-art of top-down proteomics. Together with the discussion of known challenges we show the potential of this approach and present a number of successful applications of top-down proteomics in protease research. This article is part of a Special Issue entitled: Proteolysis as a Regulatory Event in Pathophysiology edited by Stefan Rose-John. Copyright © 2017 Elsevier B.V. All rights reserved.
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
Mulvaney, Kathleen M.; Matson, Jacob P.; Siesser, Priscila F.; Tamir, Tigist Y.; Goldfarb, Dennis; Jacobs, Timothy M.; Cloer, Erica W.; Harrison, Joseph S.; Vaziri, Cyrus; Cook, Jeanette G.; Major, Michael B.
2016-01-01
KEAP1 is a substrate adaptor protein for a CUL3-based E3 ubiquitin ligase. Ubiquitylation and degradation of the antioxidant transcription factor NRF2 is considered the primary function of KEAP1; however, few other KEAP1 substrates have been identified. Because KEAP1 is altered in a number of human pathologies and has been proposed as a potential therapeutic target therein, we sought to better understand KEAP1 through systematic identification of its substrates. Toward this goal, we combined parallel affinity capture proteomics and candidate-based approaches. Substrate-trapping proteomics yielded NRF2 and the related transcription factor NRF1 as KEAP1 substrates. Our targeted investigation of KEAP1-interacting proteins revealed MCM3, an essential subunit of the replicative DNA helicase, as a new substrate. We show that MCM3 is ubiquitylated by the KEAP1-CUL3-RBX1 complex in cells and in vitro. Using ubiquitin remnant profiling, we identify the sites of KEAP1-dependent ubiquitylation in MCM3, and these sites are on predicted exposed surfaces of the MCM2–7 complex. Unexpectedly, we determined that KEAP1 does not regulate total MCM3 protein stability or subcellular localization. Our analysis of a KEAP1 targeting motif in MCM3 suggests that MCM3 is a point of direct contact between KEAP1 and the MCM hexamer. Moreover, KEAP1 associates with chromatin in a cell cycle-dependent fashion with kinetics similar to the MCM2–7 complex. KEAP1 is thus poised to affect MCM2–7 dynamics or function rather than MCM3 abundance. Together, these data establish new functions for KEAP1 within the nucleus and identify MCM3 as a novel substrate of the KEAP1-CUL3-RBX1 E3 ligase. PMID:27621311
Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-01-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095
Clustering and Network Analysis of Reverse Phase Protein Array Data.
Byron, Adam
2017-01-01
Molecular profiling of proteins and phosphoproteins using a reverse phase protein array (RPPA) platform, with a panel of target-specific antibodies, enables the parallel, quantitative proteomic analysis of many biological samples in a microarray format. Hence, RPPA analysis can generate a high volume of multidimensional data that must be effectively interrogated and interpreted. A range of computational techniques for data mining can be applied to detect and explore data structure and to form functional predictions from large datasets. Here, two approaches for the computational analysis of RPPA data are detailed: the identification of similar patterns of protein expression by hierarchical cluster analysis and the modeling of protein interactions and signaling relationships by network analysis. The protocols use freely available, cross-platform software, are easy to implement, and do not require any programming expertise. Serving as data-driven starting points for further in-depth analysis, validation, and biological experimentation, these and related bioinformatic approaches can accelerate the functional interpretation of RPPA data.
Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
2012-03-01
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Proteomics with Mass Spectrometry Imaging: Beyond Amyloid Typing.
Lavatelli, Francesca; Merlini, Giampaolo
2018-04-01
Detection and typing of amyloid deposits in tissues are two crucial steps in the management of systemic amyloidoses. The presence of amyloid deposits is routinely evaluated through Congo red staining, whereas proteomics is now a mainstay in the identification of the deposited proteins. In article number 1700236, Winter et al. [Proteomics 2017, 17, Issue 22] describe a novel method based on MALDI-MS imaging coupled to ion mobility separation and peptide filtering, to detect the presence of amyloid in histology samples and to identify its composition, while preserving the spatial distribution of proteins in tissues. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Garbis, Spiros D; Roumeliotis, Theodoros I; Tyritzis, Stavros I; Zorpas, Kostas M; Pavlakis, Kitty; Constantinides, Constantinos A
2011-02-01
The current proof-of-principle study was aimed toward development of a novel multidimensional protein identification technology (MudPIT) approach for the in-depth proteome analysis of human serum derived from patients with benign prostate hyperplasia (BPH) using rational chromatographic design principles. This study constituted an extension of our published work relating to the identification and relative quantification of potential clinical biomarkers in BPH and prostate cancer (PCa) tissue specimens. The proposed MudPIT approach encompassed the use of three distinct yet complementary liquid chromatographic chemistries. High-pressure size-exclusion chromatography (SEC) was used for the prefractionation of serum proteins followed by their dialysis exchange and solution phase trypsin proteolysis. The tryptic peptides were then subjected to offline zwitterion-ion hydrophilic interaction chromatography (ZIC-HILIC) fractionation followed by their online analysis with reversed-phase nano-ultraperformance chromatography (RP-nUPLC) hyphenated to nanoelectrospray ionization-tandem mass spectrometry using an ion trap mass analyzer. For the spectral processing, the sequential use of the SpectrumMill, Scaffold, and InsPecT software tools was applied for the tryptic peptide product ion MS(2) spectral processing, false discovery rate (FDR) assessment, validation, and protein identification. This milestone serum analysis study allowed the confident identification of over 1955 proteins (p ≤ 0.05; FDR ≤ 5%) with a broad spectrum of biological and physicochemical properties including secreted, tissue-specific proteins spanning approximately 12 orders of magnitude as they occur in their native abundance levels in the serum matrix. Also encompassed in this proteome was the confident identification of 375 phosphoproteins (p ≤ 0.05; FDR ≤ 5%) with potential importance to cancer biology. To demonstrate the performance characteristics of this novel MudPIT approach, a comparison was made with the proteomes resulting from the immunodepletion of the high abundant albumin and IgG proteins with offline first dimensional tryptic peptide separation with both ZIC-HILIC and strong cation exchange (SCX) chromatography and their subsequent online RP-nUPLC-nESI-MS(2) analysis.
Foster, Joseph M; Moreno, Pablo; Fabregat, Antonio; Hermjakob, Henning; Steinbeck, Christoph; Apweiler, Rolf; Wakelam, Michael J O; Vizcaíno, Juan Antonio
2013-01-01
Protein sequence databases are the pillar upon which modern proteomics is supported, representing a stable reference space of predicted and validated proteins. One example of such resources is UniProt, enriched with both expertly curated and automatic annotations. Taken largely for granted, similar mature resources such as UniProt are not available yet in some other "omics" fields, lipidomics being one of them. While having a seasoned community of wet lab scientists, lipidomics lies significantly behind proteomics in the adoption of data standards and other core bioinformatics concepts. This work aims to reduce the gap by developing an equivalent resource to UniProt called 'LipidHome', providing theoretically generated lipid molecules and useful metadata. Using the 'FASTLipid' Java library, a database was populated with theoretical lipids, generated from a set of community agreed upon chemical bounds. In parallel, a web application was developed to present the information and provide computational access via a web service. Designed specifically to accommodate high throughput mass spectrometry based approaches, lipids are organised into a hierarchy that reflects the variety in the structural resolution of lipid identifications. Additionally, cross-references to other lipid related resources and papers that cite specific lipids were used to annotate lipid records. The web application encompasses a browser for viewing lipid records and a 'tools' section where an MS1 search engine is currently implemented. LipidHome can be accessed at http://www.ebi.ac.uk/apweiler-srv/lipidhome.
Ota, Kazuhisa; Kito, Keiji; Okada, Satoshi; Ito, Takashi
2008-10-01
Ubiquitination plays various critical roles in eukaryotic cellular regulation and is mediated by a cascade of enzymes including ubiquitin protein ligase (E3). The Skp1-Cullin-F-box protein complex comprises the largest E3 family, in each member of which a unique F-box protein binds its targets to define substrate specificity. Although genome sequencing uncovers a growing number of F-box proteins, most of them have remained as "orphans" because of the difficulties in identification of their substrates. To address this issue, we tested a quantitative proteomic approach by combining the stable isotope labeling by amino acids in cell culture (SILAC), parallel affinity purification (PAP) that we had developed for efficient enrichment of ubiquitinated proteins, and mass spectrometry (MS). We applied this SILAC-PAP-MS approach to compare ubiquitinated proteins between yeast cells with and without over-expressed Mdm30p, an F-box protein implicated in mitochondrial morphology. Consequently, we identified the mitochondrial outer membrane protein Mdm34p as a target of Mdm30p. Furthermore, we found that mitochondrial defects induced by deletion of MDM30 are not only recapitulated by a mutant Mdm34p defective in interaction with Mdm30p but alleviated by ubiquitination-mimicking forms of Mdm34p. These results indicate that Mdm34p is a physiologically important target of Mdm30p.
Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpievitch, Yuliya V.; Polpitiya, Ashoka D.; Anderson, Gordon A.
2010-12-01
Mass spectrometry-based proteomics has become the tool of choice for identifying and quantifying the proteome of an organism. Though recent years have seen a tremendous improvement in instrument performance and the computational tools used, significant challenges remain, and there are many opportunities for statisticians to make important contributions. In the most widely used "bottom-up" approach to proteomics, complex mixtures of proteins are first subjected to enzymatic cleavage, the resulting peptide products are separated based on chemical or physical properties and analyzed using a mass spectrometer. The two fundamental challenges in the analysis of bottom-up MS-based proteomics are: (1) Identifying themore » proteins that are present in a sample, and (2) Quantifying the abundance levels of the identified proteins. Both of these challenges require knowledge of the biological and technological context that gives rise to observed data, as well as the application of sound statistical principles for estimation and inference. We present an overview of bottom-up proteomics and outline the key statistical issues that arise in protein identification and quantification.« less
A tutorial for software development in quantitative proteomics using PSI standard formats☆
Gonzalez-Galarza, Faviel F.; Qi, Da; Fan, Jun; Bessant, Conrad; Jones, Andrew R.
2014-01-01
The Human Proteome Organisation — Proteomics Standards Initiative (HUPO-PSI) has been working for ten years on the development of standardised formats that facilitate data sharing and public database deposition. In this article, we review three HUPO-PSI data standards — mzML, mzIdentML and mzQuantML, which can be used to design a complete quantitative analysis pipeline in mass spectrometry (MS)-based proteomics. In this tutorial, we briefly describe the content of each data model, sufficient for bioinformaticians to devise proteomics software. We also provide guidance on the use of recently released application programming interfaces (APIs) developed in Java for each of these standards, which makes it straightforward to read and write files of any size. We have produced a set of example Java classes and a basic graphical user interface to demonstrate how to use the most important parts of the PSI standards, available from http://code.google.com/p/psi-standard-formats-tutorial. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23584085
Trentmann, Oliver; Haferkamp, Ilka
2013-01-01
Vacuoles of plants fulfill various biologically important functions, like turgor generation and maintenance, detoxification, solute sequestration, or protein storage. Different types of plant vacuoles (lytic versus protein storage) are characterized by different functional properties apparently caused by a different composition/abundance and regulation of transport proteins in the surrounding membrane, the tonoplast. Proteome analyses allow the identification of vacuolar proteins and provide an informative basis for assigning observed transport processes to specific carriers or channels. This review summarizes techniques required for vacuolar proteome analyses, like e.g., isolation of the large central vacuole or tonoplast membrane purification. Moreover, an overview about diverse published vacuolar proteome studies is provided. It becomes evident that qualitative proteomes from different plant species represent just the tip of the iceberg. During the past few years, mass spectrometry achieved immense improvement concerning its accuracy, sensitivity, and application. As a consequence, modern tonoplast proteome approaches are suited for detecting alterations in membrane protein abundance in response to changing environmental/physiological conditions and help to clarify the regulation of tonoplast transport processes. PMID:23459586
Schizophrenia proteomics: biomarkers on the path to laboratory medicine?
Lakhan, Shaheen Emmanuel
2006-01-01
Over two million Americans are afflicted with schizophrenia, a debilitating mental health disorder with a unique symptomatic and epidemiological profile. Genomics studies have hinted towards candidate schizophrenia susceptibility chromosomal loci and genes. Modern proteomic tools, particularly mass spectrometry and expression scanning, aim to identify both pathogenic-revealing and diagnostically significant biomarkers. Only a few studies on basic proteomics have been conducted for psychiatric disorders relative to the plethora of cancer specific experiments. One such proteomic utility enables the discovery of proteins and biological marker fingerprinting profiling techniques (SELDI-TOF-MS), and then subjects them to tandem mass spectrometric fragmentation and de novo protein sequencing (MALDI-TOF/TOF-MS) for the accurate identification and characterization of the proteins. Such utilities can explain the pathogenesis of neuro-psychiatric disease, provide more objective testing methods, and further demonstrate a biological basis to mental illness. Although clinical proteomics in schizophrenia have yet to reveal a biomarker with diagnostic specificity, methods that better characterize the disorder using endophenotypes can advance findings. Schizophrenia biomarkers could potentially revolutionize its psychopharmacology, changing it into a more hypothesis and genomic/proteomic-driven science. PMID:16846510
A proteomic approach to obesity and type 2 diabetes.
López-Villar, Elena; Martos-Moreno, Gabriel Á; Chowen, Julie A; Okada, Shigeru; Kopchick, John J; Argente, Jesús
2015-07-01
The incidence of obesity and type diabetes 2 has increased dramatically resulting in an increased interest in its biomedical relevance. However, the mechanisms that trigger the development of diabetes type 2 in obese patients remain largely unknown. Scientific, clinical and pharmaceutical communities are dedicating vast resources to unravel this issue by applying different omics tools. During the last decade, the advances in proteomic approaches and the Human Proteome Organization have opened and are opening a new door that may be helpful in the identification of patients at risk and to improve current therapies. Here, we briefly review some of the advances in our understanding of type 2 diabetes that have occurred through the application of proteomics. We also review, in detail, the current improvements in proteomic methodologies and new strategies that could be employed to further advance our understanding of this pathology. By applying these new proteomic advances, novel therapeutic and/or diagnostic protein targets will be discovered in the obesity/Type 2 diabetes area. © 2015 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
Maryáš, Josef; Faktor, Jakub; Dvořáková, Monika; Struhárová, Iva; Grell, Peter; Bouchal, Pavel
2014-03-01
Metastases are responsible for most of the cases of death in patients with solid tumors. There is thus an urgent clinical need of better understanding the exact molecular mechanisms and finding novel therapeutics targets and biomarkers of metastatic disease of various tumors. Metastases are formed in a complicated biological process called metastatic cascade. Up to now, proteomics has enabled the identification of number of metastasis-associated proteins and potential biomarkers in cancer tissues, microdissected cells, model systems, and secretomes. Expression profiles and biological role of key proteins were confirmed in verification and functional experiments. This communication reviews these observations and analyses the methodological aspects of the proteomics approaches used. Moreover, it reviews contribution of current proteomics in the field of functional characterization and interactome analysis of proteins involved in various events in metastatic cascade. It is evident that ongoing technical progress will further increase proteome coverage and sample capacity of proteomics technologies, giving complex answers to clinical and functional questions asked. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Brandt, Luise Ørsted; Schmidt, Anne Lisbeth; Mannering, Ulla; Sarret, Mathilde; Kelstrup, Christian D.; Olsen, Jesper V.; Cappellini, Enrico
2014-01-01
Denmark has an extraordinarily large and well-preserved collection of archaeological skin garments found in peat bogs, dated to approximately 920 BC – AD 775. These objects provide not only the possibility to study prehistoric skin costume and technologies, but also to investigate the animal species used for the production of skin garments. Until recently, species identification of archaeological skin was primarily performed by light and scanning electron microscopy or the analysis of ancient DNA. However, the efficacy of these methods can be limited due to the harsh, mostly acidic environment of peat bogs leading to morphological and molecular degradation within the samples. We compared species assignment results of twelve archaeological skin samples from Danish bogs using Mass Spectrometry (MS)-based peptide sequencing, against results obtained using light and scanning electron microscopy. While it was difficult to obtain reliable results using microscopy, MS enabled the identification of several species-diagnostic peptides, mostly from collagen and keratins, allowing confident species discrimination even among taxonomically close organisms, such as sheep and goat. Unlike previous MS-based methods, mostly relying on peptide fingerprinting, the shotgun sequencing approach we describe aims to identify the complete extracted ancient proteome, without preselected specific targets. As an example, we report the identification, in one of the samples, of two peptides uniquely assigned to bovine foetal haemoglobin, indicating the production of skin from a calf slaughtered within the first months of its life. We conclude that MS-based peptide sequencing is a reliable method for species identification of samples from bogs. The mass spectrometry proteomics data were deposited in the ProteomeXchange Consortium with the dataset identifier PXD001029. PMID:25260035
The unique peptidome: Taxon-specific tryptic peptides as biomarkers for targeted metaproteomics.
Mesuere, Bart; Van der Jeugt, Felix; Devreese, Bart; Vandamme, Peter; Dawyndt, Peter
2016-09-01
The Unique Peptide Finder (http://unipept.ugent.be/peptidefinder) is an interactive web application to quickly hunt for tryptic peptides that are unique to a particular species, genus, or any other taxon. Biodiversity within the target taxon is represented by a set of proteomes selected from a monthly updated list of complete and nonredundant UniProt proteomes, supplemented with proprietary proteomes loaded into persistent local browser storage. The software computes and visualizes pan and core peptidomes as unions and intersections of tryptic peptides occurring in the selected proteomes. In addition, it also computes and displays unique peptidomes as the set of all tryptic peptides that occur in all selected proteomes but not in any UniProt record not assigned to the target taxon. As a result, the unique peptides can serve as robust biomarkers for the target taxon, for example, in targeted metaproteomics studies. Computations are extremely fast since they are underpinned by the Unipept database, the lowest common ancestor algorithm implemented in Unipept and modern web technologies that facilitate in-browser data storage and parallel processing. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Witzel, Katja; Surabhi, Giridara-Kumar; Jyothsnakumari, Gottimukkala; Sudhakar, Chinta; Matros, Andrea; Mock, Hans-Peter
2007-04-01
This paper describes the application of the recently introduced fluorescence stain Ruthenium(II)-tris-(bathophenanthroline-disulphonate) (RuBP) on a comparative proteome analysis of two phenotypically different barley lines. We carried out an analysis of protein patterns from 2-D gels of the parental lines of the Oregon Wolfe Barley mapping population DOM and REC and stained with either the conventional colloidal Coomassie Brilliant Blue (cCBB) or with the novel RuBP solution. We wished to experimentally verify the usefulness of such a stain in evaluating the complex pattern of a seed proteome, in comparison to the previously used cCBB staining technique. To validate the efficiency of visualization by both stains, we first compared the overall number of detected protein spots. On average, 790 spots were visible by cCBB staining and 1200 spots by RuBP staining. Then, the intensity of a set of spots was assessed, and changes in relative abundance were determined using image analysis software. As expected, staining with RuBP performed better in quantitation in terms of sensitivity and dynamic range. Furthermore, spots from a cultivar-specific region in the protein map were chosen for identification to asses the gain of biological information due to the staining procedure. From this particular region, eight spots were visualized exclusively by RuBP and identification was successful for all spots, proving the ability to identify even very low abundant proteins. Performance in MS analysis was comparable for both protein stains. Proteins were identified by MALDI-TOF MS peptide mass fingerprinting. This approach was not successful for all spots, due to the restricted entry number for barley in the database. Therefore, we subsequently used LC-ESI-Q-TOF MS/MS and de novo sequencing for identification. Because only an insufficient number of proteins from barley is annotated, an EST-based identification strategy was chosen for our experiment. We wished to test whether under these limitations the application of a more sensitive stain would lead to a more advanced proteome approach. In summary, we demonstrate here that the application of RuBP as an economical but reliable and sensitive fluorescence stain is highly suitable for quantitative proteome analysis of plant seeds.
Peptide de novo sequencing of mixture tandem mass spectra
Hotta, Stéphanie Yuki Kolbeck; Verano‐Braga, Thiago; Kjeldsen, Frank
2016-01-01
The impact of mixture spectra deconvolution on the performance of four popular de novo sequencing programs was tested using artificially constructed mixture spectra as well as experimental proteomics data. Mixture fragmentation spectra are recognized as a limitation in proteomics because they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co‐isolation and thus prone to false identifications. The deconvolution approach matched complementary b‐, y‐ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co‐isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced peptides. The improvement was in the range of 20–35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight candidate peptide score distribution and high sensitivity to small changes in the mass spectrum introduced by the employed deconvolution method could explain some of the missing peptide identifications. PMID:27329701
Ferreira, Ana M; Grossmann, Jonas; Fortes, Claudia; Kilminster, Tanya; Scanlon, Tim; Milton, John; Greeff, Johan; Oldham, Chris; Nanni, Paolo; Almeida, André M
2017-05-24
Seasonal Weight Loss (SWL) is one of the most pressing issues in animal production in the tropics and Mediterranean. This work aims to characterize muscle proteome changes as a consequence of SWL in meat producing sheep, using a label-free proteomics approach. We compare three breeds: the Australian Merino (SWL susceptible), the Damara (SWL tolerant) and the Dorper (SWL intermediate tolerance). We identified 668 proteins of the sheep proteome, 95 with differential regulation. Also we observe that the more vulnerable to SWL a breed is, the more differential abundance proteins we find. Protein binding was the most frequently altered molecular function identified. We suggest 6 putative markers for restricted nutritional conditions independently of breed: ferritin heavy-chain; immunoglobulin V lambda chain; transgelin; fatty acid synthase; glutathione S-transferase A2; dihydrodiol dehydrogenase 3-like. Moreover, we suggest as related to SWL tolerance: S100-A10 Serpin A3-5-like and Catalase, subject however to necessary validation assays. The identification of SWL-tolerance related proteins using proteomics will lead to increased stock productivity of relevant interest to animal production, particularly if identified at the muscle level, the tissue of economic importance in meat production. Seasonal Weight Loss (SWL) is the most pressing issue in animal production in the tropics and the Mediterranean. To counter SWL, farmers often use animal breeds that have a natural ability to withstand pasture scarcity. Here we study the sheep muscle proteome at the muscle level, the tissue of economic importance in meat production. Furthermore, the identification of proteins that change their abundance in response to SWL using proteomics can contribute to increased stock productivity of relevant interest to animal production. We identified 668 proteins of the sheep proteome. We demonstrate that the following proteins are affected by restricted nutritional conditions: ferritin heavy chain; immunoglobulin V lambda chain; transgelin; fatty acid synthase; glutathione S-transferase A2; dihydrodiol dehydrogenase 3-like. Furthermore, S100-A10, Serpin A3-5-like and Catalase are proteins that changed their abundance in response to SWL. Nevertheless, it is important to highlight that Catalase values for the merino breed were close to significance and therefore catalase validation is of utmost importance. Copyright © 2017 Elsevier B.V. All rights reserved.
Kedia, Komal; Nichols, Caitlin A; Thulin, Craig D; Graves, Steven W
2015-11-01
Tissue proteomics has relied heavily on two-dimensional gel electrophoresis, for protein separation and quantification, then single protein isolation, trypsin digestion, and mass spectrometric protein identification. Such methods are predominantly used for study of high-abundance, full-length proteins. Tissue peptidomics has recently been developed but is still used to study the most highly abundant species, often resulting in observation and identification of dozens of peptides only. Tissue lipidomics is likewise new, and reported studies are limited. We have developed an "omics" approach that enables over 7,000 low-molecular-weight, low-abundance species to be surveyed and have applied this to human placental tissue. Because the placenta is believed to be involved in complications of pregnancy, its proteomic evaluation is of substantial interest. In previous research on the placental proteome, abundant, high-molecular-weight proteins have been studied. Application of large-scale, global proteomics or peptidomics to the placenta have been limited, and would be challenging owing to the anatomic complexity and broad concentration range of proteins in this tissue. In our approach, involving protein depletion, capillary liquid chromatography, and tandem mass spectrometry, we attempted to identify molecular differences between two regions of the same placenta with only slightly different cellular composition. Our analysis revealed 16 species with statistically significant differences between the two regions. Tandem mass spectrometry enabled successful sequencing, or otherwise enabled chemical characterization, of twelve of these. The successful discovery and identification of regional differences between the expression of low-abundance, low-molecular weight biomolecules reveals the potential of our approach.
Data Independent Acquisition analysis in ProHits 4.0.
Liu, Guomin; Knight, James D R; Zhang, Jian Ping; Tsou, Chih-Chiang; Wang, Jian; Lambert, Jean-Philippe; Larsen, Brett; Tyers, Mike; Raught, Brian; Bandeira, Nuno; Nesvizhskii, Alexey I; Choi, Hyungwon; Gingras, Anne-Claude
2016-10-21
Affinity purification coupled with mass spectrometry (AP-MS) is a powerful technique for the identification and quantification of physical interactions. AP-MS requires careful experimental design, appropriate control selection and quantitative workflows to successfully identify bona fide interactors amongst a large background of contaminants. We previously introduced ProHits, a Laboratory Information Management System for interaction proteomics, which tracks all samples in a mass spectrometry facility, initiates database searches and provides visualization tools for spectral counting-based AP-MS approaches. More recently, we implemented Significance Analysis of INTeractome (SAINT) within ProHits to provide scoring of interactions based on spectral counts. Here, we provide an update to ProHits to support Data Independent Acquisition (DIA) with identification software (DIA-Umpire and MSPLIT-DIA), quantification tools (through DIA-Umpire, or externally via targeted extraction), and assessment of quantitative enrichment (through mapDIA) and scoring of interactions (through SAINT-intensity). With additional improvements, notably support of the iProphet pipeline, facilitated deposition into ProteomeXchange repositories and enhanced export and viewing functions, ProHits 4.0 offers a comprehensive suite of tools to facilitate affinity proteomics studies. It remains challenging to score, annotate and analyze proteomics data in a transparent manner. ProHits was previously introduced as a LIMS to enable storing, tracking and analysis of standard AP-MS data. In this revised version, we expand ProHits to include integration with a number of identification and quantification tools based on Data-Independent Acquisition (DIA). ProHits 4.0 also facilitates data deposition into public repositories, and the transfer of data to new visualization tools. Copyright © 2016 Elsevier B.V. All rights reserved.
Gautam, Poonam; Mushahary, Dolly; Hassan, Wazid; Upadhyay, Santosh Kumar; Madan, Taruna; Sirdeshmukh, Ravi; Sundaram, Curam Sreenivasacharlu; Sarma, Puranam Usha
2016-07-01
Aspergillus fumigatus (A. fumigatus) is a medically important opportunistic fungus that may lead to invasive aspergillosis in humans with weak immune system. Proteomic profiling of this fungus on exposure to itraconazole (ITC), an azole antifungal drug, may lead to identification of its molecular targets and better understanding on the development of drug resistance against ITC in A. fumigatus. Here, proteome analysis was performed using 2-DE followed by mass spectrometric analysis which resulted in identification of a total of 259 unique proteins. Further, proteome profiling of A. fumigatus was carried out on exposure to ITC, 0.154 μg/ml, the minimum inhibitory concentration (MIC50). Image analysis showed altered levels of 175 proteins (66 upregulated and 109 downregulated) of A. fumigatus treated with ITC as compared to the untreated control. Peptide mass fingerprinting led to the identification of 54 proteins (12 up-regulated and 42 down-regulated). The differentially expressed proteins include proteins related to cell stress, carbohydrate metabolism and amino acid metabolism. We also observed four proteins, including nucleotide phosphate kinase (NDK), that are reported to interact with calcineurin, a protein involved in regulation of cell morphology and fungal virulence. Comparison of differentially expressed proteins on exposure to ITC with artemisinin (ART), an antimalarial drug with antifungal activity(1), revealed a total of 26 proteins to be common among them suggesting that common proteins and pathways are targeted by these two antifungal agents. The proteins targeted by ITC may serve as important leads for development of new antifungal drugs. © The Author 2016. Published by Oxford University Press on behalf of The International Society for Human and Animal Mycology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Recent advances in proteomics of cereals.
Bansal, Monika; Sharma, Madhu; Kanwar, Priyanka; Goyal, Aakash
Cereals contribute a major part of human nutrition and are considered as an integral source of energy for human diets. With genomic databases already available in cereals such as rice, wheat, barley, and maize, the focus has now moved to proteome analysis. Proteomics studies involve the development of appropriate databases based on developing suitable separation and purification protocols, identification of protein functions, and can confirm their functional networks based on already available data from other sources. Tremendous progress has been made in the past decade in generating huge data-sets for covering interactions among proteins, protein composition of various organs and organelles, quantitative and qualitative analysis of proteins, and to characterize their modulation during plant development, biotic, and abiotic stresses. Proteomics platforms have been used to identify and improve our understanding of various metabolic pathways. This article gives a brief review of efforts made by different research groups on comparative descriptive and functional analysis of proteomics applications achieved in the cereal science so far.
Comparing Simplification Strategies for the Skeletal Muscle Proteome
Geary, Bethany; Young, Iain S.; Cash, Phillip; Whitfield, Phillip D.; Doherty, Mary K.
2016-01-01
Skeletal muscle is a complex tissue that is dominated by the presence of a few abundant proteins. This wide dynamic range can mask the presence of lower abundance proteins, which can be a confounding factor in large-scale proteomic experiments. In this study, we have investigated a number of pre-fractionation methods, at both the protein and peptide level, for the characterization of the skeletal muscle proteome. The analyses revealed that the use of OFFGEL isoelectric focusing yielded the largest number of protein identifications (>750) compared to alternative gel-based and protein equalization strategies. Further, OFFGEL led to a substantial enrichment of a different sub-population of the proteome. Filter-aided sample preparation (FASP), coupled to peptide-level OFFGEL provided more confidence in the results due to a substantial increase in the number of peptides assigned to each protein. The findings presented here support the use of a multiplexed approach to proteome characterization of skeletal muscle, which has a recognized imbalance in the dynamic range of its protein complement. PMID:28248220
Hair-bundle proteomes of avian and mammalian inner-ear utricles
Wilmarth, Phillip A.; Krey, Jocelyn F.; Shin, Jung-Bum; Choi, Dongseok; David, Larry L.; Barr-Gillespie, Peter G.
2015-01-01
Examination of multiple proteomics datasets within or between species increases the reliability of protein identification. We report here proteomes of inner-ear hair bundles from three species (chick, mouse, and rat), which were collected on LTQ or LTQ Velos ion-trap mass spectrometers; the constituent proteins were quantified using MS2 intensities, which are the summed intensities of all peptide fragmentation spectra matched to a protein. The data are available via ProteomeXchange with identifiers PXD002410 (chick LTQ), PXD002414 (chick Velos), PXD002415 (mouse Velos), and PXD002416 (rat LTQ). The two chick bundle datasets compared favourably to a third, already-described chick bundle dataset, which was quantified using MS1 peak intensities, the summed intensities of peptides identified by high-resolution mass spectrometry (PXD000104; updated analysis in PXD002445). The mouse bundle dataset described here was comparable to a different mouse bundle dataset quantified using MS1 intensities (PXD002167). These six datasets will be useful for identifying the core proteome of vestibular hair bundles. PMID:26645194
de Bernonville, Thomas Dugé; Albenne, Cécile; Arlat, Matthieu; Hoffmann, Laurent; Lauber, Emmanuelle; Jamet, Elisabeth
2014-01-01
Proteomic analysis of xylem sap has recently become a major field of interest to understand several biological questions related to plant development and responses to environmental clues. The xylem sap appears as a dynamic fluid undergoing changes in its proteome upon abiotic and biotic stresses. Unlike cell compartments which are amenable to purification in sufficient amount prior to proteomic analysis, the xylem sap has to be collected in particular conditions to avoid contamination by intracellular proteins and to obtain enough material. A model plant like Arabidopsis thaliana is not suitable for such an analysis because efficient harvesting of xylem sap is difficult. The analysis of the xylem sap proteome also requires specific procedures to concentrate proteins and to focus on proteins predicted to be secreted. Indeed, xylem sap proteins appear to be synthesized and secreted in the root stele or to originate from dying differentiated xylem cells. This chapter describes protocols to collect xylem sap from Brassica species and to prepare total and N-glycoprotein extracts for identification of proteins by mass spectrometry analyses and bioinformatics.
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.; ...
2015-08-18
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
The Use of Proteomics in Assisted Reproduction.
Kosteria, Ioanna; Anagnostopoulos, Athanasios K; Kanaka-Gantenbein, Christina; Chrousos, George P; Tsangaris, George T
2017-01-01
Despite the explosive increase in the use of Assisted Reproductive Technologies (ART) over the last 30 years, their success rates remain suboptimal. Proteomics is a rapidly-evolving technology-driven science that has already been widely applied in the exploration of human reproduction and fertility, providing useful insights into its physiology and leading to the identification of numerous proteins that may be potential biomarkers and/or treatment targets of a successful ART pregnancy. Here we present a brief overview of the techniques used in proteomic analyses and attempt a comprehensive presentation of recent data from mass spectrometry-based proteomic studies in humans, regarding all components of ARTs, including the male and female gamete, the derived zygote and embryo, the endometrium and, finally, the ART offspring both pre- and postnatally. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
Platelet proteomics: from discovery to diagnosis.
Looße, Christina; Swieringa, Frauke; Heemskerk, Johan W M; Sickmann, Albert; Lorenz, Christin
2018-05-22
Platelets are the smallest cells within the circulating blood with key roles in physiological haemostasis and pathological thrombosis regulated by the onset of activating/inhibiting processes via receptor responses and signalling cascades. Areas covered: Proteomics as well as genomic approaches have been fundamental in identifying and quantifying potential targets for future diagnostic strategies in the prevention of bleeding and thrombosis, and uncovering the complexity of platelet functions in health and disease. In this article, we provide a critical overview on current functional tests used in diagnostics and the future perspectives for platelet proteomics in clinical applications. Expert commentary: Proteomics represents a valuable tool for the identification of patients with diverse platelet associated defects. In-depth validation of identified biomarkers, e.g. receptors, signalling proteins, post-translational modifications, in large cohorts is decisive for translation into routine clinical diagnostics.
Stadlmann, Johannes; Hoi, David M; Taubenschmid, Jasmin; Mechtler, Karl; Penninger, Josef M
2018-05-18
SugarQb (www.imba.oeaw.ac.at/sugarqb) is a freely available collection of computational tools for the automated identification of intact glycopeptides from high-resolution HCD MS/MS data-sets in the Proteome Discoverer environment. We report the migration of SugarQb to the latest and free version of Proteome Discoverer 2.1, and apply it to the analysis of PNGase F-resistant N-glycopeptides from mouse embryonic stem cells. The analysis of intact glycopeptides highlights unexpected technical limitations to PNGase F-dependent glycoproteomic workflows at the proteome level, and warrants a critical re-interpretation of seminal data-sets in the context of N-glycosylation-site prediction. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
The landscape of viral proteomics and its potential to impact human health
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oxford, Kristie L.; Wendler, Jason P.; McDermott, Jason E.
2016-05-06
Translating the intimate discourse between viruses and their host cells during infection is a challenging but critical task for development of antiviral interventions and diagnostics. Viruses commandeer cellular processes at every step of their life cycle, altering expression of genes and proteins. Advances in mass spectrometry-based proteomic technologies are enhancing studies of viral pathogenesis by identifying virus-induced changes in the protein repertoire of infected cells or extracellular fluids. Interpretation of proteomics results using knowledge of cellular pathways and networks leads to identification of proteins that influence a range of infection processes, thereby focusing efforts for clinical diagnoses and therapeutics development.more » Herein we discuss applications of global proteomic studies of viral infections with the goal of providing a basis for improved studies that will benefit community-wide data integration and interpretation.« less
Lee, Hangyeore; Mun, Dong-Gi; Bae, Jingi; Kim, Hokeun; Oh, Se Yeon; Park, Young Soo; Lee, Jae-Hyuk; Lee, Sang-Won
2015-08-21
We report a new and simple design of a fully automated dual-online ultra-high pressure liquid chromatography system. The system employs only two nano-volume switching valves (a two-position four port valve and a two-position ten port valve) that direct solvent flows from two binary nano-pumps for parallel operation of two analytical columns and two solid phase extraction (SPE) columns. Despite the simple design, the sDO-UHPLC offers many advantageous features that include high duty cycle, back flushing sample injection for fast and narrow zone sample injection, online desalting, high separation resolution and high intra/inter-column reproducibility. This system was applied to analyze proteome samples not only in high throughput deep proteome profiling experiments but also in high throughput MRM experiments.
Plant proteome analysis: a 2006 update.
Jorrín, Jesús V; Maldonado, Ana M; Castillejo, Ma Angeles
2007-08-01
This 2006 'Plant Proteomics Update' is a continuation of the two previously published in 'Proteomics' by 2004 (Canovas et al., Proteomics 2004, 4, 285-298) and 2006 (Rossignol et al., Proteomics 2006, 6, 5529-5548) and it aims to bring up-to-date the contribution of proteomics to plant biology on the basis of the original research papers published throughout 2006, with references to those appearing last year. According to the published papers and topics addressed, we can conclude that, as observed for the three previous years, there has been a quantitative, but not qualitative leap in plant proteomics. The full potential of proteomics is far from being exploited in plant biology research, especially if compared to other organisms, mainly yeast and humans, and a number of challenges, mainly technological, remain to be tackled. The original papers published last year numbered nearly 100 and deal with the proteome of at least 26 plant species, with a high percentage for Arabidopsis thaliana (28) and rice (11). Scientific objectives ranged from proteomic analysis of organs/tissues/cell suspensions (57) or subcellular fractions (29), to the study of plant development (12), the effect of hormones and signalling molecules (8) and response to symbionts (4) and stresses (27). A small number of contributions have covered PTMs (8) and protein interactions (4). 2-DE (specifically IEF-SDS-PAGE) coupled to MS still constitutes the almost unique platform utilized in plant proteome analysis. The application of gel-free protein separation methods and 'second generation' proteomic techniques such as multidimensional protein identification technology (MudPIT), and those for quantitative proteomics including DIGE, isotope-coded affinity tags (ICAT), iTRAQ and stable isotope labelling by amino acids in cell culture (SILAC) still remains anecdotal. This review is divided into seven sections: Introduction, Methodology, Subcellular proteomes, Development, Responses to biotic and abiotic stresses, PTMs and Protein interactions. Section 8 summarizes the major pitfalls and challenges of plant proteomics.
Method for genetic identification of unknown organisms
Colston, Jr., Billy W.; Fitch, Joseph P.; Hindson, Benjamin J.; Carter, Chance J.; Beer, Neil Reginald
2016-08-23
A method of rapid, genome and proteome based identification of unknown pathogenic or non-pathogenic organisms in a complex sample. The entire sample is analyzed by creating millions of emulsion encapsulated microdroplets, each containing a single pathogenic or non-pathogenic organism sized particle and appropriate reagents for amplification. Following amplification, the amplified product is analyzed.
Dasari, Surendra; Chambers, Matthew C.; Martinez, Misti A.; Carpenter, Kristin L.; Ham, Amy-Joan L.; Vega-Montoto, Lorenzo J.; Tabb, David L.
2012-01-01
Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines. PMID:22217208
Barrera, Alejandro; Alastruey-Izquierdo, Ana; Martín, María J.; Cuesta, Isabel; Vizcaíno, Juan Antonio
2014-01-01
Over the past several years fungal infections have shown an increasing incidence in the susceptible population, and caused high mortality rates. In parallel, multi-resistant fungi are emerging in human infections. Therefore, the identification of new potential antifungal targets is a priority. The first task of this study was to analyse the protein domain and domain architecture content of the 137 fungal proteomes (corresponding to 111 species) available in UniProtKB (UniProt KnowledgeBase) by January 2013. The resulting list of core and exclusive domain and domain architectures is provided in this paper. It delineates the different levels of fungal taxonomic classification: phylum, subphylum, order, genus and species. The analysis highlighted Aspergillus as the most diverse genus in terms of exclusive domain content. In addition, we also investigated which domains could be considered promiscuous in the different organisms. As an application of this analysis, we explored three different ways to detect potential targets for antifungal drugs. First, we compared the domain and domain architecture content of the human and fungal proteomes, and identified those domains and domain architectures only present in fungi. Secondly, we looked for information regarding fungal pathways in public repositories, where proteins containing promiscuous domains could be involved. Three pathways were identified as a result: lovastatin biosynthesis, xylan degradation and biosynthesis of siroheme. Finally, we classified a subset of the studied fungi in five groups depending on their occurrence in clinical samples. We then looked for exclusive domains in the groups that were more relevant clinically and determined which of them had the potential to bind small molecules. Overall, this study provides a comprehensive analysis of the available fungal proteomes and shows three approaches that can be used as a first step in the detection of new antifungal targets. PMID:25033262
Dun, Matthew D.; Chalkley, Robert J.; Faulkner, Sam; Keene, Sheridan; Avery-Kiejda, Kelly A.; Scott, Rodney J.; Falkenby, Lasse G.; Cairns, Murray J.; Larsen, Martin R.; Bradshaw, Ralph A.; Hondermarck, Hubert
2015-01-01
Brain metastases are a devastating consequence of cancer and currently there are no specific biomarkers or therapeutic targets for risk prediction, diagnosis, and treatment. Here the proteome of the brain metastatic breast cancer cell line 231-BR has been compared with that of the parental cell line MDA-MB-231, which is also metastatic but has no organ selectivity. Using SILAC and nanoLC-MS/MS, 1957 proteins were identified in reciprocal labeling experiments and 1584 were quantified in the two cell lines. A total of 152 proteins were confidently determined to be up- or down-regulated by more than twofold in 231-BR. Of note, 112/152 proteins were decreased as compared with only 40/152 that were increased, suggesting that down-regulation of specific proteins is an important part of the mechanism underlying the ability of breast cancer cells to metastasize to the brain. When matched against transcriptomic data, 43% of individual protein changes were associated with corresponding changes in mRNA, indicating that the transcript level is a limited predictor of protein level. In addition, differential miRNA analyses showed that most miRNA changes in 231-BR were up- (36/45) as compared with down-regulations (9/45). Pathway analysis revealed that proteome changes were mostly related to cell signaling and cell cycle, metabolism and extracellular matrix remodeling. The major protein changes in 231-BR were confirmed by parallel reaction monitoring mass spectrometry and consisted in increases (by more than fivefold) in the matrix metalloproteinase-1, ephrin-B1, stomatin, myc target-1, and decreases (by more than 10-fold) in transglutaminase-2, the S100 calcium-binding protein A4, and l-plastin. The clinicopathological significance of these major proteomic changes to predict the occurrence of brain metastases, and their potential value as therapeutic targets, warrants further investigation. PMID:26041846
Barrera, Alejandro; Alastruey-Izquierdo, Ana; Martín, María J; Cuesta, Isabel; Vizcaíno, Juan Antonio
2014-07-01
Over the past several years fungal infections have shown an increasing incidence in the susceptible population, and caused high mortality rates. In parallel, multi-resistant fungi are emerging in human infections. Therefore, the identification of new potential antifungal targets is a priority. The first task of this study was to analyse the protein domain and domain architecture content of the 137 fungal proteomes (corresponding to 111 species) available in UniProtKB (UniProt KnowledgeBase) by January 2013. The resulting list of core and exclusive domain and domain architectures is provided in this paper. It delineates the different levels of fungal taxonomic classification: phylum, subphylum, order, genus and species. The analysis highlighted Aspergillus as the most diverse genus in terms of exclusive domain content. In addition, we also investigated which domains could be considered promiscuous in the different organisms. As an application of this analysis, we explored three different ways to detect potential targets for antifungal drugs. First, we compared the domain and domain architecture content of the human and fungal proteomes, and identified those domains and domain architectures only present in fungi. Secondly, we looked for information regarding fungal pathways in public repositories, where proteins containing promiscuous domains could be involved. Three pathways were identified as a result: lovastatin biosynthesis, xylan degradation and biosynthesis of siroheme. Finally, we classified a subset of the studied fungi in five groups depending on their occurrence in clinical samples. We then looked for exclusive domains in the groups that were more relevant clinically and determined which of them had the potential to bind small molecules. Overall, this study provides a comprehensive analysis of the available fungal proteomes and shows three approaches that can be used as a first step in the detection of new antifungal targets.
Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.
2012-01-01
Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687
Pan, Sheng; Rush, John; Peskind, Elaine R; Galasko, Douglas; Chung, Kathryn; Quinn, Joseph; Jankovic, Joseph; Leverenz, James B; Zabetian, Cyrus; Pan, Catherine; Wang, Yan; Oh, Jung Hun; Gao, Jean; Zhang, Jianpeng; Montine, Thomas; Zhang, Jing
2008-02-01
Targeted quantitative proteomics by mass spectrometry aims to selectively detect one or a panel of peptides/proteins in a complex sample and is particularly appealing for novel biomarker verification/validation because it does not require specific antibodies. Here, we demonstrated the application of targeted quantitative proteomics in searching, identifying, and quantifying selected peptides in human cerebrospinal spinal fluid (CSF) using a matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer (MALDI TOF/TOF)-based platform. The approach involved two major components: the use of isotopic-labeled synthetic peptides as references for targeted identification and quantification and a highly selective mass spectrometric analysis based on the unique characteristics of the MALDI instrument. The platform provides high confidence for targeted peptide detection in a complex system and can potentially be developed into a high-throughput system. Using the liquid chromatography (LC) MALDI TOF/TOF platform and the complementary identification strategy, we were able to selectively identify and quantify a panel of targeted peptides in the whole proteome of CSF without prior depletion of abundant proteins. The effectiveness and robustness of the approach associated with different sample complexity, sample preparation strategies, as well as mass spectrometric quantification were evaluated. Other issues related to chromatography separation and the feasibility for high-throughput analysis were also discussed. Finally, we applied targeted quantitative proteomics to analyze a subset of previously identified candidate markers in CSF samples of patients with Parkinson's disease (PD) at different stages and Alzheimer's disease (AD) along with normal controls.
Parsing and Quantification of Raw Orbitrap Mass Spectrometer Data Using RawQuant.
Kovalchik, Kevin A; Moggridge, Sophie; Chen, David D Y; Morin, Gregg B; Hughes, Christopher S
2018-06-01
Effective analysis of protein samples by mass spectrometry (MS) requires careful selection and optimization of a range of experimental parameters. As the output from the primary detection device, the "raw" MS data file can be used to gauge the success of a given sample analysis. However, the closed-source nature of the standard raw MS file can complicate effective parsing of the data contained within. To ease and increase the range of analyses possible, the RawQuant tool was developed to enable parsing of raw MS files derived from Thermo Orbitrap instruments to yield meta and scan data in an openly readable text format. RawQuant can be commanded to export user-friendly files containing MS 1 , MS 2 , and MS 3 metadata as well as matrices of quantification values based on isobaric tagging approaches. In this study, the utility of RawQuant is demonstrated in several scenarios: (1) reanalysis of shotgun proteomics data for the identification of the human proteome, (2) reanalysis of experiments utilizing isobaric tagging for whole-proteome quantification, and (3) analysis of a novel bacterial proteome and synthetic peptide mixture for assessing quantification accuracy when using isobaric tags. Together, these analyses successfully demonstrate RawQuant for the efficient parsing and quantification of data from raw Thermo Orbitrap MS files acquired in a range of common proteomics experiments. In addition, the individual analyses using RawQuant highlights parametric considerations in the different experimental sets and suggests targetable areas to improve depth of coverage in identification-focused studies and quantification accuracy when using isobaric tags.
Brunet, Marie A; Levesque, Sébastien A; Hunting, Darel J; Cohen, Alan A; Roucou, Xavier
2018-05-01
Technological advances promise unprecedented opportunities for whole exome sequencing and proteomic analyses of populations. Currently, data from genome and exome sequencing or proteomic studies are searched against reference genome annotations. This provides the foundation for research and clinical screening for genetic causes of pathologies. However, current genome annotations substantially underestimate the proteomic information encoded within a gene. Numerous studies have now demonstrated the expression and function of alternative (mainly small, sometimes overlapping) ORFs within mature gene transcripts. This has important consequences for the correlation of phenotypes and genotypes. Most alternative ORFs are not yet annotated because of a lack of evidence, and this absence from databases precludes their detection by standard proteomic methods, such as mass spectrometry. Here, we demonstrate how current approaches tend to overlook alternative ORFs, hindering the discovery of new genetic drivers and fundamental research. We discuss available tools and techniques to improve identification of proteins from alternative ORFs and finally suggest a novel annotation system to permit a more complete representation of the transcriptomic and proteomic information contained within a gene. Given the crucial challenge of distinguishing functional ORFs from random ones, the suggested pipeline emphasizes both experimental data and conservation signatures. The addition of alternative ORFs in databases will render identification less serendipitous and advance the pace of research and genomic knowledge. This review highlights the urgent medical and research need to incorporate alternative ORFs in current genome annotations and thus permit their inclusion in hypotheses and models, which relate phenotypes and genotypes. © 2018 Brunet et al.; Published by Cold Spring Harbor Laboratory Press.
Proteomic analysis of the Theileria annulata schizont
Witschi, M.; Xia, D.; Sanderson, S.; Baumgartner, M.; Wastling, J.M.; Dobbelaere, D.A.E.
2013-01-01
The apicomplexan parasite, Theileria annulata, is the causative agent of tropical theileriosis, a devastating lymphoproliferative disease of cattle. The schizont stage transforms bovine leukocytes and provides an intriguing model to study host/pathogen interactions. The genome of T. annulata has been sequenced and transcriptomic data are rapidly accumulating. In contrast, little is known about the proteome of the schizont, the pathogenic, transforming life cycle stage of the parasite. Using one-dimensional (1-D) gel LC-MS/MS, a proteomic analysis of purified T. annulata schizonts was carried out. In whole parasite lysates, 645 proteins were identified. Proteins with transmembrane domains (TMDs) were under-represented and no proteins with more than four TMDs could be detected. To tackle this problem, Triton X-114 treatment was applied, which facilitates the extraction of membrane proteins, followed by 1-D gel LC-MS/MS. This resulted in the identification of an additional 153 proteins. Half of those had one or more TMD and 30 proteins with more than four TMDs were identified. This demonstrates that Triton X-114 treatment can provide a valuable additional tool for the identification of new membrane proteins in proteomic studies. With two exceptions, all proteins involved in glycolysis and the citric acid cycle were identified. For at least 29% of identified proteins, the corresponding transcripts were not present in the existing expressed sequence tag databases. The proteomics data were integrated into the publicly accessible database resource at EuPathDB (www.eupathdb.org) so that mass spectrometry-based protein expression evidence for T. annulata can be queried alongside transcriptional and other genomics data available for these parasites. PMID:23178997
PROTICdb: a web-based application to store, track, query, and compare plant proteome data.
Ferry-Dumazet, Hélène; Houel, Gwenn; Montalent, Pierre; Moreau, Luc; Langella, Olivier; Negroni, Luc; Vincent, Delphine; Lalanne, Céline; de Daruvar, Antoine; Plomion, Christophe; Zivy, Michel; Joets, Johann
2005-05-01
PROTICdb is a web-based application, mainly designed to store and analyze plant proteome data obtained by two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) and mass spectrometry (MS). The purposes of PROTICdb are (i) to store, track, and query information related to proteomic experiments, i.e., from tissue sampling to protein identification and quantitative measurements, and (ii) to integrate information from the user's own expertise and other sources into a knowledge base, used to support data interpretation (e.g., for the determination of allelic variants or products of post-translational modifications). Data insertion into the relational database of PROTICdb is achieved either by uploading outputs of image analysis and MS identification software, or by filling web forms. 2-D PAGE annotated maps can be displayed, queried, and compared through a graphical interface. Links to external databases are also available. Quantitative data can be easily exported in a tabulated format for statistical analyses. PROTICdb is based on the Oracle or the PostgreSQL Database Management System and is freely available upon request at the following URL: http://moulon.inra.fr/ bioinfo/PROTICdb.
Insights from Proteomic Studies into Plant Somatic Embryogenesis.
Heringer, Angelo Schuabb; Santa-Catarina, Claudete; Silveira, Vanildo
2018-03-01
Somatic embryogenesis is a biotechnological approach mainly used for the clonal propagation of different plants worldwide. In somatic embryogenesis, embryos arise from somatic cells under appropriate culture conditions. This plasticity in plants is a demonstration of true cellular totipotency and is the best approach among the genetic transformation protocols used for plant regeneration. Despite the importance of somatic embryogenesis, knowledge regarding the control of the somatic embryogenesis process is limited. Therefore, the elucidation of both the biochemical and molecular processes is important for understanding the mechanisms by which a single somatic cell becomes a whole plant. Modern proteomic techniques rely on an alternative method for the identification and quantification of proteins with different abundances in embryogenic cell cultures or somatic embryos and enable the identification of specific proteins related to somatic embryogenesis development. This review focuses on somatic embryogenesis studies that use gel-free shotgun proteomic analyses to categorize proteins that could enhance our understanding of particular aspects of the somatic embryogenesis process and identify possible targets for future studies. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Barkla, Bronwyn J.
2016-01-01
Modern day agriculture practice is narrowing the genetic diversity in our food supply. This may compromise the ability to obtain high yield under extreme climactic conditions, threatening food security for a rapidly growing world population. To identify genetic diversity, tolerance mechanisms of cultivars, landraces and wild relatives of major crops can be identified and ultimately exploited for yield improvement. Quantitative proteomics allows for the identification of proteins that may contribute to tolerance mechanisms by directly comparing protein abundance under stress conditions between genotypes differing in their stress responses. In this review, a summary is provided of the data accumulated from quantitative proteomic comparisons of crop genotypes/cultivars which present different stress tolerance responses when exposed to various abiotic stress conditions, including drought, salinity, high/low temperature, nutrient deficiency and UV-B irradiation. This field of research aims to identify molecular features that can be developed as biomarkers for crop improvement, however without accurate phenotyping, careful experimental design, statistical robustness and appropriate biomarker validation and verification it will be challenging to deliver what is promised. PMID:28248236
Barkla, Bronwyn J
2016-09-08
Modern day agriculture practice is narrowing the genetic diversity in our food supply. This may compromise the ability to obtain high yield under extreme climactic conditions, threatening food security for a rapidly growing world population. To identify genetic diversity, tolerance mechanisms of cultivars, landraces and wild relatives of major crops can be identified and ultimately exploited for yield improvement. Quantitative proteomics allows for the identification of proteins that may contribute to tolerance mechanisms by directly comparing protein abundance under stress conditions between genotypes differing in their stress responses. In this review, a summary is provided of the data accumulated from quantitative proteomic comparisons of crop genotypes/cultivars which present different stress tolerance responses when exposed to various abiotic stress conditions, including drought, salinity, high/low temperature, nutrient deficiency and UV-B irradiation. This field of research aims to identify molecular features that can be developed as biomarkers for crop improvement, however without accurate phenotyping, careful experimental design, statistical robustness and appropriate biomarker validation and verification it will be challenging to deliver what is promised.
Nahm, Francis Sahngun; Park, Zee-Yong; Nahm, Sang-Soep; Kim, Yong Chul; Lee, Pyung Bok
2014-01-01
Complex regional pain syndrome (CRPS) is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP) model, a novel experimental model of CRPS. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.
Biomarkers for AAA: Encouraging steps but clinical relevance still to be delivered.
Htun, Nay Min; Peter, Karlheinz
2014-10-01
Potential biomarkers have been investigated using proteomic studies in a variety of diseases. Some biomarkers have central roles in both diagnosis and monitoring of various disorders in clinical medicine, such as troponins, brain natriuretic peptide, and C-reactive protein. Although several biomarkers have been suggested in human abdominal aortic aneurysm (AAA), reliable markers have been lacking. In this issue, Moxon et al. [Proteomics Clin Appl. 2014, 8, 762-772] undertook a broad and systematic proteomic approach toward identification of biomarkers in a commonly used AAA mouse model (AAA created by angiotensin-II infusion). In this mouse model, apolipoprotein C1 and matrix metalloproteinase-9 were identified as novel biomarkers of stable AAA. This finding represents an important step forward, toward a clinically relevant role of biomarkers in AAA. This also encourages for further studies toward the identification of biomarkers (or their combinations) that can predict AAA progression and rupture, which would represent a major progress in AAA management and would establish an AAA biomarker as a much anticipated clinical tool. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Quantitation of heat-shock proteins in clinical samples using mass spectrometry.
Kaur, Punit; Asea, Alexzander
2011-01-01
Mass spectrometry (MS) is a powerful analytical tool for proteomics research and drug and biomarker discovery. MS enables identification and quantification of known and unknown compounds by revealing their structural and chemical properties. Proper sample preparation for MS-based analysis is a critical step in the proteomics workflow because the quality and reproducibility of sample extraction and preparation for downstream analysis significantly impact the separation and identification capabilities of mass spectrometers. The highly expressed proteins represent potential biomarkers that could aid in diagnosis, therapy, or drug development. Because the proteome is so complex, there is no one standard method for preparing protein samples for MS analysis. Protocols differ depending on the type of sample, source, experiment, and method of analysis. Molecular chaperones play significant roles in almost all biological functions due to their capacity for detecting intracellular denatured/unfolded proteins, initiating refolding or denaturation of such malfolded protein sequences and more recently for their role in the extracellular milieu as chaperokines. In this chapter, we describe the latest techniques for quantitating the expression of molecular chaperones in human clinical samples.
Hao, Piliang; Ren, Yan; Dutta, Bamaprasad; Sze, Siu Kwan
2013-04-26
ERLIC and high-pH RP (Hp-RP) have been reported to be promising alternatives to strong cation exchange (SCX) in proteome fractionation. Here we compared the performance of ERLIC, concatenated ERLIC and concatenated Hp-RP in proteome profiling. The protein identification is comparable in these three strategies, but significantly more unique peptides are identified by the two concatenation methods, resulting in a significant increase of the average protein sequence coverage. The pooling of fractions from spaced intervals results in more uniform distribution of peptides in each fraction compared with the chromatogram-based pooling of adjacent fractions. ERLIC fractionates peptides according to their pI and GRAVY values. These properties remains but becomes less remarkable in concatenated ERLIC. In contrast, the average pI and GRAVY values of the peptides are comparable in each fraction in concatenated Hp-RP. ERLIC performs the best in identifying peptides with pI>9 among the three strategies, while concatenated Hp-RP is good at identifying peptides with pI<4. These advantages are useful when either basic or acidic peptides/proteins are analytical targets. The power of ERLIC in identification of basic peptides seems to be due to their efficient separation from acidic peptides. This study facilitates the choice of proper fractionation strategies based on specific objectives. For in-depth proteomic analysis of a cell, tissue and plasma, multidimensional liquid chromatography (MDLC) is still necessary to reduce sample complexity for improving analytical dynamic range and proteome coverage. This work conducts a direct comparison of three promising first-dimensional proteome fractionation methods. They are comparable in identifying proteins, but concatenated ERLIC and concatenated Hp-RP identify significantly more unique peptides than ERLIC. ERLIC is good at analyzing basic peptides, while concatenated Hp-RP performs the best in analyzing acidic peptides with pI<4. This will facilitate the choice of the proper peptide fractionation strategy based on a specific need. A combination of different fractionation strategies can be used to increase the sequence coverage and number of protein identification due to the complementary effect between different methods. Copyright © 2013 Elsevier B.V. All rights reserved.
Cardiovascular Redox and Ox Stress Proteomics
Kumar, Vikas; Calamaras, Timothy Dean; Haeussler, Dagmar; Colucci, Wilson Steven; Cohen, Richard Alan; McComb, Mark Errol; Pimentel, David
2012-01-01
Abstract Significance: Oxidative post-translational modifications (OPTMs) have been demonstrated as contributing to cardiovascular physiology and pathophysiology. These modifications have been identified using antibodies as well as advanced proteomic methods, and the functional importance of each is beginning to be understood using transgenic and gene deletion animal models. Given that OPTMs are involved in cardiovascular pathology, the use of these modifications as biomarkers and predictors of disease has significant therapeutic potential. Adequate understanding of the chemistry of the OPTMs is necessary to determine what may occur in vivo and which modifications would best serve as biomarkers. Recent Advances: By using mass spectrometry, advanced labeling techniques, and antibody identification, OPTMs have become accessible to a larger proportion of the scientific community. Advancements in instrumentation, database search algorithms, and processing speed have allowed MS to fully expand on the proteome of OPTMs. In addition, the role of enzymatically reversible OPTMs has been further clarified in preclinical models. Critical Issues: The identification of OPTMs suffers from limitations in analytic detection based on the methodology, instrumentation, sample complexity, and bioinformatics. Currently, each type of OPTM requires a specific strategy for identification, and generalized approaches result in an incomplete assessment. Future Directions: Novel types of highly sensitive MS instrumentation that allow for improved separation and detection of modified proteins and peptides have been crucial in the discovery of OPTMs and biomarkers. To further advance the identification of relevant OPTMs in advanced search algorithms, standardized methods for sample processing and depository of MS data will be required. Antioxid. Redox Signal. 17, 1528–1559. PMID:22607061
Advances in targeted proteomics and applications to biomedical research
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, Tujin; Song, Ehwang; Nie, Song
Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity (Shi et al., Proteomics, 12, 1074–1092, 2012) herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications inmore » human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed.« less
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A New Mass Spectrometry-compatible Degradable Surfactant for Tissue Proteomics
Chang, Ying-Hua; Gregorich, Zachery R.; Chen, Albert J.; Hwang, Leekyoung; Guner, Huseyin; Yu, Deyang; Zhang, Jianyi; Ge, Ying
2015-01-01
Tissue proteomics is increasingly recognized for its role in biomarker discovery and disease mechanism investigation. However, protein solubility remains a significant challenge in mass spectrometry (MS)-based tissue proteomics. Conventional surfactants such as sodium dodecyl sulfate (SDS), the preferred surfactant for protein solubilization, are not compatible with MS. Herein, we have screened a library of surfactant-like compounds and discovered an MS-compatible degradable surfactant (MaSDeS) for tissue proteomics that solubilizes all categories of proteins with performance comparable to SDS. The use of MaSDeS in the tissue extraction significantly improves the total number of protein identifications from commonly used tissues, including tissue from the heart, liver, and lung. Notably, MaSDeS significantly enriches membrane proteins, which are often under-represented in proteomics studies. The acid degradable nature of MaSDeS makes it amenable for high-throughput mass spectrometry-based proteomics. In addition, the thermostability of MaSDeS allows for its use in experiments requiring high temperature to facilitate protein extraction and solubilization. Furthermore, we have shown that MaSDeS outperforms the other MS-compatible surfactants in terms of overall protein solubility and the total number of identified proteins in tissue proteomics. Thus, the use of MaSDeS will greatly advance tissue proteomics and realize its potential in basic biomedical and clinical research. MaSDeS could be utilized in a variety of proteomics studies as well as general biochemical and biological experiments that employ surfactants for protein solubilization. PMID:25589168
Expanding proteome coverage with orthogonal-specificity α-Lytic proteases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meyer, Jesse G.; Kim, Sangtae; Maltby, David A.
2014-03-01
Bottom-up proteomics studies traditionally involve proteome digestion with a single protease, trypsin. However, trypsin alone does not generate peptides that encompass the entire proteome. Alternative proteases have been explored, but most have specificity for charged amino acid side chains. Therefore, additional proteases that improve proteome coverage by cleavage at sequences complimentary to trypsin may increase proteome coverage. We demonstrate the novel application of two proteases for bottom-up proteomics: wild type alpha-lytic protease (WaLP), and an active site mutant of WaLP, M190A alpha-lytic protease (MaLP). We assess several relevant factors including MS/MS fragmentation, peptide length, peptide yield, and protease specificity. Bymore » combining data from separate digestions with trypsin, LysC, WaLP, and MaLP, proteome coverage was increased 101% compared to trypsin digestion alone. To demonstrate how the gained sequence coverage can access additional PTM information, we show identification of a number of novel phosphorylation sites in the S. pombe proteome and include an illustrative example from the protein MPD2, wherein two novel sites are identified, one in a tryptic peptide too short to identify and the other in a sequence devoid of tryptic sites. The specificity of WaLP and MaLP for aliphatic amino acid side chains was particularly valuable for coverage of membrane protein sequences, which increased 350% when the data from trypsin, LysC, WaLP, and MaLP were combined.« less
Van Cutsem, Emmanuel; Simonart, Géraldine; Degand, Hervé; Faber, Anne-Marie; Morsomme, Pierre; Boutry, Marc
2011-02-01
Nicotiana tabacum leaves are covered by trichomes involved in the secretion of large amounts of secondary metabolites, some of which play a major role in plant defense. However, little is known about the metabolic pathways that operate in these structures. We undertook a proteomic analysis of N. tabacum trichomes in order to identify their protein complement. Efficient trichome isolation was obtained by abrading frozen leaves. After homogenization, soluble proteins and a microsomal fraction were prepared by centrifugation. Gel-based and gel-free proteomic analyses were then performed. 2-DE analysis of soluble proteins led to the identification of 1373 protein spots, which were digested and analyzed by MS/MS, leading to 680 unique identifications. Both soluble proteins and microsomal fraction were analyzed by LC MALDI-MS/MS after trypsin digestion, leading to 858 identifications, many of which had not been identified after 2-DE, indicating that the two methods complement each other. Many enzymes putatively involved in secondary metabolism were identified, including enzymes involved in the synthesis of terpenoid precursors and in acyl sugar production. Several transporters were also identified, some of which might be involved in secondary metabolite transport. Various (a)biotic stress response proteins were also detected, supporting the role of trichomes in plant defense. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Pandey, Vishakha; Singh, Manoj; Pandey, Dinesh; Kumar, Anil
2018-05-18
Tilletia indica incites Karnal bunt (KB) disease in wheat. To date, no KB resistant wheat cultivar could be developed due to non-availability of potential biomarkers related to pathogenicity/virulence for screening of resistant wheat genotypes. The present study was carried out to compare the proteomes of T. indica highly (TiK) and low (TiP) virulent isolates. Twenty one protein spots consistently observed as up-regulated/differential in the TiK proteome were selected for identification by MALDI-TOF/TOF. Identified sequences showed homology with fungal proteins playing essential role in plant infection and pathogen survival, including stress response, adhesion, fungal penetration, invasion, colonization, degradation of host cell wall, signal transduction pathway. These results were integrated with T. indica genome sequence for identification of homologs of candidate pathogenicity/virulence related proteins. Protein identified in TiK isolate as malate dehydrogenase that converts malate to oxaloacetate which is precursor of oxalic acid. Oxalic acid is key pathogenicity factor in phytopathogenic fungi. These results were validated by GC-MS based metabolic profiling of T. indica isolates indicating that oxalic acid was exclusively identified in TiK isolate. Thus, integrated omics approaches leads to identification of pathogenicity/virulence factor(s) that would provide insights into pathogenic mechanisms of fungi and aid in devising effective disease management strategies.
Galisson, Frederic; Mahrouche, Louiza; Courcelles, Mathieu; Bonneil, Eric; Meloche, Sylvain; Chelbi-Alix, Mounira K.; Thibault, Pierre
2011-01-01
The small ubiquitin-related modifier (SUMO) is a small group of proteins that are reversibly attached to protein substrates to modify their functions. The large scale identification of protein SUMOylation and their modification sites in mammalian cells represents a significant challenge because of the relatively small number of in vivo substrates and the dynamic nature of this modification. We report here a novel proteomics approach to selectively enrich and identify SUMO conjugates from human cells. We stably expressed different SUMO paralogs in HEK293 cells, each containing a His6 tag and a strategically located tryptic cleavage site at the C terminus to facilitate the recovery and identification of SUMOylated peptides by affinity enrichment and mass spectrometry. Tryptic peptides with short SUMO remnants offer significant advantages in large scale SUMOylome experiments including the generation of paralog-specific fragment ions following CID and ETD activation, and the identification of modified peptides using conventional database search engines such as Mascot. We identified 205 unique protein substrates together with 17 precise SUMOylation sites present in 12 SUMO protein conjugates including three new sites (Lys-380, Lys-400, and Lys-497) on the protein promyelocytic leukemia. Label-free quantitative proteomics analyses on purified nuclear extracts from untreated and arsenic trioxide-treated cells revealed that all identified SUMOylated sites of promyelocytic leukemia were differentially SUMOylated upon stimulation. PMID:21098080
Analysis of secreted proteins from Aspergillus flavus.
Medina, Martha L; Haynes, Paul A; Breci, Linda; Francisco, Wilson A
2005-08-01
MS/MS techniques in proteomics make possible the identification of proteins from organisms with little or no genome sequence information available. Peptide sequences are obtained from tandem mass spectra by matching peptide mass and fragmentation information to protein sequence information from related organisms, including unannotated genome sequence data. This peptide identification data can then be grouped and reconstructed into protein data. In this study, we have used this approach to study protein secretion by Aspergillus flavus, a filamentous fungus for which very little genome sequence information is available. A. flavus is capable of degrading the flavonoid rutin (quercetin 3-O-glycoside), as the only source of carbon via an extracellular enzyme system. In this continuing study, a proteomic analysis was used to identify secreted proteins from A. flavus when grown on rutin. The growth media glucose and potato dextrose were used to identify differentially expressed secreted proteins. The secreted proteins were analyzed by 1- and 2-DE and MS/MS. A total of 51 unique A. flavus secreted proteins were identified from the three growth conditions. Ten proteins were unique to rutin-, five to glucose- and one to potato dextrose-grown A. flavus. Sixteen secreted proteins were common to all three media. Fourteen identifications were of hypothetical proteins or proteins of unknown functions. To our knowledge, this is the first extensive proteomic study conducted to identify the secreted proteins from a filamentous fungus.
Selective staining of proteins with hydrophobic surface sites on a native electrophoretic gel.
Bertsch, Martina; Kassner, Richard J
2003-01-01
Chemical proteomics aims to characterize all of the proteins in the proteome with respect to their function, which is associated with their interaction with other molecules. We propose the identification of a subproteomic library of expressed proteins whose native structures are typified by the presence of hydrophobic surface sites, which are often involved in interactions with small molecules, membrane lipids, and other proteins, pertaining to their functions. We demonstrate that soluble globular proteins with hydrophobic surface sites can be detected selectively by staining on an electrophoretic gel run under nondenaturing conditions. The application of these staining techniques may help elucidate new catalytic, transport, and regulatory functionalities in complex proteomic screenings.
17th Chromosome-Centric Human Proteome Project Symposium in Tehran.
Meyfour, Anna; Pahlavan, Sara; Sobhanian, Hamid; Salekdeh, Ghasem Hosseini
2018-04-01
This report describes the 17th Chromosome-Centric Human Proteome Project which was held in Tehran, Iran, April 27 and 28, 2017. A brief summary of the symposium's talks including new technical and computational approaches for the identification of novel proteins from non-coding genomic regions, physicochemical and biological causes of missing proteins, and the close interactions between Chromosome- and Biology/Disease-driven Human Proteome Project are presented. A synopsis of decisions made on the prospective programs to maintain collaborative works, share resources and information, and establishment of a newly organized working group, the task force for missing protein analysis are discussed. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Introducing the PRIDE Archive RESTful web services.
Reisinger, Florian; del-Toro, Noemi; Ternent, Tobias; Hermjakob, Henning; Vizcaíno, Juan Antonio
2015-07-01
The PRIDE (PRoteomics IDEntifications) database is one of the world-leading public repositories of mass spectrometry (MS)-based proteomics data and it is a founding member of the ProteomeXchange Consortium of proteomics resources. In the original PRIDE database system, users could access data programmatically by accessing the web services provided by the PRIDE BioMart interface. New REST (REpresentational State Transfer) web services have been developed to serve the most popular functionality provided by BioMart (now discontinued due to data scalability issues) and address the data access requirements of the newly developed PRIDE Archive. Using the API (Application Programming Interface) it is now possible to programmatically query for and retrieve peptide and protein identifications, project and assay metadata and the originally submitted files. Searching and filtering is also possible by metadata information, such as sample details (e.g. species and tissues), instrumentation (mass spectrometer), keywords and other provided annotations. The PRIDE Archive web services were first made available in April 2014. The API has already been adopted by a few applications and standalone tools such as PeptideShaker, PRIDE Inspector, the Unipept web application and the Python-based BioServices package. This application is free and open to all users with no login requirement and can be accessed at http://www.ebi.ac.uk/pride/ws/archive/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Proteomics and bioinformatics strategies to design countermeasures against infectious threat agents.
Khan, Akbar S; Mujer, Cesar V; Alefantis, Timothy G; Connolly, Joseph P; Mayr, Ulrike Beate; Walcher, Petra; Lubitz, Werner; Delvecchio, Vito G
2006-01-01
The potential devastation resulting from an intentional outbreak caused by biological warfare agents such as Brucella abortus and Bacillus anthracis underscores the need for next generation vaccines. Proteomics, genomics, and systems biology approaches coupled with the bacterial ghost (BG) vaccine delivery strategy offer an ideal approach for developing safer, cost-effective, and efficacious vaccines for human use in a relatively rapid time frame. Critical to any subunit vaccine development strategy is the identification of a pathogen's proteins with the greatest potential of eliciting a protective immune response. These proteins are collectively referred to as the pathogen's immunome. Proteomics provides high-resolution identification of these immunogenic proteins using standard proteomic technologies, Western blots probed with antisera from infected patients, and the pathogen's sequenced and annotated genome. Selected immunoreactive proteins can be then cloned and expressed in nonpathogenic Gram-negative bacteria. Subsequently, a temperature shift or chemical induction process is initiated to induce expression of the PhiX174 E-lysis gene, whose protein product forms an E tunnel between the inner and outer membrane of the bacteria, expelling all intracellular contents. The BG vaccine system is a proven strategy developed for many different pathogens and tested in a complete array of animal models. The BG vaccine system also has great potential for producing multiagent vaccines for protection to multiple species in a single formulation.
Bacterial membrane proteomics.
Poetsch, Ansgar; Wolters, Dirk
2008-10-01
About one quarter to one third of all bacterial genes encode proteins of the inner or outer bacterial membrane. These proteins perform essential physiological functions, such as the import or export of metabolites, the homeostasis of metal ions, the extrusion of toxic substances or antibiotics, and the generation or conversion of energy. The last years have witnessed completion of a plethora of whole-genome sequences of bacteria important for biotechnology or medicine, which is the foundation for proteome and other functional genome analyses. In this review, we discuss the challenges in membrane proteome analysis, starting from sample preparation and leading to MS-data analysis and quantification. The current state of available proteomics technologies as well as their advantages and disadvantages will be described with a focus on shotgun proteomics. Then, we will briefly introduce the most abundant proteins and protein families present in bacterial membranes before bacterial membrane proteomics studies of the last years will be presented. It will be shown how these works enlarged our knowledge about the physiological adaptations that take place in bacteria during fine chemical production, bioremediation, protein overexpression, and during infections. Furthermore, several examples from literature demonstrate the suitability of membrane proteomics for the identification of antigens and different pathogenic strains, as well as the elucidation of membrane protein structure and function.
Cell death proteomics database: consolidating proteomics data on cell death.
Arntzen, Magnus Ø; Bull, Vibeke H; Thiede, Bernd
2013-05-03
Programmed cell death is a ubiquitous process of utmost importance for the development and maintenance of multicellular organisms. More than 10 different types of programmed cell death forms have been discovered. Several proteomics analyses have been performed to gain insight in proteins involved in the different forms of programmed cell death. To consolidate these studies, we have developed the cell death proteomics (CDP) database, which comprehends data from apoptosis, autophagy, cytotoxic granule-mediated cell death, excitotoxicity, mitotic catastrophe, paraptosis, pyroptosis, and Wallerian degeneration. The CDP database is available as a web-based database to compare protein identifications and quantitative information across different experimental setups. The proteomics data of 73 publications were integrated and unified with protein annotations from UniProt-KB and gene ontology (GO). Currently, more than 6,500 records of more than 3,700 proteins are included in the CDP. Comparing apoptosis and autophagy using overrepresentation analysis of GO terms, the majority of enriched processes were found in both, but also some clear differences were perceived. Furthermore, the analysis revealed differences and similarities of the proteome between autophagosomal and overall autophagy. The CDP database represents a useful tool to consolidate data from proteome analyses of programmed cell death and is available at http://celldeathproteomics.uio.no.
Analyzing large-scale proteomics projects with latent semantic indexing.
Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning
2008-01-01
Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.
Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project.
Omenn, Gilbert S; Lane, Lydie; Lundberg, Emma K; Overall, Christopher M; Deutsch, Eric W
2017-12-01
The Human Proteome Organization (HUPO) Human Proteome Project (HPP) continues to make progress on its two overall goals: (1) completing the protein parts list, with an annual update of the HUPO draft human proteome, and (2) making proteomics an integrated complement to genomics and transcriptomics throughout biomedical and life sciences research. neXtProt version 2017-01-23 has 17 008 confident protein identifications (Protein Existence [PE] level 1) that are compliant with the HPP Guidelines v2.1 ( https://hupo.org/Guidelines ), up from 13 664 in 2012-12 and 16 518 in 2016-04. Remaining to be found by mass spectrometry and other methods are 2579 "missing proteins" (PE2+3+4), down from 2949 in 2016. PeptideAtlas 2017-01 has 15 173 canonical proteins, accounting for nearly all of the 15 290 PE1 proteins based on MS data. These resources have extensive data on PTMs, single amino acid variants, and splice isoforms. The Human Protein Atlas v16 has 10 492 highly curated protein entries with tissue and subcellular spatial localization of proteins and transcript expression. Organ-specific popular protein lists have been generated for broad use in quantitative targeted proteomics using SRM-MS or DIA-SWATH-MS studies of biology and disease.
Couto, Narciso; Schooling, Sarah R; Dutcher, John R; Barber, Jill
2015-10-02
In the present work, two different proteomic platforms, gel-based and gel-free, were used to map the matrix and outer membrane vesicle exoproteomes of Pseudomonas aeruginosa PAO1 biofilms. These two proteomic strategies allowed us a confident identification of 207 and 327 proteins from enriched outer membrane vesicles and whole matrix isolated from biofilms. Because of the physicochemical characteristics of these subproteomes, the two strategies showed complementarity, and thus, the most comprehensive analysis of P. aeruginosa exoproteome to date was achieved. Under our conditions, outer membrane vesicles contribute approximately 20% of the whole matrix proteome, demonstrating that membrane vesicles are an important component of the matrix. The proteomic profiles were analyzed in terms of their biological context, namely, a biofilm. Accordingly relevant metabolic processes involved in cellular adaptation to the biofilm lifestyle as well as those related to P. aeruginosa virulence capabilities were a key feature of the analyses. The diversity of the matrix proteome corroborates the idea of high heterogeneity within the biofilm; cells can display different levels of metabolism and can adapt to local microenvironments making this proteomic analysis challenging. In addition to analyzing our own primary data, we extend the analysis to published data by other groups in order to deepen our understanding of the complexity inherent within biofilm populations.
Proteomics of Skeletal Muscle: Focus on Insulin Resistance and Exercise Biology
Deshmukh, Atul S.
2016-01-01
Skeletal muscle is the largest tissue in the human body and plays an important role in locomotion and whole body metabolism. It accounts for ~80% of insulin stimulated glucose disposal. Skeletal muscle insulin resistance, a primary feature of Type 2 diabetes, is caused by a decreased ability of muscle to respond to circulating insulin. Physical exercise improves insulin sensitivity and whole body metabolism and remains one of the most promising interventions for the prevention of Type 2 diabetes. Insulin resistance and exercise adaptations in skeletal muscle might be a cause, or consequence, of altered protein expressions profiles and/or their posttranslational modifications (PTMs). Mass spectrometry (MS)-based proteomics offer enormous promise for investigating the molecular mechanisms underlying skeletal muscle insulin resistance and exercise-induced adaptation; however, skeletal muscle proteomics are challenging. This review describes the technical limitations of skeletal muscle proteomics as well as emerging developments in proteomics workflow with respect to samples preparation, liquid chromatography (LC), MS and computational analysis. These technologies have not yet been fully exploited in the field of skeletal muscle proteomics. Future studies that involve state-of-the-art proteomics technology will broaden our understanding of exercise-induced adaptations as well as molecular pathogenesis of insulin resistance. This could lead to the identification of new therapeutic targets. PMID:28248217
Matallana-Surget, Sabine; Derock, Jérémy; Leroy, Baptiste; Badri, Hanène; Deschoenmaeker, Frédéric; Wattiez, Ruddy
2014-01-01
The filamentous cyanobacterium Arthrospira platensis has a long history of use as a food supply and it has been used by the European Space Agency in the MELiSSA project, an artificial microecosystem which supports life during long-term manned space missions. This study assesses progress in the field of cyanobacterial shotgun proteomics and light/dark diurnal cycles by focusing on Arthrospira platensis. Several fractionation workflows including gel-free and gel-based protein/peptide fractionation procedures were used and combined with LC-MS/MS analysis, enabling the overall identification of 1306 proteins, which represents 21% coverage of the theoretical proteome. A total of 30 proteins were found to be significantly differentially regulated under light/dark growth transition. Interestingly, most of the proteins showing differential abundance were related to photosynthesis, the Calvin cycle and translation processes. A novel aspect and major achievement of this work is the successful improvement of the cyanobacterial proteome coverage using a 3D LC-MS/MS approach, based on an immobilized metal affinity chromatography, a suitable tool that enabled us to eliminate the most abundant protein, the allophycocyanin. We also demonstrated that cell growth follows a light/dark cycle in A. platensis. This preliminary proteomic study has highlighted new characteristics of the Arthrospira platensis proteome in terms of diurnal regulation.
Matallana-Surget, Sabine; Derock, Jérémy; Leroy, Baptiste; Badri, Hanène; Deschoenmaeker, Frédéric; Wattiez, Ruddy
2014-01-01
The filamentous cyanobacterium Arthrospira platensis has a long history of use as a food supply and it has been used by the European Space Agency in the MELiSSA project, an artificial microecosystem which supports life during long-term manned space missions. This study assesses progress in the field of cyanobacterial shotgun proteomics and light/dark diurnal cycles by focusing on Arthrospira platensis. Several fractionation workflows including gel-free and gel-based protein/peptide fractionation procedures were used and combined with LC-MS/MS analysis, enabling the overall identification of 1306 proteins, which represents 21% coverage of the theoretical proteome. A total of 30 proteins were found to be significantly differentially regulated under light/dark growth transition. Interestingly, most of the proteins showing differential abundance were related to photosynthesis, the Calvin cycle and translation processes. A novel aspect and major achievement of this work is the successful improvement of the cyanobacterial proteome coverage using a 3D LC-MS/MS approach, based on an immobilized metal affinity chromatography, a suitable tool that enabled us to eliminate the most abundant protein, the allophycocyanin. We also demonstrated that cell growth follows a light/dark cycle in A. platensis. This preliminary proteomic study has highlighted new characteristics of the Arthrospira platensis proteome in terms of diurnal regulation. PMID:24914774
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-01-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)1 not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. PMID:27215607
Xu, Ruilian; Tang, Jun; Deng, Quantong; He, Wan; Sun, Xiujie; Xia, Ligang; Cheng, Zhiqiang; He, Lisheng; You, Shuyuan; Hu, Jintao; Fu, Yuxiang; Zhu, Jian; Chen, Yixin; Gao, Weina; He, An; Guo, Zhengyu; Lin, Lin; Li, Hua; Hu, Chaofeng; Tian, Ruijun
2018-05-01
Increasing attention has been focused on cell type proteome profiling for understanding the heterogeneous multicellular microenvironment in tissue samples. However, current cell type proteome profiling methods need large amounts of starting materials which preclude their application to clinical tumor specimens with limited access. Here, by seamlessly combining laser capture microdissection and integrated proteomics sample preparation technology SISPROT, specific cell types in tumor samples could be precisely dissected with single cell resolution and processed for high-sensitivity proteome profiling. Sample loss and contamination due to the multiple transfer steps are significantly reduced by the full integration and noncontact design. H&E staining dyes which are necessary for cell type investigation could be selectively removed by the unique two-stage design of the spintip device. This easy-to-use proteome profiling technology achieved high sensitivity with the identification of more than 500 proteins from only 0.1 mm 2 and 10 μm thickness colon cancer tissue section. The first cell type proteome profiling of four cell types from one colon tumor and surrounding normal tissue, including cancer cells, enterocytes, lymphocytes, and smooth muscle cells, was obtained. 5271, 4691, 4876, and 2140 protein groups were identified, respectively, from tissue section of only 5 mm 2 and 10 μm thickness. Furthermore, spatially resolved proteome distribution profiles of enterocytes, lymphocytes, and smooth muscle cells on the same tissue slices and across four consecutive sections with micrometer distance were successfully achieved. This fully integrated proteomics technology, termed LCM-SISPROT, is therefore promising for spatial-resolution cell type proteome profiling of tumor microenvironment with a minute amount of clinical starting materials.
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes.
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-08-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)(1) not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Comprehensive Identification of Proteins from MALDI Imaging*
Maier, Stefan K.; Hahne, Hannes; Gholami, Amin Moghaddas; Balluff, Benjamin; Meding, Stephan; Schoene, Cédrik; Walch, Axel K.; Kuster, Bernhard
2013-01-01
Matrix-assisted laser desorption/ionization imaging mass spectrometry (MALDI IMS) is a powerful tool for the visualization of proteins in tissues and has demonstrated considerable diagnostic and prognostic value. One main challenge is that the molecular identity of such potential biomarkers mostly remains unknown. We introduce a generic method that removes this issue by systematically identifying the proteins embedded in the MALDI matrix using a combination of bottom-up and top-down proteomics. The analyses of ten human tissues lead to the identification of 1400 abundant and soluble proteins constituting the set of proteins detectable by MALDI IMS including >90% of all IMS biomarkers reported in the literature. Top-down analysis of the matrix proteome identified 124 mostly N- and C-terminally fragmented proteins indicating considerable protein processing activity in tissues. All protein identification data from this study as well as the IMS literature has been deposited into MaTisse, a new publically available database, which we anticipate will become a valuable resource for the IMS community. PMID:23782541
USDA-ARS?s Scientific Manuscript database
Matrix-assisted laser desorption/ionization time-of-flight-time-of-flight mass spectrometry(MALDI-TOF-TOF-MS)has provided new capabilities for the rapid identification of digested and non-digested proteins. The tandem (MS/MS) capability of TOF-TOF instruments allows precursor ion selection/isolation...
Hamzeiy, Hamid; Cox, Jürgen
2017-02-01
Computational workflows for mass spectrometry-based shotgun proteomics and untargeted metabolomics share many steps. Despite the similarities, untargeted metabolomics is lagging behind in terms of reliable fully automated quantitative data analysis. We argue that metabolomics will strongly benefit from the adaptation of successful automated proteomics workflows to metabolomics. MaxQuant is a popular platform for proteomics data analysis and is widely considered to be superior in achieving high precursor mass accuracies through advanced nonlinear recalibration, usually leading to five to ten-fold better accuracy in complex LC-MS/MS runs. This translates to a sharp decrease in the number of peptide candidates per measured feature, thereby strongly improving the coverage of identified peptides. We argue that similar strategies can be applied to untargeted metabolomics, leading to equivalent improvements in metabolite identification. Copyright © 2016 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Huang, Rongrong; Chen, Zhongsi; He, Lei; He, Nongyue; Xi, Zhijiang; Li, Zhiyang; Deng, Yan; Zeng, Xin
2017-01-01
There is a critical need for the discovery of novel biomarkers for early detection and targeted therapy of cancer, a major cause of deaths worldwide. In this respect, proteomic technologies, such as mass spectrometry (MS), enable the identification of pathologically significant proteins in various types of samples. MS is capable of high-throughput profiling of complex biological samples including blood, tissues, urine, milk, and cells. MS-assisted proteomics has contributed to the development of cancer biomarkers that may form the foundation for new clinical tests. It can also aid in elucidating the molecular mechanisms underlying cancer. In this review, we discuss MS principles and instrumentation as well as approaches in MS-based proteomics, which have been employed in the development of potential biomarkers. Furthermore, the challenges in validation of MS biomarkers for their use in clinical practice are also reviewed. PMID:28912895
Assessment of Sample Preparation Bias in Mass Spectrometry-Based Proteomics.
Klont, Frank; Bras, Linda; Wolters, Justina C; Ongay, Sara; Bischoff, Rainer; Halmos, Gyorgy B; Horvatovich, Péter
2018-04-17
For mass spectrometry-based proteomics, the selected sample preparation strategy is a key determinant for information that will be obtained. However, the corresponding selection is often not based on a fit-for-purpose evaluation. Here we report a comparison of in-gel (IGD), in-solution (ISD), on-filter (OFD), and on-pellet digestion (OPD) workflows on the basis of targeted (QconCAT-multiple reaction monitoring (MRM) method for mitochondrial proteins) and discovery proteomics (data-dependent acquisition, DDA) analyses using three different human head and neck tissues (i.e., nasal polyps, parotid gland, and palatine tonsils). Our study reveals differences between the sample preparation methods, for example, with respect to protein and peptide losses, quantification variability, protocol-induced methionine oxidation, and asparagine/glutamine deamidation as well as identification of cysteine-containing peptides. However, none of the methods performed best for all types of tissues, which argues against the existence of a universal sample preparation method for proteome analysis.
Banfi, Cristina; Baetta, Roberta; Gianazza, Erica; Tremoli, Elena
2017-06-01
Proteomic-based techniques provide a powerful tool for identifying the full spectrum of protein targets of a drug, elucidating its mechanism(s) of action, and identifying biomarkers of its efficacy and safety. Herein, we outline the technological advancements in the field, and illustrate the contribution of proteomics to the definition of the pharmacological profile of statins, which represent the cornerstone of the prevention and treatment of cardiovascular diseases (CVDs). Statins act by inhibiting 3-hydroxy-3-methyl-glutaryl-coenzyme A (HMG-CoA) reductase, thus reducing cholesterol biosynthesis and consequently enhancing the clearance of low-density lipoproteins from the blood; however, HMG-CoA reductase inhibition can result in a multitude of additional effects beyond lipid lowering, known as 'pleiotropic effects'. The case of statins highlights the unique contribution of proteomics to the target profiling of a drug molecule. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nogueira, Fábio C. S.; Farias, Andreza R. B.; Teixeira, Fabiano M.; Domont, Gilberto B.; Campos, Francisco A. P.
2018-01-01
Label-free quantitative proteome analysis of extrafloral (EFN) and floral nectar (FN) from castor (Ricinus communis) plants resulted in the identification of 72 and 37 proteins, respectively. Thirty proteins were differentially accumulated between EFN and FN, and 24 of these were more abundant in the EFN. In addition to proteins involved in maintaining the nectar pathogen free such as chitinases and glucan 1,3-beta-glucosidase, both proteomes share an array of peptidases, lipases, carbohydrases, and nucleases. A total of 39 of the identified proteins, comprising different classes of hydrolases, were found to have biochemical matching partners in the exudates of at least five genera of carnivorous plants, indicating the EFN and FN possess a potential to digest biological material from microbial, animal or plant origin equivalent to the exudates of carnivorous plants. PMID:29755492
Anthelmintic metabolism in parasitic helminths: proteomic insights.
Brophy, Peter M; MacKintosh, Neil; Morphew, Russell M
2012-08-01
Anthelmintics are the cornerstone of parasitic helminth control. Surprisingly, understanding of the biochemical pathways used by parasitic helminths to detoxify anthelmintics is fragmented, despite the increasing global threat of anthelmintic resistance within the ruminant and equine industries. Reductionist biochemistry has likely over-estimated the enzymatic role of glutathione transferases in anthelmintic metabolism and neglected the potential role of the cytochrome P-450 superfamily (CYPs). Proteomic technologies offers the opportunity to support genomics, reverse genetics and pharmacokinetics, and provide an integrated insight into both the cellular mechanisms underpinning response to anthelmintics and also the identification of biomarker panels for monitoring the development of anthelmintic resistance. To date, there have been limited attempts to include proteomics in anthelmintic metabolism studies. Optimisations of membrane, post-translational modification and interaction proteomic technologies in helminths are needed to especially study Phase I CYPs and Phase III ABC transporter pumps for anthelmintics and their metabolites.
Nogueira, Fábio C S; Farias, Andreza R B; Teixeira, Fabiano M; Domont, Gilberto B; Campos, Francisco A P
2018-01-01
Label-free quantitative proteome analysis of extrafloral (EFN) and floral nectar (FN) from castor ( Ricinus communis ) plants resulted in the identification of 72 and 37 proteins, respectively. Thirty proteins were differentially accumulated between EFN and FN, and 24 of these were more abundant in the EFN. In addition to proteins involved in maintaining the nectar pathogen free such as chitinases and glucan 1,3-beta-glucosidase, both proteomes share an array of peptidases, lipases, carbohydrases, and nucleases. A total of 39 of the identified proteins, comprising different classes of hydrolases, were found to have biochemical matching partners in the exudates of at least five genera of carnivorous plants, indicating the EFN and FN possess a potential to digest biological material from microbial, animal or plant origin equivalent to the exudates of carnivorous plants.
Advances in Proteomics of Mycobacterium leprae.
Parkash, O; Singh, B P
2012-04-01
Although Mycobacterium leprae was the first bacterial pathogen identified causing human disease, it remains one of the few that is non-cultivable. Understanding the biology of M. leprae is one of the primary challenges in current leprosy research. Genomics has been extremely valuable, nonetheless, functional proteins are ultimately responsible for controlling most aspects of cellular functions, which in turn could facilitate parasitizing the host. Furthermore, bacterial proteins provide targets for most of the vaccines and immunodiagnostic tools. Better understanding of the proteomics of M. leprae could also help in developing new drugs against M. leprae. During the past nearly 15 years, there have been several developments towards the identification of M. leprae proteins employing contemporary proteomics tools. In this review, we discuss the knowledge gained on the biology and pathogenesis of M. leprae from current proteomic studies. © 2012 The Authors. Scandinavian Journal of Immunology © 2012 Blackwell Publishing Ltd.
[Progress in the spectral library based protein identification strategy].
Yu, Derui; Ma, Jie; Xie, Zengyan; Bai, Mingze; Zhu, Yunping; Shu, Kunxian
2018-04-25
Exponential growth of the mass spectrometry (MS) data is exhibited when the mass spectrometry-based proteomics has been developing rapidly. It is a great challenge to develop some quick, accurate and repeatable methods to identify peptides and proteins. Nowadays, the spectral library searching has become a mature strategy for tandem mass spectra based proteins identification in proteomics, which searches the experiment spectra against a collection of confidently identified MS/MS spectra that have been observed previously, and fully utilizes the abundance in the spectrum, peaks from non-canonical fragment ions, and other features. This review provides an overview of the implement of spectral library search strategy, and two key steps, spectral library construction and spectral library searching comprehensively, and discusses the progress and challenge of the library search strategy.
Probing the Proteome on Earth and Beyond
NASA Astrophysics Data System (ADS)
Ostrom, P.
2008-12-01
Less than a decade ago, protein sequencing was the bane of paleobiology. Since that time researchers have completely sequenced proteins in >50 Ka fossils, been dazzled by reports of collagen peptides in dinosaur bones, and witnessed the development of phylogenetic trees from ancient protein sequences. Enlisting proteomics as biosignature is now in our grasp. In this talk the pitfalls and challenges of mass spectrometric approaches to protein sequencing will be illustrated and phylogenetic applications will be discussed. Work on extinct organisms at Michigan State University, University of Michigan and York University will provide a vantage point to assess methodologies, explore diagenetic alterations, evaluate mass spectra and illustrate issues associated with data base searching. Challenges encountered in the study of paleoproteomics, such as the absence of sequences for extinct organisms in commercially available databases, protein diagenesis and low concentrations of target are parallel to those that will be encountered when protein sequencing is extended to extreme and extraterrestrial environments. Thus, lessons learned from interrogating the ancient proteome are important and necessary step in developing proteomics as a biosignature tools.
Lawless, Craig; Hubbard, Simon J.; Fan, Jun; Bessant, Conrad; Hermjakob, Henning; Jones, Andrew R.
2012-01-01
Abstract New methods for performing quantitative proteome analyses based on differential labeling protocols or label-free techniques are reported in the literature on an almost monthly basis. In parallel, a correspondingly vast number of software tools for the analysis of quantitative proteomics data has also been described in the literature and produced by private companies. In this article we focus on the review of some of the most popular techniques in the field and present a critical appraisal of several software packages available to process and analyze the data produced. We also describe the importance of community standards to support the wide range of software, which may assist researchers in the analysis of data using different platforms and protocols. It is intended that this review will serve bench scientists both as a useful reference and a guide to the selection and use of different pipelines to perform quantitative proteomics data analysis. We have produced a web-based tool (http://www.proteosuite.org/?q=other_resources) to help researchers find appropriate software for their local instrumentation, available file formats, and quantitative methodology. PMID:22804616
ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function
Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira
2015-01-01
Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026
PROTEOMICS OF THE AMNIOTIC FLUID IN ASSESSMENT OF THE PLACENTA – RELEVANCE FOR PRETERM BIRTH
Buhimschi, Irina A.; Buhimschi, Catalin S.
2008-01-01
Proteomics is the study of expressed proteins and has emerged as a complement to genomic research. The major advantage of proteomics over DNA-RNA based technologies is that it more closely relates to phenotype and not the source code. Proteomics thus holds the promise of providing direct insight into the true mechanisms of human disease. Historically, examination of the placenta was the first modality to subclassify pathogenetical entities responsible for preterm birth. Because placenta is a key pathophysiological participant in several major obstetrical syndromes (preterm birth, preeclampsia, intrauterine growth restriction) identification of relevant biomarkers of placental function can profoundly impact on the prediction of fetal outcome and treatment efficacy. Proteomics is a young science and studies that associate proteomic patterns with long-term outcome require follow-up of children up to school age. In the interim, placental pathological footprints of cellular injury can be useful as intermediate outcomes. Furthermore, knowledge of the identity of the dys-regulated proteins may provide the necessary insight into novel pathophysiological pathways and unravel possible targets for therapeutic intervention that could not have been envisioned through hypothesis-driven approaches. PMID:18191197
Proteomics in Heart Failure: Top-down or Bottom-up?
Gregorich, Zachery R.; Chang, Ying-Hua; Ge, Ying
2014-01-01
Summary The pathophysiology of heart failure (HF) is diverse, owing to multiple etiologies and aberrations in a number of cellular processes. Therefore, it is essential to understand how defects in the molecular pathways that mediate cellular responses to internal and external stressors function as a system to drive the HF phenotype. Mass spectrometry (MS)-based proteomics strategies have great potential for advancing our understanding of disease mechanisms at the systems level because proteins are the effector molecules for all cell functions and, thus, are directly responsible for determining cell phenotype. Two MS-based proteomics strategies exist: peptide-based bottom-up and protein-based top-down proteomics—each with its own unique strengths and weaknesses for interrogating the proteome. In this review, we will discuss the advantages and disadvantages of bottom-up and top-down MS for protein identification, quantification, and the analysis of post-translational modifications, as well as highlight how both of these strategies have contributed to our understanding of the molecular and cellular mechanisms underlying HF. Additionally, the challenges associated with both proteomics approaches will be discussed and insights will be offered regarding the future of MS-based proteomics in HF research. PMID:24619480
Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S.; Dai, Susie Y.
2013-01-01
High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779
Proteomics Insights into Autophagy.
Cudjoe, Emmanuel K; Saleh, Tareq; Hawkridge, Adam M; Gewirtz, David A
2017-10-01
Autophagy, a conserved cellular process by which cells recycle their contents either to maintain basal homeostasis or in response to external stimuli, has for the past two decades become one of the most studied physiological processes in cell biology. The 2016 Nobel Prize in Medicine and Biology awarded to Dr. Ohsumi Yoshinori, one of the first scientists to characterize this cellular mechanism, attests to its importance. The induction and consequent completion of the process of autophagy results in wide ranging changes to the cellular proteome as well as the secretome. MS-based proteomics affords the ability to measure, in an unbiased manner, the ubiquitous changes that occur when autophagy is initiated and progresses in the cell. The continuous improvements and advances in mass spectrometers, especially relating to ionization sources and detectors, coupled with advances in proteomics experimental design, has made it possible to study autophagy, among other process, in great detail. Innovative labeling strategies and protein separation techniques as well as complementary methods including immuno-capture/blotting/staining have been used in proteomics studies to provide more specific protein identification. In this review, we will discuss recent advances in proteomics studies focused on autophagy. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
UNiquant, a program for quantitative proteomics analysis using stable isotope labeling.
Huang, Xin; Tolmachev, Aleksey V; Shen, Yulei; Liu, Miao; Huang, Lin; Zhang, Zhixin; Anderson, Gordon A; Smith, Richard D; Chan, Wing C; Hinrichs, Steven H; Fu, Kai; Ding, Shi-Jian
2011-03-04
Stable isotope labeling (SIL) methods coupled with nanoscale liquid chromatography and high resolution tandem mass spectrometry are increasingly useful for elucidation of the proteome-wide differences between multiple biological samples. Development of more effective programs for the sensitive identification of peptide pairs and accurate measurement of the relative peptide/protein abundance are essential for quantitative proteomic analysis. We developed and evaluated the performance of a new program, termed UNiquant, for analyzing quantitative proteomics data using stable isotope labeling. UNiquant was compared with two other programs, MaxQuant and Mascot Distiller, using SILAC-labeled complex proteome mixtures having either known or unknown heavy/light ratios. For the SILAC-labeled Jeko-1 cell proteome digests with known heavy/light ratios (H/L = 1:1, 1:5, and 1:10), UNiquant quantified a similar number of peptide pairs as MaxQuant for the H/L = 1:1 and 1:5 mixtures. In addition, UNiquant quantified significantly more peptides than MaxQuant and Mascot Distiller in the H/L = 1:10 mixtures. UNiquant accurately measured relative peptide/protein abundance without the need for postmeasurement normalization of peptide ratios, which is required by the other programs.
UNiquant, a Program for Quantitative Proteomics Analysis Using Stable Isotope Labeling
Huang, Xin; Tolmachev, Aleksey V.; Shen, Yulei; Liu, Miao; Huang, Lin; Zhang, Zhixin; Anderson, Gordon A.; Smith, Richard D.; Chan, Wing C.; Hinrichs, Steven H.; Fu, Kai; Ding, Shi-Jian
2011-01-01
Stable isotope labeling (SIL) methods coupled with nanoscale liquid chromatography and high resolution tandem mass spectrometry are increasingly useful for elucidation of the proteome-wide differences between multiple biological samples. Development of more effective programs for the sensitive identification of peptide pairs and accurate measurement of the relative peptide/protein abundance are essential for quantitative proteomic analysis. We developed and evaluated the performance of a new program, termed UNiquant, for analyzing quantitative proteomics data using stable isotope labeling. UNiquant was compared with two other programs, MaxQuant and Mascot Distiller, using SILAC-labeled complex proteome mixtures having either known or unknown heavy/light ratios. For the SILAC-labeled Jeko-1 cell proteome digests with known heavy/light ratios (H/L = 1:1, 1:5, and 1:10), UNiquant quantified a similar number of peptide pairs as MaxQuant for the H/L = 1:1 and 1:5 mixtures. In addition, UNiquant quantified significantly more peptides than MaxQuant and Mascot Distiller in the H/L = 1:10 mixtures. UNiquant accurately measured relative peptide/protein abundance without the need for post-measurement normalization of peptide ratios, which is required by the other programs. PMID:21158445
Sloothaak, J; Odoni, D I; de Graaff, L H; Martins Dos Santos, V A P; Schaap, P J; Tamayo-Ramos, J A
2015-01-01
The development of biological processes that replace the existing petrochemical-based industry is one of the biggest challenges in biotechnology. Aspergillus niger is one of the main industrial producers of lignocellulolytic enzymes, which are used in the conversion of lignocellulosic feedstocks into fermentable sugars. Both the hydrolytic enzymes responsible for lignocellulose depolymerisation and the molecular mechanisms controlling their expression have been well described, but little is known about the transport systems for sugar uptake in A. niger. Understanding the transportome of A. niger is essential to achieve further improvements at strain and process design level. Therefore, this study aims to identify and classify A. niger sugar transporters, using newly developed tools for in silico and in vivo analysis of its membrane-associated proteome. In the present research work, a hidden Markov model (HMM), that shows a good performance in the identification and segmentation of functionally validated glucose transporters, was constructed. The model (HMMgluT) was used to analyse the A. niger membrane-associated proteome response to high and low glucose concentrations at a low pH. By combining the abundance patterns of the proteins found in the A. niger plasmalemma proteome with their HMMgluT scores, two new putative high-affinity glucose transporters, denoted MstG and MstH, were identified. MstG and MstH were functionally validated and biochemically characterised by heterologous expression in a S. cerevisiae glucose transport null mutant. They were shown to be a high-affinity glucose transporter (K m = 0.5 ± 0.04 mM) and a very high-affinity glucose transporter (K m = 0.06 ± 0.005 mM), respectively. This study, focusing for the first time on the membrane-associated proteome of the industrially relevant organism A. niger, shows the global response of the transportome to the availability of different glucose concentrations. Analysis of the A. niger transportome with the newly developed HMMgluT showed to be an efficient approach for the identification and classification of new glucose transporters.
Paasch, Uwe; Heidenreich, Falk; Pursche, Theresia; Kuhlisch, Eberhard; Kettner, Karina; Grunewald, Sonja; Kratzsch, Jürgen; Dittmar, Gunnar; Glander, Hans-Jürgen; Hoflack, Bernard; Kriegel, Thomas M
2011-08-01
Metabolic disorders like diabetes mellitus and obesity may compromise the fertility of men and women. To unveil disease-associated proteomic changes potentially affecting male fertility, the proteomes of sperm cells from type-1 diabetic, type-2 diabetic, non-diabetic obese and clinically healthy individuals were comparatively analyzed by difference gel electrophoresis. The adaptation of a general protein extraction procedure to the solubilization of proteins from sperm cells allowed for the resolution of 3187 fluorescent spots in the difference gel electrophoresis image of the master gel, which contained the entirety of solubilized sperm proteins. Comparison of the pathological and reference proteomes by applying an average abundance ratio setting of 1.6 and a p ≤ 0.05 criterion resulted in the identification of 79 fluorescent spots containing proteins that were present at significantly changed levels in the sperm cells. Biometric evaluation of the fluorescence data followed by mass spectrometric protein identification revealed altered levels of 12, 71, and 13 protein species in the proteomes of the type-1 diabetic, type-2 diabetic, and non-diabetic obese patients, respectively, with considerably enhanced amounts of the same set of one molecular form of semenogelin-1, one form of clusterin, and two forms of lactotransferrin in each group of pathologic samples. Remarkably, β-galactosidase-1-like protein was the only protein that was detected at decreased levels in all three pathologic situations. The former three proteins are part of the eppin (epididymal proteinase inhibitor) protein complex, which is thought to fulfill fertilization-related functions, such as ejaculate sperm protection, motility regulation and gain of competence for acrosome reaction, whereas the putative role of the latter protein to function as a glycosyl hydrolase during sperm maturation remains to be explored at the protein/enzyme level. The strikingly similar differences detected in the three groups of pathological sperm proteomes reflect a disease-associated enhanced formation of predominantly proteolytically modified forms of three eppin protein complex components, possibly as a response to enduring hyperglycemia and enhanced oxidative stress.
The secrets of Oriental panacea: Panax ginseng.
Colzani, Mara; Altomare, Alessandra; Caliendo, Matteo; Aldini, Giancarlo; Righetti, Pier Giorgio; Fasoli, Elisa
2016-01-01
The Panax ginseng root proteome has been investigated via capture with combinatorial peptide ligand libraries (CPLL) at three different pH values. Proteomic characterization by SDS-PAGE and nLC–MS/MS analysis, via LTQ-Orbitrap XL, led to the identification of a total of 207 expressed proteins. This quite large number of identifications was achieved by consulting two different plant databases: P. ginseng and Arabidopsis thaliana. The major groups of identified proteins were associated to structural species (19.2%), oxidoreductase (19.5%), dehydrogenases (7.6%) and synthases (9.0%). For the first time, an exploration of protein–protein interactions was performed by merging all recognized proteins and building an interactomic map, characterized by 196 nodes and 1554 interactions. Finally a peptidomic analysis was developed combining different in-silico enzymatic digestions to simulate the human gastrointestinal process: from 661 generated peptides, 95 were identified as possible bioactives and in particular 6 of them were characterized by antimicrobial activity. The present report offers new insight for future investigations focused on elucidation of biological properties of P. ginseng proteome and peptidome. Ginseng is a traditional oriental herbal remedy whose use is very diffused in all the world for its numerous pharmacological effects. However, the exact mechanism of action of ginseng components, both ginsenosides and proteins, is still unidentified. So the common use of ginseng requires strict investigations to assess both its efficiency and its safety. Although many reports have been published regarding the pharmacological effects of ginseng, little is known about the biochemical pathways of root. Proteomics analysis could be useful to elucidate the physiological pathways. In this manuscript, an integrated approach to proteomics and peptidomics will usher in exploration of Panax ginseng proteins and proteolytic peptides, obtained by in-silico gastrointestinal digestion, characterized by antimicrobial action. The present research would pave the way for better knowledge of metabolic functions connected with ginseng proteome and provide with new information necessary to understand better antimicrobial activity of P. ginseng.
Sedeek, Khalid E M; Qi, Weihong; Schauer, Monica A; Gupta, Alok K; Poveda, Lucy; Xu, Shuqing; Liu, Zhong-Jian; Grossniklaus, Ueli; Schiestl, Florian P; Schlüter, Philipp M
2013-01-01
Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids. We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors. Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of floral morphology. These data will serve as an invaluable resource for research in orchid floral biology, enabling studies into the molecular mechanisms of pollinator attraction and speciation.
Sedeek, Khalid E. M.; Qi, Weihong; Schauer, Monica A.; Gupta, Alok K.; Poveda, Lucy; Xu, Shuqing; Liu, Zhong-Jian; Grossniklaus, Ueli; Schiestl, Florian P.; Schlüter, Philipp M.
2013-01-01
Background Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids. Results We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors. Conclusion Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of floral morphology. These data will serve as an invaluable resource for research in orchid floral biology, enabling studies into the molecular mechanisms of pollinator attraction and speciation. PMID:23734209
Liu, Suli; Im, Hogune; Bairoch, Amos; Cristofanilli, Massimo; Chen, Rui; Deutsch, Eric W; Dalton, Stephen; Fenyo, David; Fanayan, Susan; Gates, Chris; Gaudet, Pascale; Hincapie, Marina; Hanash, Samir; Kim, Hoguen; Jeong, Seul-Ki; Lundberg, Emma; Mias, George; Menon, Rajasree; Mu, Zhaomei; Nice, Edouard; Paik, Young-Ki; Uhlen, Mathias; Wells, Lance; Wu, Shiaw-Lin; Yan, Fangfei; Zhang, Fan; Zhang, Yue; Snyder, Michael; Omenn, Gilbert S; Beavis, Ronald C; Hancock, William S
2013-01-04
We report progress assembling the parts list for chromosome 17 and illustrate the various processes that we have developed to integrate available data from diverse genomic and proteomic knowledge bases. As primary resources, we have used GPMDB, neXtProt, PeptideAtlas, Human Protein Atlas (HPA), and GeneCards. All sites share the common resource of Ensembl for the genome modeling information. We have defined the chromosome 17 parts list with the following information: 1169 protein-coding genes, the numbers of proteins confidently identified by various experimental approaches as documented in GPMDB, neXtProt, PeptideAtlas, and HPA, examples of typical data sets obtained by RNASeq and proteomic studies of epithelial derived tumor cell lines (disease proteome) and a normal proteome (peripheral mononuclear cells), reported evidence of post-translational modifications, and examples of alternative splice variants (ASVs). We have constructed a list of the 59 "missing" proteins as well as 201 proteins that have inconclusive mass spectrometric (MS) identifications. In this report we have defined a process to establish a baseline for the incorporation of new evidence on protein identification and characterization as well as related information from transcriptome analyses. This initial list of "missing" proteins that will guide the selection of appropriate samples for discovery studies as well as antibody reagents. Also we have illustrated the significant diversity of protein variants (including post-translational modifications, PTMs) using regions on chromosome 17 that contain important oncogenes. We emphasize the need for mandated deposition of proteomics data in public databases, the further development of improved PTM, ASV, and single nucleotide variant (SNV) databases, and the construction of Web sites that can integrate and regularly update such information. In addition, we describe the distribution of both clustered and scattered sets of protein families on the chromosome. Since chromosome 17 is rich in cancer-associated genes, we have focused the clustering of cancer-associated genes in such genomic regions and have used the ERBB2 amplicon as an example of the value of a proteogenomic approach in which one integrates transcriptomic with proteomic information and captures evidence of coexpression through coordinated regulation.
Unbiased and targeted mass spectrometry for the HDL proteome.
Singh, Sasha A; Aikawa, Masanori
2017-02-01
Mass spectrometry is an ever evolving technology that is equipped with a variety of tools for protein research. Some lipoprotein studies, especially those pertaining to HDL biology, have been exploiting the versatility of mass spectrometry to understand HDL function through its proteome. Despite the role of mass spectrometry in advancing research as a whole, however, the technology remains obscure to those without hands on experience, but still wishing to understand it. In this review, we walk the reader through the coevolution of common mass spectrometry workflows and HDL research, starting from the basic unbiased mass spectrometry methods used to profile the HDL proteome to the most recent targeted methods that have enabled an unprecedented view of HDL metabolism. Unbiased global proteomics have demonstrated that the HDL proteome is organized into subgroups across the HDL size fractions providing further evidence that HDL functional heterogeneity is in part governed by its varying protein constituents. Parallel reaction monitoring, a novel targeted mass spectrometry method, was used to monitor the metabolism of HDL apolipoproteins in humans and revealed that apolipoproteins contained within the same HDL size fraction exhibit diverse metabolic properties. Mass spectrometry provides a variety of tools and strategies to facilitate understanding, through its proteins, the complex biology of HDL.
2017-05-19
LightCycler® 96 desktop software. Positive and negative samples were identified using the “ Qualitative Detection” analysis function using the default...Institute of Infectious Diseases, Fort Detrick, MD 21702, United States A R T I C L E I N F O Keywords: West Nile virus Virus inactivation Sample buffer... samples using a commercially available SDS- PAGE sample buffer for proteomic studies. Using this method, we demonstrate its utility by identification
Rapid development of Proteomic applications with the AIBench framework.
López-Fernández, Hugo; Reboiro-Jato, Miguel; Glez-Peña, Daniel; Méndez Reboredo, José R; Santos, Hugo M; Carreira, Ricardo J; Capelo-Martínez, José L; Fdez-Riverola, Florentino
2011-09-15
In this paper we present two case studies of Proteomics applications development using the AIBench framework, a Java desktop application framework mainly focused in scientific software development. The applications presented in this work are Decision Peptide-Driven, for rapid and accurate protein quantification, and Bacterial Identification, for Tuberculosis biomarker search and diagnosis. Both tools work with mass spectrometry data, specifically with MALDI-TOF spectra, minimizing the time required to process and analyze the experimental data. Copyright 2011 The Author(s). Published by Journal of Integrative Bioinformatics.
Mukherjee, Ashis K; Kalita, Bhargab; Mackessy, Stephen P
2016-07-20
To address the dearth of knowledge on the biochemical composition of Pakistan Russell's Viper (Daboia russelii russelii) venom (RVV), the venom proteome has been analyzed and several biochemical and pharmacological properties of the venom were investigated. SDS-PAGE (reduced) analysis indicated that proteins/peptides in the molecular mass range of ~56.0-105.0kDa, 31.6-51.0kDa, 15.6-30.0kDa, 9.0-14.2kDa and 5.6-7.2kDa contribute approximately 9.8%, 12.1%, 13.4%, 34.1% and 30.5%, respectively of Pakistan RVV. Proteomics analysis of gel-filtration peaks of RVV resulted in identification of 75 proteins/peptides which belong to 14 distinct snake venom protein families. Phospholipases A2 (32.8%), Kunitz type serine protease inhibitors (28.4%), and snake venom metalloproteases (21.8%) comprised the majority of Pakistan RVV proteins, while 11 additional families accounted for 6.5-0.2%. Occurrence of aminotransferase, endo-β-glycosidase, and disintegrins is reported for the first time in RVV. Several of RVV proteins/peptides share significant sequence homology across Viperidae subfamilies. Pakistan RVV was well recognized by both the polyvalent (PAV) and monovalent (MAV) antivenom manufactured in India; nonetheless, immunological cross-reactivity determined by ELISA and neutralization of pro-coagulant/anticoagulant activity of RVV and its fractions by MAV surpassed that of PAV. The study establishes the proteome profile of the Pakistan RVV, thereby indicating the presence of diverse proteins and peptides that play a significant role in the pathophysiology of RVV bite. Further, the proteomic findings will contribute to understand the variation in venom composition owing to different geographical location and identification of pharmacologically important proteins in Pakistan RVV. Copyright © 2016. Published by Elsevier B.V.
A practical data processing workflow for multi-OMICS projects.
Kohl, Michael; Megger, Dominik A; Trippler, Martin; Meckel, Hagen; Ahrens, Maike; Bracht, Thilo; Weber, Frank; Hoffmann, Andreas-Claudius; Baba, Hideo A; Sitek, Barbara; Schlaak, Jörg F; Meyer, Helmut E; Stephan, Christian; Eisenacher, Martin
2014-01-01
Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Jami, Mohammad-Saeid; García-Estrada, Carlos; Barreiro, Carlos; Cuadrado, Abel-Alberto; Salehi-Najafabadi, Zahra; Martín, Juan-Francisco
2010-01-01
The filamentous fungus Penicillium chrysogenum is well-known by its ability to synthesize β-lactam antibiotics as well as other secondary metabolites. Like other filamentous fungi, this microorganism is an excellent host for secretion of extracellular proteins because of the high capacity of its protein secretion machinery. In this work, we have characterized the extracellular proteome reference map of P. chrysogenum Wisconsin 54–1255 by two-dimensional gel electrophoresis. This method allowed the correct identification of 279 spots by peptide mass fingerprinting and tandem MS. These 279 spots included 328 correctly identified proteins, which corresponded to 131 different proteins and their isoforms. One hundred and two proteins out of 131 were predicted to contain either classical or nonclassical secretion signal peptide sequences, providing evidence of the authentic extracellular location of these proteins. Proteins with higher representation in the extracellular proteome were those involved in plant cell wall degradation (polygalacturonase, pectate lyase, and glucan 1,3-β-glucosidase), utilization of nutrients (extracellular acid phosphatases and 6-hydroxy-d-nicotine oxidase), and stress response (catalase R). This filamentous fungus also secretes enzymes specially relevant for food industry, such as sulfydryl oxidase, dihydroxy-acid dehydratase, or glucoamylase. The identification of several antigens in the extracellular proteome also highlights the importance of this microorganism as one of the main indoor allergens. Comparison of the extracellular proteome among three strains of P. chrysogenum, the wild-type NRRL 1951, the Wis 54–1255 (an improved, moderate penicillin producer), and the AS-P-78 (a penicillin high-producer), provided important insights to consider improved strains of this filamentous fungus as versatile cell-factories of interest, beyond antibiotic production, for other aspects of white biotechnology. PMID:20823121
A Mouse to Human Search for Plasma Proteome Changes Associated with Pancreatic Tumor Development
Faca, Vitor M; Song, Kenneth S; Wang, Hong; Zhang, Qing; Krasnoselsky, Alexei L; Newcomb, Lisa F; Plentz, Ruben R; Gurumurthy, Sushma; Redston, Mark S; Pitteri, Sharon J; Pereira-Faca, Sandra R; Ireton, Renee C; Katayama, Hiroyuki; Glukhova, Veronika; Phanstiel, Douglas; Brenner, Dean E; Anderson, Michelle A; Misek, David; Scholler, Nathalie; Urban, Nicole D; Barnett, Matt J; Edelstein, Cim; Goodman, Gary E; Thornquist, Mark D; McIntosh, Martin W; DePinho, Ronald A; Bardeesy, Nabeel; Hanash, Samir M
2008-01-01
Background The complexity and heterogeneity of the human plasma proteome have presented significant challenges in the identification of protein changes associated with tumor development. Refined genetically engineered mouse (GEM) models of human cancer have been shown to faithfully recapitulate the molecular, biological, and clinical features of human disease. Here, we sought to exploit the merits of a well-characterized GEM model of pancreatic cancer to determine whether proteomics technologies allow identification of protein changes associated with tumor development and whether such changes are relevant to human pancreatic cancer. Methods and Findings Plasma was sampled from mice at early and advanced stages of tumor development and from matched controls. Using a proteomic approach based on extensive protein fractionation, we confidently identified 1,442 proteins that were distributed across seven orders of magnitude of abundance in plasma. Analysis of proteins chosen on the basis of increased levels in plasma from tumor-bearing mice and corroborating protein or RNA expression in tissue documented concordance in the blood from 30 newly diagnosed patients with pancreatic cancer relative to 30 control specimens. A panel of five proteins selected on the basis of their increased level at an early stage of tumor development in the mouse was tested in a blinded study in 26 humans from the CARET (Carotene and Retinol Efficacy Trial) cohort. The panel discriminated pancreatic cancer cases from matched controls in blood specimens obtained between 7 and 13 mo prior to the development of symptoms and clinical diagnosis of pancreatic cancer. Conclusions Our findings indicate that GEM models of cancer, in combination with in-depth proteomic analysis, provide a useful strategy to identify candidate markers applicable to human cancer with potential utility for early detection. PMID:18547137
Poli, Maurizio; Ori, Alessandro; Child, Tim; Jaroudi, Souraya; Spath, Katharina; Beck, Martin; Wells, Dagan
2015-11-01
The use of in vitro fertilization (IVF) has revolutionized the treatment of infertility and is now responsible for 1-5% of all births in industrialized countries. During IVF, it is typical for patients to generate multiple embryos. However, only a small proportion of them possess the genetic and metabolic requirements needed in order to produce a healthy pregnancy. The identification of the embryo with the greatest developmental capacity represents a major challenge for fertility clinics. Current methods for the assessment of embryo competence are proven inefficient, and the inadvertent transfer of non-viable embryos is the principal reason why most IVF treatments (approximately two-thirds) end in failure. In this study, we investigate how the application of proteomic measurements could improve success rates in clinical embryology. We describe a procedure that allows the identification and quantification of proteins of embryonic origin, present in attomole concentrations in the blastocoel, the enclosed fluid-filled cavity that forms within 5-day-old human embryos. By using targeted proteomics, we demonstrate the feasibility of quantifying multiple proteins in samples derived from single blastocoels and that such measurements correlate with aspects of embryo viability, such as chromosomal (ploidy) status. This study illustrates the potential of high-sensitivity proteomics to measure clinically relevant biomarkers in minute samples and, more specifically, suggests that key aspects of embryo competence could be measured using a proteomic-based strategy, with negligible risk of harm to the living embryo. Our work paves the way for the development of "next-generation" embryo competence assessment strategies, based on functional proteomics. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.
Identifier mapping performance for integrating transcriptomics and proteomics experimental results
2011-01-01
Background Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. Results We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. Conclusions The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. PMID:21619611
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
Ortseifen, Vera; Stolze, Yvonne; Maus, Irena; Sczyrba, Alexander; Bremges, Andreas; Albaum, Stefan P; Jaenicke, Sebastian; Fracowiak, Jochen; Pühler, Alfred; Schlüter, Andreas
2016-08-10
To study the metaproteome of a biogas-producing microbial community, fermentation samples were taken from an agricultural biogas plant for microbial cell and protein extraction and corresponding metagenome analyses. Based on metagenome sequence data, taxonomic community profiling was performed to elucidate the composition of bacterial and archaeal sub-communities. The community's cytosolic metaproteome was represented in a 2D-PAGE approach. Metaproteome databases for protein identification were compiled based on the assembled metagenome sequence dataset for the biogas plant analyzed and non-corresponding biogas metagenomes. Protein identification results revealed that the corresponding biogas protein database facilitated the highest identification rate followed by other biogas-specific databases, whereas common public databases yielded insufficient identification rates. Proteins of the biogas microbiome identified as highly abundant were assigned to the pathways involved in methanogenesis, transport and carbon metabolism. Moreover, the integrated metagenome/-proteome approach enabled the examination of genetic-context information for genes encoding identified proteins by studying neighboring genes on the corresponding contig. Exemplarily, this approach led to the identification of a Methanoculleus sp. contig encoding 16 methanogenesis-related gene products, three of which were also detected as abundant proteins within the community's metaproteome. Thus, metagenome contigs provide additional information on the genetic environment of identified abundant proteins. Copyright © 2016 Elsevier B.V. All rights reserved.
Yu, Wen; Taylor, J Alex; Davis, Michael T; Bonilla, Leo E; Lee, Kimberly A; Auger, Paul L; Farnsworth, Chris C; Welcher, Andrew A; Patterson, Scott D
2010-03-01
Despite recent advances in qualitative proteomics, the automatic identification of peptides with optimal sensitivity and accuracy remains a difficult goal. To address this deficiency, a novel algorithm, Multiple Search Engines, Normalization and Consensus is described. The method employs six search engines and a re-scoring engine to search MS/MS spectra against protein and decoy sequences. After the peptide hits from each engine are normalized to error rates estimated from the decoy hits, peptide assignments are then deduced using a minimum consensus model. These assignments are produced in a series of progressively relaxed false-discovery rates, thus enabling a comprehensive interpretation of the data set. Additionally, the estimated false-discovery rate was found to have good concordance with the observed false-positive rate calculated from known identities. Benchmarking against standard proteins data sets (ISBv1, sPRG2006) and their published analysis, demonstrated that the Multiple Search Engines, Normalization and Consensus algorithm consistently achieved significantly higher sensitivity in peptide identifications, which led to increased or more robust protein identifications in all data sets compared with prior methods. The sensitivity and the false-positive rate of peptide identification exhibit an inverse-proportional and linear relationship with the number of participating search engines.
Peptide de novo sequencing of mixture tandem mass spectra.
Gorshkov, Vladimir; Hotta, Stéphanie Yuki Kolbeck; Verano-Braga, Thiago; Kjeldsen, Frank
2016-09-01
The impact of mixture spectra deconvolution on the performance of four popular de novo sequencing programs was tested using artificially constructed mixture spectra as well as experimental proteomics data. Mixture fragmentation spectra are recognized as a limitation in proteomics because they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co-isolation and thus prone to false identifications. The deconvolution approach matched complementary b-, y-ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co-isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced peptides. The improvement was in the range of 20-35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight candidate peptide score distribution and high sensitivity to small changes in the mass spectrum introduced by the employed deconvolution method could explain some of the missing peptide identifications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomics of ovarian cancer: functional insights and clinical applications
Elzek, Mohamed A.; Rodland, Karin D.
2015-03-04
In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification ofmore » aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. In conclusion, we propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.« less
Saliva Proteomics Analysis Offers Insights on Type 1 Diabetes Pathology in a Pediatric Population
Pappa, Eftychia; Vastardis, Heleni; Mermelekas, George; Gerasimidi-Vazeou, Andriani; Zoidakis, Jerome; Vougas, Konstantinos
2018-01-01
The composition of the salivary proteome is affected by pathological conditions. We analyzed by high resolution mass spectrometry approaches saliva samples collected from children and adolescents with type 1 diabetes and healthy controls. The list of more than 2000 high confidence protein identifications constitutes a comprehensive characterization of the salivary proteome. Patients with good glycemic regulation and healthy individuals have comparable proteomic profiles. In contrast, a significant number of differentially expressed proteins were identified in the saliva of patients with poor glycemic regulation compared to patients with good glycemic control and healthy children. These proteins are involved in biological processes relevant to diabetic pathology such as endothelial damage and inflammation. Moreover, a putative preventive therapeutic approach was identified based on bioinformatic analysis of the deregulated salivary proteins. Thus, thorough characterization of saliva proteins in diabetic pediatric patients established a connection between molecular changes and disease pathology. This proteomic and bioinformatic approach highlights the potential of salivary diagnostics in diabetes pathology and opens the way for preventive treatment of the disease. PMID:29755368
Muraeva, O A; Maltseva, A L; Mikhailova, N A; Granovitch, A I
2015-01-01
Salinity is one of the most important abiotic environmental factors affecting marine animals. If salinity deviate from optimum, adaptive mechanisms switch on to maintain organism's physiological activity. In this study, the reaction of the snails Littorina saxatilis from natural habitats and in response to experimental salinity decreasing was analyzed on proteomic level. The isolation of all snails inside their shells and gradually declining mortality was observed under acute experimental salinity decrease (down to 10 per hundred). Proteomic changes were evaluated in the surviving experimental mollusks compared to control individual using differential 2D gel-electrophoresis (DIGE) and subsequent LC-MS/MS-identification of proteins. Approximately 10% of analyzed proteins underwent up- or down regulation during the experiment. Proteins of folding, antioxidant response, intercellular matrix, cell adhesion, cell signaling and metabolic enzymes were identified among them. Proteome changes observed in experimental hypoosmotic stress partially reproduced in the proteomes of mollusks that live in conditions of natural freshening (estuaries). Possible mechanisms involved in the adaptation process of L. saxatilis individuals to hypo-osmotic stress are discussed.
The speciation of the proteome
Jungblut, Peter R; Holzhütter, Hermann G; Apweiler, Rolf; Schlüter, Hartmut
2008-01-01
Introduction In proteomics a paradox situation developed in the last years. At one side it is basic knowledge that proteins are post-translationally modified and occur in different isoforms. At the other side the protein expression concept disclaims post-translational modifications by connecting protein names directly with function. Discussion Optimal proteome coverage is today reached by bottom-up liquid chromatography/mass spectrometry. But quantification at the peptide level in shotgun or bottom-up approaches by liquid chromatography and mass spectrometry is completely ignoring that a special peptide may exist in an unmodified form and in several-fold modified forms. The acceptance of the protein species concept is a basic prerequisite for meaningful quantitative analyses in functional proteomics. In discovery approaches only top-down analyses, separating the protein species before digestion, identification and quantification by two-dimensional gel electrophoresis or protein liquid chromatography, allow the correlation between changes of a biological situation and function. Conclusion To obtain biological relevant information kinetics and systems biology have to be performed at the protein species level, which is the major challenge in proteomics today. PMID:18638390
Miao, Jun; Chen, Zhao; Wang, Zenglei; Shrestha, Sony; Li, Xiaolian; Li, Runze; Cui, Liwang
2017-04-01
The gametocytes of the malaria parasites are obligate for perpetuating the parasite's life cycle through mosquitoes, but the sex-specific biology of gametocytes is poorly understood. We generated a transgenic line in the human malaria parasite Plasmodium falciparum , which allowed us to accurately separate male and female gametocytes by flow cytometry. In-depth analysis of the proteomes by liquid chromatography-tandem mass spectrometry identified 1244 and 1387 proteins in mature male and female gametocytes, respectively. GFP-tagging of nine selected proteins confirmed their sex-partitions to be agreeable with the results from the proteomic analysis. The sex-specific proteomes showed significant differences that are consistent with the divergent functions of the two sexes. Although the male-specific proteome (119 proteins) is enriched in proteins associated with the flagella and genome replication, the female-specific proteome (262 proteins) is more abundant in proteins involved in metabolism, translation and organellar functions. Compared with the Plasmodium berghei sex-specific proteomes, this study revealed both extensive conservation and considerable divergence between these two species, which reflect the disparities between the two species in proteins involved in cytoskeleton, lipid metabolism and protein degradation. Comparison with three sex-specific proteomes allowed us to obtain high-confidence lists of 73 and 89 core male- and female-specific/biased proteins conserved in Plasmodium The identification of sex-specific/biased proteomes in Plasmodium lays a solid foundation for understanding the molecular mechanisms underlying the unique sex-specific biology in this early-branching eukaryote. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Miao, Jun; Chen, Zhao; Wang, Zenglei; Shrestha, Sony; Li, Xiaolian; Li, Runze; Cui, Liwang
2017-01-01
The gametocytes of the malaria parasites are obligate for perpetuating the parasite's life cycle through mosquitoes, but the sex-specific biology of gametocytes is poorly understood. We generated a transgenic line in the human malaria parasite Plasmodium falciparum, which allowed us to accurately separate male and female gametocytes by flow cytometry. In-depth analysis of the proteomes by liquid chromatography-tandem mass spectrometry identified 1244 and 1387 proteins in mature male and female gametocytes, respectively. GFP-tagging of nine selected proteins confirmed their sex-partitions to be agreeable with the results from the proteomic analysis. The sex-specific proteomes showed significant differences that are consistent with the divergent functions of the two sexes. Although the male-specific proteome (119 proteins) is enriched in proteins associated with the flagella and genome replication, the female-specific proteome (262 proteins) is more abundant in proteins involved in metabolism, translation and organellar functions. Compared with the Plasmodium berghei sex-specific proteomes, this study revealed both extensive conservation and considerable divergence between these two species, which reflect the disparities between the two species in proteins involved in cytoskeleton, lipid metabolism and protein degradation. Comparison with three sex-specific proteomes allowed us to obtain high-confidence lists of 73 and 89 core male- and female-specific/biased proteins conserved in Plasmodium. The identification of sex-specific/biased proteomes in Plasmodium lays a solid foundation for understanding the molecular mechanisms underlying the unique sex-specific biology in this early-branching eukaryote. PMID:28126901
Ararso, Zewdu; Ma, Chuan; Qi, Yuping; Feng, Mao; Han, Bin; Hu, Han; Meng, Lifeng; Li, Jianke
2018-01-05
Hemolymph is vital for the immunity of honeybees and offers a way to investigate their physiological status. To gain novel insight into the functionality and molecular details of the hemolymph in driving increased Royal Jelly (RJ) production, we characterized and compared hemolymph proteomes across the larval and adult ages of Italian bees (ITbs) and Royal Jelly bees (RJbs), a stock selected from ITbs for increasing RJ output. Unprecedented in-depth proteome was attained with the identification of 3394 hemolymph proteins in both bee lines. The changes in proteome support the general function of hemolymph to drive development and immunity across different ages. However, age-specific proteome settings have adapted to prime the distinct physiology for larvae and adult bees. In larvae, the proteome is thought to drive temporal immunity, rapid organogenesis, and reorganization of larval structures. In adults, the proteome plays key roles in prompting tissue development and immune defense in newly emerged bees, in gland maturity in nurse bees, and in carbohydrate energy production in forager bees. Between larval and adult samples of the same age, RJbs and ITbs have tailored distinct hemolymph proteome programs to drive their physiology. In particular, in day 4 larvae and nurse bees, a large number of highly abundant proteins are enriched in protein synthesis and energy metabolism in RJbs. This implies that they have adapted their proteome to initiate different developmental trajectories and high RJ secretion in response to selection for enhanced RJ production. Our hitherto unexplored in-depth proteome coverage provides novel insight into molecular details that drive hemolymph function and high RJ production by RJbs.
Neural Stem Cells (NSCs) and Proteomics*
Shoemaker, Lorelei D.; Kornblum, Harley I.
2016-01-01
Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. PMID:26494823
Time-resolved Global and Chromatin Proteomics during Herpes Simplex Virus Type 1 (HSV-1) Infection*
Kulej, Katarzyna; Avgousti, Daphne C.; Sidoli, Simone; Herrmann, Christin; Della Fera, Ashley N.; Kim, Eui Tae; Garcia, Benjamin A.; Weitzman, Matthew D.
2017-01-01
Herpes simplex virus (HSV-1) lytic infection results in global changes to the host cell proteome and the proteins associated with host chromatin. We present a system level characterization of proteome dynamics during infection by performing a multi-dimensional analysis during HSV-1 lytic infection of human foreskin fibroblast (HFF) cells. Our study includes identification and quantification of the host and viral proteomes, phosphoproteomes, chromatin bound proteomes and post-translational modifications (PTMs) on cellular histones during infection. We analyzed proteomes across six time points of virus infection (0, 3, 6, 9, 12 and 15 h post-infection) and clustered trends in abundance using fuzzy c-means. Globally, we accurately quantified more than 4000 proteins, 200 differently modified histone peptides and 9000 phosphorylation sites on cellular proteins. In addition, we identified 67 viral proteins and quantified 571 phosphorylation events (465 with high confidence site localization) on viral proteins, which is currently the most comprehensive map of HSV-1 phosphoproteome. We investigated chromatin bound proteins by proteomic analysis of the high-salt chromatin fraction and identified 510 proteins that were significantly different in abundance during infection. We found 53 histone marks significantly regulated during virus infection, including a steady increase of histone H3 acetylation (H3K9ac and H3K14ac). Our data provide a resource of unprecedented depth for human and viral proteome dynamics during infection. Collectively, our results indicate that the proteome composition of the chromatin of HFF cells is highly affected during HSV-1 infection, and that phosphorylation events are abundant on viral proteins. We propose that our epi-proteomics approach will prove to be important in the characterization of other model infectious systems that involve changes to chromatin composition. PMID:28179408
Final Report: Proteomic study of brassinosteroid responses in Arabidopsis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Zhiyong; Burlingame, Alma
2017-11-29
The steroid hormone brassinosteroid (BR) is a major growth-promoting phytohormone. The specific aim of the current project is to identify BR-regulated proteins and characterize their functions in various aspects of plant growth, development, and adaptation. Our research has significantly advanced our understanding of how BR signal is transduced from the receptor at the cell surface to changes of nuclear gene expression and other cellular responses such as vesicle trafficking, as well as developmental transitions such as seed germination and flowering. We have also developed effective proteomic methods for quantitative analysis of protein phosphorylation and for identification of glycosylated proteins. Throughmore » this DOE funding, we have performed several proteomic experiments and made major discoveries.« less
Cologna, Nicholas de Mojana di; Gómez-Mendoza, Diana Paola; Zanoelo, Fabiana Fonseca; Giannesi, Giovana Cristina; Guimarães, Nelciele Cavalieri de Alencar; Moreira, Leonora Rios de Souza; Filho, Edivaldo Ximenes Ferreira; Ricart, Carlos André Ornelas
2018-02-01
Filamentous fungal secretomes comprise highly dynamic sets of proteins, including multiple carbohydrate active enzymes (CAZymes) which are able to hydrolyze plant biomass polysaccharides into products of biotechnological interest such as fermentable sugars. In recent years, proteomics has been used to identify and quantify enzymatic and non-enzymatic polypeptides present in secretomes of several fungi species. The resulting data have widened the scientific understanding of the way filamentous fungi perform biomass degradation and offered novel perspectives for biotechnological applications. The present review discusses proteomics approaches that have been applied to the study of fungal secretomes, focusing on two of the most studied filamentous fungi genera: Trichoderma and Aspergillus. Copyright © 2017 Elsevier Inc. All rights reserved.
Della Valle, Maria Cecilia; Sleat, David E; Sohar, Istvan; Wen, Ting; Pintar, John E; Jadot, Michel; Lobel, Peter
2006-11-17
Most newly synthesized soluble lysosomal proteins are delivered to the lysosome via the mannose 6-phosphate (Man-6-P)-targeting pathway. The presence of the Man-6-P post-translational modification allows these proteins to be affinity-purified on immobilized Man-6-P receptors. This approach has formed the basis for a number of proteomic studies that identified multiple as yet uncharacterized Man-6-P glycoproteins that may represent new lysosomal proteins. Although the presence of Man-6-P is suggestive of lysosomal function, the subcellular localization of such candidates requires experimental verification. Here, we have investigated one such candidate, ependymin-related protein (EPDR). EPDR is a protein of unknown function with some sequence similarity to ependymin, a fish protein thought to play a role in memory consolidation and learning. Using classical subcellular fractionation on rat brain, EPDR co-distributes with lysosomal proteins, but there is significant overlap between lysosomal and mitochondrial markers. For more definitive localization, we have developed a novel approach based upon a selective buoyant density shift of the brain lysosomes in a mutant mouse lacking NPC2, a lysosomal protein involved in lipid transport. EPDR, in parallel with lysosomal markers, shows this density shift in gradient centrifugation experiments comparing mutant and wild type mice. This approach, combined with morphological analyses, demonstrates that EPDR resides in the lysosome. In addition, the lipidosis-induced density shift approach represents a valuable tool for identification and validation of both luminal and membrane lysosomal proteins that should be applicable to high throughput proteomic studies.
Strategies for the enrichment and identification of basic proteins in proteome projects.
Bae, Soo-Han; Harris, Andrew G; Hains, Peter G; Chen, Hong; Garfin, David E; Hazell, Stuart L; Paik, Young-Ki; Walsh, Bradley J; Cordwell, Stuart J
2003-05-01
Two-dimensional gel electrophoresis (2-DE) is currently the method of choice for separating complex mixtures of proteins for visual comparison in proteome analysis. This technology, however, is biased against certain classes of proteins including low abundance and hydrophobic proteins. Proteins with extremely alkaline isoelectric points (pI) are often very poorly represented using 2-DE technology, even when complex mixtures are separated using commercially available pH 6-11 or pH 7-10 immobilized pH gradients. The genome of the human gut pathogen, Helicobacter pylori, is dominated by genes encoding basic proteins, and is therefore a useful model for examining methodology suitable for separating such proteins. H. pylori proteins were separated on pH 6-11 and novel pH 9-12 immobilized pH gradients and 65 protein spots were subjected to matrix-assisted laser desorption/ionization-time of flight mass spectrometry, leading to the identification of 49 unique proteins. No proteins were characterized with a theoretical pI of greater than 10.23. A second approach to examine extremely alkaline proteins (pI > 9.0) utilized a prefractionation isoelectric focusing. Proteins were separated into two fractions using Gradiflow technology, and the extremely basic fraction subjected to both sodium dodecyl sulphate-polyacrylamide gel electrophoresis and liquid chromatography (LC) - tandem mass spectrometry post-tryptic digest, allowing the identification of 17 and 13 proteins, respectively. Gradiflow separations were highly specific for proteins with pI > 9.0, however, a single LC separation only allowed the identification of peptides from highly abundant proteins. These methods and those encompassing multiple LC 'dimensions' may be a useful complement to 2-DE for 'near-to-total' proteome coverage in the alkaline pH range.
Successive bacterial colonisation of pork and its implications for forensic investigations.
Handke, Jessica; Procopio, Noemi; Buckley, Michael; van der Meer, Dieudonne; Williams, Graham; Carr, Martin; Williams, Anna
2017-12-01
Bacteria are considered one of the major driving forces of the mammalian decomposition process and have only recently been recognised as forensic tools. At this point, little is known about their potential use as 'post-mortem clocks'. This study aimed to establish the proof of concept for using bacterial identification as post-mortem interval (PMI) indicators, using a multi-omics approach. Pieces of pork were placed in the University's outdoor facility and surface swabs were taken at regular intervals up to 60 days. Terminal restriction fragment length polymorphism (T-RFLP) of the 16S rDNA was used to identify bacterial taxa. It succeeded in detecting two out of three key contributors involved in decomposition and represents the first study to reveal Vibrionaceae as abundant on decomposing pork. However, a high fraction of present bacterial taxa could not be identified by T-RFLP. Proteomic analyses were also performed at selected time points, and they partially succeeded in the identification of precise strains, subspecies and species of bacteria that colonized the body after different PMIs. T-RFLP is incapable of reliably and fully identifying bacterial taxa, whereas proteomics could help in the identification of specific strains of bacteria. Nevertheless, microbial identification by next generation sequencing might be used as PMI clock in future investigations and in conjunction with information provided by forensic entomologists. To the best of our knowledge, this work represents the first attempt to find a cheaper and easily accessible, culture-independent alternative to high-throughput techniques to establish a 'microbial clock', in combination with proteomic strategies to address this issue. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Su, Dian; Gaffrey, Matthew J.; Guo, Jia
2014-02-11
Protein S-glutathionylation (SSG) is an important regulatory posttranslational modification of protein cysteine (Cys) thiol redox switches, yet the role of specific cysteine residues as targets of modification is poorly understood. We report a novel quantitative mass spectrometry (MS)-based proteomic method for site-specific identification and quantification of S-glutathionylation across different conditions. Briefly, this approach consists of initial blocking of free thiols by alkylation, selective reduction of glutathionylated thiols and enrichment using thiol affinity resins, followed by on-resin tryptic digestion and isobaric labeling with iTRAQ (isobaric tags for relative and absolute quantitation) for MS-based identification and quantification. The overall approach was validatedmore » by application to RAW 264.7 mouse macrophages treated with different doses of diamide to induce glutathionylation. A total of 1071 Cys-sites from 690 proteins were identified in response to diamide treatment, with ~90% of the sites displaying >2-fold increases in SSG-modification compared to controls.. This approach was extended to identify potential SSG modified Cys-sites in response to H2O2, an endogenous oxidant produced by activated macrophages and many pathophysiological stimuli. The results revealed 364 Cys-sites from 265 proteins that were sensitive to S-glutathionylation in response to H2O2 treatment. These proteins covered a range of molecular types and molecular functions with free radical scavenging, and cell death and survival included as the most significantly enriched functional categories. Overall the results demonstrate that our approach is effective for site-specific identification and quantification of S-glutathionylated proteins. The analytical strategy also provides a unique approach to determining the major pathways and cell processes most susceptible to glutathionylation at a proteome-wide scale.« less
Woo, Jongmin; Han, Dohyun; Wang, Joseph Injae; Park, Joonho; Kim, Hyunsoo; Kim, Youngsoo
2017-09-01
The development of systematic proteomic quantification techniques in systems biology research has enabled one to perform an in-depth analysis of cellular systems. We have developed a systematic proteomic approach that encompasses the spectrum from global to targeted analysis on a single platform. We have applied this technique to an activated microglia cell system to examine changes in the intracellular and extracellular proteomes. Microglia become activated when their homeostatic microenvironment is disrupted. There are varying degrees of microglial activation, and we chose to focus on the proinflammatory reactive state that is induced by exposure to such stimuli as lipopolysaccharide (LPS) and interferon-gamma (IFN-γ). Using an improved shotgun proteomics approach, we identified 5497 proteins in the whole-cell proteome and 4938 proteins in the secretome that were associated with the activation of BV2 mouse microglia by LPS or IFN-γ. Of the differentially expressed proteins in stimulated microglia, we classified pathways that were related to immune-inflammatory responses and metabolism. Our label-free parallel reaction monitoring (PRM) approach made it possible to comprehensively measure the hyper-multiplex quantitative value of each protein by high-resolution mass spectrometry. Over 450 peptides that corresponded to pathway proteins and direct or indirect interactors via the STRING database were quantified by label-free PRM in a single run. Moreover, we performed a longitudinal quantification of secreted proteins during microglial activation, in which neurotoxic molecules that mediate neuronal cell loss in the brain are released. These data suggest that latent pathways that are associated with neurodegenerative diseases can be discovered by constructing and analyzing a pathway network model of proteins. Furthermore, this systematic quantification platform has tremendous potential for applications in large-scale targeted analyses. The proteomics data for discovery and label-free PRM analysis have been deposited to the ProteomeXchange Consortium with identifiers
Durighello, Emie; Christie-Oleza, Joseph Alexander; Armengaud, Jean
2014-01-01
Bacteria from the Roseobacter clade are abundant in surface marine ecosystems as over 10% of bacterial cells in the open ocean and 20% in coastal waters belong to this group. In order to document how these marine bacteria interact with their environment, we analyzed the exoproteome of Phaeobacter strain DSM 17395. We grew the strain in marine medium, collected the exoproteome and catalogued its content with high-throughput nanoLC-MS/MS shotgun proteomics. The major component represented 60% of the total protein content but was refractory to either classical proteomic identification or proteogenomics. We de novo sequenced this abundant protein with high-resolution tandem mass spectra which turned out being the 53 kDa RTX-toxin ZP_02147451. It comprised a peptidase M10 serralysin domain. We explained its recalcitrance to trypsin proteolysis and proteomic identification by its unusual low number of basic residues. We found this is a conserved trait in RTX-toxins from Roseobacter strains which probably explains their persistence in the harsh conditions around bacteria. Comprehensive analysis of exoproteomes from environmental bacteria should take into account this proteolytic recalcitrance. PMID:24586966
Durighello, Emie; Christie-Oleza, Joseph Alexander; Armengaud, Jean
2014-01-01
Bacteria from the Roseobacter clade are abundant in surface marine ecosystems as over 10% of bacterial cells in the open ocean and 20% in coastal waters belong to this group. In order to document how these marine bacteria interact with their environment, we analyzed the exoproteome of Phaeobacter strain DSM 17395. We grew the strain in marine medium, collected the exoproteome and catalogued its content with high-throughput nanoLC-MS/MS shotgun proteomics. The major component represented 60% of the total protein content but was refractory to either classical proteomic identification or proteogenomics. We de novo sequenced this abundant protein with high-resolution tandem mass spectra which turned out being the 53 kDa RTX-toxin ZP_02147451. It comprised a peptidase M10 serralysin domain. We explained its recalcitrance to trypsin proteolysis and proteomic identification by its unusual low number of basic residues. We found this is a conserved trait in RTX-toxins from Roseobacter strains which probably explains their persistence in the harsh conditions around bacteria. Comprehensive analysis of exoproteomes from environmental bacteria should take into account this proteolytic recalcitrance.
Combining Search Engines for Comparative Proteomics
Tabb, David
2012-01-01
Many proteomics laboratories have found spectral counting to be an ideal way to recognize biomarkers that differentiate cohorts of samples. This approach assumes that proteins that differ in quantity between samples will generate different numbers of identifiable tandem mass spectra. Increasingly, researchers are employing multiple search engines to maximize the identifications generated from data collections. This talk evaluates four strategies to combine information from multiple search engines in comparative proteomics. The “Count Sum” model pools the spectra across search engines. The “Vote Counting” model combines the judgments from each search engine by protein. Two other models employ parametric and non-parametric analyses of protein-specific p-values from different search engines. We evaluated the four strategies in two different data sets. The ABRF iPRG 2009 study generated five LC-MS/MS analyses of “red” E. coli and five analyses of “yellow” E. coli. NCI CPTAC Study 6 generated five concentrations of Sigma UPS1 spiked into a yeast background. All data were identified with X!Tandem, Sequest, MyriMatch, and TagRecon. For both sample types, “Vote Counting” appeared to manage the diverse identification sets most effectively, yielding heightened discrimination as more search engines were added.
Display technologies: application for the discovery of drug and gene delivery agents
Sergeeva, Anna; Kolonin, Mikhail G.; Molldrem, Jeffrey J.; Pasqualini, Renata; Arap, Wadih
2007-01-01
Recognition of molecular diversity of cell surface proteomes in disease is essential for the development of targeted therapies. Progress in targeted therapeutics requires establishing effective approaches for high-throughput identification of agents specific for clinically relevant cell surface markers. Over the past decade, a number of platform strategies have been developed to screen polypeptide libraries for ligands targeting receptors selectively expressed in the context of various cell surface proteomes. Streamlined procedures for identification of ligand-receptor pairs that could serve as targets in disease diagnosis, profiling, imaging and therapy have relied on the display technologies, in which polypeptides with desired binding profiles can be serially selected, in a process called biopanning, based on their physical linkage with the encoding nucleic acid. These technologies include virus/phage display, cell display, ribosomal display, mRNA display and covalent DNA display (CDT), with phage display being by far the most utilized. The scope of this review is the recent advancements in the display technologies with a particular emphasis on molecular mapping of cell surface proteomes with peptide phage display. Prospective applications of targeted compounds derived from display libraries in the discovery of targeted drugs and gene therapy vectors are discussed. PMID:17123658
Proteomic characterization of hempseed (Cannabis sativa L.).
Aiello, Gilda; Fasoli, Elisa; Boschin, Giovanna; Lammi, Carmen; Zanoni, Chiara; Citterio, Attilio; Arnoldi, Anna
2016-09-16
This paper presents an investigation on hempseed proteome. The experimental approach, based on combinatorial peptide ligand libraries (CPLLs), SDS-PAGE separation, nLC-ESI-MS/MS identification, and database search, permitted identifying in total 181 expressed proteins. This very large number of identifications was achieved by searching in two databases: Cannabis sativa L. (56 gene products identified) and Arabidopsis thaliana (125 gene products identified). By performing a protein-protein association network analysis using the STRING software, it was possible to build the first interactomic map of all detected proteins, characterized by 137 nodes and 410 interactions. Finally, a Gene Ontology analysis of the identified species permitted to classify their molecular functions: the great majority is involved in the seed metabolic processes (41%), responses to stimulus (8%), and biological process (7%). Hempseed is an underexploited non-legume protein-rich seed. Although its protein is well known for its digestibility, essential amino acid composition, and useful techno-functional properties, a comprehensive proteome characterization is still lacking. The objective of this work was to fill this knowledge gap and provide information useful for a better exploitation of this seed in different food products. Copyright © 2016 Elsevier B.V. All rights reserved.
Study of cellular oncometabolism via multidimensional protein identification technology.
Aukim-Hastie, Claire; Garbis, Spiros D
2014-01-01
Cellular proteomics is becoming a widespread clinical application, matching the definition of bench-to-bedside translation. Among various fields of investigation, this approach can be applied to the study of the metabolic alterations that accompany oncogenesis and tumor progression, which are globally referred to as oncometabolism. Here, we describe a multidimensional protein identification technology (MuDPIT)-based strategy that can be employed to study the cellular proteome of malignant cells and tissues. This method has previously been shown to be compatible with the reproducible, in-depth analysis of up to a thousand proteins in clinical samples. The possibility to employ this technique to study clinical specimens demonstrates its robustness. MuDPIT is advantageous as compared to other approaches because it is direct, highly sensitive, and reproducible, it provides high resolution with ultra-high mass accuracy, it allows for relative quantifications, and it is compatible with multiplexing (thus limiting costs).This method enables the direct assessment of the proteomic profile of neoplastic cells and tissues and could be employed in the near future as a high-throughput, rapid, quantitative, and cost-effective screening platform for clinical samples. © 2014 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharma, Ritin; Dill, Brian; Chourey, Karuna
2012-01-01
The expanding use of surfactants for proteome sample preparations has prompted the need to systematically optimize the application and removal of these MS-deleterious agents prior to proteome measurements. Here we compare four different detergent clean-up methods (Trichloroacetic acid (TCA) precipitation, Chloroform/Methanol/Water (CMW) extraction, commercial detergent removal spin column method (DRS) and filter-aided sample preparation(FASP)) with respect to varying amounts of protein biomass in the samples, and provide efficiency benchmarks with respect to protein, peptide, and spectral identifications for each method. Our results show that for protein limited samples, FASP outperforms the other three clean-up methods, while at high protein amountmore » all the methods are comparable. This information was used in a dual strategy of comparing molecular weight based fractionated and unfractionated lysates from three increasingly complex samples (Escherichia coli, a five microbial isolate mixture, and a natural microbial community groundwater sample), which were all lysed with SDS and cleaned up using FASP. The two approaches complemented each other by enhancing the number of protein identifications by 8%-25% across the three samples and provided broad pathway coverage.« less
Proteomic Analysis of Pachytene Spermatocytes of Sterile Hybrid Male Mice.
Wang, Lu; Guo, Yueshuai; Liu, Wenjing; Zhao, Weidong; Song, Gendi; Zhou, Tao; Huang, Hefeng; Guo, Xuejiang; Sun, Fei
2016-09-01
Incompatibilities in interspecific hybrids, such as reduced hybrid fertility and lethality, are common features resulting from reproductive isolation that lead to speciation. Subspecies crosses of house mice produce offspring in which one sex is infertile or absent, yet the molecular mechanisms of hybrid sterility are poorly understood. In this study, we observed extensive asynapsis of chromosomes and disturbance of the sex body in pachytene spermatocytes of sterile F1 males (PWK/Ph female × C57BL/6J male). We report the high-confidence identification of 4005 proteins in the pachytene spermatocytes of fertile F1 males (PWK/Ph male × C57BL/6J female) and sterile F1 males (PWK/Ph female × C57BL/6J male), of which 215 were upregulated and 381 were downregulated. Bioinformatics analysis of the proteome led to the identification of 43 and 59 proteins known to be essential for male meiosis and spermatogenesis in mice, respectively. Characterization of the proteome of pachytene spermatocytes associated with hybrid male sterility provides an inventory of proteins that is useful for understanding meiosis and the mechanisms of hybrid male infertility. © 2016 by the Society for the Study of Reproduction, Inc.
Comprehensive proteomic analysis of Penicillium verrucosum.
Nöbauer, Katharina; Hummel, Karin; Mayrhofer, Corina; Ahrens, Maike; Setyabudi, Francis M C; Schmidt-Heydt, Markus; Eisenacher, Martin; Razzazi-Fazeli, Ebrahim
2017-05-01
Mass spectrometric identification of proteins in species lacking validated sequence information is a major problem in veterinary science. In the present study, we used ochratoxin A producing Penicillium verrucosum to identify and quantitatively analyze proteins of an organism with yet no protein information available. The work presented here aimed to provide a comprehensive protein identification of P. verrucosum using shotgun proteomics. We were able to identify 3631 proteins in an "ab initio" translated database from DNA sequences of P. verrucosum. Additionally, a sequential window acquisition of all theoretical fragment-ion spectra analysis was done to find differentially regulated proteins at two different time points of the growth curve. We compared the proteins at the beginning (day 3) and at the end of the log phase (day 12). © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Visualization of LC-MS/MS proteomics data in MaxQuant.
Tyanova, Stefka; Temu, Tikira; Carlson, Arthur; Sinitcyn, Pavel; Mann, Matthias; Cox, Juergen
2015-04-01
Modern software platforms enable the analysis of shotgun proteomics data in an automated fashion resulting in high quality identification and quantification results. Additional understanding of the underlying data can be gained with the help of advanced visualization tools that allow for easy navigation through large LC-MS/MS datasets potentially consisting of terabytes of raw data. The updated MaxQuant version has a map navigation component that steers the users through mass and retention time-dependent mass spectrometric signals. It can be used to monitor a peptide feature used in label-free quantification over many LC-MS runs and visualize it with advanced 3D graphic models. An expert annotation system aids the interpretation of the MS/MS spectra used for the identification of these peptide features. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Enrichment of low-molecular-weight proteins from biofluids for biomarker discovery.
Chertov, Oleg; Simpson, John T; Biragyn, Arya; Conrads, Thomas P; Veenstra, Timothy D; Fisher, Robert J
2005-01-01
The dramatic progress in mass spectrometry-based methods of protein identification has triggered a new quest for disease-associated biomarkers. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and its variant surface-enhanced laser desorption/ionization mass spectrometry, provide effective means to explore the less studied information slice of the human serum proteome -- low-molecular-weight proteins and peptides. These low-molecular-weight proteins and peptides are promising for the detection of important biomarkers. Due to the significant experimental problems imposed by high-abundance and high-molecular-weight proteins, it is important to effectively remove these species prior to mass spectrometry analysis of the low-molecular-weight serum and plasma proteomes. In this review, the advantages afforded by recently introduced methods for prefractionation of serum, as they pertain to the detection and identification of biomarkers, will be discussed.
MALDI FTICR IMS of Intact Proteins: Using Mass Accuracy to Link Protein Images with Proteomics Data
NASA Astrophysics Data System (ADS)
Spraggins, Jeffrey M.; Rizzo, David G.; Moore, Jessica L.; Rose, Kristie L.; Hammer, Neal D.; Skaar, Eric P.; Caprioli, Richard M.
2015-06-01
MALDI imaging mass spectrometry is a highly sensitive and selective tool used to visualize biomolecules in tissue. However, identification of detected proteins remains a difficult task. Indirect identification strategies have been limited by insufficient mass accuracy to confidently link ion images to proteomics data. Here, we demonstrate the capabilities of MALDI FTICR MS for imaging intact proteins. MALDI FTICR IMS provides an unprecedented combination of mass resolving power (~75,000 at m/z 5000) and accuracy (<5ppm) for proteins up to ~12kDa, enabling identification based on correlation with LC-MS/MS proteomics data. Analysis of rat brain tissue was performed as a proof-of-concept highlighting the capabilities of this approach by imaging and identifying a number of proteins including N-terminally acetylated thymosin β4 ( m/z 4,963.502, 0.6ppm) and ATP synthase subunit ɛ ( m/z 5,636.074, -2.3ppm). MALDI FTICR IMS was also used to differentiate a series of oxidation products of S100A8 ( m/z 10,164.03, -2.1ppm), a subunit of the heterodimer calprotectin, in kidney tissue from mice infected with Staphylococcus aureus. S100A8 - M37O/C42O3 ( m/z 10228.00, -2.6ppm) was found to co-localize with bacterial microcolonies at the center of infectious foci. The ability of MALDI FTICR IMS to distinguish S100A8 modifications is critical to understanding calprotectin's roll in nutritional immunity.
Shen, Shichen; Sheng, Quanhu; Shyr, Yu; Qu, Jun
2016-01-01
The recently-introduced Orbitrap Fusion mass spectrometry permits various types of MS2 acquisition methods. To date, these different MS2 strategies and the optimal data interpretation approach for each have not been adequately evaluated. This study comprehensively investigated the four MS2 strategies: HCD-OT (higher-energy-collisional-dissociation with Orbitrap detection), HCD-IT (HCD with ion trap, IT), CID-IT (collision-induced-dissociation with IT) and CID-OT on Orbitrap Fusion. To achieve extensive comparison and identify the optimal data interpretation method for each technique, several search engines (SEQUEST and Mascot) and post-processing methods (score-based, PeptideProphet, and Percolator) were assessed for all techniques for the analysis of a human cell proteome. It was found that divergent conclusions could be made from the same dataset when different data interpretation approaches were used and therefore requiring a relatively fair comparison among techniques. Percolator was chosen for comparison of techniques because it performs the best among all search engines and MS2 strategies. For the analysis of human cell proteome using individual MS2 strategies, the highest number of identifications was achieved by HCD-OT, followed by HCD-IT and CID-IT. Based on these results, we concluded that a relatively fair platform for data interpretation is necessary to avoid divergent conclusions from the same dataset, and HCD-OT and HCD-IT may be preferable for protein/peptide identification using Orbitrap Fusion. PMID:27472422
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger.
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-02-04
Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.
Lee, Hyoung-Joo; Jeong, Seul-Ki; Na, Keun; Lee, Min Jung; Lee, Sun Hee; Lim, Jong-Sun; Cha, Hyun-Jeong; Cho, Jin-Young; Kwon, Ja-Young; Kim, Hoguen; Song, Si Young; Yoo, Jong Shin; Park, Young Mok; Kim, Hail; Hancock, William S; Paik, Young-Ki
2013-06-07
As a starting point of the Chromosome-Centric Human Proteome Project (C-HPP), we established strategies of genome-wide proteomic analysis, including protein identification, quantitation of disease-specific proteins, and assessment of post-translational modifications, using paired human placental tissues from healthy and preeclampsia patients. This analysis resulted in identification of 4239 unique proteins with high confidence (two or more unique peptides with a false discovery rate less than 1%), covering 21% of approximately 20, 059 (Ensembl v69, Oct 2012) human proteins, among which 28 proteins exhibited differentially expressed preeclampsia-specific proteins. When these proteins are assigned to all human chromosomes, the pattern of the newly identified placental protein population is proportional to that of the gene count distribution of each chromosome. We also identified 219 unique N-linked glycopeptides, 592 unique phosphopeptides, and 66 chromosome 13-specific proteins. In particular, protein evidence of 14 genes previously known to be specifically up-regulated in human placenta was verified by mass spectrometry. With respect to the functional implication of these proteins, 38 proteins were found to be involved in regulatory factor biosynthesis or the immune system in the placenta, but the molecular mechanism of these proteins during pregnancy warrants further investigation. As far as we know, this work produced the highest number of proteins identified in the placenta and will be useful for annotating and mapping all proteins encoded in the human genome.
Comparative proteomic assessment of matrisome enrichment methodologies
Krasny, Lukas; Paul, Angela; Wai, Patty; Howard, Beatrice A.; Natrajan, Rachael C.; Huang, Paul H.
2016-01-01
The matrisome is a complex and heterogeneous collection of extracellular matrix (ECM) and ECM-associated proteins that play important roles in tissue development and homeostasis. While several strategies for matrisome enrichment have been developed, it is currently unknown how the performance of these different methodologies compares in the proteomic identification of matrisome components across multiple tissue types. In the present study, we perform a comparative proteomic assessment of two widely used decellularisation protocols and two extraction methods to characterise the matrisome in four murine organs (heart, mammary gland, lung and liver). We undertook a systematic evaluation of the performance of the individual methods on protein yield, matrisome enrichment capability and the ability to isolate core matrisome and matrisome-associated components. Our data find that sodium dodecyl sulphate (SDS) decellularisation leads to the highest matrisome enrichment efficiency, while the extraction protocol that comprises chemical and trypsin digestion of the ECM fraction consistently identifies the highest number of matrisomal proteins across all types of tissue examined. Matrisome enrichment had a clear benefit over non-enriched tissue for the comprehensive identification of matrisomal components in murine liver and heart. Strikingly, we find that all four matrisome enrichment methods led to significant losses in the soluble matrisome-associated proteins across all organs. Our findings highlight the multiple factors (including tissue type, matrisome class of interest and desired enrichment purity) that influence the choice of enrichment methodology, and we anticipate that these data will serve as a useful guide for the design of future proteomic studies of the matrisome. PMID:27589945
Gonzalez-Begne, Mireya; Lu, Bingwen; Liao, Lujian; Xu, Tao; Bedi, Gurrinder; Melvin, James E.; Yates, John R.
2011-01-01
In-depth analysis of the salivary proteome is fundamental to understanding the functions of salivary proteins in the oral cavity and to reveal disease biomarkers involved in different pathophysiological conditions, with the ultimate goal of improving patient diagnosis and prognosis. Submandibular and sublingual glands contribute saliva rich in glycoproteins to the total saliva output, making them valuable sources for glycoproteomic analysis. Lectin-affinity chromatography coupled to mass spectrometry-based shotgun proteomics was used to explore the submandibular/sublingual (SM/SL) saliva glycoproteome. A total of 262 N- and O-linked glycoproteins were identified by multidimensional protein identification technology (MudPIT). Only 38 were previously described in SM and SL salivas from the human salivary N-linked glycoproteome, while 224 were unique. Further comparison analysis with SM/SL saliva of the human saliva proteome, revealed 125 glycoproteins not formerly reported in this secretion. KEGG pathway analyses demonstrated that many of these glycoproteins are involved in processes such as complement and coagulation cascades, cell communication, glycosphingolipid biosynthesis neo-lactoseries, O-glycan biosynthesis, glycan structures-biosynthesis 2, starch and sucrose metabolism, peptidoglycan biosynthesis or others pathways. In summary, lectin-affinity chromatography coupled to MudPIT mass spectrometry identified many novel glycoproteins in SM/SL saliva. These new additions to the salivary proteome may prove to be a critical step for providing reliable biomarkers in the diagnosis of a myriad of oral and systemic diseases. PMID:21936497
A novel algorithm for validating peptide identification from a shotgun proteomics search engine.
Jian, Ling; Niu, Xinnan; Xia, Zhonghang; Samir, Parimal; Sumanasekera, Chiranthani; Mu, Zheng; Jennings, Jennifer L; Hoek, Kristen L; Allos, Tara; Howard, Leigh M; Edwards, Kathryn M; Weil, P Anthony; Link, Andrew J
2013-03-01
Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.
Schlautman, Joshua D; Rozek, Wojciech; Stetler, Robert; Mosley, R Lee; Gendelman, Howard E; Ciborowski, Pawel
2008-01-01
Background The ProteomeLab™ PF 2D platform is a relatively new approach to global protein profiling. Herein, it was used for investigation of plasma proteome changes in amyotrophic lateral sclerosis (ALS) patients before and during immunization with glatiramer acetate (GA) in a clinical trial. Results The experimental design included immunoaffinity depletion of 12 most abundant proteins from plasma samples with the ProteomeLab™ IgY-12 LC10 column kit as first dimension separation, also referred to as immuno-partitioning. Second and third dimension separations of the enriched proteome were performed on the PF 2D platform utilizing 2D isoelectric focusing and RP-HPLC with the resulting fractions collected for analysis. 1D gel electrophoresis was added as a fourth dimension when sufficient protein was available. Protein identification from collected fractions was performed using nano-LC-MS/MS approach. Analysis of differences in the resulting two-dimensional maps of fractions obtained from the PF 2D and the ability to identify proteins from these fractions allowed sensitivity threshold measurements. Masked proteins in the PF 2D fractions are discussed. Conclusion We offer some insight into the strengths and limitations of this emerging proteomic platform. PMID:18789151
The emergence of top-down proteomics in clinical research
2013-01-01
Proteomic technology has advanced steadily since the development of 'soft-ionization' techniques for mass-spectrometry-based molecular identification more than two decades ago. Now, the large-scale analysis of proteins (proteomics) is a mainstay of biological research and clinical translation, with researchers seeking molecular diagnostics, as well as protein-based markers for personalized medicine. Proteomic strategies using the protease trypsin (known as bottom-up proteomics) were the first to be developed and optimized and form the dominant approach at present. However, researchers are now beginning to understand the limitations of bottom-up techniques, namely the inability to characterize and quantify intact protein molecules from a complex mixture of digested peptides. To overcome these limitations, several laboratories are taking a whole-protein-based approach, in which intact protein molecules are the analytical targets for characterization and quantification. We discuss these top-down techniques and how they have been applied to clinical research and are likely to be applied in the near future. Given the recent improvements in mass-spectrometry-based proteomics and stronger cooperation between researchers, clinicians and statisticians, both peptide-based (bottom-up) strategies and whole-protein-based (top-down) strategies are set to complement each other and help researchers and clinicians better understand and detect complex disease phenotypes. PMID:23806018
Urinary proteomics in renal pathophysiology: Impact of proteinuria.
Sancho-Martínez, Sandra M; Prieto-García, Laura; Blanco-Gozalo, Víctor; Fontecha-Barriuso, Miguel; López-Novoa, José M; López-Hernández, Francisco J
2015-06-01
Urinary differential proteomics is used to study renal pathophysiological mechanisms, find novel markers of biological processes and renal diseases, and stratify patients according to proteomic profiles. The proteomic procedure determines the pathophysiological meaning and clinical relevance of results. Urine samples for differential proteomic studies are usually normalized by protein content, regardless of its pathophysiological characteristics. In the field of nephrology, this approach translates into the comparison of a different fraction of the total daily urine output between proteinuric and nonproteinuric samples. Accordingly, alterations in the level of specific proteins found by this method reflect the relative presence of individual proteins in the urine; but they do not necessarily show alterations in their daily excretion, which is a key parameter for the understanding of the pathophysiological meaning of urinary components. For renal pathophysiology studies and clinical biomarker identification or determination, an alternative proteomic concept providing complementary information is based on sample normalization by daily urine output, which directly informs on changes in the daily excretion of individual proteins. This is clinically important because daily excretion (rather than absolute or relative concentration) is the only self-normalized way to evaluate the real meaning of urinary parameters, which is also independent of urine concentration. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Nuez-Ortín, Waldo G; Carter, Chris G; Nichols, Peter D; Wilson, Richard
2016-07-01
Understanding diet- and environmentally induced physiological changes in fish larvae is a major goal for the aquaculture industry. Proteomic analysis of whole fish larvae comprising multiple tissues offers considerable potential but is challenging due to the very large dynamic range of protein abundance. To extend the coverage of the larval phase of the Atlantic salmon (Salmo salar) proteome, we applied a two-step sequential extraction (SE) method, based on differential protein solubility, using a nondenaturing buffer containing 150 mM NaCl followed by a denaturing buffer containing 7 M urea and 2 M thiourea. Extracts prepared using SE and one-step direct extraction were characterized via label-free shotgun proteomics using nanoLC-MS/MS (LTQ-Orbitrap). SE partitioned the proteins into two fractions of approximately equal amounts, but with very distinct protein composition, leading to identification of ∼40% more proteins than direct extraction. This fractionation strategy enabled the most detailed characterization of the salmon larval proteome to date and provides a platform for greater understanding of physiological changes in whole fish larvae. The MS data are available via the ProteomeXchange Consortium PRIDE partner repository, dataset PXD003366. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Tan, Meng-Kwang Marcus; Lim, Hui-Jun; Bennett, Eric J.; Shi, Yang; Harper, J. Wade
2014-01-01
Modular Cullin-RING E3 ubiquitin ligases (CRLs) use substrate binding adaptor proteins to specify target ubiquitylation. Many of the ~200 human CRL adaptor proteins remain poorly studied due to a shortage of efficient methods to identify biologically relevant substrates. Here, we report the development of Parallel Adaptor Capture (PAC) proteomics, and its use to systematically identify candidate targets for the leucine-rich repeat family of F-box proteins (FBXLs) that function with SKP1-CUL1-F-box protein (SCF) E3s. In validation experiments, we identify the unstudied F-box protein FBXL17 as a regulator of the NFR2 oxidative stress pathway. We demonstrate that FBXL17 controls the transcription of the NRF2 target HMOX1 via turnover of the transcriptional repressor BACH1 in the absence or presence of extrinsic oxidative stress. This work identifies a role for SCFFBXL17 in controlling the threshold for NRF2-dependent gene activation and provides a framework for elucidating the functions of CRL adaptor proteins. PMID:24035498
Tan, Meng-Kwang Marcus; Lim, Hui-Jun; Bennett, Eric J; Shi, Yang; Harper, J Wade
2013-10-10
Modular cullin-RING E3 ubiquitin ligases (CRLs) use substrate binding adaptor proteins to specify target ubiquitylation. Many of the ~200 human CRL adaptor proteins remain poorly studied due to a shortage of efficient methods to identify biologically relevant substrates. Here, we report the development of parallel adaptor capture (PAC) proteomics and its use to systematically identify candidate targets for the leucine-rich repeat family of F-box proteins (FBXLs) that function with SKP1-CUL1-F-box protein (SCF) E3s. In validation experiments, we identify the unstudied F-box protein FBXL17 as a regulator of the NFR2 oxidative stress pathway. We demonstrate that FBXL17 controls the transcription of the NRF2 target HMOX1 via turnover of the transcriptional repressor BACH1 in the absence or presence of extrinsic oxidative stress. This work identifies a role for SCF(FBXL17) in controlling the threshold for NRF2-dependent gene activation and provides a framework for elucidating the functions of CRL adaptor proteins. Copyright © 2013 Elsevier Inc. All rights reserved.
Kockmann, Tobias; Trachsel, Christian; Panse, Christian; Wahlander, Asa; Selevsek, Nathalie; Grossmann, Jonas; Wolski, Witold E; Schlapbach, Ralph
2016-08-01
Quantitative mass spectrometry is a rapidly evolving methodology applied in a large number of omics-type research projects. During the past years, new designs of mass spectrometers have been developed and launched as commercial systems while in parallel new data acquisition schemes and data analysis paradigms have been introduced. Core facilities provide access to such technologies, but also actively support the researchers in finding and applying the best-suited analytical approach. In order to implement a solid fundament for this decision making process, core facilities need to constantly compare and benchmark the various approaches. In this article we compare the quantitative accuracy and precision of current state of the art targeted proteomics approaches single reaction monitoring (SRM), parallel reaction monitoring (PRM) and data independent acquisition (DIA) across multiple liquid chromatography mass spectrometry (LC-MS) platforms, using a readily available commercial standard sample. All workflows are able to reproducibly generate accurate quantitative data. However, SRM and PRM workflows show higher accuracy and precision compared to DIA approaches, especially when analyzing low concentrated analytes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Ecological adaptation of diverse honey bee (Apis mellifera) populations.
Parker, Robert; Melathopoulos, Andony P; White, Rick; Pernal, Stephen F; Guarna, M Marta; Foster, Leonard J
2010-06-15
Honey bees are complex eusocial insects that provide a critical contribution to human agricultural food production. Their natural migration has selected for traits that increase fitness within geographical areas, but in parallel their domestication has selected for traits that enhance productivity and survival under local conditions. Elucidating the biochemical mechanisms of these local adaptive processes is a key goal of evolutionary biology. Proteomics provides tools unique among the major 'omics disciplines for identifying the mechanisms employed by an organism in adapting to environmental challenges. Through proteome profiling of adult honey bee midgut from geographically dispersed, domesticated populations combined with multiple parallel statistical treatments, the data presented here suggest some of the major cellular processes involved in adapting to different climates. These findings provide insight into the molecular underpinnings that may confer an advantage to honey bee populations. Significantly, the major energy-producing pathways of the mitochondria, the organelle most closely involved in heat production, were consistently higher in bees that had adapted to colder climates. In opposition, up-regulation of protein metabolism capacity, from biosynthesis to degradation, had been selected for in bees from warmer climates. Overall, our results present a proteomic interpretation of expression polymorphisms between honey bee ecotypes and provide insight into molecular aspects of local adaptation or selection with consequences for honey bee management and breeding. The implications of our findings extend beyond apiculture as they underscore the need to consider the interdependence of animal populations and their agro-ecological context.
PAnalyzer: a software tool for protein inference in shotgun proteomics.
Prieto, Gorka; Aloria, Kerman; Osinalde, Nerea; Fullaondo, Asier; Arizmendi, Jesus M; Matthiesen, Rune
2012-11-05
Protein inference from peptide identifications in shotgun proteomics must deal with ambiguities that arise due to the presence of peptides shared between different proteins, which is common in higher eukaryotes. Recently data independent acquisition (DIA) approaches have emerged as an alternative to the traditional data dependent acquisition (DDA) in shotgun proteomics experiments. MSE is the term used to name one of the DIA approaches used in QTOF instruments. MSE data require specialized software to process acquired spectra and to perform peptide and protein identifications. However the software available at the moment does not group the identified proteins in a transparent way by taking into account peptide evidence categories. Furthermore the inspection, comparison and report of the obtained results require tedious manual intervention. Here we report a software tool to address these limitations for MSE data. In this paper we present PAnalyzer, a software tool focused on the protein inference process of shotgun proteomics. Our approach considers all the identified proteins and groups them when necessary indicating their confidence using different evidence categories. PAnalyzer can read protein identification files in the XML output format of the ProteinLynx Global Server (PLGS) software provided by Waters Corporation for their MSE data, and also in the mzIdentML format recently standardized by HUPO-PSI. Multiple files can also be read simultaneously and are considered as technical replicates. Results are saved to CSV, HTML and mzIdentML (in the case of a single mzIdentML input file) files. An MSE analysis of a real sample is presented to compare the results of PAnalyzer and ProteinLynx Global Server. We present a software tool to deal with the ambiguities that arise in the protein inference process. Key contributions are support for MSE data analysis by ProteinLynx Global Server and technical replicates integration. PAnalyzer is an easy to use multiplatform and free software tool.
PAnalyzer: A software tool for protein inference in shotgun proteomics
2012-01-01
Background Protein inference from peptide identifications in shotgun proteomics must deal with ambiguities that arise due to the presence of peptides shared between different proteins, which is common in higher eukaryotes. Recently data independent acquisition (DIA) approaches have emerged as an alternative to the traditional data dependent acquisition (DDA) in shotgun proteomics experiments. MSE is the term used to name one of the DIA approaches used in QTOF instruments. MSE data require specialized software to process acquired spectra and to perform peptide and protein identifications. However the software available at the moment does not group the identified proteins in a transparent way by taking into account peptide evidence categories. Furthermore the inspection, comparison and report of the obtained results require tedious manual intervention. Here we report a software tool to address these limitations for MSE data. Results In this paper we present PAnalyzer, a software tool focused on the protein inference process of shotgun proteomics. Our approach considers all the identified proteins and groups them when necessary indicating their confidence using different evidence categories. PAnalyzer can read protein identification files in the XML output format of the ProteinLynx Global Server (PLGS) software provided by Waters Corporation for their MSE data, and also in the mzIdentML format recently standardized by HUPO-PSI. Multiple files can also be read simultaneously and are considered as technical replicates. Results are saved to CSV, HTML and mzIdentML (in the case of a single mzIdentML input file) files. An MSE analysis of a real sample is presented to compare the results of PAnalyzer and ProteinLynx Global Server. Conclusions We present a software tool to deal with the ambiguities that arise in the protein inference process. Key contributions are support for MSE data analysis by ProteinLynx Global Server and technical replicates integration. PAnalyzer is an easy to use multiplatform and free software tool. PMID:23126499
Examining Troughs in the Mass Distribution of All Theoretically Possible Tryptic Peptides
Nefedov, Alexey V.; Mitra, Indranil; Brasier, Allan R.; Sadygov, Rovshan G.
2011-01-01
This work describes the mass distribution of all theoretically possibly tryptic peptides made of 20 amino acids, up to the mass of 3 kDa, with resolution of 0.001 Da. We characterize regions between the peaks of the distribution, including gaps (forbidden zones) and low-populated areas (quiet zones). We show how the gaps shrink over the mass range, and when they completely disappear. We demonstrate that peptide compositions in quiet zones are less diverse than those in the peaks of the distribution, and that by eliminating certain types of unrealistic compositions the gaps in the distribution may be increased. The mass distribution is generated using a parallel implementation of a recursive procedure that enumerates all amino acid compositions. It allows us to enumerate all compositions of tryptic peptides below 3 kDa in 48 minutes using a computer cluster with 12 Intel Xeon X5650 CPUs (72 cores). The results of this work can be used to facilitate protein identification and mass defect labeling in mass spectrometry-based proteomics experiments. PMID:21780838
Zhang, Zhenbin; Dovichi, Norman J
2018-02-25
The effects of MS1 injection time, MS2 injection time, dynamic exclusion time, intensity threshold, and isolation width were investigated on the numbers of peptide and protein identifications for single-shot bottom-up proteomics analysis using CZE-MS/MS analysis of a Xenopus laevis tryptic digest. An electrokinetically pumped nanospray interface was used to couple a linear-polyacrylamide coated capillary to a Q Exactive HF mass spectrometer. A sensitive method that used a 1.4 Th isolation width, 60,000 MS2 resolution, 110 ms MS2 injection time, and a top 7 fragmentation produced the largest number of identifications when the CZE loading amount was less than 100 ng. A programmable autogain control method (pAGC) that used a 1.4 Th isolation width, 15,000 MS2 resolution, 110 ms MS2 injection time, and top 10 fragmentation produced the largest number of identifications for CZE loading amounts greater than 100 ng; 7218 unique peptides and 1653 protein groups were identified from 200 ng by using the pAGC method. The effect of mass spectrometer conditions on the performance of UPLC-MS/MS was also investigated. A fast method that used a 1.4 Th isolation width, 30,000 MS2 resolution, 45 ms MS2 injection time, and top 12 fragmentation produced the largest number of identifications for 200 ng UPLC loading amount (6025 unique peptides and 1501 protein groups). This is the first report where the identification number for CZE surpasses that of the UPLC at the 200 ng loading level. However, more peptides (11476) and protein groups (2378) were identified by using UPLC-MS/MS when the sample loading amount was increased to 2 μg with the fast method. To exploit the fast scan speed of the Q-Exactive HF mass spectrometer, higher sample loading amounts are required for single-shot bottom-up proteomics analysis using CZE-MS/MS. Copyright © 2017 Elsevier B.V. All rights reserved.
DeNovoGUI: An Open Source Graphical User Interface for de Novo Sequencing of Tandem Mass Spectra
2013-01-01
De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com. PMID:24295440
DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.
Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald
2014-02-07
De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .
Silva, Wanderson M; Carvalho, Rodrigo D; Soares, Siomar C; Bastos, Isabela Fs; Folador, Edson L; Souza, Gustavo Hmf; Le Loir, Yves; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco
2014-12-04
Corynebacterium pseudotuberculosis biovar ovis is a facultative intracellular pathogen, and the etiological agent of caseous lymphadenitis in small ruminants. During the infection process, the bacterium is subjected to several stress conditions, including nitrosative stress, which is caused by nitric oxide (NO). In silico analysis of the genome of C. pseudotuberculosis ovis 1002 predicted several genes that could influence the resistance of this pathogen to nitrosative stress. Here, we applied high-throughput proteomics using high definition mass spectrometry to characterize the functional genome of C. pseudotuberculosis ovis 1002 in the presence of NO-donor Diethylenetriamine/nitric oxide adduct (DETA/NO), with the aim of identifying proteins involved in nitrosative stress resistance. We characterized 835 proteins, representing approximately 41% of the predicted proteome of C. pseudotuberculosis ovis 1002, following exposure to nitrosative stress. In total, 102 proteins were exclusive to the proteome of DETA/NO-induced cells, and a further 58 proteins were differentially regulated between the DETA/NO and control conditions. An interactomic analysis of the differential proteome of C. pseudotuberculosis in response to nitrosative stress was also performed. Our proteomic data set suggested the activation of both a general stress response and a specific nitrosative stress response, as well as changes in proteins involved in cellular metabolism, detoxification, transcriptional regulation, and DNA synthesis and repair. Our proteomic analysis validated previously-determined in silico data for C. pseudotuberculosis ovis 1002. In addition, proteomic screening performed in the presence of NO enabled the identification of a set of factors that can influence the resistance and survival of C. pseudotuberculosis during exposure to nitrosative stress.
Proteomics of drug resistance in Candida glabrata biofilms.
Seneviratne, C Jayampath; Wang, Yu; Jin, Lijian; Abiko, Y; Samaranayake, Lakshman P
2010-04-01
Candida glabrata is a fungal pathogen that causes a variety of mucosal and systemic infections among compromised patient populations with higher mortality rates. Previous studies have shown that biofilm mode of the growth of the fungus is highly resistant to antifungal agents compared with the free-floating or planktonic mode of growth. Therefore, in the present study, we used 2-D DIGE to evaluate the differential proteomic profiles of C. glabrata under planktonic and biofilm modes of growth. Candida glabrata biofilms were developed on polystyrene surfaces and age-matched planktonic cultures were obtained in parallel. Initially, biofilm architecture, viability, and antifungal susceptibility were evaluated. Differentially expressed proteins more than 1.5-fold in DIGE analysis were subjected to MS/MS. The transcriptomic regulation of these biomarkers was evaluated by quantitative real-time PCR. Candida glabrata biofilms were highly resistant to the antifungals and biocides compared with the planktonic mode of growth. Candida glabrata biofilm proteome when compared with its planktonic proteome showed upregulation of stress response proteins, while glycolysis enzymes were downregulated. Similar trend could be observed at transcriptomic level. In conclusion, C. glabrata biofilms possess higher amount of stress response proteins, which may potentially contribute to the higher antifungal resistance seen in C. glabrata biofilms.
Eckhard, Ulrich; Huesgen, Pitter F; Schilling, Oliver; Bellac, Caroline L; Butler, Georgina S; Cox, Jennifer H; Dufour, Antoine; Goebeler, Verena; Kappelhoff, Reinhild; Auf dem Keller, Ulrich; Klein, Theo; Lange, Philipp F; Marino, Giada; Morrison, Charlotte J; Prudova, Anna; Rodriguez, David; Starr, Amanda E; Wang, Yili; Overall, Christopher M
2016-06-01
The data described provide a comprehensive resource for the family-wide active site specificity portrayal of the human matrix metalloproteinase family. We used the high-throughput proteomic technique PICS (Proteomic Identification of protease Cleavage Sites) to comprehensively assay 9 different MMPs. We identified more than 4300 peptide cleavage sites, spanning both the prime and non-prime sides of the scissile peptide bond allowing detailed subsite cooperativity analysis. The proteomic cleavage data were expanded by kinetic analysis using a set of 6 quenched-fluorescent peptide substrates designed using these results. These datasets represent one of the largest specificity profiling efforts with subsequent structural follow up for any protease family and put the spotlight on the specificity similarities and differences of the MMP family. A detailed analysis of this data may be found in Eckhard et al. (2015) [1]. The raw mass spectrometry data and the corresponding metadata have been deposited in PRIDE/ProteomeXchange with the accession number PXD002265.
State-of-the-art nanoplatform-integrated MALDI-MS impacting resolutions in urinary proteomics.
Gopal, Judy; Muthu, Manikandan; Chun, Se-Chul; Wu, Hui-Fen
2015-06-01
Urine proteomics has become a subject of interest, since it has led to a number of breakthroughs in disease diagnostics. Urine contains information not only from the kidney and the urinary tract but also from other organs, thus urinary proteome analysis allows for identification of biomarkers for both urogenital and systemic diseases. The following review gives a brief overview of the analytical techniques that have been in practice for urinary proteomics. MALDI-MS technique and its current application status in this area of clinical research have been discussed. The review comments on the challenges facing the conventional MALDI-MS technique and the upgradation of this technique with the introduction of nanotechnology. This review projects nano-based techniques such as nano-MALDI-MS, surface-assisted laser desorption/ionization, and nanostructure-initiator MS as the platforms that have the potential in trafficking MALDI-MS from the lab to the bedside. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Elzek, Mohamed A.; Rodland, Karin D.
In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification ofmore » aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. In conclusion, we propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.« less
Mass spectrometry-based proteomics: basic principles and emerging technologies and directions.
Van Riper, Susan K; de Jong, Ebbing P; Carlis, John V; Griffin, Timothy J
2013-01-01
As the main catalytic and structural molecules within living systems, proteins are the most likely biomolecules to be affected by radiation exposure. Proteomics, the comprehensive characterization of proteins within complex biological samples, is therefore a research approach ideally suited to assess the effects of radiation exposure on cells and tissues. For comprehensive characterization of proteomes, an analytical platform capable of quantifying protein abundance, identifying post-translation modifications and revealing members of protein complexes on a system-wide level is necessary. Mass spectrometry (MS), coupled with technologies for sample fractionation and automated data analysis, provides such a versatile and powerful platform. In this chapter we offer a view on the current state of MS-proteomics, and focus on emerging technologies within three areas: (1) New instrumental methods; (2) New computational methods for peptide identification; and (3) Label-free quantification. These emerging technologies should be valuable for researchers seeking to better understand biological effects of radiation on living systems.
Interaction Analysis through Proteomic Phage Display
2014-01-01
Phage display is a powerful technique for profiling specificities of peptide binding domains. The method is suited for the identification of high-affinity ligands with inhibitor potential when using highly diverse combinatorial peptide phage libraries. Such experiments further provide consensus motifs for genome-wide scanning of ligands of potential biological relevance. A complementary but considerably less explored approach is to display expression products of genomic DNA, cDNA, open reading frames (ORFs), or oligonucleotide libraries designed to encode defined regions of a target proteome on phage particles. One of the main applications of such proteomic libraries has been the elucidation of antibody epitopes. This review is focused on the use of proteomic phage display to uncover protein-protein interactions of potential relevance for cellular function. The method is particularly suited for the discovery of interactions between peptide binding domains and their targets. We discuss the largely unexplored potential of this method in the discovery of domain-motif interactions of potential biological relevance. PMID:25295249
Kling, Peter; Förlin, Lars
2009-10-01
Proteomic effect screening in zebrafish liver cells was performed to generate hypotheses regarding single and mixed exposure to the BFRs HBCD and TBBPA. Responses at sublethal exposure were analysed by two-dimensional gel electrophoresis followed by MALDI-TOF and FT-ICR protein identification. Mixing of HBCD and TBBPA at sublethal doses of individual substances seemed to increase toxicity. Proteomic analyses revealed distinct exposure-specific and overlapping responses suggesting novel mechanisms with regard to HBCD and TBBPA exposure. While distinct HBCD responses were related to decreased protein metabolism, TBBPA revealed effects related to protein folding and NADPH production. Overlapping responses suggest increased gluconeogenesis (GAPDH and aldolase) while distinct mixture effects suggest a pronounced NADPH production and changes in proteins related to cell cycle control (prohibitin and crk-like oncogene). We conclude that mixtures containing HBCD and TBBPA may result in unexpected effects highlighting proteomics as a sensitive tool for detecting and hypothesis generation of mixture effects.
Nakamura, Tatsuji; Kuromitsu, Junro; Oda, Yoshiya
2008-03-01
Two-dimensional liquid-chromatographic (LC) separation followed by mass spectrometric (MS) analysis was examined for the identification of peptides in complex mixtures as an alternative to widely used two-dimensional gel electrophoresis followed by MS analysis for use in proteomics. The present method involves the off-line coupling of a narrow-bore, polymer-based, reversed-phase column using an acetonitrile gradient in an alkaline mobile phase in the first dimension with octadecylsilanized silica (ODS)-based nano-LC/MS in the second dimension. After the first separation, successive fractions were acidified and dried off-line, then loaded on the second dimension column. Both columns separate peptides according to hydrophobicity under different pH conditions, but more peptides were identified than with the conventional technique for shotgun proteomics, that is, the combination of a strong cation exchange column with an ODS column, and the system was robust because no salts were included in the mobile phases. The suitability of the method for proteomics measurements was evaluated.
NASA Astrophysics Data System (ADS)
Li, Huilin; Nguyen, Hong Hanh; Ogorzalek Loo, Rachel R.; Campuzano, Iain D. G.; Loo, Joseph A.
2018-02-01
Mass spectrometry (MS) has become a crucial technique for the analysis of protein complexes. Native MS has traditionally examined protein subunit arrangements, while proteomics MS has focused on sequence identification. These two techniques are usually performed separately without taking advantage of the synergies between them. Here we describe the development of an integrated native MS and top-down proteomics method using Fourier-transform ion cyclotron resonance (FTICR) to analyse macromolecular protein complexes in a single experiment. We address previous concerns of employing FTICR MS to measure large macromolecular complexes by demonstrating the detection of complexes up to 1.8 MDa, and we demonstrate the efficacy of this technique for direct acquirement of sequence to higher-order structural information with several large complexes. We then summarize the unique functionalities of different activation/dissociation techniques. The platform expands the ability of MS to integrate proteomics and structural biology to provide insights into protein structure, function and regulation.
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L.; Dianes, José A.; del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W.; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-01-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra. PMID:27493588
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L; Dianes, José A; Del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-08-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra.
Wu, Si; Brown, Roslyn N.; Payne, Samuel H.; ...
2013-01-01
The periplasm of Gram-negative bacteria is a dynamic and physiologically important subcellular compartment where the constant exposure to potential environmental insults amplifies the need for proper protein folding and modifications. Top-down proteomics analysis of the periplasmic fraction at the intact protein level provides unrestricted characterization and annotation of the periplasmic proteome, including the post-translational modifications (PTMs) on these proteins. Here, we used single-dimension ultra-high pressure liquid chromatography coupled with the Fourier transform mass spectrometry (FTMS) to investigate the intact periplasmic proteome of Novosphingobium aromaticivorans . Our top-down analysis provided the confident identification of 55 proteins in the periplasm and characterizedmore » their PTMs including signal peptide removal, N-terminal methionine excision, acetylation, glutathionylation, pyroglutamate, and disulfide bond formation. This study provides the first experimental evidence for the expression and periplasmic localization of many hypothetical and uncharacterized proteins and the first unrestrictive, large-scale data on PTMs in the bacterial periplasm.« less
Zhu, Ying; Piehowski, Paul D; Zhao, Rui; Chen, Jing; Shen, Yufeng; Moore, Ronald J; Shukla, Anil K; Petyuk, Vladislav A; Campbell-Thompson, Martha; Mathews, Clayton E; Smith, Richard D; Qian, Wei-Jun; Kelly, Ryan T
2018-02-28
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Between Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Ying; Piehowski, Paul D.; Zhao, Rui
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here in this paper, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Betweenmore » Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.« less
Zhu, Ying; Piehowski, Paul D.; Zhao, Rui; ...
2018-02-28
Nanoscale or single-cell technologies are critical for biomedical applications. However, current mass spectrometry (MS)-based proteomic approaches require samples comprising a minimum of thousands of cells to provide in-depth profiling. Here in this paper, we report the development of a nanoPOTS (nanodroplet processing in one pot for trace samples) platform for small cell population proteomics analysis. NanoPOTS enhances the efficiency and recovery of sample processing by downscaling processing volumes to <200 nL to minimize surface losses. When combined with ultrasensitive liquid chromatography-MS, nanoPOTS allows identification of ~1500 to ~3000 proteins from ~10 to ~140 cells, respectively. By incorporating the Match Betweenmore » Runs algorithm of MaxQuant, >3000 proteins are consistently identified from as few as 10 cells. Furthermore, we demonstrate quantification of ~2400 proteins from single human pancreatic islet thin sections from type 1 diabetic and control donors, illustrating the application of nanoPOTS for spatially resolved proteome measurements from clinical tissues.« less
Kusebauch, Ulrike; Deutsch, Eric W.; Campbell, David S.; Sun, Zhi; Farrah, Terry; Moritz, Robert L.
2014-01-01
PeptideAtlas, SRMAtlas and PASSEL are web-accessible resources to support discovery and targeted proteomics research. PeptideAtlas is a multi-species compendium of shotgun proteomic data provided by the scientific community, SRMAtlas is a resource of high-quality, complete proteome SRM assays generated in a consistent manner for the targeted identification and quantification of proteins, and PASSEL is a repository that compiles and represents selected reaction monitoring data, all in an easy to use interface. The databases are generated from native mass spectrometry data files that are analyzed in a standardized manner including statistical validation of the results. Each resource offers search functionalities and can be queried by user defined constraints; the query results are provided in tables or are graphically displayed. PeptideAtlas, SRMAtlas and PASSEL are publicly available freely via the website http://www.peptideatlas.org. In this protocol, we describe the use of these resources, we highlight how to submit, search, collate and download data. PMID:24939129
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kolker, Eugene
Our project focused primarily on analysis of different types of data produced by global high-throughput technologies, data integration of gene annotation, and gene and protein expression information, as well as on getting a better functional annotation of Shewanella genes. Specifically, four of our numerous major activities and achievements include the development of: statistical models for identification and expression proteomics, superior to currently available approaches (including our own earlier ones); approaches to improve gene annotations on the whole-organism scale; standards for annotation, transcriptomics and proteomics approaches; and generalized approaches for data integration of gene annotation, gene and protein expression information.
Proteomic Profiling of Skin Fibroblasts as a Model of Schizophrenia.
Wang, Lan; Rahmoune, Hassan; Guest, Paul C
2017-01-01
Since many aspects of schizophrenia are also manifested at the peripheral level in proliferating cell types, this chapter describes the analysis of skin fibroblasts biopsied from living patients. The method focuses on cell culture and sample preparation for characterization of the model. The resulting cell extracts can be analysed by any number of proteomic techniques for identification of biomarker candidates. This approach could help to elucidate the molecular mechanisms associated with the pathophysiology of schizophrenia and provide a useful model for a new target and drug discovery.
Palmblad, Magnus; van der Burgt, Yuri E M; Dalebout, Hans; Derks, Rico J E; Schoenmaker, Bart; Deelder, André M
2009-05-02
Accurate mass determination enhances peptide identification in mass spectrometry based proteomics. We here describe the combination of two previously published open source software tools to improve mass measurement accuracy in Fourier transform ion cyclotron resonance mass spectrometry (FTICRMS). The first program, msalign, aligns one MS/MS dataset with one FTICRMS dataset. The second software, recal2, uses peptides identified from the MS/MS data for automated internal calibration of the FTICR spectra, resulting in sub-ppm mass measurement errors.
Vellaichamy, Adaikkalam; Tran, John C.; Catherman, Adam D.; Lee, Ji Eun; Kellie, John F.; Sweet, Steve M.M.; Zamdborg, Leonid; Thomas, Paul M.; Ahlf, Dorothy R.; Durbin, Kenneth R.; Valaskovic, Gary A.; Kelleher, Neil L.
2010-01-01
Despite the availability of ultra-high resolution mass spectrometers, methods for separation and detection of intact proteins for proteome-scale analyses are still in a developmental phase. Here we report robust protocols for on-line LC-MS to drive high-throughput top-down proteomics in a fashion similar to bottom-up. Comparative work on protein standards showed that a polymeric stationary phase led to superior sensitivity over a silica-based medium in reversed-phase nanocapillary-LC, with detection of proteins >50 kDa routinely accomplished in the linear ion trap of a hybrid Fourier-Transform mass spectrometer. Protein identification was enabled by nozzle-skimmer dissociation (NSD) and detection of fragment ions with <5 ppm mass accuracy for highly-specific database searching using custom software. This overall approach led to identification of proteins up to 80 kDa, with 10-60 proteins identified in single LC-MS runs of samples from yeast and human cell lines pre-fractionated by their molecular weight using a gel-based sieving system. PMID:20073486
Basophile: Accurate Fragment Charge State Prediction Improves Peptide Identification Rates
Wang, Dong; Dasari, Surendra; Chambers, Matthew C.; ...
2013-03-07
In shotgun proteomics, database search algorithms rely on fragmentation models to predict fragment ions that should be observed for a given peptide sequence. The most widely used strategy (Naive model) is oversimplified, cleaving all peptide bonds with equal probability to produce fragments of all charges below that of the precursor ion. More accurate models, based on fragmentation simulation, are too computationally intensive for on-the-fly use in database search algorithms. We have created an ordinal-regression-based model called Basophile that takes fragment size and basic residue distribution into account when determining the charge retention during CID/higher-energy collision induced dissociation (HCD) of chargedmore » peptides. This model improves the accuracy of predictions by reducing the number of unnecessary fragments that are routinely predicted for highly-charged precursors. Basophile increased the identification rates by 26% (on average) over the Naive model, when analyzing triply-charged precursors from ion trap data. Basophile achieves simplicity and speed by solving the prediction problem with an ordinal regression equation, which can be incorporated into any database search software for shotgun proteomic identification.« less
A rapid method for preparation of the cerebrospinal fluid proteome.
Larssen, Eivind; Brede, Cato; Hjelle, Anne Bjørnstad; Øysaed, Kjell Birger; Tjensvoll, Anne Bolette; Omdal, Roald; Ruoff, Peter
2015-01-01
The cerebrospinal fluid (CSF) proteome is of great interest for investigation of diseases and conditions involving the CNS. However, the presence of high-abundance proteins (HAPs) can interfere with the detection of low-abundance proteins, potentially hindering the discovery of new biomarkers. Therefore, an assessment of the CSF subproteome composition requires depletion strategies. Existing methods are time consuming, often involving multistep protocols. Here, we present a rapid, accurate, and reproducible method for preparing the CSF proteome, which allows the identification of a high number of proteins. This method involves acetonitrile (ACN) precipitation for depleting HAPs, followed by immediate trypsination. As an example, we demonstrate that this method allows discrimination between multiple sclerosis patients and healthy subjects. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DelVecchio, Vito G; Wagner, Mary Ann; Eschenbrenner, Michel; Horn, Troy A; Kraycer, Jo Ann; Estock, Frank; Elzer, Phil; Mujer, Cesar V
2002-12-20
The proteomes of selected Brucella spp. have been extensively analyzed by utilizing current proteomic technology involving 2-DE and MALDI-MS. In Brucella melitensis, more than 500 proteins were identified. The rapid and large-scale identification of proteins in this organism was accomplished by using the annotated B. melitensis genome which is now available in the GenBank. Coupled with new and powerful tools for data analysis, differentially expressed proteins were identified and categorized into several classes. A global overview of protein expression patterns emerged, thereby facilitating the simultaneous analysis of different metabolic pathways in B. melitensis. Such a global characterization would not have been possible by using time consuming and traditional biochemical approaches. The era of post-genomic technology offers new and exciting opportunities to understand the complete biology of different Brucella species.
Ultraweak photon emission and proteomics analyses in soybean under abiotic stress.
Komatsu, Setsuko; Kamal, Abu Hena Mostafa; Makino, Takahiro; Hossain, Zahed
2014-07-01
Biophotons are ultraweak photon emissions that are closely related to various biological activities and processes. In mammals, biophoton emissions originate from oxidative bursts in immunocytes during immunological responses. Biophotons emitted from plant organs provide novel information about the physiological state of plant under in vivo condition. In this review, the principles and recent advances in the measurement of biophoton emissions in plants are described. Furthermore, examples of biophoton emission and proteomics in soybean under abiotic stress are reviewed and discussed. Finally, this review suggests that the application of proteomics should provide a better interpretation of plant response to biophoton emission and allow the identification of genes that will allow the screening of crops able to produce maximal yields, even in stressful environments. Copyright © 2014 Elsevier B.V. All rights reserved.
Advances in targeted proteomics and applications to biomedical research
Shi, Tujin; Song, Ehwang; Nie, Song; Rodland, Karin D.; Liu, Tao; Qian, Wei-Jun; Smith, Richard D.
2016-01-01
Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications in human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed. PMID:27302376
Rajjou, Loïc; Belghazi, Maya; Huguet, Romain; Robin, Caroline; Moreau, Adrien; Job, Claudette; Job, Dominique
2006-07-01
The influence of salicylic acid (SA) on elicitation of defense mechanisms in Arabidopsis (Arabidopsis thaliana) seeds and seedlings was assessed by physiological measurements combined with global expression profiling (proteomics). Parallel experiments were carried out using the NahG transgenic plants expressing the bacterial gene encoding SA hydroxylase, which cannot accumulate the active form of this plant defense elicitor. SA markedly improved germination under salt stress. Proteomic analyses disclosed a specific accumulation of protein spots regulated by SA as inferred by silver-nitrate staining of two-dimensional gels, detection of carbonylated (oxidized) proteins, and neosynthesized proteins with [35S]-methionine. The combined results revealed several processes potentially affected by SA. This molecule enhanced the reinduction of the late maturation program during early stages of germination, thereby allowing the germinating seeds to reinforce their capacity to mount adaptive responses in environmental water stress. Other processes affected by SA concerned the quality of protein translation, the priming of seed metabolism, the synthesis of antioxidant enzymes, and the mobilization of seed storage proteins. All the observed effects are likely to improve seed vigor. Another aspect revealed by this study concerned the oxidative stress entailed by SA in germinating seeds, as inferred from a characterization of the carbonylated (oxidized) proteome. Finally, the proteomic data revealed a close interplay between abscisic signaling and SA elicitation of seed vigor.
In situ imaging and proteome profiling indicate andrographolide is a highly promiscuous compound
NASA Astrophysics Data System (ADS)
Li, Lin; Wijaya, Hadhi; Samanta, Sanjay; Lam, Yulin; Yao, Shao Q.
2015-06-01
Natural products represent an enormous source of pharmacologically useful compounds, and are often used as the starting point in modern drug discovery. Many biologically interesting natural products are however not being pursued as potential drug candidates, partly due to a lack of well-defined mechanism-of-action. Traditional in vitro methods for target identification of natural products based on affinity protein enrichment from crude cellular lysates cannot faithfully recapitulate protein-drug interactions in living cells. Reported herein are dual-purpose probes inspired by the natural product andrographolide, capable of both reaction-based, real-time bioimaging and in situ proteome profiling/target identification in live mammalian cells. Our results confirm that andrographolide is a highly promiscuous compound and engaged in covalent interactions with numerous previously unknown cellular targets in cell type-specific manner. We caution its potential therapeutic effects should be further investigated in detail.
Murugaiyan, Jayaseelan; Eravci, Murat; Weise, Christoph; Roesler, Uwe
2017-06-01
Here, we provide the dataset associated with our research article 'label-free quantitative proteomic analysis of harmless and pathogenic strains of infectious microalgae, Prototheca spp.' (Murugaiyan et al., 2017) [1]. This dataset describes liquid chromatography-mass spectrometry (LC-MS)-based protein identification and quantification of a non-infectious strain, Prototheca zopfii genotype 1 and two strains associated with severe and mild infections, respectively, P. zopfii genotype 2 and Prototheca blaschkeae . Protein identification and label-free quantification was carried out by analysing MS raw data using the MaxQuant-Andromeda software suit. The expressional level differences of the identified proteins among the strains were computed using Perseus software and the results were presented in [1]. This DiB provides the MaxQuant output file and raw data deposited in the PRIDE repository with the dataset identifier PXD005305.
Tools for phospho- and glycoproteomics of plasma membranes.
Wiśniewski, Jacek R
2011-07-01
Analysis of plasma membrane proteins and their posttranslational modifications is considered as important for identification of disease markers and targets for drug treatment. Due to their insolubility in water, studying of plasma membrane proteins using mass spectrometry has been difficult for a long time. Recent technological developments in sample preparation together with important improvements in mass spectrometric analysis have facilitated analysis of these proteins and their posttranslational modifications. Now, large scale proteomic analyses allow identification of thousands of membrane proteins from minute amounts of sample. Optimized protocols for affinity enrichment of phosphorylated and glycosylated peptides have set new dimensions in the depth of characterization of these posttranslational modifications of plasma membrane proteins. Here, I summarize recent advances in proteomic technology for the characterization of the cell surface proteins and their modifications. In the focus are approaches allowing large scale mapping rather than analytical methods suitable for studying individual proteins or non-complex mixtures.
Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach.
Guruceaga, Elizabeth; Garin-Muga, Alba; Prieto, Gorka; Bejarano, Bartolomé; Marcilla, Miguel; Marín-Vicente, Consuelo; Perez-Riverol, Yasset; Casal, J Ignacio; Vizcaíno, Juan Antonio; Corrales, Fernando J; Segura, Victor
2017-12-01
The Human Proteome Project (HPP) aims deciphering the complete map of the human proteome. In the past few years, significant efforts of the HPP teams have been dedicated to the experimental detection of the missing proteins, which lack reliable mass spectrometry evidence of their existence. In this endeavor, an in depth analysis of shotgun experiments might represent a valuable resource to select a biological matrix in design validation experiments. In this work, we used all the proteomic experiments from the NCI60 cell lines and applied an integrative approach based on the results obtained from Comet, Mascot, OMSSA, and X!Tandem. This workflow benefits from the complementarity of these search engines to increase the proteome coverage. Five missing proteins C-HPP guidelines compliant were identified, although further validation is needed. Moreover, 165 missing proteins were detected with only one unique peptide, and their functional analysis supported their participation in cellular pathways as was also proposed in other studies. Finally, we performed a combined analysis of the gene expression levels and the proteomic identifications from the common cell lines between the NCI60 and the CCLE project to suggest alternatives for further validation of missing protein observations.
Hiller, Karsten; Grote, Andreas; Maneck, Matthias; Münch, Richard; Jahn, Dieter
2006-10-01
After the publication of JVirGel 1.0 in 2003 we got many requests and suggestions from the proteomics community to further improve the performance of the software and to add additional useful new features. The integration of the PrediSi algorithm for the prediction of signal peptides for the Sec-dependent protein export into JVirGel 2.0 allows the exclusion of most exported preproteins from calculated proteomic maps and provides the basis for the calculation of Sec-based secretomes. A tool for the identification of transmembrane helices carrying proteins (JCaMelix) and the prediction of the corresponding membrane proteome was added. Finally, in order to directly compare experimental and calculated proteome data, a function to overlay and evaluate predicted and experimental two-dimensional gels was included. JVirGel 2.0 is freely available as precompiled package for the installation on Windows or Linux operating systems. Furthermore, there is a completely platform-independent Java version available for download. Additionally, we provide a Java Server Pages based version of JVirGel 2.0 which can be operated in nearly all web browsers. All versions are accessible at http://www.jvirgel.de
Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach
2017-01-01
The Human Proteome Project (HPP) aims deciphering the complete map of the human proteome. In the past few years, significant efforts of the HPP teams have been dedicated to the experimental detection of the missing proteins, which lack reliable mass spectrometry evidence of their existence. In this endeavor, an in depth analysis of shotgun experiments might represent a valuable resource to select a biological matrix in design validation experiments. In this work, we used all the proteomic experiments from the NCI60 cell lines and applied an integrative approach based on the results obtained from Comet, Mascot, OMSSA, and X!Tandem. This workflow benefits from the complementarity of these search engines to increase the proteome coverage. Five missing proteins C-HPP guidelines compliant were identified, although further validation is needed. Moreover, 165 missing proteins were detected with only one unique peptide, and their functional analysis supported their participation in cellular pathways as was also proposed in other studies. Finally, we performed a combined analysis of the gene expression levels and the proteomic identifications from the common cell lines between the NCI60 and the CCLE project to suggest alternatives for further validation of missing protein observations. PMID:28960077
Islam, Mohammad T; Garg, Gagan; Hancock, William S; Risk, Brian A; Baker, Mark S; Ranganathan, Shoba
2014-01-03
The chromosome-centric human proteome project (C-HPP) aims to define the complete set of proteins encoded in each human chromosome. The neXtProt database (September 2013) lists 20,128 proteins for the human proteome, of which 3831 human proteins (∼19%) are considered "missing" according to the standard metrics table (released September 27, 2013). In support of the C-HPP initiative, we have extended the annotation strategy developed for human chromosome 7 "missing" proteins into a semiautomated pipeline to functionally annotate the "missing" human proteome. This pipeline integrates a suite of bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology, and biochemical pathways. From sequential BLAST searches, we have primarily identified homologues from reviewed nonhuman mammalian proteins with protein evidence for 1271 (33.2%) "missing" proteins, followed by 703 (18.4%) homologues from reviewed nonhuman mammalian proteins and subsequently 564 (14.7%) homologues from reviewed human proteins. Functional annotations for 1945 (50.8%) "missing" proteins were also determined. To accelerate the identification of "missing" proteins from proteomics studies, we generated proteotypic peptides in silico. Matching these proteotypic peptides to ENCODE proteogenomic data resulted in proteomic evidence for 107 (2.8%) of the 3831 "missing proteins, while evidence from a recent membrane proteomic study supported the existence for another 15 "missing" proteins. The chromosome-wise functional annotation of all "missing" proteins is freely available to the scientific community through our web server (http://biolinfo.org/protannotator).
Standardized protocols for quality control of MRM-based plasma proteomic workflows.
Percy, Andrew J; Chambers, Andrew G; Smith, Derek S; Borchers, Christoph H
2013-01-04
Mass spectrometry (MS)-based proteomics is rapidly emerging as a viable technology for the identification and quantitation of biological samples, such as human plasma--the most complex yet commonly employed biofluid in clinical analyses. The transition from a qualitative to quantitative science is required if proteomics is going to successfully make the transition to a clinically useful technique. MS, however, has been criticized for a lack of reproducibility and interlaboratory transferability. Currently, the MS and plasma proteomics communities lack standardized protocols and reagents to ensure that high-quality quantitative data can be accurately and precisely reproduced by laboratories across the world using different MS technologies. Toward addressing this issue, we have developed standard protocols for multiple reaction monitoring (MRM)-based assays with customized isotopically labeled internal standards for quality control of the sample preparation workflow and the MS platform in quantitative plasma proteomic analyses. The development of reference standards and their application to a single MS platform is discussed herein, along with the results from intralaboratory tests. The tests highlighted the importance of the reference standards in assessing the efficiency and reproducibility of the entire bottom-up proteomic workflow and revealed errors related to the sample preparation and performance quality and deficits of the MS and LC systems. Such evaluations are necessary if MRM-based quantitative plasma proteomics is to be used in verifying and validating putative disease biomarkers across different research laboratories and eventually in clinical laboratories.
Krüger, Thomas; Luo, Ting; Schmidt, Hella; Shopova, Iordana; Kniemeyer, Olaf
2015-12-14
Opportunistic human pathogenic fungi including the saprotrophic mold Aspergillus fumigatus and the human commensal Candida albicans can cause severe fungal infections in immunocompromised or critically ill patients. The first line of defense against opportunistic fungal pathogens is the innate immune system. Phagocytes such as macrophages, neutrophils and dendritic cells are an important pillar of the innate immune response and have evolved versatile defense strategies against microbial pathogens. On the other hand, human-pathogenic fungi have sophisticated virulence strategies to counteract the innate immune defense. In this context, proteomic approaches can provide deeper insights into the molecular mechanisms of the interaction of host immune cells with fungal pathogens. This is crucial for the identification of both diagnostic biomarkers for fungal infections and therapeutic targets. Studying host-fungal interactions at the protein level is a challenging endeavor, yet there are few studies that have been undertaken. This review draws attention to proteomic techniques and their application to fungal pathogens and to challenges, difficulties, and limitations that may arise in the course of simultaneous dual proteome analysis of host immune cells interacting with diverse morphotypes of fungal pathogens. On this basis, we discuss strategies to overcome these multifaceted experimental and analytical challenges including the viability of immune cells during co-cultivation, the increased and heterogeneous protein complexity of the host proteome dynamically interacting with the fungal proteome, and the demands on normalization strategies in terms of relative quantitative proteome analysis.
Valeja, Santosh G; Xiu, Lichen; Gregorich, Zachery R; Guner, Huseyin; Jin, Song; Ge, Ying
2015-01-01
To address the complexity of the proteome in mass spectrometry (MS)-based top-down proteomics, multidimensional liquid chromatography (MDLC) strategies that can effectively separate proteins with high resolution and automation are highly desirable. Although various MDLC methods that can effectively separate peptides from protein digests exist, very few MDLC strategies, primarily consisting of 2DLC, are available for intact protein separation, which is insufficient to address the complexity of the proteome. We recently demonstrated that hydrophobic interaction chromatography (HIC) utilizing a MS-compatible salt can provide high resolution separation of intact proteins for top-down proteomics. Herein, we have developed a novel 3DLC strategy by coupling HIC with ion exchange chromatography (IEC) and reverse phase chromatography (RPC) for intact protein separation. We demonstrated that a 3D (IEC-HIC-RPC) approach greatly outperformed the conventional 2D IEC-RPC approach. For the same IEC fraction (out of 35 fractions) from a crude HEK 293 cell lysate, a total of 640 proteins were identified in the 3D approach (corresponding to 201 nonredundant proteins) as compared to 47 in the 2D approach, whereas simply prolonging the gradients in RPC in the 2D approach only led to minimal improvement in protein separation and identifications. Therefore, this novel 3DLC method has great potential for effective separation of intact proteins to achieve deep proteome coverage in top-down proteomics.
Zhao, Yan; Chang, Cheng; Qin, Peibin; Cao, Qichen; Tian, Fang; Jiang, Jing; Li, Xianyu; Yu, Wenfeng; Zhu, Yunping; He, Fuchu; Ying, Wantao; Qian, Xiaohong
2016-01-21
Human plasma is a readily available clinical sample that reflects the status of the body in normal physiological and disease states. Although the wide dynamic range and immense complexity of plasma proteins are obstacles, comprehensive proteomic analysis of human plasma is necessary for biomarker discovery and further verification. Various methods such as immunodepletion, protein equalization and hyper fractionation have been applied to reduce the influence of high-abundance proteins (HAPs) and to reduce the high level of complexity. However, the depth at which the human plasma proteome has been explored in a relatively short time frame has been limited, which impedes the transfer of proteomic techniques to clinical research. Development of an optimal strategy is expected to improve the efficiency of human plasma proteome profiling. Here, five three-dimensional strategies combining HAP depletion (the 1st dimension) and protein fractionation (the 2nd dimension), followed by LC-MS/MS analysis (the 3rd dimension) were developed and compared for human plasma proteome profiling. Pros and cons of the five strategies are discussed for two issues: HAP depletion and complexity reduction. Strategies A and B used proteome equalization and tandem Seppro IgY14 immunodepletion, respectively, as the first dimension. Proteome equalization (strategy A) was biased toward the enrichment of basic and low-molecular weight proteins and had limited ability to enrich low-abundance proteins. By tandem removal of HAPs (strategy B), the efficiency of HAP depletion was significantly increased, whereas more off-target proteins were subtracted simultaneously. In the comparison of complexity reduction, strategy D involved a deglycosylation step before high-pH RPLC separation. However, the increase in sequence coverage did not increase the protein number as expected. Strategy E introduced SDS-PAGE separation of proteins, and the results showed oversampling of HAPs and identification of fewer proteins. Strategy C combined single Seppro IgY14 immunodepletion, high-pH RPLC fractionation and LC-MS/MS analysis. It generated the largest dataset, containing 1544 plasma protein groups and 258 newly identified proteins in a 30-h-machine-time analysis, making it the optimum three-dimensional strategy in our study. Further analysis of the integrated data from the five strategies showed identical distribution patterns in terms of sequence features and GO functional analysis with the 1929-plasma-protein dataset, further supporting the reliability of our plasma protein identifications. The characterization of 20 cytokines in the concentration range from sub-nanograms/milliliter to micrograms/milliliter demonstrated the sensitivity of the current strategies. Copyright © 2015 Elsevier B.V. All rights reserved.
Global Reprogramming of Host SUMOylation during Influenza Virus Infection
Domingues, Patricia; Golebiowski, Filip; Tatham, Michael H.; Lopes, Antonio M.; Taggart, Aislynn; Hay, Ronald T.; Hale, Benjamin G.
2015-01-01
Summary Dynamic nuclear SUMO modifications play essential roles in orchestrating cellular responses to proteotoxic stress, DNA damage, and DNA virus infection. Here, we describe a non-canonical host SUMOylation response to the nuclear-replicating RNA pathogen, influenza virus, and identify viral RNA polymerase activity as a major contributor to SUMO proteome remodeling. Using quantitative proteomics to compare stress-induced SUMOylation responses, we reveal that influenza virus infection triggers unique re-targeting of SUMO to 63 host proteins involved in transcription, mRNA processing, RNA quality control, and DNA damage repair. This is paralleled by widespread host deSUMOylation. Depletion screening identified ten virus-induced SUMO targets as potential antiviral factors, including C18orf25 and the SMC5/6 and PAF1 complexes. Mechanistic studies further uncovered a role for SUMOylation of the PAF1 complex component, parafibromin (CDC73), in potentiating antiviral gene expression. Our global characterization of influenza virus-triggered SUMO redistribution provides a proteomic resource to understand host nuclear SUMOylation responses to infection. PMID:26549460
Olinares, Paul Dominic B.; Ponnala, Lalit; van Wijk, Klaas J.
2010-01-01
To characterize MDa-sized macromolecular chloroplast stroma protein assemblies and to extend coverage of the chloroplast stroma proteome, we fractionated soluble chloroplast stroma in the non-denatured state by size exclusion chromatography with a size separation range up to ∼5 MDa. To maximize protein complex stability and resolution of megadalton complexes, ionic strength and composition were optimized. Subsequent high accuracy tandem mass spectrometry analysis (LTQ-Orbitrap) identified 1081 proteins across the complete native mass range. Protein complexes and assembly states above 0.8 MDa were resolved using hierarchical clustering, and protein heat maps were generated from normalized protein spectral counts for each of the size exclusion chromatography fractions; this complemented previous analysis of stromal complexes up to 0.8 MDa (Peltier, J. B., Cai, Y., Sun, Q., Zabrouskov, V., Giacomelli, L., Rudella, A., Ytterberg, A. J., Rutschow, H., and van Wijk, K. J. (2006) The oligomeric stromal proteome of Arabidopsis thaliana chloroplasts. Mol. Cell. Proteomics 5, 114–133). This combined experimental and bioinformatics analyses resolved chloroplast ribosomes in different assembly and functional states (e.g. 30, 50, and 70 S), which enabled the identification of plastid homologues of prokaryotic ribosome assembly factors as well as proteins involved in co-translational modifications, targeting, and folding. The roles of these ribosome-associating proteins will be discussed. Known RNA splice factors (e.g. CAF1/WTF1/RNC1) as well as uncharacterized proteins with RNA-binding domains (pentatricopeptide repeat, RNA recognition motif, and chloroplast ribosome maturation), RNases, and DEAD box helicases were found in various sized complexes. Chloroplast DNA (>3 MDa) was found in association with the complete heteromeric plastid-encoded DNA polymerase complex, and a dozen other DNA-binding proteins, e.g. DNA gyrase, topoisomerase, and various DNA repair enzymes. The heteromeric ≥5-MDa pyruvate dehydrogenase complex and the 0.8–1-MDa acetyl-CoA carboxylase complex associated with uncharacterized biotin carboxyl carrier domain proteins constitute the entry point to fatty acid metabolism in leaves; we suggest that their large size relates to the need for metabolic channeling. Protein annotations and identification data are available through the Plant Proteomics Database, and mass spectrometry data are available through Proteomics Identifications database. PMID:20423899
Di Girolamo, Francesco; Righetti, Pier Giorgio; Soste, Martin; Feng, Yuehan; Picotti, Paola
2013-08-26
Systems biology studies require the capability to quantify with high precision proteins spanning a broad range of abundances across multiple samples. However, the broad range of protein expression in cells often precludes the detection of low-abundance proteins. Different sample processing techniques can be applied to increase proteome coverage. Among these, combinatorial (hexa)peptide ligand libraries (CPLLs) bound to solid matrices have been used to specifically capture and detect low-abundance proteins in complex samples. To assess whether CPLL capture can be applied in systems biology studies involving the precise quantitation of proteins across a multitude of samples, we evaluated its performance across the whole range of protein abundances in Saccharomyces cerevisiae. We used selected reaction monitoring assays for a set of target proteins covering a broad abundance range to quantitatively evaluate the precision of the approach and its capability to detect low-abundance proteins. Replicated CPLL-isolates showed an average variability of ~10% in the amount of the isolated proteins. The high reproducibility of the technique was not dependent on the abundance of the protein or the amount of beads used for the capture. However, the protein-to-bead ratio affected the enrichment of specific proteins. We did not observe a normalization effect of CPLL beads on protein abundances. However, CPLLs enriched for and depleted specific sets of proteins and thus changed the abundances of proteins from a whole proteome extract. This allowed the identification of ~400 proteins otherwise undetected in an untreated sample, under the experimental conditions used. CPLL capture is thus a useful tool to increase protein identifications in proteomic experiments, but it should be coupled to the analysis of untreated samples, to maximize proteome coverage. Our data also confirms that CPLL capture is reproducible and can be confidently used in quantitative proteomic experiments. Combinatorial hexapeptide ligand libraries (CPLLs) bound to solid matrices have been proposed to specifically capture and detect low-abundance proteins in complex samples. To assess whether the CPLL capture can be confidently applied in systems biology studies involving the precise quantitation of proteins across a broad range of abundances and a multitude of samples, we evaluated its reproducibility and performance features. Using selected reaction monitoring assays for proteins covering the whole range of abundances we show that the technique is reproducible and compatible with quantitative proteomic studies. However, the protein-to-bead ratio affects the enrichment of specific proteins and CPLLs depleted specific sets of proteins from a whole proteome extract. Our results suggest that CPLL-based analyses should be coupled to the analysis of untreated samples, to maximize proteome coverage. Overall, our data confirms the applicability of CPLLs in systems biology research and guides the correct use of this technique. Copyright © 2013 Elsevier B.V. All rights reserved.
2011-01-01
Background Despite the successful eradication of smallpox by the WHO-led vaccination programme, pox virus infections remain a considerable health threat. The possible use of smallpox as a bioterrorism agent as well as the continuous occurrence of zoonotic pox virus infections document the relevance to deepen the understanding for virus host interactions. Since the permissiveness of pox infections is independent of hosts surface receptors, but correlates with the ability of the virus to infiltrate the antiviral host response, it directly depends on the hosts proteome set. In this report the proteome of HEK293 cells infected with Vaccinia Virus strain IHD-W was analyzed by 2-dimensional gel electrophoresis and MALDI-PSD-TOF MS in a bottom-up approach. Results The cellular and viral proteomes of VACV IHD-W infected HEK293 cells, UV-inactivated VACV IHD-W-treated as well as non-infected cells were compared. Derivatization of peptides with 4-sulfophenyl isothiocyanate (SPITC) carried out on ZipTipμ-C18 columns enabled protein identification via the peptides' primary sequence, providing improved s/n ratios as well as signal intensities of the PSD spectra. The expression of more than 24 human proteins was modulated by the viral infection. Effects of UV-inactivated and infectious viruses on the hosts' proteome concerning energy metabolism and proteins associated with gene expression and protein-biosynthesis were quite similar. These effects might therefore be attributed to virus entry and virion proteins. However, the modulation of proteins involved in apoptosis was clearly correlated to infectious viruses. Conclusions The proteome analysis of infected cells provides insight into apoptosis modulation, regulation of cellular gene expression and the regulation of energy metabolism. The confidence of protein identifications was clearly improved by the peptides' derivatization with SPITC on a solid phase support. Some of the identified proteins have not been described in the context of poxvirus infections before and need to be further characterised to identify their meaning for apoptosis modulation and pathogenesis. PMID:21806805
Bromilow, Sophie; Gethings, Lee A; Buckley, Mike; Bromley, Mike; Shewry, Peter R; Langridge, James I; Clare Mills, E N
2017-06-23
The unique physiochemical properties of wheat gluten enable a diverse range of food products to be manufactured. However, gluten triggers coeliac disease, a condition which is treated using a gluten-free diet. Analytical methods are required to confirm if foods are gluten-free, but current immunoassay-based methods can unreliable and proteomic methods offer an alternative but require comprehensive and well annotated sequence databases which are lacking for gluten. A manually a curated database (GluPro V1.0) of gluten proteins, comprising 630 discrete unique full length protein sequences has been compiled. It is representative of the different types of gliadin and glutenin components found in gluten. An in silico comparison of their coeliac toxicity was undertaken by analysing the distribution of coeliac toxic motifs. This demonstrated that whilst the α-gliadin proteins contained more toxic motifs, these were distributed across all gluten protein sub-types. Comparison of annotations observed using a discovery proteomics dataset acquired using ion mobility MS/MS showed that more reliable identifications were obtained using the GluPro V1.0 database compared to the complete reviewed Viridiplantae database. This highlights the value of a curated sequence database specifically designed to support the proteomic workflows and the development of methods to detect and quantify gluten. We have constructed the first manually curated open-source wheat gluten protein sequence database (GluPro V1.0) in a FASTA format to support the application of proteomic methods for gluten protein detection and quantification. We have also analysed the manually verified sequences to give the first comprehensive overview of the distribution of sequences able to elicit a reaction in coeliac disease, the prevalent form of gluten intolerance. Provision of this database will improve the reliability of gluten protein identification by proteomic analysis, and aid the development of targeted mass spectrometry methods in line with Codex Alimentarius Commission requirements for foods designed to meet the needs of gluten intolerant individuals. Copyright © 2017. Published by Elsevier B.V.
Xue, Ting; Liu, Ping; Zhou, Yong; Liu, Kun; Yang, Li; Moritz, Robert L; Yan, Wei; Xu, Lisa X
2016-01-01
Cryo-thermal therapy has been emerged as a promising novel therapeutic strategy for advanced breast cancer, triggering higher incidence of tumor regression and enhanced remission of metastasis than routine treatments. To better understand its anti-tumor mechanism, we utilized a spontaneous metastatic mouse model and quantitative proteomics to compare N-glycoproteome changes in 94 serum samples with and without treatment. We quantified 231 highly confident N-glycosylated proteins using iTRAQ shotgun proteomics. Among them, 53 showed significantly discriminated regulatory patterns over the time course, in which the acute phase response emerged as the most enhanced pathway. The anti-tumor feature of the acute response was further investigated using parallel reaction monitoring target proteomics and flow cytometry on 23 of the 53 significant proteins. We found that cryo-thermal therapy reset the tumor chronic inflammation to an "acute" phenotype, with up-regulation of acute phase proteins including IL-6 as a key regulator. The IL-6 mediated "acute" phenotype transformed IL-4 and Treg-promoting ICOSL expression to Th1-promoting IFN-γ and IL-12 production, augmented complement system activation and CD86(+)MHCII(+) dendritic cells maturation and enhanced the proliferation of Th1 memory cells. In addition, we found an increased production of tumor progression and metastatic inhibitory proteins under such "acute" environment, favoring the anti-metastatic effect. Moreover, cryo-thermal on tumors induced the strongest "acute" response compared to cryo/hyperthermia alone or cryo-thermal on healthy tissues, accompanying by the most pronounced anti-tumor immunological effect. In summary, we demonstrated that cryo-thermal therapy induced, IL-6 mediated "acute" microenvironment shifted the tumor chronic microenvironment from Th2 immunosuppressive and pro-tumorigenic to Th1 immunostimulatory and tumoricidal state. Moreover, the magnitude of "acute" and "danger" signals play a key role in determining the efficacy of anti-tumor activity.
A combinatorial perspective of the protein inference problem.
Yang, Chao; He, Zengyou; Yu, Weichuan
2013-01-01
In a shotgun proteomics experiment, proteins are the most biologically meaningful output. The success of proteomics studies depends on the ability to accurately and efficiently identify proteins. Many methods have been proposed to facilitate the identification of proteins from peptide identification results. However, the relationship between protein identification and peptide identification has not been thoroughly explained before. In this paper, we devote ourselves to a combinatorial perspective of the protein inference problem. We employ combinatorial mathematics to calculate the conditional protein probabilities (protein probability means the probability that a protein is correctly identified) under three assumptions, which lead to a lower bound, an upper bound, and an empirical estimation of protein probabilities, respectively. The combinatorial perspective enables us to obtain an analytical expression for protein inference. Our method achieves comparable results with ProteinProphet in a more efficient manner in experiments on two data sets of standard protein mixtures and two data sets of real samples. Based on our model, we study the impact of unique peptides and degenerate peptides (degenerate peptides are peptides shared by at least two proteins) on protein probabilities. Meanwhile, we also study the relationship between our model and ProteinProphet. We name our program ProteinInfer. Its Java source code, our supplementary document and experimental results are available at: >http://bioinformatics.ust.hk/proteininfer.
Yang, Chunguang G; Granite, Stephen J; Van Eyk, Jennifer E; Winslow, Raimond L
2006-11-01
Protein identification using MS is an important technique in proteomics as well as a major generator of proteomics data. We have designed the protein identification data object model (PDOM) and developed a parser based on this model to facilitate the analysis and storage of these data. The parser works with HTML or XML files saved or exported from MASCOT MS/MS ions search in peptide summary report or MASCOT PMF search in protein summary report. The program creates PDOM objects, eliminates redundancy in the input file, and has the capability to output any PDOM object to a relational database. This program facilitates additional analysis of MASCOT search results and aids the storage of protein identification information. The implementation is extensible and can serve as a template to develop parsers for other search engines. The parser can be used as a stand-alone application or can be driven by other Java programs. It is currently being used as the front end for a system that loads HTML and XML result files of MASCOT searches into a relational database. The source code is freely available at http://www.ccbm.jhu.edu and the program uses only free and open-source Java libraries.
Kou, Qiang; Wu, Si; Tolic, Nikola; Paša-Tolic, Ljiljana; Liu, Yunlong; Liu, Xiaowen
2017-05-01
Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a 'bird's eye view' of intact proteoforms. The combinatorial explosion of various alterations on a protein may result in billions of possible proteoforms, making proteoform identification a challenging computational problem. We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry datasets showed that TopMG outperformed existing methods in identifying complex proteoforms. http://proteomics.informatics.iupui.edu/software/topmg/. xwliu@iupui.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Comparative proteomic assessment of matrisome enrichment methodologies.
Krasny, Lukas; Paul, Angela; Wai, Patty; Howard, Beatrice A; Natrajan, Rachael C; Huang, Paul H
2016-11-01
The matrisome is a complex and heterogeneous collection of extracellular matrix (ECM) and ECM-associated proteins that play important roles in tissue development and homeostasis. While several strategies for matrisome enrichment have been developed, it is currently unknown how the performance of these different methodologies compares in the proteomic identification of matrisome components across multiple tissue types. In the present study, we perform a comparative proteomic assessment of two widely used decellularisation protocols and two extraction methods to characterise the matrisome in four murine organs (heart, mammary gland, lung and liver). We undertook a systematic evaluation of the performance of the individual methods on protein yield, matrisome enrichment capability and the ability to isolate core matrisome and matrisome-associated components. Our data find that sodium dodecyl sulphate (SDS) decellularisation leads to the highest matrisome enrichment efficiency, while the extraction protocol that comprises chemical and trypsin digestion of the ECM fraction consistently identifies the highest number of matrisomal proteins across all types of tissue examined. Matrisome enrichment had a clear benefit over non-enriched tissue for the comprehensive identification of matrisomal components in murine liver and heart. Strikingly, we find that all four matrisome enrichment methods led to significant losses in the soluble matrisome-associated proteins across all organs. Our findings highlight the multiple factors (including tissue type, matrisome class of interest and desired enrichment purity) that influence the choice of enrichment methodology, and we anticipate that these data will serve as a useful guide for the design of future proteomic studies of the matrisome. © 2016 The Author(s).
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger
Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J
2009-01-01
Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). Results 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. Conclusion This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method. PMID:19193216
González, Fermín E; Chernobrovkin, Alexey; Pereda, Cristián; García-Salum, Tamara; Tittarelli, Andrés; López, Mercedes N; Salazar-Onfray, Flavio; Zubarev, Roman A
2018-01-01
Autologous dendritic cells (DCs) loaded with cancer cell-derived lysates have become a promising tool in cancer immunotherapy. During the last decade, we demonstrated that vaccination of advanced melanoma patients with autologous tumor antigen presenting cells (TAPCells) loaded with an allogeneic heat shock- (HS-) conditioned melanoma cell-derived lysate (called TRIMEL) is able to induce an antitumor immune response associated with a prolonged patient survival. TRIMEL provides not only a broad spectrum of potential melanoma-associated antigens but also danger signals that are crucial in the induction of a committed mature DC phenotype. However, potential changes induced by heat conditioning on the proteome of TRIMEL are still unknown. The identification of newly or differentially expressed proteins under defined stress conditions is relevant for understanding the lysate immunogenicity. Here, we characterized the proteomic profile of TRIMEL in response to HS treatment. A quantitative label-free proteome analysis of over 2800 proteins was performed, with 91 proteins that were found to be regulated by HS treatment: 18 proteins were overexpressed and 73 underexpressed. Additionally, 32 proteins were only identified in the HS-treated TRIMEL and 26 in non HS-conditioned samples. One protein from the overexpressed group and two proteins from the HS-exclusive group were previously described as potential damage-associated molecular patterns (DAMPs). Some of the HS-induced proteins, such as haptoglobin, could be also considered as DAMPs and candidates for further immunological analysis in the establishment of new putative danger signals with immunostimulatory functions.
Chernobrovkin, Alexey; Pereda, Cristián; García-Salum, Tamara; Tittarelli, Andrés; López, Mercedes N.; Salazar-Onfray, Flavio
2018-01-01
Autologous dendritic cells (DCs) loaded with cancer cell-derived lysates have become a promising tool in cancer immunotherapy. During the last decade, we demonstrated that vaccination of advanced melanoma patients with autologous tumor antigen presenting cells (TAPCells) loaded with an allogeneic heat shock- (HS-) conditioned melanoma cell-derived lysate (called TRIMEL) is able to induce an antitumor immune response associated with a prolonged patient survival. TRIMEL provides not only a broad spectrum of potential melanoma-associated antigens but also danger signals that are crucial in the induction of a committed mature DC phenotype. However, potential changes induced by heat conditioning on the proteome of TRIMEL are still unknown. The identification of newly or differentially expressed proteins under defined stress conditions is relevant for understanding the lysate immunogenicity. Here, we characterized the proteomic profile of TRIMEL in response to HS treatment. A quantitative label-free proteome analysis of over 2800 proteins was performed, with 91 proteins that were found to be regulated by HS treatment: 18 proteins were overexpressed and 73 underexpressed. Additionally, 32 proteins were only identified in the HS-treated TRIMEL and 26 in non HS-conditioned samples. One protein from the overexpressed group and two proteins from the HS-exclusive group were previously described as potential damage-associated molecular patterns (DAMPs). Some of the HS-induced proteins, such as haptoglobin, could be also considered as DAMPs and candidates for further immunological analysis in the establishment of new putative danger signals with immunostimulatory functions. PMID:29744371
Dalle-Donne, Isabella; Colombo, Graziano; Gornati, Rosalba; Garavaglia, Maria L; Portinaro, Nicola; Giustarini, Daniela; Bernardini, Giovanni; Rossi, Ranieri; Milzani, Aldo
2017-03-10
Oxidative stress is one mechanism whereby tobacco smoking affects human health, as reflected by increased levels of several biomarkers of oxidative stress/damage isolated from tissues and biological fluids of active and passive smokers. Many investigations of cigarette smoke (CS)-induced oxidative stress/damage have been carried out in mammalian animal and cellular models of exposure to CS. Animal models allow the investigation of many parameters that are similar to those measured in human smokers. In vitro cell models may provide new information on molecular and functional differences between cells of smokers and nonsmokers. Recent Advances: Over the past decade or so, a growing number of researches highlighted that CS induces protein carbonylation in different tissues and body fluids of smokers as well as in in vivo and in vitro models of exposure to CS. We review recent findings on protein carbonylation in smokers and models thereof, focusing on redox proteomic studies. We also discuss the relevance and limitations of these models of exposure to CS and critically assess the congruence between the smoker's condition and laboratory models. The identification of protein targets is crucial for understanding the mechanism(s) by which carbonylated proteins accumulate and potentially affect cellular functions. Recent progress in redox proteomics allows the enrichment, identification, and characterization of specific oxidative protein modifications, including carbonylation. Therefore, redox proteomics can be a powerful tool to gain new insights into the onset and/or progression of CS-related diseases and to develop strategies to prevent and/or treat them. Antioxid. Redox Signal. 26, 406-426.
Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.
Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali
2018-04-01
Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.
Quantitative proteomic analysis of bacterial enzymes released in cheese during ripening.
Jardin, Julien; Mollé, Daniel; Piot, Michel; Lortal, Sylvie; Gagnaire, Valérie
2012-04-02
Due to increasingly available bacterial genomes in databases, proteomic tools have recently been used to screen proteins expressed by micro-organisms in food in order to better understand their metabolism in situ. While the main objective is the systematic identification of proteins, the next step will be to bridge the gap between identification and quantification of these proteins. For that purpose, a new mass spectrometry-based approach was applied, using isobaric tagging reagent for quantitative proteomic analysis (iTRAQ), which are amine specific and yield labelled peptides identical in mass. Experimental Swiss-type cheeses were manufactured from microfiltered milk using Streptococcus thermophilus ITG ST20 and Lactobacillus helveticus ITG LH1 as lactic acid starters. At three ripening times (7, 20 and 69 days), cheese aqueous phases were extracted and enriched in bacterial proteins by fractionation. Each sample, standardised in protein amount prior to proteomic analyses, was: i) analysed by 2D-electrophoresis for qualitative analysis and ii) submitted to trypsinolysis, and labelled with specific iTRAQ tag, one per ripening time. The three labelled samples were mixed together and analysed by nano-LC coupled on-line with ESI-QTOF mass spectrometer. Thirty proteins, both from bacterial or bovine origin, were identified and efficiently quantified. The free bacterial proteins detected were enzymes from the central carbon metabolism as well as stress proteins. Depending on the protein considered, the quantity of these proteins in the cheese aqueous extract increased from 2.5 to 20 fold in concentration from day 7 to day 69 of ripening. Copyright © 2012 Elsevier B.V. All rights reserved.
Mahajan, Shikha; Manetsch, Roman; Merkler, David J.; Stevens Jr., Stanley M.
2015-01-01
Proteomics is a powerful approach used for investigating the complex molecular mechanisms of disease pathogenesis and progression. An important challenge in modern protein profiling approaches involves targeting of specific protein activities in order to identify altered molecular processes associated with disease pathophysiology. Adenosine-binding proteins represent an important subset of the proteome where aberrant expression or activity changes of these proteins have been implicated in numerous human diseases. Herein, we describe an affinity-based approach for the enrichment of adenosine-binding proteins from a complex cell proteome. A novel N 6-biotinylated-8-azido-adenosine probe (AdoR probe) was synthesized, which contains a reactive group that forms a covalent bond with the target proteins, as well as a biotin tag for affinity enrichment using avidin chromatography. Probe specificity was confirmed with protein standards prior to further evaluation in a complex protein mixture consisting of a lysate derived from mouse neuroblastoma N18TG2 cells. Protein identification and relative quantitation using mass spectrometry allowed for the identification of small variations in abundance of nucleoside- and nucleotide-binding proteins in these samples where a significant enrichment of AdoR-binding proteins in the labeled proteome from the neuroblastoma cells was observed. The results from this study demonstrate the utility of this method to enrich for nucleoside- and nucleotide-binding proteins in a complex protein mixture, pointing towards a unique set of proteins that can be examined in the context of further understanding mechanisms of disease, or fundamental biological processes in general. PMID:25671571
Proteomic analysis of Rhodotorula mucilaginosa: dealing with the issues of a non-conventional yeast.
Addis, Maria Filippa; Tanca, Alessandro; Landolfo, Sara; Abbondio, Marcello; Cutzu, Raffaela; Biosa, Grazia; Pagnozzi, Daniela; Uzzau, Sergio; Mannazzu, Ilaria
2016-08-01
Red yeasts ascribed to the species Rhodotorula mucilaginosa are gaining increasing attention, due to their numerous biotechnological applications, spanning carotenoid production, liquid bioremediation, heavy metal biotransformation and antifungal and plant growth-promoting actions, but also for their role as opportunistic pathogens. Nevertheless, their characterization at the 'omic' level is still scarce. Here, we applied different proteomic workflows to R. mucilaginosa with the aim of assessing their potential in generating information on proteins and functions of biotechnological interest, with a particular focus on the carotenogenic pathway. After optimization of protein extraction, we tested several gel-based (including 2D-DIGE) and gel-free sample preparation techniques, followed by tandem mass spectrometry analysis. Contextually, we evaluated different bioinformatic strategies for protein identification and interpretation of the biological significance of the dataset. When 2D-DIGE analysis was applied, not all spots returned a unambiguous identification and no carotenogenic enzymes were identified, even upon the application of different database search strategies. Then, the application of shotgun proteomic workflows with varying levels of sensitivity provided a picture of the information depth that can be reached with different analytical resources, and resulted in a plethora of information on R. mucilaginosa metabolism. However, also in these cases no proteins related to the carotenogenic pathway were identified, thus indicating that further improvements in sequence databases and functional annotations are strictly needed for increasing the outcome of proteomic analysis of this and other non-conventional yeasts. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Russ, Holger A; Landsman, Limor; Moss, Christopher L; Higdon, Roger; Greer, Renee L; Kaihara, Kelly; Salamon, Randy; Kolker, Eugene; Hebrok, Matthias
2016-01-01
Current approaches in human embryonic stem cell (hESC) to pancreatic beta cell differentiation have largely been based on knowledge gained from developmental studies of the epithelial pancreas, while the potential roles of other supporting tissue compartments have not been fully explored. One such tissue is the pancreatic mesenchyme that supports epithelial organogenesis throughout embryogenesis. We hypothesized that detailed characterization of the pancreatic mesenchyme might result in the identification of novel factors not used in current differentiation protocols. Supplementing existing hESC differentiation conditions with such factors might create a more comprehensive simulation of normal development in cell culture. To validate our hypothesis, we took advantage of a novel transgenic mouse model to isolate the pancreatic mesenchyme at distinct embryonic and postnatal stages for subsequent proteomic analysis. Refined sample preparation and analysis conditions across four embryonic and prenatal time points resulted in the identification of 21,498 peptides with high-confidence mapping to 1,502 proteins. Expression analysis of pancreata confirmed the presence of three potentially important factors in cell differentiation: Galectin-1 (LGALS1), Neuroplastin (NPTN), and the Laminin α-2 subunit (LAMA2). Two of the three factors (LGALS1 and LAMA2) increased expression of pancreatic progenitor transcript levels in a published hESC to beta cell differentiation protocol. In addition, LAMA2 partially blocks cell culture induced beta cell dedifferentiation. Summarily, we provide evidence that proteomic analysis of supporting tissues such as the pancreatic mesenchyme allows for the identification of potentially important factors guiding hESC to pancreas differentiation.
Russ, Holger A.; Landsman, Limor; Moss, Christopher L.; Higdon, Roger; Greer, Renee L.; Kaihara, Kelly; Salamon, Randy; Kolker, Eugene; Hebrok, Matthias
2016-01-01
Current approaches in human embryonic stem cell (hESC) to pancreatic beta cell differentiation have largely been based on knowledge gained from developmental studies of the epithelial pancreas, while the potential roles of other supporting tissue compartments have not been fully explored. One such tissue is the pancreatic mesenchyme that supports epithelial organogenesis throughout embryogenesis. We hypothesized that detailed characterization of the pancreatic mesenchyme might result in the identification of novel factors not used in current differentiation protocols. Supplementing existing hESC differentiation conditions with such factors might create a more comprehensive simulation of normal development in cell culture. To validate our hypothesis, we took advantage of a novel transgenic mouse model to isolate the pancreatic mesenchyme at distinct embryonic and postnatal stages for subsequent proteomic analysis. Refined sample preparation and analysis conditions across four embryonic and prenatal time points resulted in the identification of 21,498 peptides with high-confidence mapping to 1,502 proteins. Expression analysis of pancreata confirmed the presence of three potentially important factors in cell differentiation: Galectin-1 (LGALS1), Neuroplastin (NPTN), and the Laminin α-2 subunit (LAMA2). Two of the three factors (LGALS1 and LAMA2) increased expression of pancreatic progenitor transcript levels in a published hESC to beta cell differentiation protocol. In addition, LAMA2 partially blocks cell culture induced beta cell dedifferentiation. Summarily, we provide evidence that proteomic analysis of supporting tissues such as the pancreatic mesenchyme allows for the identification of potentially important factors guiding hESC to pancreas differentiation. PMID:26681951
Tyrosine-Nitrated Proteins: Proteomic and Bioanalytical Aspects.
Batthyány, Carlos; Bartesaghi, Silvina; Mastrogiovanni, Mauricio; Lima, Analía; Demicheli, Verónica; Radi, Rafael
2017-03-01
"Nitroproteomic" is under active development, as 3-nitrotyrosine in proteins constitutes a footprint left by the reactions of nitric oxide-derived oxidants that are usually associated to oxidative stress conditions. Moreover, protein tyrosine nitration can cause structural and functional changes, which may be of pathophysiological relevance for human disease conditions. Biological protein tyrosine nitration is a free radical process involving the intermediacy of tyrosyl radicals; in spite of being a nonenzymatic process, nitration is selectively directed toward a limited subset of tyrosine residues. Precise identification and quantitation of 3-nitrotyrosine in proteins has represented a "tour de force" for researchers. Recent Advances: A small number of proteins are preferential targets of nitration (usually less than 100 proteins per proteome), contrasting with the large number of proteins modified by other post-translational modifications such as phosphorylation, acetylation, and, notably, S-nitrosation. Proteomic approaches have revealed key features of tyrosine nitration both in vivo and in vitro, including selectivity, site specificity, and effects in protein structure and function. Identification of 3-nitrotyrosine-containing proteins and mapping nitrated residues is challenging, due to low abundance of this oxidative modification in biological samples and its unfriendly behavior in mass spectrometry (MS)-based technologies, that is, MALDI, electrospray ionization, and collision-induced dissociation. The use of (i) classical two-dimensional electrophoresis with immunochemical detection of nitrated proteins followed by protein ID by regular MS/MS in combination with (ii) immuno-enrichment of tyrosine-nitrated peptides and (iii) identification of nitrated peptides by a MIDAS™ experiment is arising as a potent methodology to unambiguously map and quantitate tyrosine-nitrated proteins in vivo. Antioxid. Redox Signal. 26, 313-328.
Time-resolved Global and Chromatin Proteomics during Herpes Simplex Virus Type 1 (HSV-1) Infection.
Kulej, Katarzyna; Avgousti, Daphne C; Sidoli, Simone; Herrmann, Christin; Della Fera, Ashley N; Kim, Eui Tae; Garcia, Benjamin A; Weitzman, Matthew D
2017-04-01
Herpes simplex virus (HSV-1) lytic infection results in global changes to the host cell proteome and the proteins associated with host chromatin. We present a system level characterization of proteome dynamics during infection by performing a multi-dimensional analysis during HSV-1 lytic infection of human foreskin fibroblast (HFF) cells. Our study includes identification and quantification of the host and viral proteomes, phosphoproteomes, chromatin bound proteomes and post-translational modifications (PTMs) on cellular histones during infection. We analyzed proteomes across six time points of virus infection (0, 3, 6, 9, 12 and 15 h post-infection) and clustered trends in abundance using fuzzy c-means. Globally, we accurately quantified more than 4000 proteins, 200 differently modified histone peptides and 9000 phosphorylation sites on cellular proteins. In addition, we identified 67 viral proteins and quantified 571 phosphorylation events (465 with high confidence site localization) on viral proteins, which is currently the most comprehensive map of HSV-1 phosphoproteome. We investigated chromatin bound proteins by proteomic analysis of the high-salt chromatin fraction and identified 510 proteins that were significantly different in abundance during infection. We found 53 histone marks significantly regulated during virus infection, including a steady increase of histone H3 acetylation (H3K9ac and H3K14ac). Our data provide a resource of unprecedented depth for human and viral proteome dynamics during infection. Collectively, our results indicate that the proteome composition of the chromatin of HFF cells is highly affected during HSV-1 infection, and that phosphorylation events are abundant on viral proteins. We propose that our epi-proteomics approach will prove to be important in the characterization of other model infectious systems that involve changes to chromatin composition. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Neural Stem Cells (NSCs) and Proteomics.
Shoemaker, Lorelei D; Kornblum, Harley I
2016-02-01
Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Emerging techniques for the discovery and validation of therapeutic targets for skeletal diseases.
Cho, Christine H; Nuttall, Mark E
2002-12-01
Advances in genomics and proteomics have revolutionised the drug discovery process and target validation. Identification of novel therapeutic targets for chronic skeletal diseases is an extremely challenging process based on the difficulty of obtaining high-quality human diseased versus normal tissue samples. The quality of tissue and genomic information obtained from the sample is critical to identifying disease-related genes. Using a genomics-based approach, novel genes or genes with similar homology to existing genes can be identified from cDNA libraries generated from normal versus diseased tissue. High-quality cDNA libraries are prepared from uncontaminated homogeneous cell populations harvested from tissue sections of interest. Localised gene expression analysis and confirmation are obtained through in situ hybridisation or immunohistochemical studies. Cells overexpressing the recombinant protein are subsequently designed for primary cell-based high-throughput assays that are capable of screening large compound banks for potential hits. Afterwards, secondary functional assays are used to test promising compounds. The same overexpressing cells are used in the secondary assay to test protein activity and functionality as well as screen for small-molecule agonists or antagonists. Once a hit is generated, a structure-activity relationship of the compound is optimised for better oral bioavailability and pharmacokinetics allowing the compound to progress into development. Parallel efforts from proteomics, as well as genetics/transgenics, bioinformatics and combinatorial chemistry, and improvements in high-throughput automation technologies, allow the drug discovery process to meet the demands of the medicinal market. This review discusses and illustrates how different approaches are incorporated into the discovery and validation of novel targets and, consequently, the development of potentially therapeutic agents in the areas of osteoporosis and osteoarthritis. While current treatments exist in the form of hormone replacement therapy, antiresorptive and anabolic agents for osteoporosis, there are no disease-modifying therapies for the treatment of the most common human joint disease, osteoarthritis. A massive market potential for improved options with better safety and efficacy still remains. Therefore, the application of genomics and proteomics for both diseases should provide much needed novel therapeutic approaches to treating these major world health problems.
Welkie, David; Zhang, Xiaohui; Markillie, Meng Lye; Taylor, Ronald; Orr, Galya; Jacobs, Jon; Bhide, Ketaki; Thimmapuram, Jyothi; Gritsenko, Marina; Mitchell, Hugh; Smith, Richard D; Sherman, Louis A
2014-12-29
Cyanothece sp. PCC 7822 is an excellent cyanobacterial model organism with great potential to be applied as a biocatalyst for the production of high value compounds. Like other unicellular diazotrophic cyanobacterial species, it has a tightly regulated metabolism synchronized to the light-dark cycle. Utilizing transcriptomic and proteomic methods, we quantified the relationships between transcription and translation underlying central and secondary metabolism in response to nitrogen free, 12 hour light and 12 hour dark conditions. By combining mass-spectrometry based proteomics and RNA-sequencing transcriptomics, we quantitatively measured a total of 6766 mRNAs and 1322 proteins at four time points across a 24 hour light-dark cycle. Photosynthesis, nitrogen fixation, and carbon storage relevant genes were expressed during the preceding light or dark period, concurrent with measured nitrogenase activity in the late light period. We describe many instances of disparity in peak mRNA and protein abundances, and strong correlation of light dependent expression of both antisense and CRISPR-related gene expression. The proteins for nitrogenase and the pentose phosphate pathway were highest in the dark, whereas those for glycolysis and the TCA cycle were more prominent in the light. Interestingly, one copy of the psbA gene encoding the photosystem II (PSII) reaction center protein D1 (psbA4) was highly upregulated only in the dark. This protein likely cannot catalyze O2 evolution and so may be used by the cell to keep PSII intact during N2 fixation. The CRISPR elements were found exclusively at the ends of the large plasmid and we speculate that their presence is crucial to the maintenance of this plasmid. This investigation of parallel transcriptional and translational activity within Cyanothece sp. PCC 7822 provided quantitative information on expression levels of metabolic pathways relevant to engineering efforts. The identification of expression patterns for both mRNA and protein affords a basis for improving biofuel production in this strain and for further genetic manipulations. Expression analysis of the genes encoded on the 6 plasmids provided insight into the possible acquisition and maintenance of some of these extra-chromosomal elements.
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.
Keich, Uri; Kertesz-Farkas, Attila; Noble, William Stafford
2015-08-07
Interpreting the potentially vast number of hypotheses generated by a shotgun proteomics experiment requires a valid and accurate procedure for assigning statistical confidence estimates to identified tandem mass spectra. Despite the crucial role such procedures play in most high-throughput proteomics experiments, the scientific literature has not reached a consensus about the best confidence estimation methodology. In this work, we evaluate, using theoretical and empirical analysis, four previously proposed protocols for estimating the false discovery rate (FDR) associated with a set of identified tandem mass spectra: two variants of the target-decoy competition protocol (TDC) of Elias and Gygi and two variants of the separate target-decoy search protocol of Käll et al. Our analysis reveals significant biases in the two separate target-decoy search protocols. Moreover, the one TDC protocol that provides an unbiased FDR estimate among the target PSMs does so at the cost of forfeiting a random subset of high-scoring spectrum identifications. We therefore propose the mix-max procedure to provide unbiased, accurate FDR estimates in the presence of well-calibrated scores. The method avoids biases associated with the two separate target-decoy search protocols and also avoids the propensity for target-decoy competition to discard a random subset of high-scoring target identifications.
Peddinti, Divyaswetha; Nanduri, Bindu; Kaya, Abdullah; Feugang, Jean M; Burgess, Shane C; Memili, Erdogan
2008-02-22
Male infertility is a major problem for mammalian reproduction. However, molecular details including the underlying mechanisms of male fertility are still not known. A thorough understanding of these mechanisms is essential for obtaining consistently high reproductive efficiency and to ensure lower cost and time-loss by breeder. Using high and low fertility bull spermatozoa, here we employed differential detergent fractionation multidimensional protein identification technology (DDF-Mud PIT) and identified 125 putative biomarkers of fertility. We next used quantitative Systems Biology modeling and canonical protein interaction pathways and networks to show that high fertility spermatozoa differ from low fertility spermatozoa in four main ways. Compared to sperm from low fertility bulls, sperm from high fertility bulls have higher expression of proteins involved in: energy metabolism, cell communication, spermatogenesis, and cell motility. Our data also suggests a hypothesis that low fertility sperm DNA integrity may be compromised because cell cycle: G2/M DNA damage checkpoint regulation was most significant signaling pathway identified in low fertility spermatozoa. This is the first comprehensive description of the bovine spermatozoa proteome. Comparative proteomic analysis of high fertility and low fertility bulls, in the context of protein interaction networks identified putative molecular markers associated with high fertility phenotype.
Ura, Blendi; Scrimin, Federica; Arrigoni, Giorgio; Franchin, Cinzia; Monasta, Lorenzo; Ricci, Giuseppe
2016-04-09
Uterine leiomyoma is the most common benign smooth muscle cell tumor of the uterus. Proteomics is a powerful tool for the analysis of complex mixtures of proteins. In our study, we focused on proteins that were upregulated in the leiomyoma compared to the myometrium. Paired samples of eight leiomyomas and adjacent myometrium were obtained and submitted to two-dimensional gel electrophoresis (2-DE) and mass spectrometry for protein identification and to Western blotting for 2-DE data validation. The comparison between the patterns revealed 24 significantly upregulated (p < 0.05) protein spots, 12 of which were found to be associated with the metabolic processes of the leiomyoma and not with the normal myometrium. The overexpression of seven proteins involved in the metabolic processes of the leiomyoma was further validated by Western blotting and 2D Western blotting. Four of these proteins have never been associated with the leiomyoma before. The 2-DE approach coupled with mass spectrometry, which is among the methods of choice for comparative proteomic studies, identified a number of proteins overexpressed in the leiomyoma and involved in several biological processes, including metabolic processes. A better understanding of the mechanism underlying the overexpression of these proteins may be important for therapeutic purposes.
Ura, Blendi; Scrimin, Federica; Arrigoni, Giorgio; Franchin, Cinzia; Monasta, Lorenzo; Ricci, Giuseppe
2016-01-01
Uterine leiomyoma is the most common benign smooth muscle cell tumor of the uterus. Proteomics is a powerful tool for the analysis of complex mixtures of proteins. In our study, we focused on proteins that were upregulated in the leiomyoma compared to the myometrium. Paired samples of eight leiomyomas and adjacent myometrium were obtained and submitted to two-dimensional gel electrophoresis (2-DE) and mass spectrometry for protein identification and to Western blotting for 2-DE data validation. The comparison between the patterns revealed 24 significantly upregulated (p < 0.05) protein spots, 12 of which were found to be associated with the metabolic processes of the leiomyoma and not with the normal myometrium. The overexpression of seven proteins involved in the metabolic processes of the leiomyoma was further validated by Western blotting and 2D Western blotting. Four of these proteins have never been associated with the leiomyoma before. The 2-DE approach coupled with mass spectrometry, which is among the methods of choice for comparative proteomic studies, identified a number of proteins overexpressed in the leiomyoma and involved in several biological processes, including metabolic processes. A better understanding of the mechanism underlying the overexpression of these proteins may be important for therapeutic purposes. PMID:27070597
Proteomic Analysis of the Endosperm Ontogeny of Jatropha curcas L. Seeds.
Shah, Mohibullah; Soares, Emanoella L; Carvalho, Paulo C; Soares, Arlete A; Domont, Gilberto B; Nogueira, Fábio C S; Campos, Francisco A P
2015-06-05
Seeds of Jatropha curcas L. represent a potential source of raw material for the production of biodiesel. However, this use is hampered by the lack of basic information on the biosynthetic pathways associated with synthesis of toxic diterpenes, fatty acids, and triacylglycerols, as well as the pattern of deposition of storage proteins during seed development. In this study, we performed an in-depth proteome analysis of the endosperm isolated from five developmental stages which resulted in the identification of 1517, 1256, 1033, 752, and 307 proteins, respectively, summing up 1760 different proteins. Proteins with similar label free quantitation expression pattern were grouped into five clusters. The biological significance of these identifications is discussed with special focus on the analysis of seed storage proteins, proteins involved in the metabolism of fatty acids, carbohydrates, toxic components and proteolytic processing. Although several enzymes belonging to the biosynthesis of diterpenoid precursors were identified, we were unable to find any terpene synthase/cyclase, indicating that the synthesis of phorbol esters, the main toxic diterpenes, does not occur in seeds. The strategy used enabled us to provide a first in depth proteome analysis of the developing endosperm of this biodiesel plant, providing an important glimpse into the enzymatic machinery devoted to the production of C and N sources to sustain seed development.
Sanmartín, Esther; Arboleya, Juan Carlos; Iloro, Ibon; Escuredo, Kepa; Elortza, Felix; Moreno, F Javier
2012-09-15
Proteomic approaches have been used to identify the main proteins present in processing by-products generated by the canning tuna-industry, as well as in by-products derived from filleting of skeletal red muscle of fresh tuna. Following fractionation by using an ammonium sulphate precipitation method, three proteins (tropomyosin, haemoglobin and the stress-shock protein ubiquitin) were identified in the highly heterogeneous and heat-treated material discarded by the canning-industry. Additionally, this fractionation method was successful to obtain tropomyosin of high purity from the heterogeneous starting material. By-products from skeletal red muscle of fresh tuna were efficiently fractionated to sarcoplasmic and myofibrillar fractions, prior to the identification based mainly on the combined searching of the peptide mass fingerprint (MALDI-TOF) and peptide fragment fingerprinting (MALDI LIFT-TOF/TOF) spectra of fifteen bands separated by 1D SDS-PAGE. Thus, the sarcoplasmic fraction contained myoglobin and several enzymes that are essential for efficient energy production, whereas the myofibrillar fraction had important contractile proteins, such as actin, tropomyosin, myosin or an isoform of the enzyme creatine kinase. Application of proteomic technologies has revealed new knowledge on the composition of important by-products from tuna species, enabling a better evaluation of their potential applications. Copyright © 2012 Elsevier Ltd. All rights reserved.
Wedge, David C; Krishna, Ritesh; Blackhurst, Paul; Siepen, Jennifer A; Jones, Andrew R; Hubbard, Simon J
2011-04-01
Confident identification of peptides via tandem mass spectrometry underpins modern high-throughput proteomics. This has motivated considerable recent interest in the postprocessing of search engine results to increase confidence and calculate robust statistical measures, for example through the use of decoy databases to calculate false discovery rates (FDR). FDR-based analyses allow for multiple testing and can assign a single confidence value for both sets and individual peptide spectrum matches (PSMs). We recently developed an algorithm for combining the results from multiple search engines, integrating FDRs for sets of PSMs made by different search engine combinations. Here we describe a web-server and a downloadable application that makes this routinely available to the proteomics community. The web server offers a range of outputs including informative graphics to assess the confidence of the PSMs and any potential biases. The underlying pipeline also provides a basic protein inference step, integrating PSMs into protein ambiguity groups where peptides can be matched to more than one protein. Importantly, we have also implemented full support for the mzIdentML data standard, recently released by the Proteomics Standards Initiative, providing users with the ability to convert native formats to mzIdentML files, which are available to download.
Wedge, David C; Krishna, Ritesh; Blackhurst, Paul; Siepen, Jennifer A; Jones, Andrew R.; Hubbard, Simon J.
2013-01-01
Confident identification of peptides via tandem mass spectrometry underpins modern high-throughput proteomics. This has motivated considerable recent interest in the post-processing of search engine results to increase confidence and calculate robust statistical measures, for example through the use of decoy databases to calculate false discovery rates (FDR). FDR-based analyses allow for multiple testing and can assign a single confidence value for both sets and individual peptide spectrum matches (PSMs). We recently developed an algorithm for combining the results from multiple search engines, integrating FDRs for sets of PSMs made by different search engine combinations. Here we describe a web-server, and a downloadable application, which makes this routinely available to the proteomics community. The web server offers a range of outputs including informative graphics to assess the confidence of the PSMs and any potential biases. The underlying pipeline provides a basic protein inference step, integrating PSMs into protein ambiguity groups where peptides can be matched to more than one protein. Importantly, we have also implemented full support for the mzIdentML data standard, recently released by the Proteomics Standards Initiative, providing users with the ability to convert native formats to mzIdentML files, which are available to download. PMID:21222473
NASA Astrophysics Data System (ADS)
Pfammatter, Sibylle; Bonneil, Eric; McManus, Francis P.; Thibault, Pierre
2018-04-01
The small ubiquitin-like modifier (SUMO) is a member of the family of ubiquitin-like modifiers (UBLs) and is involved in important cellular processes, including DNA damage response, meiosis and cellular trafficking. The large-scale identification of SUMO peptides in a site-specific manner is challenging not only because of the low abundance and dynamic nature of this modification, but also due to the branched structure of the corresponding peptides that further complicate their identification using conventional search engines. Here, we exploited the unusual structure of SUMO peptides to facilitate their separation by high-field asymmetric waveform ion mobility spectrometry (FAIMS) and increase the coverage of SUMO proteome analysis. Upon trypsin digestion, branched peptides contain a SUMO remnant side chain and predominantly form triply protonated ions that facilitate their gas-phase separation using FAIMS. We evaluated the mobility characteristics of synthetic SUMO peptides and further demonstrated the application of FAIMS to profile the changes in protein SUMOylation of HEK293 cells following heat shock, a condition known to affect this modification. FAIMS typically provided a 10-fold improvement of detection limit of SUMO peptides, and enabled a 36% increase in SUMO proteome coverage compared to the same LC-MS/MS analyses performed without FAIMS. [Figure not available: see fulltext.
Fonseca, Cátia; Planchon, Sébastien; Serra, Tânia; Chander, Subhash; Saibo, Nelson J M; Renaut, Jenny; Oliveira, M Margarida; Batista, Rita
2015-01-01
Identification of differences between genetically modified plants and their original counterparts plays a central role in risk assessment strategy. Our main goal was to better understand the relevance of transgene presence, genetic, and epigenetic changes induced by transgene insertion, and in vitro culture in putative unintended differences between a transgenic and its comparator. Thus, we have used multiplex fluorescence 2DE coupled with MS to characterize the proteome of three different rice lines (Oryza sativa L. ssp. japonica cv. Nipponbare): a control conventional line (C), an Agrobacterium-transformed transgenic line (Ta) and a negative segregant (NSb). We observed that Ta and NSb appeared identical (with only one spot differentially abundant--fold difference ≥ 1.5), contrasting with the control (49 spots with fold difference ≥ 1.5, in both Ta and NSb vs. control). Given that in vitro culture was the only event in common between Ta and NSb, we hypothesize that in vitro culture stress was the most relevant condition contributing for the observed proteomic differences. MS protein identification support our hypothesis, indicating that Ta and NSb lines adjusted their metabolic pathways and altered the abundance of several stress related proteins in order to cope with in vitro culture. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
David, Matthieu; Fertin, Guillaume; Rogniaux, Hélène; Tessier, Dominique
2017-08-04
The analysis of discovery proteomics experiments relies on algorithms that identify peptides from their tandem mass spectra. The almost exhaustive interpretation of these spectra remains an unresolved issue. At present, an important number of missing interpretations is probably due to peptides displaying post-translational modifications and variants that yield spectra that are particularly difficult to interpret. However, the emergence of a new generation of mass spectrometers that provide high fragment ion accuracy has paved the way for more efficient algorithms. We present a new software, SpecOMS, that can handle the computational complexity of pairwise comparisons of spectra in the context of large volumes. SpecOMS can compare a whole set of experimental spectra generated by a discovery proteomics experiment to a whole set of theoretical spectra deduced from a protein database in a few minutes on a standard workstation. SpecOMS can ingeniously exploit those capabilities to improve the peptide identification process, allowing strong competition between all possible peptides for spectrum interpretation. Remarkably, this software resolves the drawbacks (i.e., efficiency problems and decreased sensitivity) that usually accompany open modification searches. We highlight this promising approach using results obtained from the analysis of a public human data set downloaded from the PRIDE (PRoteomics IDEntification) database.
Peddinti, Divyaswetha; Nanduri, Bindu; Kaya, Abdullah; Feugang, Jean M; Burgess, Shane C; Memili, Erdogan
2008-01-01
Background Male infertility is a major problem for mammalian reproduction. However, molecular details including the underlying mechanisms of male fertility are still not known. A thorough understanding of these mechanisms is essential for obtaining consistently high reproductive efficiency and to ensure lower cost and time-loss by breeder. Results Using high and low fertility bull spermatozoa, here we employed differential detergent fractionation multidimensional protein identification technology (DDF-Mud PIT) and identified 125 putative biomarkers of fertility. We next used quantitative Systems Biology modeling and canonical protein interaction pathways and networks to show that high fertility spermatozoa differ from low fertility spermatozoa in four main ways. Compared to sperm from low fertility bulls, sperm from high fertility bulls have higher expression of proteins involved in: energy metabolism, cell communication, spermatogenesis, and cell motility. Our data also suggests a hypothesis that low fertility sperm DNA integrity may be compromised because cell cycle: G2/M DNA damage checkpoint regulation was most significant signaling pathway identified in low fertility spermatozoa. Conclusion This is the first comprehensive description of the bovine spermatozoa proteome. Comparative proteomic analysis of high fertility and low fertility bulls, in the context of protein interaction networks identified putative molecular markers associated with high fertility phenotype. PMID:18294385
2013-01-01
Due to its compatibility and orthogonality to reversed phase (RP) liquid chromatography (LC) separation, ion exchange chromatography, and mainly strong cation exchange (SCX), has often been the first choice in multidimensional LC experiments in proteomics. Here, we have tested the ability of three strong anion exchanger (SAX) columns differing in their hydrophobicity to fractionate RAW264.7 macrophage cell lysate. IonPac AS24, a strong anion exchange material with ultralow hydrophobicity, demonstrated to be superior to other materials by fractionation and separation of tryptic peptides from both a mixture of 6 proteins as well as mouse cell lysate. The chromatography displayed very high orthogonality and high robustness depending on the hydrophilicity of column chemistry, which we termed hydrophilic strong anion exchange (hSAX). Mass spectrometry analysis of 34 SAX fractions from RAW264.7 macrophage cell lysate digest resulted in an identification of 9469 unique proteins and 126318 distinct peptides in one week of instrument time. Moreover, when compared to an optimized high pH/low pH RP separation approach, the method presented here raised the identification of proteins and peptides by 10 and 28%, respectively. This novel hSAX approach provides robust, reproducible, and highly orthogonal separation of complex protein digest samples for deep coverage proteome analysis. PMID:23294059
Zhang, Bo; Pirmoradian, Mohammad; Chernobrovkin, Alexey; Zubarev, Roman A.
2014-01-01
Based on conventional data-dependent acquisition strategy of shotgun proteomics, we present a new workflow DeMix, which significantly increases the efficiency of peptide identification for in-depth shotgun analysis of complex proteomes. Capitalizing on the high resolution and mass accuracy of Orbitrap-based tandem mass spectrometry, we developed a simple deconvolution method of “cloning” chimeric tandem spectra for cofragmented peptides. Additional to a database search, a simple rescoring scheme utilizes mass accuracy and converts the unwanted cofragmenting events into a surprising advantage of multiplexing. With the combination of cloning and rescoring, we obtained on average nine peptide-spectrum matches per second on a Q-Exactive workbench, whereas the actual MS/MS acquisition rate was close to seven spectra per second. This efficiency boost to 1.24 identified peptides per MS/MS spectrum enabled analysis of over 5000 human proteins in single-dimensional LC-MS/MS shotgun experiments with an only two-hour gradient. These findings suggest a change in the dominant “one MS/MS spectrum - one peptide” paradigm for data acquisition and analysis in shotgun data-dependent proteomics. DeMix also demonstrated higher robustness than conventional approaches in terms of lower variation among the results of consecutive LC-MS/MS runs. PMID:25100859
Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics
2016-01-01
Interpreting the potentially vast number of hypotheses generated by a shotgun proteomics experiment requires a valid and accurate procedure for assigning statistical confidence estimates to identified tandem mass spectra. Despite the crucial role such procedures play in most high-throughput proteomics experiments, the scientific literature has not reached a consensus about the best confidence estimation methodology. In this work, we evaluate, using theoretical and empirical analysis, four previously proposed protocols for estimating the false discovery rate (FDR) associated with a set of identified tandem mass spectra: two variants of the target-decoy competition protocol (TDC) of Elias and Gygi and two variants of the separate target-decoy search protocol of Käll et al. Our analysis reveals significant biases in the two separate target-decoy search protocols. Moreover, the one TDC protocol that provides an unbiased FDR estimate among the target PSMs does so at the cost of forfeiting a random subset of high-scoring spectrum identifications. We therefore propose the mix-max procedure to provide unbiased, accurate FDR estimates in the presence of well-calibrated scores. The method avoids biases associated with the two separate target-decoy search protocols and also avoids the propensity for target-decoy competition to discard a random subset of high-scoring target identifications. PMID:26152888
Tu, Chengjian; Sheng, Quanhu; Li, Jun; Ma, Danjun; Shen, Xiaomeng; Wang, Xue; Shyr, Yu; Yi, Zhengping; Qu, Jun
2015-11-06
The two key steps for analyzing proteomic data generated by high-resolution MS are database searching and postprocessing. While the two steps are interrelated, studies on their combinatory effects and the optimization of these procedures have not been adequately conducted. Here, we investigated the performance of three popular search engines (SEQUEST, Mascot, and MS Amanda) in conjunction with five filtering approaches, including respective score-based filtering, a group-based approach, local false discovery rate (LFDR), PeptideProphet, and Percolator. A total of eight data sets from various proteomes (e.g., E. coli, yeast, and human) produced by various instruments with high-accuracy survey scan (MS1) and high- or low-accuracy fragment ion scan (MS2) (LTQ-Orbitrap, Orbitrap-Velos, Orbitrap-Elite, Q-Exactive, Orbitrap-Fusion, and Q-TOF) were analyzed. It was found combinations involving Percolator achieved markedly more peptide and protein identifications at the same FDR level than the other 12 combinations for all data sets. Among these, combinations of SEQUEST-Percolator and MS Amanda-Percolator provided slightly better performances for data sets with low-accuracy MS2 (ion trap or IT) and high accuracy MS2 (Orbitrap or TOF), respectively, than did other methods. For approaches without Percolator, SEQUEST-group performs the best for data sets with MS2 produced by collision-induced dissociation (CID) and IT analysis; Mascot-LFDR gives more identifications for data sets generated by higher-energy collisional dissociation (HCD) and analyzed in Orbitrap (HCD-OT) and in Orbitrap Fusion (HCD-IT); MS Amanda-Group excels for the Q-TOF data set and the Orbitrap Velos HCD-OT data set. Therefore, if Percolator was not used, a specific combination should be applied for each type of data set. Moreover, a higher percentage of multiple-peptide proteins and lower variation of protein spectral counts were observed when analyzing technical replicates using Percolator-associated combinations; therefore, Percolator enhanced the reliability for both identification and quantification. The analyses were performed using the specific programs embedded in Proteome Discoverer, Scaffold, and an in-house algorithm (BuildSummary). These results provide valuable guidelines for the optimal interpretation of proteomic results and the development of fit-for-purpose protocols under different situations.
Lo, Andy; Weiner, Joel H; Li, Liang
2013-09-17
Due to limited sample amounts, instrument time considerations, and reagent costs, only a small number of replicate experiments are typically performed for quantitative proteome analyses. Generation of reproducible data that can be readily assessed for consistency within a small number of datasets is critical for accurate quantification. We report our investigation of a strategy using reciprocal isotope labeling of two comparative samples as a tool for determining proteome changes. Reciprocal labeling was evaluated to determine the internal consistency of quantified proteome changes from Escherichia coli grown under aerobic and anaerobic conditions. Qualitatively, the peptide overlap between replicate analyses of the same sample and reverse labeled samples were found to be within 8%. Quantitatively, reciprocal analyses showed only a slight increase in average overall inconsistency when compared with replicate analyses (1.29 vs. 1.24-fold difference). Most importantly, reverse labeling was successfully used to identify spurious values resulting from incorrect peptide identifications and poor peak fitting. After removal of 5% of the peptide data with low reproducibility, a total of 275 differentially expressed proteins (>1.50-fold difference) were consistently identified and were then subjected to bioinformatics analysis. General considerations and guidelines for reciprocal labeling experimental design and biological significance of obtained results are discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Kitata, Reta Birhanu; Dimayacyac-Esleta, Baby Rorielyn T.; Choong, Wai-Kok; Tsai, Chia-Feng; Lin, Tai-Du; Tsou, Chih-Chiang; Weng, Shao-Hsing; Chen, Yi-Ju; Yang, Pan-Chyr; Arco, Susan D.; Nesvizhskii, Alexey I.; Sung, Ting-Yi; Chen, Yu-Ju
2016-01-01
Despite significant efforts in the past decade towards complete mapping of the human proteome, 3564 proteins (neXtProt, 09-2014) are still “missing proteins”. Over one-third of these missing proteins are annotated as membrane proteins, owing to their relatively challenging accessibility with standard shotgun proteomics. Using non-small cell lung cancer (NSCLC) as a model study, we aim to mine missing proteins from disease-associated membrane proteome, which may be still largely under-represented. To increase identification coverage, we employed Hp-RP StageTip pre-fractionation of membrane-enriched samples from 11 NSCLC cell lines. Analysis of membrane samples from 20 pairs of tumor and adjacent normal lung tissue were incorporated to include physiologically expressed membrane proteins. Using multiple search engines (X!Tandem, Comet and Mascot) and stringent evaluation of FDR (MAYU and PeptideShaker), we identified 7702 proteins (66% membrane proteins) and 178 missing proteins (74 membrane proteins) with PSM-, peptide-, and protein-level FDR of 1%. Through multiple reaction monitoring (MRM) using synthetic peptides, we provided additional evidences for 8 missing proteins including 7 with transmembrane helix domains (TMH). This study demonstrates that mining missing proteins focused on cancer membrane sub-proteome can greatly contribute to map the whole human proteome. All data were deposited into ProteomeXchange with the identifier PXD002224. PMID:26202522
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis
Padula, Matthew P.; Berry, Iain J.; O′Rourke, Matthew B.; Raymond, Benjamin B.A.; Santos, Jerran; Djordjevic, Steven P.
2017-01-01
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O′Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single ‘spots’ in a polyacrylamide gel, allowing the quantitation of changes in a proteoform′s abundance to ascertain changes in an organism′s phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the ‘Top-Down’. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O’Farrell’s paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism′s proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer. PMID:28387712
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis.
Padula, Matthew P; Berry, Iain J; O Rourke, Matthew B; Raymond, Benjamin B A; Santos, Jerran; Djordjevic, Steven P
2017-04-07
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O'Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single 'spots' in a polyacrylamide gel, allowing the quantitation of changes in a proteoform's abundance to ascertain changes in an organism's phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the 'Top-Down'. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O'Farrell's paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism's proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer.
Runau, Franscois; Arshad, Ali; Isherwood, John; Norris, Leonie; Howells, Lynne; Metcalfe, Matthew; Dennison, Ashley
2015-06-01
Pancreatic cancer is a disease with a significantly poor prognosis. Despite modern advances in other medical, surgical, and oncologic therapy, the outcome from pancreatic cancer has improved little over the last 40 years. To improve the management of this difficult disease, trials investigating the use of dietary and parenteral fish oils rich in omega-3 (ω-3) fatty acids, exhibiting proven anti-inflammatory and anticarcinogenic properties, have revealed favorable results in pancreatic cancers. Proteomics is the large-scale study of proteins that attempts to characterize the complete set of proteins encoded by the genome of an organism and that, with the use of sensitive mass spectrometric-based techniques, has allowed high-throughput analysis of the proteome to aid identification of putative biomarkers pertinent to given disease states. These biomarkers provide useful insight into potentially discovering new markers for early detection or elucidating the efficacy of treatment on pancreatic cancers. Here, our review identifies potential proteomic-based biomarkers in pancreatic cancer relating to apoptosis, cell proliferation, angiogenesis, and metabolic regulation in clinical studies. We also reviewed proteomic biomarkers from the administration of ω-3 fatty acids that act on similar anticarcinogenic pathways as above and reflect that proteomic studies on the effect of ω-3 fatty acids in pancreatic cancer will yield favorable results. © 2015 American Society for Parenteral and Enteral Nutrition.
Confetti: A Multiprotease Map of the HeLa Proteome for Comprehensive Proteomics*
Guo, Xiaofeng; Trudgian, David C.; Lemoff, Andrew; Yadavalli, Sivaramakrishna; Mirzaei, Hamid
2014-01-01
Bottom-up proteomics largely relies on tryptic peptides for protein identification and quantification. Tryptic digestion often provides limited coverage of protein sequence because of issues such as peptide length, ionization efficiency, and post-translational modification colocalization. Unfortunately, a region of interest in a protein, for example, because of proximity to an active site or the presence of important post-translational modifications, may not be covered by tryptic peptides. Detection limits, quantification accuracy, and isoform differentiation can also be improved with greater sequence coverage. Selected reaction monitoring (SRM) would also greatly benefit from being able to identify additional targetable sequences. In an attempt to improve protein sequence coverage and to target regions of proteins that do not generate useful tryptic peptides, we deployed a multiprotease strategy on the HeLa proteome. First, we used seven commercially available enzymes in single, double, and triple enzyme combinations. A total of 48 digests were performed. 5223 proteins were detected by analyzing the unfractionated cell lysate digest directly; with 42% mean sequence coverage. Additional strong-anion exchange fractionation of the most complementary digests permitted identification of over 3000 more proteins, with improved mean sequence coverage. We then constructed a web application (https://proteomics.swmed.edu/confetti) that allows the community to examine a target protein or protein isoform in order to discover the enzyme or combination of enzymes that would yield peptides spanning a certain region of interest in the sequence. Finally, we examined the use of nontryptic digests for SRM. From our strong-anion exchange fractionation data, we were able to identify three or more proteotypic SRM candidates within a single digest for 6056 genes. Surprisingly, in 25% of these cases the digest producing the most observable proteotypic peptides was neither trypsin nor Lys-C. SRM analysis of Asp-N versus tryptic peptides for eight proteins determined that Asp-N yielded higher signal in five of eight cases. PMID:24696503
Kültz, Dietmar; Li, Johnathon; Gardell, Alison; Sacchi, Romina
2013-01-01
A two-tiered label-free quantitative (LFQ) proteomics workflow was used to elucidate how salinity affects the molecular phenotype, i.e. proteome, of gills from a cichlid fish, the euryhaline tilapia (Oreochromis mossambicus). The workflow consists of initial global profiling of relative tryptic peptide abundances in treated versus control samples followed by targeted identification (by MS/MS) and quantitation (by chromatographic peak area integration) of validated peptides for each protein of interest. Fresh water acclimated tilapia were independently exposed in separate experiments to acute short-term (34 ppt) and gradual long-term (70 ppt, 90 ppt) salinity stress followed by molecular phenotyping of the gill proteome. The severity of salinity stress can be deduced with high technical reproducibility from the initial global label-free quantitative profiling step alone at both peptide and protein levels. However, an accurate regulation ratio can only be determined by targeted label-free quantitative profiling because not all peptides used for protein identification are also valid for quantitation. Of the three salinity challenges, gradual acclimation to 90 ppt has the most pronounced effect on gill molecular phenotype. Known salinity effects on tilapia gills, including an increase in the size and number of mitochondria-rich ionocytes, activities of specific ion transporters, and induction of specific molecular chaperones are reflected in the regulation of abundances of the corresponding proteins. Moreover, specific protein isoforms that are responsive to environmental salinity change are resolved and it is revealed that salinity effects on the mitochondrial proteome are nonuniform. Furthermore, protein NDRG1 has been identified as a novel key component of molecular phenotype restructuring during salinity-induced gill remodeling. In conclusion, besides confirming known effects of salinity on gills of euryhaline fish, molecular phenotyping reveals novel insight into proteome changes that underlie the remodeling of tilapia gill epithelium in response to environmental salinity change. PMID:24065692
Morphinome Database - The database of proteins altered by morphine administration - An update.
Bodzon-Kulakowska, Anna; Padrtova, Tereza; Drabik, Anna; Ner-Kluza, Joanna; Antolak, Anna; Kulakowski, Konrad; Suder, Piotr
2018-04-13
Morphine is considered a gold standard in pain treatment. Nevertheless, its use could be associated with severe side effects, including drug addiction. Thus, it is very important to understand the molecular mechanism of morphine action in order to develop new methods of pain therapy, or at least to attenuate the side effects of opioids usage. Proteomics allows for the indication of proteins involved in certain biological processes, but the number of items identified in a single study is usually overwhelming. Thus, researchers face the difficult problem of choosing the proteins which are really important for the investigated processes and worth further studies. Therefore, based on the 29 published articles, we created a database of proteins regulated by morphine administration - The Morphinome Database (addiction-proteomics.org). This web tool allows for indicating proteins that were identified during different proteomics studies. Moreover, the collection and organization of such a vast amount of data allows us to find the same proteins that were identified in various studies and to create their ranking, based on the frequency of their identification. STRING and KEGG databases indicated metabolic pathways which those molecules are involved in. This means that those molecular pathways seem to be strongly affected by morphine administration and could be important targets for further investigations. The data about proteins identified by different proteomics studies of molecular changes caused by morphine administration (29 published articles) were gathered in the Morphinome Database. Unification of those data allowed for the identification of proteins that were indicated several times by distinct proteomics studies, which means that they seem to be very well verified and important for the entire process. Those proteins might be now considered promising aims for more detailed studies of their role in the molecular mechanism of morphine action. Copyright © 2018. Published by Elsevier B.V.
Quantitative Proteomics Identifies Activation of Hallmark Pathways of Cancer in Patient Melanoma.
Byrum, Stephanie D; Larson, Signe K; Avaritt, Nathan L; Moreland, Linley E; Mackintosh, Samuel G; Cheung, Wang L; Tackett, Alan J
2013-03-01
Molecular pathways regulating melanoma initiation and progression are potential targets of therapeutic development for this aggressive cancer. Identification and molecular analysis of these pathways in patients has been primarily restricted to targeted studies on individual proteins. Here, we report the most comprehensive analysis of formalin-fixed paraffin-embedded human melanoma tissues using quantitative proteomics. From 61 patient samples, we identified 171 proteins varying in abundance among benign nevi, primary melanoma, and metastatic melanoma. Seventy-three percent of these proteins were validated by immunohistochemistry staining of malignant melanoma tissues from the Human Protein Atlas database. Our results reveal that molecular pathways involved with tumor cell proliferation, motility, and apoptosis are mis-regulated in melanoma. These data provide the most comprehensive proteome resource on patient melanoma and reveal insight into the molecular mechanisms driving melanoma progression.
Hanson, Andrea J; Paszczynski, Andrzej J; Coats, Erik R
2016-03-01
The production of polyhydroxyalkanoates (PHA; bioplastics) from waste or surplus feedstocks using mixed microbial consortia (MMC) and aerobic dynamic feeding (ADF) is a growing field within mixed culture biotechnology. This study aimed to optimize a 2DE workflow to investigate the proteome dynamics of an MMC synthesizing PHA from fermented dairy manure. To mitigate the challenges posed to effective 2DE by this complex sample matrix, the bacterial biomass was purified using Accudenz gradient centrifugation (AGC) before protein extraction. The optimized 2DE method yielded high-quality gels suitable for quantitative comparative analysis and subsequent protein identification by LC-MS/MS. The optimized 2DE method could be adapted to other proteomic investigations involving MMC in complex organic or environmental matrices. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
High-field asymmetric waveform ion mobility spectrometry for mass spectrometry-based proteomics.
Swearingen, Kristian E; Moritz, Robert L
2012-10-01
High-field asymmetric waveform ion mobility spectrometry (FAIMS) is an atmospheric pressure ion mobility technique that separates gas-phase ions by their behavior in strong and weak electric fields. FAIMS is easily interfaced with electrospray ionization and has been implemented as an additional separation mode between liquid chromatography (LC) and mass spectrometry (MS) in proteomic studies. FAIMS separation is orthogonal to both LC and MS and is used as a means of on-line fractionation to improve the detection of peptides in complex samples. FAIMS improves dynamic range and concomitantly the detection limits of ions by filtering out chemical noise. FAIMS can also be used to remove interfering ion species and to select peptide charge states optimal for identification by tandem MS. Here, the authors review recent developments in LC-FAIMS-MS and its application to MS-based proteomics.
NASA Astrophysics Data System (ADS)
Jabbour, Rabih E.; Wade, Mary; Deshpande, Samir V.; McCubbin, Patrick; Snyder, A. Peter; Bevilacqua, Vicky
2012-06-01
Mass spectrometry based proteomic approaches are showing promising capabilities in addressing various biological and biochemical issues. Outer membrane proteins (OMPs) are often associated with virulence in gram-negative pathogens and could prove to be excellent model biomarkers for strain level differentiation among bacteria. Whole cells and OMP extracts were isolated from pathogenic and non-pathogenic strains of Francisella tularensis, Burkholderia thailandensis, and Burkholderia mallei. OMP extracts were compared for their ability to differentiate and delineate the correct database organism to an experimental sample and for the degree of dissimilarity to the nearest-neighbor database strains. This study addresses the comparative experimental proteome analyses of OMPs vs. whole cell lysates on the strain-level discrimination among gram negative pathogenic and non-pathogenic strains.
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J.; Chatterjee, Aditi; Prasad, T. S. Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division. PMID:25874956
The Escherichia coli Peripheral Inner Membrane Proteome*
Papanastasiou, Malvina; Orfanoudaki, Georgia; Koukaki, Marina; Kountourakis, Nikos; Sardis, Marios Frantzeskos; Aivaliotis, Michalis; Karamanou, Spyridoula; Economou, Anastassios
2013-01-01
Biological membranes are essential for cell viability. Their functional characteristics strongly depend on their protein content, which consists of transmembrane (integral) and peripherally associated membrane proteins. Both integral and peripheral inner membrane proteins mediate a plethora of biological processes. Whereas transmembrane proteins have characteristic hydrophobic stretches and can be predicted using bioinformatics approaches, peripheral inner membrane proteins are hydrophilic, exist in equilibria with soluble pools, and carry no discernible membrane targeting signals. We experimentally determined the cytoplasmic peripheral inner membrane proteome of the model organism Escherichia coli using a multidisciplinary approach. Initially, we extensively re-annotated the theoretical proteome regarding subcellular localization using literature searches, manual curation, and multi-combinatorial bioinformatics searches of the available databases. Next we used sequential biochemical fractionations coupled to direct identification of individual proteins and protein complexes using high resolution mass spectrometry. We determined that the proposed cytoplasmic peripheral inner membrane proteome occupies a previously unsuspected ∼19% of the basic E. coli BL21(DE3) proteome, and the detected peripheral inner membrane proteome occupies ∼25% of the estimated expressed proteome of this cell grown in LB medium to mid-log phase. This value might increase when fleeting interactions, not studied here, are taken into account. Several proteins previously regarded as exclusively cytoplasmic bind membranes avidly. Many of these proteins are organized in functional or/and structural oligomeric complexes that bind to the membrane with multiple interactions. Identified proteins cover the full spectrum of biological activities, and more than half of them are essential. Our data suggest that the cytoplasmic proteome displays remarkably dynamic and extensive communication with biological membrane surfaces that we are only beginning to decipher. PMID:23230279
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J; Chatterjee, Aditi; Prasad, T S Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division.
Stable isotope dimethyl labelling for quantitative proteomics and beyond
Hsu, Jue-Liang; Chen, Shu-Hui
2016-01-01
Stable-isotope reductive dimethylation, a cost-effective, simple, robust, reliable and easy-to- multiplex labelling method, is widely applied to quantitative proteomics using liquid chromatography-mass spectrometry. This review focuses on biological applications of stable-isotope dimethyl labelling for a large-scale comparative analysis of protein expression and post-translational modifications based on its unique properties of the labelling chemistry. Some other applications of the labelling method for sample preparation and mass spectrometry-based protein identification and characterization are also summarized. This article is part of the themed issue ‘Quantitative mass spectrometry’. PMID:27644970
Accurate proteome-wide protein quantification from high-resolution 15N mass spectra
2011-01-01
In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of 15N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing 15N spectra in which quantification is integrated with identification. Application of this method to an Escherichia coli growth transition reveals significant improvement in quantification accuracy over previous methods. PMID:22182234
PROTEOMIC ANALYSIS OF ALLERGENS FROM METARHIZIUM ANISOPLIAE
Introduction
The goal of this project is the identification and characterization of allergens from the fungus Metarhizium anisopliae, using mass spectrometry (MS). The US EPA, under the "Children at Risk" program, is currently addressing the problem of indoor fungal bioaer...
PROTEOMIC ANALYSIS OF ALLERGENS FROM METARHIZIUM ANISOPLIEA
The goal of this project is the identification and characterization of allergens from the fungus M. Anisopliae, using mass spectrometry (MS). The US EPA, under the "Children at Risk" program, is currently addressing the problem of indoor fungal bioaerosol contamination. One of ...
Rapid Detection & Identification of Bacillus Species using MALDI-TOF/TOF and Biomarker Database
2006-06-01
rRNA sequence analysis. Multilocus enzyme electrophoresis ( MEE ) and comparative DNA sequence analysis suggest that they may represent a single species...adaptation of the MEE method [63] but with greater discrimination [64]. All of these new PCR-based subtyping methods are certainly superior and more...Demirev, P.A., Lin, J.S., Pineda , F.J., and Fenselau, C. (2001). Bioinformatics and mass spectrometry for microorganism identification: proteome-wide
The Human Skeletal Muscle Proteome Project: a reappraisal of the current literature
Gonzalez‐Freire, Marta; Semba, Richard D.; Ubaida‐Mohien, Ceereena; Fabbri, Elisa; Scalzo, Paul; Højlund, Kurt; Dufresne, Craig; Lyashkov, Alexey
2016-01-01
Abstract Skeletal muscle is a large organ that accounts for up to half the total mass of the human body. A progressive decline in muscle mass and strength occurs with ageing and in some individuals configures the syndrome of ‘sarcopenia’, a condition that impairs mobility, challenges autonomy, and is a risk factor for mortality. The mechanisms leading to sarcopenia as well as myopathies are still little understood. The Human Skeletal Muscle Proteome Project was initiated with the aim to characterize muscle proteins and how they change with ageing and disease. We conducted an extensive review of the literature and analysed publically available protein databases. A systematic search of peer‐reviewed studies was performed using PubMed. Search terms included ‘human’, ‘skeletal muscle’, ‘proteome’, ‘proteomic(s)’, and ‘mass spectrometry’, ‘liquid chromatography‐mass spectrometry (LC‐MS/MS)’. A catalogue of 5431 non‐redundant muscle proteins identified by mass spectrometry‐based proteomics from 38 peer‐reviewed scientific publications from 2002 to November 2015 was created. We also developed a nosology system for the classification of muscle proteins based on localization and function. Such inventory of proteins should serve as a useful background reference for future research on changes in muscle proteome assessed by quantitative mass spectrometry‐based proteomic approaches that occur with ageing and diseases. This classification and compilation of the human skeletal muscle proteome can be used for the identification and quantification of proteins in skeletal muscle to discover new mechanisms for sarcopenia and specific muscle diseases that can be targeted for the prevention and treatment. PMID:27897395
2012-01-01
Background Accurate diagnostic and monitoring tools for ulcerative colitis (UC) are missing. Our aim was to describe the proteomic profile of UC and search for markers associated with disease exacerbation. Therefore, we aimed to characterize specific proteins associated with inflamed colon mucosa from patients with acute UC using mass spectrometry-based proteomic analysis. Methods Biopsies were sampled from rectum, sigmoid colon and left colonic flexure from twenty patients with active proctosigmoiditis and from four healthy controls for proteomics and histology. Proteomic profiles of whole colonic biopsies were characterized using 2D-gel electrophoresis, and peptide mass fingerprinting using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) was applied for identification of differently expressed protein spots. Results A total of 597 spots were annotated by image analysis and 222 of these had a statistically different protein level between inflamed and non-inflamed tissue in the patient group. Principal component analysis clearly grouped non-inflamed samples separately from the inflamed samples indicating that the proteomic signature of colon mucosa with acute UC is strong. Totally, 43 individual protein spots were identified, including proteins involved in energy metabolism (triosephosphate isomerase, glycerol-3-phosphate-dehydrogenase, alpha enolase and L-lactate dehydrogenase B-chain) and in oxidative stress (superoxide dismutase, thioredoxins and selenium binding protein). Conclusions A distinct proteomic profile of inflamed tissue in UC patients was found. Specific proteins involved in energy metabolism and oxidative stress were identified as potential candidate markers for UC. PMID:22726388
Practical and Efficient Searching in Proteomics: A Cross Engine Comparison
Paulo, Joao A.
2014-01-01
Background Analysis of large datasets produced by mass spectrometry-based proteomics relies on database search algorithms to sequence peptides and identify proteins. Several such scoring methods are available, each based on different statistical foundations and thereby not producing identical results. Here, the aim is to compare peptide and protein identifications using multiple search engines and examine the additional proteins gained by increasing the number of technical replicate analyses. Methods A HeLa whole cell lysate was analyzed on an Orbitrap mass spectrometer for 10 technical replicates. The data were combined and searched using Mascot, SEQUEST, and Andromeda. Comparisons were made of peptide and protein identifications among the search engines. In addition, searches using each engine were performed with incrementing number of technical replicates. Results The number and identity of peptides and proteins differed across search engines. For all three search engines, the differences in proteins identifications were greater than the differences in peptide identifications indicating that the major source of the disparity may be at the protein inference grouping level. The data also revealed that analysis of 2 technical replicates can increase protein identifications by up to 10-15%, while a third replicate results in an additional 4-5%. Conclusions The data emphasize two practical methods of increasing the robustness of mass spectrometry data analysis. The data show that 1) using multiple search engines can expand the number of identified proteins (union) and validate protein identifications (intersection), and 2) analysis of 2 or 3 technical replicates can substantially expand protein identifications. Moreover, information can be extracted from a dataset by performing database searching with different engines and performing technical repeats, which requires no additional sample preparation and effectively utilizes research time and effort. PMID:25346847
Khatun, Jainab; Hamlett, Eric; Giddings, Morgan C
2008-03-01
The identification of peptides by tandem mass spectrometry (MS/MS) is a central method of proteomics research, but due to the complexity of MS/MS data and the large databases searched, the accuracy of peptide identification algorithms remains limited. To improve the accuracy of identification we applied a machine-learning approach using a hidden Markov model (HMM) to capture the complex and often subtle links between a peptide sequence and its MS/MS spectrum. Our model, HMM_Score, represents ion types as HMM states and calculates the maximum joint probability for a peptide/spectrum pair using emission probabilities from three factors: the amino acids adjacent to each fragmentation site, the mass dependence of ion types and the intensity dependence of ion types. The Viterbi algorithm is used to calculate the most probable assignment between ion types in a spectrum and a peptide sequence, then a correction factor is added to account for the propensity of the model to favor longer peptides. An expectation value is calculated based on the model score to assess the significance of each peptide/spectrum match. We trained and tested HMM_Score on three data sets generated by two different mass spectrometer types. For a reference data set recently reported in the literature and validated using seven identification algorithms, HMM_Score produced 43% more positive identification results at a 1% false positive rate than the best of two other commonly used algorithms, Mascot and X!Tandem. HMM_Score is a highly accurate platform for peptide identification that works well for a variety of mass spectrometer and biological sample types. The program is freely available on ProteomeCommons via an OpenSource license. See http://bioinfo.unc.edu/downloads/ for the download link.
Practical and Efficient Searching in Proteomics: A Cross Engine Comparison.
Paulo, Joao A
2013-10-01
Analysis of large datasets produced by mass spectrometry-based proteomics relies on database search algorithms to sequence peptides and identify proteins. Several such scoring methods are available, each based on different statistical foundations and thereby not producing identical results. Here, the aim is to compare peptide and protein identifications using multiple search engines and examine the additional proteins gained by increasing the number of technical replicate analyses. A HeLa whole cell lysate was analyzed on an Orbitrap mass spectrometer for 10 technical replicates. The data were combined and searched using Mascot, SEQUEST, and Andromeda. Comparisons were made of peptide and protein identifications among the search engines. In addition, searches using each engine were performed with incrementing number of technical replicates. The number and identity of peptides and proteins differed across search engines. For all three search engines, the differences in proteins identifications were greater than the differences in peptide identifications indicating that the major source of the disparity may be at the protein inference grouping level. The data also revealed that analysis of 2 technical replicates can increase protein identifications by up to 10-15%, while a third replicate results in an additional 4-5%. The data emphasize two practical methods of increasing the robustness of mass spectrometry data analysis. The data show that 1) using multiple search engines can expand the number of identified proteins (union) and validate protein identifications (intersection), and 2) analysis of 2 or 3 technical replicates can substantially expand protein identifications. Moreover, information can be extracted from a dataset by performing database searching with different engines and performing technical repeats, which requires no additional sample preparation and effectively utilizes research time and effort.