Halligan, Brian D.; Geiger, Joey F.; Vallejos, Andrew K.; Greene, Andrew S.; Twigger, Simon N.
2009-01-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step by step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center website (http://proteomics.mcw.edu/vipdac). PMID:19358578
Halligan, Brian D; Geiger, Joey F; Vallejos, Andrew K; Greene, Andrew S; Twigger, Simon N
2009-06-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step-by-step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center Web site ( http://proteomics.mcw.edu/vipdac ).
Beyond the proteome: Mass Spectrometry Special Interest Group (MS-SIG) at ISMB/ECCB 2013
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ryu, Soyoung; Payne, Samuel H.; Schaab, Christoph
2014-07-02
Mass spectrometry special interest group (MS-SIG) aims to bring together experts from the global research community to discuss highlights and challenges in the field of mass spectrometry (MS)-based proteomics and computational biology. The rapid echnological developments in MS-based proteomics have enabled the generation of a large amount of meaningful information on hundreds to thousands of proteins simultaneously from a biological sample; however, the complexity of the MS data require sophisticated computational algorithms and software for data analysis and interpretation. This year’s MS-SIG meeting theme was ‘Beyond the Proteome’ with major focuses on improving protein identification/quantification and using proteomics data tomore » solve interesting problems in systems biology and clinical research.« less
Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.
2015-01-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L
2015-02-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.
2016-01-01
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...
2016-11-18
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
This research project combines the use of whole organism endpoints, genomic, proteomic and metabolomic approaches, and computational modeling in a systems biology approach to 1) identify molecular indicators of exposure and biomarkers of effect to EDCs representing several modes/...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Current algorithmic solutions for peptide-based proteomics data generation and identification.
Hoopmann, Michael R; Moritz, Robert L
2013-02-01
Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics. Copyright © 2012 Elsevier Ltd. All rights reserved.
Proteomics and Systems Biology: Current and Future Applications in the Nutritional Sciences1
Moore, J. Bernadette; Weeks, Mark E.
2011-01-01
In the last decade, advances in genomics, proteomics, and metabolomics have yielded large-scale datasets that have driven an interest in global analyses, with the objective of understanding biological systems as a whole. Systems biology integrates computational modeling and experimental biology to predict and characterize the dynamic properties of biological systems, which are viewed as complex signaling networks. Whereas the systems analysis of disease-perturbed networks holds promise for identification of drug targets for therapy, equally the identified critical network nodes may be targeted through nutritional intervention in either a preventative or therapeutic fashion. As such, in the context of the nutritional sciences, it is envisioned that systems analysis of normal and nutrient-perturbed signaling networks in combination with knowledge of underlying genetic polymorphisms will lead to a future in which the health of individuals will be improved through predictive and preventative nutrition. Although high-throughput transcriptomic microarray data were initially most readily available and amenable to systems analysis, recent technological and methodological advances in MS have contributed to a linear increase in proteomic investigations. It is now commonplace for combined proteomic technologies to generate complex, multi-faceted datasets, and these will be the keystone of future systems biology research. This review will define systems biology, outline current proteomic methodologies, highlight successful applications of proteomics in nutrition research, and discuss the challenges for future applications of systems biology approaches in the nutritional sciences. PMID:22332076
Single-molecule protein sequencing through fingerprinting: computational assessment
NASA Astrophysics Data System (ADS)
Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin
2015-10-01
Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
Computational clustering for viral reference proteomes
Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.
2016-01-01
Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712
Mass spectrometry-based proteomics: basic principles and emerging technologies and directions.
Van Riper, Susan K; de Jong, Ebbing P; Carlis, John V; Griffin, Timothy J
2013-01-01
As the main catalytic and structural molecules within living systems, proteins are the most likely biomolecules to be affected by radiation exposure. Proteomics, the comprehensive characterization of proteins within complex biological samples, is therefore a research approach ideally suited to assess the effects of radiation exposure on cells and tissues. For comprehensive characterization of proteomes, an analytical platform capable of quantifying protein abundance, identifying post-translation modifications and revealing members of protein complexes on a system-wide level is necessary. Mass spectrometry (MS), coupled with technologies for sample fractionation and automated data analysis, provides such a versatile and powerful platform. In this chapter we offer a view on the current state of MS-proteomics, and focus on emerging technologies within three areas: (1) New instrumental methods; (2) New computational methods for peptide identification; and (3) Label-free quantification. These emerging technologies should be valuable for researchers seeking to better understand biological effects of radiation on living systems.
Deutsch, Eric W.; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L.
2015-01-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include mass spectrometry to define protein sequence, protein:protein interactions, and protein post-translational modifications. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative mass spectrometry proteomics. It supports all major operating systems and instrument vendors via open data formats. Here we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of tandem mass spectrometry datasets, as well as some major upcoming features. PMID:25631240
Deutsch, Eric W; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L
2015-08-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A community proposal to integrate proteomics activities in ELIXIR.
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J R; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on 'The Future of Proteomics in ELIXIR' that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR's existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper.
A community proposal to integrate proteomics activities in ELIXIR
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C.; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J.R.; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on ‘The Future of Proteomics in ELIXIR’ that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR’s existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper. PMID:28713550
Micro computed tomography (CT) scanned anatomical gateway to insect pest bioinformatics
USDA-ARS?s Scientific Manuscript database
An international collaboration to establish an interactive Digital Video Library for a Systems Biology Approach to study the Asian citrus Psyllid and psyllid genomics/proteomics interactions is demonstrated. Advances in micro-CT, digital computed tomography (CT) scan uses X-rays to make detailed pic...
This week, we are excited to announce the launch of the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) Proteogenomics Computational DREAM Challenge. The aim of this Challenge is to encourage the generation of computational methods for extracting information from the cancer proteome and for linking those data to genomic and transcriptomic information. The specific goals are to predict proteomic and phosphoproteomic data from other multiple data types including transcriptomics and genetics.
ProCon - PROteomics CONversion tool.
Mayer, Gerhard; Stephan, Christian; Meyer, Helmut E; Kohl, Michael; Marcus, Katrin; Eisenacher, Martin
2015-11-03
With the growing amount of experimental data produced in proteomics experiments and the requirements/recommendations of journals in the proteomics field to publicly make available data described in papers, a need for long-term storage of proteomics data in public repositories arises. For such an upload one needs proteomics data in a standardized format. Therefore, it is desirable, that the proprietary vendor's software will integrate in the future such an export functionality using the standard formats for proteomics results defined by the HUPO-PSI group. Currently not all search engines and analysis tools support these standard formats. In the meantime there is a need to provide user-friendly free-to-use conversion tools that can convert the data into such standard formats in order to support wet-lab scientists in creating proteomics data files ready for upload into the public repositories. ProCon is such a conversion tool written in Java for conversion of proteomics identification data into standard formats mzIdentML and Pride XML. It allows the conversion of Sequest™/Comet .out files, of search results from the popular and often used ProteomeDiscoverer® 1.x (x=versions 1.1 to1.4) software and search results stored in the LIMS systems ProteinScape® 1.3 and 2.1 into mzIdentML and PRIDE XML. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
Computer applications making rapid advances in high throughput microbial proteomics (HTMP).
Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen
2014-02-01
The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.
Systematic Proteomic Approach to Characterize the Impacts of ...
Chemical interactions have posed a big challenge in toxicity characterization and human health risk assessment of environmental mixtures. To characterize the impacts of chemical interactions on protein and cytotoxicity responses to environmental mixtures, we established a systems biology approach integrating proteomics, bioinformatics, statistics, and computational toxicology to measure expression or phosphorylation levels of 21 critical toxicity pathway regulators and 445 downstream proteins in human BEAS-28 cells treated with 4 concentrations of nickel, 2 concentrations each of cadmium and chromium, as well as 12 defined binary and 8 defined ternary mixtures of these metals in vitro. Multivariate statistical analysis and mathematical modeling of the metal-mediated proteomic response patterns showed a high correlation between changes in protein expression or phosphorylation and cellular toxic responses to both individual metals and metal mixtures. Of the identified correlated proteins, only a small set of proteins including HIF-1a is likely to be responsible for selective cytotoxic responses to different metals and metals mixtures. Furthermore, support vector machine learning was utilized to computationally predict protein responses to uncharacterized metal mixtures using experimentally generated protein response profiles corresponding to known metal mixtures. This study provides a novel proteomic approach for characterization and prediction of toxicities of
Cloud Computing for Protein-Ligand Binding Site Comparison
2013-01-01
The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery. PMID:23762824
Cloud computing for protein-ligand binding site comparison.
Hung, Che-Lun; Hua, Guan-Jie
2013-01-01
The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
2011-01-01
Background Since its inception, proteomics has essentially operated in a discovery mode with the goal of identifying and quantifying the maximal number of proteins in a sample. Increasingly, proteomic measurements are also supporting hypothesis-driven studies, in which a predetermined set of proteins is consistently detected and quantified in multiple samples. Selected reaction monitoring (SRM) is a targeted mass spectrometric technique that supports the detection and quantification of specific proteins in complex samples at high sensitivity and reproducibility. Here, we describe ATAQS, an integrated software platform that supports all stages of targeted, SRM-based proteomics experiments including target selection, transition optimization and post acquisition data analysis. This software will significantly facilitate the use of targeted proteomic techniques and contribute to the generation of highly sensitive, reproducible and complete datasets that are particularly critical for the discovery and validation of targets in hypothesis-driven studies in systems biology. Result We introduce a new open source software pipeline, ATAQS (Automated and Targeted Analysis with Quantitative SRM), which consists of a number of modules that collectively support the SRM assay development workflow for targeted proteomic experiments (project management and generation of protein, peptide and transitions and the validation of peptide detection by SRM). ATAQS provides a flexible pipeline for end-users by allowing the workflow to start or end at any point of the pipeline, and for computational biologists, by enabling the easy extension of java algorithm classes for their own algorithm plug-in or connection via an external web site. This integrated system supports all steps in a SRM-based experiment and provides a user-friendly GUI that can be run by any operating system that allows the installation of the Mozilla Firefox web browser. Conclusions Targeted proteomics via SRM is a powerful new technique that enables the reproducible and accurate identification and quantification of sets of proteins of interest. ATAQS is the first open-source software that supports all steps of the targeted proteomics workflow. ATAQS also provides software API (Application Program Interface) documentation that enables the addition of new algorithms to each of the workflow steps. The software, installation guide and sample dataset can be found in http://tools.proteomecenter.org/ATAQS/ATAQS.html PMID:21414234
The amino acid's backup bone - storage solutions for proteomics facilities.
Meckel, Hagen; Stephan, Christian; Bunse, Christian; Krafzik, Michael; Reher, Christopher; Kohl, Michael; Meyer, Helmut Erich; Eisenacher, Martin
2014-01-01
Proteomics methods, especially high-throughput mass spectrometry analysis have been continually developed and improved over the years. The analysis of complex biological samples produces large volumes of raw data. Data storage and recovery management pose substantial challenges to biomedical or proteomic facilities regarding backup and archiving concepts as well as hardware requirements. In this article we describe differences between the terms backup and archive with regard to manual and automatic approaches. We also introduce different storage concepts and technologies from transportable media to professional solutions such as redundant array of independent disks (RAID) systems, network attached storages (NAS) and storage area network (SAN). Moreover, we present a software solution, which we developed for the purpose of long-term preservation of large mass spectrometry raw data files on an object storage device (OSD) archiving system. Finally, advantages, disadvantages, and experiences from routine operations of the presented concepts and technologies are evaluated and discussed. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013. Published by Elsevier B.V.
Targeted proteomics identifies liquid-biopsy signatures for extracapsular prostate cancer
Kim, Yunee; Jeon, Jouhyun; Mejia, Salvador; Yao, Cindy Q; Ignatchenko, Vladimir; Nyalwidhe, Julius O; Gramolini, Anthony O; Lance, Raymond S; Troyer, Dean A; Drake, Richard R; Boutros, Paul C; Semmes, O. John; Kislinger, Thomas
2016-01-01
Biomarkers are rapidly gaining importance in personalized medicine. Although numerous molecular signatures have been developed over the past decade, there is a lack of overlap and many biomarkers fail to validate in independent patient cohorts and hence are not useful for clinical application. For these reasons, identification of novel and robust biomarkers remains a formidable challenge. We combine targeted proteomics with computational biology to discover robust proteomic signatures for prostate cancer. Quantitative proteomics conducted in expressed prostatic secretions from men with extraprostatic and organ-confined prostate cancers identified 133 differentially expressed proteins. Using synthetic peptides, we evaluate them by targeted proteomics in a 74-patient cohort of expressed prostatic secretions in urine. We quantify a panel of 34 candidates in an independent 207-patient cohort. We apply machine-learning approaches to develop clinical predictive models for prostate cancer diagnosis and prognosis. Our results demonstrate that computationally guided proteomics can discover highly accurate non-invasive biomarkers. PMID:27350604
Computational Omics Pre-Awardees | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the pre-awardees of the Computational Omics solicitation. Working with NVIDIA Foundation's Compute the Cure initiative and Leidos Biomedical Research Inc., the NCI, through this solicitation, seeks to leverage computational efforts to provide tools for the mining and interpretation of large-scale publicly available ‘omics’ datasets.
The November 1, 2017 issue of Cancer Research is dedicated to a collection of computational resource papers in genomics, proteomics, animal models, imaging, and clinical subjects for non-bioinformaticists looking to incorporate computing tools into their work. Scientists at Pacific Northwest National Laboratory have developed P-MartCancer, an open, web-based interactive software tool that enables statistical analyses of peptide or protein data generated from mass-spectrometry (MS)-based global proteomics experiments.
Marc Snir | Argonne National Laboratory
Molecular biology Proteomics Environmental science & technology Air quality Atmospheric & climate , H.S., Jr., Demonstrating the scalability of a molecular dynamics application on a Petaflop computer Transformations IGSBInstitute for Genomics and Systems Biology IMEInstitute for Molecular Engineering JCESRJoint
Integrating Omics Technologies to Study Pulmonary Physiology and Pathology at the Systems Level
Pathak, Ravi Ramesh; Davé, Vrushank
2014-01-01
Assimilation and integration of “omics” technologies, including genomics, epigenomics, proteomics, and metabolomics has readily altered the landscape of medical research in the last decade. The vast and complex nature of omics data can only be interpreted by linking molecular information at the organismic level, forming the foundation of systems biology. Research in pulmonary biology/medicine has necessitated integration of omics, network, systems and computational biology data to differentially diagnose, interpret, and prognosticate pulmonary diseases, facilitating improvement in therapy and treatment modalities. This review describes how to leverage this emerging technology in understanding pulmonary diseases at the systems level –called a “systomic” approach. Considering the operational wholeness of cellular and organ systems, diseased genome, proteome, and the metabolome needs to be conceptualized at the systems level to understand disease pathogenesis and progression. Currently available omics technology and resources require a certain degree of training and proficiency in addition to dedicated hardware and applications, making them relatively less user friendly for the pulmonary biologist and clinicians. Herein, we discuss the various strategies, computational tools and approaches required to study pulmonary diseases at the systems level for biomedical scientists and clinical researchers. PMID:24802001
Yang, Shuai; Zhang, Xinlei; Diao, Lihong; Guo, Feifei; Wang, Dan; Liu, Zhongyang; Li, Honglei; Zheng, Junjie; Pan, Jingshan; Nice, Edouard C; Li, Dong; He, Fuchu
2015-09-04
The Chromosome-centric Human Proteome Project (C-HPP) aims to catalog genome-encoded proteins using a chromosome-by-chromosome strategy. As the C-HPP proceeds, the increasing requirement for data-intensive analysis of the MS/MS data poses a challenge to the proteomic community, especially small laboratories lacking computational infrastructure. To address this challenge, we have updated the previous CAPER browser into a higher version, CAPER 3.0, which is a scalable cloud-based system for data-intensive analysis of C-HPP data sets. CAPER 3.0 uses cloud computing technology to facilitate MS/MS-based peptide identification. In particular, it can use both public and private cloud, facilitating the analysis of C-HPP data sets. CAPER 3.0 provides a graphical user interface (GUI) to help users transfer data, configure jobs, track progress, and visualize the results comprehensively. These features enable users without programming expertise to easily conduct data-intensive analysis using CAPER 3.0. Here, we illustrate the usage of CAPER 3.0 with four specific mass spectral data-intensive problems: detecting novel peptides, identifying single amino acid variants (SAVs) derived from known missense mutations, identifying sample-specific SAVs, and identifying exon-skipping events. CAPER 3.0 is available at http://prodigy.bprc.ac.cn/caper3.
Simulation of two dimensional electrophoresis and tandem mass spectrometry for teaching proteomics.
Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul
2012-01-01
In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations-2D electrophoresis and tandem mass spectrometry. The two simulations are integrated together and are designed to teach the concept of proteome analysis of prokaryotic and eukaryotic organisms. 2DE-Tandem MS can be used as a freestanding simulation, or in conjunction with a wet lab, to introduce proteomics in the undergraduate classroom. 2DE Tandem MS is a free program available on Sourceforge at https://sourceforge.net/projects/jbf/. It was developed using Java Swing and functions in Mac OSX, Windows, and Linux, ensuring that every student sees a consistent and informative graphical user interface no matter the computer platform they choose. Java must be installed on the host computer to run 2DE Tandem MS. Example classroom exercises are provided in the Supporting Information. Copyright © 2012 Wiley Periodicals, Inc.
Andromeda: a peptide search engine integrated into the MaxQuant environment.
Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias
2011-04-01
A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
High throughput profile-profile based fold recognition for the entire human proteome.
McGuffin, Liam J; Smith, Richard T; Bryson, Kevin; Sørensen, Søren-Aksel; Jones, David T
2006-06-07
In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power. In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.
Integration of cardiac proteome biology and medicine by a specialized knowledgebase.
Zong, Nobel C; Li, Haomin; Li, Hua; Lam, Maggie P Y; Jimenez, Rafael C; Kim, Christina S; Deng, Ning; Kim, Allen K; Choi, Jeong Ho; Zelaya, Ivette; Liem, David; Meyer, David; Odeberg, Jacob; Fang, Caiyun; Lu, Hao-Jie; Xu, Tao; Weiss, James; Duan, Huilong; Uhlen, Mathias; Yates, John R; Apweiler, Rolf; Ge, Junbo; Hermjakob, Henning; Ping, Peipei
2013-10-12
Omics sciences enable a systems-level perspective in characterizing cardiovascular biology. Integration of diverse proteomics data via a computational strategy will catalyze the assembly of contextualized knowledge, foster discoveries through multidisciplinary investigations, and minimize unnecessary redundancy in research efforts. The goal of this project is to develop a consolidated cardiac proteome knowledgebase with novel bioinformatics pipeline and Web portals, thereby serving as a new resource to advance cardiovascular biology and medicine. We created Cardiac Organellar Protein Atlas Knowledgebase (COPaKB; www.HeartProteome.org), a centralized platform of high-quality cardiac proteomic data, bioinformatics tools, and relevant cardiovascular phenotypes. Currently, COPaKB features 8 organellar modules, comprising 4203 LC-MS/MS experiments from human, mouse, drosophila, and Caenorhabditis elegans, as well as expression images of 10,924 proteins in human myocardium. In addition, the Java-coded bioinformatics tools provided by COPaKB enable cardiovascular investigators in all disciplines to retrieve and analyze pertinent organellar protein properties of interest. COPaKB provides an innovative and interactive resource that connects research interests with the new biological discoveries in protein sciences. With an array of intuitive tools in this unified Web server, nonproteomics investigators can conveniently collaborate with proteomics specialists to dissect the molecular signatures of cardiovascular phenotypes.
The unique peptidome: Taxon-specific tryptic peptides as biomarkers for targeted metaproteomics.
Mesuere, Bart; Van der Jeugt, Felix; Devreese, Bart; Vandamme, Peter; Dawyndt, Peter
2016-09-01
The Unique Peptide Finder (http://unipept.ugent.be/peptidefinder) is an interactive web application to quickly hunt for tryptic peptides that are unique to a particular species, genus, or any other taxon. Biodiversity within the target taxon is represented by a set of proteomes selected from a monthly updated list of complete and nonredundant UniProt proteomes, supplemented with proprietary proteomes loaded into persistent local browser storage. The software computes and visualizes pan and core peptidomes as unions and intersections of tryptic peptides occurring in the selected proteomes. In addition, it also computes and displays unique peptidomes as the set of all tryptic peptides that occur in all selected proteomes but not in any UniProt record not assigned to the target taxon. As a result, the unique peptides can serve as robust biomarkers for the target taxon, for example, in targeted metaproteomics studies. Computations are extremely fast since they are underpinned by the Unipept database, the lowest common ancestor algorithm implemented in Unipept and modern web technologies that facilitate in-browser data storage and parallel processing. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework
2012-01-01
Background For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. Results We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. Conclusion The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources. PMID:23216909
Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework.
Lewis, Steven; Csordas, Attila; Killcoyne, Sarah; Hermjakob, Henning; Hoopmann, Michael R; Moritz, Robert L; Deutsch, Eric W; Boyle, John
2012-12-05
For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources.
The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the opening of the leaderboard to its Proteogenomics Computational DREAM Challenge. The leadership board remains open for submissions during September 25, 2017 through October 8, 2017, with the Challenge expected to run until November 17, 2017.
Choi, Hyungwon; Kim, Sinae; Fermin, Damian; Tsou, Chih-Chiang; Nesvizhskii, Alexey I
2015-11-03
We introduce QPROT, a statistical framework and computational tool for differential protein expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite, originally developed for spectral count data, adapted for the analysis using continuously measured protein-level intensity data. QPROT offers a new intensity normalization procedure and model-based differential expression analysis, both of which account for missing data. Determination of differential expression of each protein is based on the standardized Z-statistic based on the posterior distribution of the log fold change parameter, guided by the false discovery rate estimated by a well-known Empirical Bayes method. We evaluated the classification performance of QPROT using the quantification calibration data from the clinical proteomic technology assessment for cancer (CPTAC) study and a recently published Escherichia coli benchmark dataset, with evaluation of FDR accuracy in the latter. QPROT is a statistical framework with computational software tool for comparative quantitative proteomics analysis. It features various extensions of QSPEC method originally built for spectral count data analysis, including probabilistic treatment of missing values in protein intensity data. With the increasing popularity of label-free quantitative proteomics data, the proposed method and accompanying software suite will be immediately useful for many proteomics laboratories. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.
Simulation of Two Dimensional Electrophoresis and Tandem Mass Spectrometry for Teaching Proteomics
ERIC Educational Resources Information Center
Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul
2012-01-01
In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations--2D electrophoresis and tandem mass spectrometry.…
Hiller, Karsten; Grote, Andreas; Maneck, Matthias; Münch, Richard; Jahn, Dieter
2006-10-01
After the publication of JVirGel 1.0 in 2003 we got many requests and suggestions from the proteomics community to further improve the performance of the software and to add additional useful new features. The integration of the PrediSi algorithm for the prediction of signal peptides for the Sec-dependent protein export into JVirGel 2.0 allows the exclusion of most exported preproteins from calculated proteomic maps and provides the basis for the calculation of Sec-based secretomes. A tool for the identification of transmembrane helices carrying proteins (JCaMelix) and the prediction of the corresponding membrane proteome was added. Finally, in order to directly compare experimental and calculated proteome data, a function to overlay and evaluate predicted and experimental two-dimensional gels was included. JVirGel 2.0 is freely available as precompiled package for the installation on Windows or Linux operating systems. Furthermore, there is a completely platform-independent Java version available for download. Additionally, we provide a Java Server Pages based version of JVirGel 2.0 which can be operated in nearly all web browsers. All versions are accessible at http://www.jvirgel.de
compomics-utilities: an open-source Java library for computational proteomics.
Barsnes, Harald; Vaudel, Marc; Colaert, Niklaas; Helsens, Kenny; Sickmann, Albert; Berven, Frode S; Martens, Lennart
2011-03-08
The growing interest in the field of proteomics has increased the demand for software tools and applications that process and analyze the resulting data. And even though the purpose of these tools can vary significantly, they usually share a basic set of features, including the handling of protein and peptide sequences, the visualization of (and interaction with) spectra and chromatograms, and the parsing of results from various proteomics search engines. Developers typically spend considerable time and effort implementing these support structures, which detracts from working on the novel aspects of their tool. In order to simplify the development of proteomics tools, we have implemented an open-source support library for computational proteomics, called compomics-utilities. The library contains a broad set of features required for reading, parsing, and analyzing proteomics data. compomics-utilities is already used by a long list of existing software, ensuring library stability and continued support and development. As a user-friendly, well-documented and open-source library, compomics-utilities greatly simplifies the implementation of the basic features needed in most proteomics tools. Implemented in 100% Java, compomics-utilities is fully portable across platforms and architectures. Our library thus allows the developers to focus on the novel aspects of their tools, rather than on the basic functions, which can contribute substantially to faster development, and better tools for proteomics.
Thermosensitivity of growth is determined by chaperone-mediated proteome reallocation
Chen, Ke; Gao, Ye; Mih, Nathan; O’Brien, Edward J.; Yang, Laurence; Palsson, Bernhard O.
2017-01-01
Maintenance of a properly folded proteome is critical for bacterial survival at notably different growth temperatures. Understanding the molecular basis of thermoadaptation has progressed in two main directions, the sequence and structural basis of protein thermostability and the mechanistic principles of protein quality control assisted by chaperones. Yet we do not fully understand how structural integrity of the entire proteome is maintained under stress and how it affects cellular fitness. To address this challenge, we reconstruct a genome-scale protein-folding network for Escherichia coli and formulate a computational model, FoldME, that provides statistical descriptions of multiscale cellular response consistent with many datasets. FoldME simulations show (i) that the chaperones act as a system when they respond to unfolding stress rather than achieving efficient folding of any single component of the proteome, (ii) how the proteome is globally balanced between chaperones for folding and the complex machinery synthesizing the proteins in response to perturbation, (iii) how this balancing determines growth rate dependence on temperature and is achieved through nonspecific regulation, and (iv) how thermal instability of the individual protein affects the overall functional state of the proteome. Overall, these results expand our view of cellular regulation, from targeted specific control mechanisms to global regulation through a web of nonspecific competing interactions that modulate the optimal reallocation of cellular resources. The methodology developed in this study enables genome-scale integration of environment-dependent protein properties and a proteome-wide study of cellular stress responses. PMID:29073085
The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce that teams led by Jaewoo Kang (Korea University), and Yuanfang Guan with Hongyang Li (University of Michigan) as the best performers of the NCI-CPTAC DREAM Proteogenomics Computational Challenge. Over 500 participants from 20 countries registered for the Challenge, which offered $25,000 in cash awards contributed by the NVIDIA Foundation through its Compute the Cure initiative.
Controlled vocabularies and ontologies in proteomics: Overview, principles and practice☆
Mayer, Gerhard; Jones, Andrew R.; Binz, Pierre-Alain; Deutsch, Eric W.; Orchard, Sandra; Montecchi-Palazzi, Luisa; Vizcaíno, Juan Antonio; Hermjakob, Henning; Oveillero, David; Julian, Randall; Stephan, Christian; Meyer, Helmut E.; Eisenacher, Martin
2014-01-01
This paper focuses on the use of controlled vocabularies (CVs) and ontologies especially in the area of proteomics, primarily related to the work of the Proteomics Standards Initiative (PSI). It describes the relevant proteomics standard formats and the ontologies used within them. Software and tools for working with these ontology files are also discussed. The article also examines the “mapping files” used to ensure correct controlled vocabulary terms that are placed within PSI standards and the fulfillment of the MIAPE (Minimum Information about a Proteomics Experiment) requirements. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23429179
Computer aided manual validation of mass spectrometry-based proteomic data.
Curran, Timothy G; Bryson, Bryan D; Reigelhaupt, Michael; Johnson, Hannah; White, Forest M
2013-06-15
Advances in mass spectrometry-based proteomic technologies have increased the speed of analysis and the depth provided by a single analysis. Computational tools to evaluate the accuracy of peptide identifications from these high-throughput analyses have not kept pace with technological advances; currently the most common quality evaluation methods are based on statistical analysis of the likelihood of false positive identifications in large-scale data sets. While helpful, these calculations do not consider the accuracy of each identification, thus creating a precarious situation for biologists relying on the data to inform experimental design. Manual validation is the gold standard approach to confirm accuracy of database identifications, but is extremely time-intensive. To palliate the increasing time required to manually validate large proteomic datasets, we provide computer aided manual validation software (CAMV) to expedite the process. Relevant spectra are collected, catalogued, and pre-labeled, allowing users to efficiently judge the quality of each identification and summarize applicable quantitative information. CAMV significantly reduces the burden associated with manual validation and will hopefully encourage broader adoption of manual validation in mass spectrometry-based proteomics. Copyright © 2013 Elsevier Inc. All rights reserved.
Expert system for computer-assisted annotation of MS/MS spectra.
Neuhauser, Nadin; Michalski, Annette; Cox, Jürgen; Mann, Matthias
2012-11-01
An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be explainable by human experts but the scale of modern proteomics experiments makes this impractical. In computer science, Expert Systems are a mature technology to implement a list of rules generated by interviews with practitioners. We here develop such an Expert System, making use of literature knowledge as well as a large body of high mass accuracy and pure fragmentation spectra. Interestingly, we find that even with high mass accuracy data, rule sets can quickly become too complex, leading to over-annotation. Therefore we establish a rigorous false discovery rate, calculated by random insertion of peaks from a large collection of other MS/MS spectra, and use it to develop an optimized knowledge base. This rule set correctly annotates almost all peaks of medium or high abundance. For high resolution HCD data, median intensity coverage of fragment peaks in MS/MS spectra increases from 58% by search engine annotation alone to 86%. The resulting annotation performance surpasses a human expert, especially on complex spectra such as those of larger phosphorylated peptides. Our system is also applicable to high resolution collision-induced dissociation data. It is available both as a part of MaxQuant and via a webserver that only requires an MS/MS spectrum and the corresponding peptides sequence, and which outputs publication quality, annotated MS/MS spectra (www.biochem.mpg.de/mann/tools/). It provides expert knowledge to beginners in the field of MS-based proteomics and helps advanced users to focus on unusual and possibly novel types of fragment ions.
Expert System for Computer-assisted Annotation of MS/MS Spectra*
Neuhauser, Nadin; Michalski, Annette; Cox, Jürgen; Mann, Matthias
2012-01-01
An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be explainable by human experts but the scale of modern proteomics experiments makes this impractical. In computer science, Expert Systems are a mature technology to implement a list of rules generated by interviews with practitioners. We here develop such an Expert System, making use of literature knowledge as well as a large body of high mass accuracy and pure fragmentation spectra. Interestingly, we find that even with high mass accuracy data, rule sets can quickly become too complex, leading to over-annotation. Therefore we establish a rigorous false discovery rate, calculated by random insertion of peaks from a large collection of other MS/MS spectra, and use it to develop an optimized knowledge base. This rule set correctly annotates almost all peaks of medium or high abundance. For high resolution HCD data, median intensity coverage of fragment peaks in MS/MS spectra increases from 58% by search engine annotation alone to 86%. The resulting annotation performance surpasses a human expert, especially on complex spectra such as those of larger phosphorylated peptides. Our system is also applicable to high resolution collision-induced dissociation data. It is available both as a part of MaxQuant and via a webserver that only requires an MS/MS spectrum and the corresponding peptides sequence, and which outputs publication quality, annotated MS/MS spectra (www.biochem.mpg.de/mann/tools/). It provides expert knowledge to beginners in the field of MS-based proteomics and helps advanced users to focus on unusual and possibly novel types of fragment ions. PMID:22888147
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stekhoven, Daniel J.; Omasits, Ulrich; Quebatte, Maxime
2014-03-01
Proteomics data provide unique insights into biological systems, including the predominant subcellular localization (SCL) of proteins, which can reveal important clues about their functions. Here we analyzed data of a complete prokaryotic proteome expressed under two conditions mimicking interaction of the emerging pathogen Bartonella henselae with its mammalian host. Normalized spectral count data from cytoplasmic, total membrane, inner and outer membrane fractions allowed us to identify the predominant SCL for 82% of the identified proteins. The spectral count proportion of total membrane versus cytoplasmic fractions indicated the propensity of cytoplasmic proteins to co-fractionate with the inner membrane, and enabled usmore » to distinguish cytoplasmic, peripheral innermembrane and bona fide inner membrane proteins. Principal component analysis and k-nearest neighbor classification training on selected marker proteins or predominantly localized proteins, allowed us to determine an extensive catalog of at least 74 expressed outer membrane proteins, and to extend the SCL assignment to 94% of the identified proteins, including 18% where in silico methods gave no prediction. Suitable experimental proteomics data combined with straightforward computational approaches can thus identify the predominant SCL on a proteome-wide scale. Finally, we present a conceptual approach to identify proteins potentially changing their SCL in a condition-dependent fashion.« less
Cloud CPFP: a shotgun proteomics data analysis pipeline using cloud and high performance computing.
Trudgian, David C; Mirzaei, Hamid
2012-12-07
We have extended the functionality of the Central Proteomics Facilities Pipeline (CPFP) to allow use of remote cloud and high performance computing (HPC) resources for shotgun proteomics data processing. CPFP has been modified to include modular local and remote scheduling for data processing jobs. The pipeline can now be run on a single PC or server, a local cluster, a remote HPC cluster, and/or the Amazon Web Services (AWS) cloud. We provide public images that allow easy deployment of CPFP in its entirety in the AWS cloud. This significantly reduces the effort necessary to use the software, and allows proteomics laboratories to pay for compute time ad hoc, rather than obtaining and maintaining expensive local server clusters. Alternatively the Amazon cloud can be used to increase the throughput of a local installation of CPFP as necessary. We demonstrate that cloud CPFP allows users to process data at higher speed than local installations but with similar cost and lower staff requirements. In addition to the computational improvements, the web interface to CPFP is simplified, and other functionalities are enhanced. The software is under active development at two leading institutions and continues to be released under an open-source license at http://cpfp.sourceforge.net.
Hamzeiy, Hamid; Cox, Jürgen
2017-02-01
Computational workflows for mass spectrometry-based shotgun proteomics and untargeted metabolomics share many steps. Despite the similarities, untargeted metabolomics is lagging behind in terms of reliable fully automated quantitative data analysis. We argue that metabolomics will strongly benefit from the adaptation of successful automated proteomics workflows to metabolomics. MaxQuant is a popular platform for proteomics data analysis and is widely considered to be superior in achieving high precursor mass accuracies through advanced nonlinear recalibration, usually leading to five to ten-fold better accuracy in complex LC-MS/MS runs. This translates to a sharp decrease in the number of peptide candidates per measured feature, thereby strongly improving the coverage of identified peptides. We argue that similar strategies can be applied to untargeted metabolomics, leading to equivalent improvements in metabolite identification. Copyright © 2016 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Design and Initial Characterization of the SC-200 Proteomics Standard Mixture
Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald
2011-01-01
Abstract High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels. PMID:21250827
Design and initial characterization of the SC-200 proteomics standard mixture.
Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald; Kolker, Eugene
2011-01-01
High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels.
2005-01-01
proteomic gel analyses. The research group has explored the use of chemodescriptors calculated using high-level ab initio quantum chemical basis sets...descriptors that characterize the entire proteomics map, local descriptors that characterize a subset of the proteins present in the gel, and spectrum...techniques for analyzing the full set of proteins present in a proteomics map. 14. SUBJECT TERMS 1S. NUMBER OF PAGES Topological indices
Employee Spotlight: Clarence Chang | Argonne National Laboratory
batteries --Electricity transmission --Smart Grid Environment -Biology --Computational biology --Environmental biology ---Metagenomics ---Terrestrial ecology --Molecular biology ---Clinical proteomics and biomarker discovery ---Interventional biology ---Proteomics --Structural biology -Environmental science &
PatternLab for proteomics 4.0: A one-stop shop for analyzing shotgun proteomic data
Carvalho, Paulo C; Lima, Diogo B; Leprevost, Felipe V; Santos, Marlon D M; Fischer, Juliana S G; Aquino, Priscila F; Moresco, James J; Yates, John R; Barbosa, Valmir C
2017-01-01
PatternLab for proteomics is an integrated computational environment that unifies several previously published modules for analyzing shotgun proteomic data. PatternLab contains modules for formatting sequence databases, performing peptide spectrum matching, statistically filtering and organizing shotgun proteomic data, extracting quantitative information from label-free and chemically labeled data, performing statistics for differential proteomics, displaying results in a variety of graphical formats, performing similarity-driven studies with de novo sequencing data, analyzing time-course experiments, and helping with the understanding of the biological significance of data in the light of the Gene Ontology. Here we describe PatternLab for proteomics 4.0, which closely knits together all of these modules in a self-contained environment, covering the principal aspects of proteomic data analysis as a freely available and easily installable software package. All updates to PatternLab, as well as all new features added to it, have been tested over the years on millions of mass spectra. PMID:26658470
Appel, R D; Palagi, P M; Walther, D; Vargas, J R; Sanchez, J C; Ravier, F; Pasquali, C; Hochstrasser, D F
1997-12-01
Although two-dimensional electrophoresis (2-DE) computer analysis software packages have existed ever since 2-DE technology was developed, it is only now that the hardware and software technology allows large-scale studies to be performed on low-cost personal computers or workstations, and that setting up a 2-DE computer analysis system in a small laboratory is no longer considered a luxury. After a first attempt in the seventies and early eighties to develop 2-DE analysis software systems on hardware that had poor or even no graphical capabilities, followed in the late eighties by a wave of innovative software developments that were possible thanks to new graphical interface standards such as XWindows, a third generation of 2-DE analysis software packages has now come to maturity. It can be run on a variety of low-cost, general-purpose personal computers, thus making the purchase of a 2-DE analysis system easily attainable for even the smallest laboratory that is involved in proteome research. Melanie II 2-D PAGE, developed at the University Hospital of Geneva, is such a third-generation software system for 2-DE analysis. Based on unique image processing algorithms, this user-friendly object-oriented software package runs on multiple platforms, including Unix, MS-Windows 95 and NT, and Power Macintosh. It provides efficient spot detection and quantitation, state-of-the-art image comparison, statistical data analysis facilities, and is Internet-ready. Linked to proteome databases such as those available on the World Wide Web, it represents a valuable tool for the "Virtual Lab" of the post-genome area.
77 FR 67381 - Government-Owned Inventions; Availability for Licensing
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-09
.... ``Computational and Experimental RNA Nanoparticle Design,'' in Automation in Genomics and Proteomics: An... and Experimental RNA Nanoparticle Design,'' in Automation in Genomics and Proteomics: An Engineering... Development Stage: Prototype Pre-clinical In vitro data available Inventors: Robert J. Crouch and Yutaka...
Shteynberg, David; Deutsch, Eric W.; Lam, Henry; Eng, Jimmy K.; Sun, Zhi; Tasman, Natalie; Mendoza, Luis; Moritz, Robert L.; Aebersold, Ruedi; Nesvizhskii, Alexey I.
2011-01-01
The combination of tandem mass spectrometry and sequence database searching is the method of choice for the identification of peptides and the mapping of proteomes. Over the last several years, the volume of data generated in proteomic studies has increased dramatically, which challenges the computational approaches previously developed for these data. Furthermore, a multitude of search engines have been developed that identify different, overlapping subsets of the sample peptides from a particular set of tandem mass spectrometry spectra. We present iProphet, the new addition to the widely used open-source suite of proteomic data analysis tools Trans-Proteomics Pipeline. Applied in tandem with PeptideProphet, it provides more accurate representation of the multilevel nature of shotgun proteomic data. iProphet combines the evidence from multiple identifications of the same peptide sequences across different spectra, experiments, precursor ion charge states, and modified states. It also allows accurate and effective integration of the results from multiple database search engines applied to the same data. The use of iProphet in the Trans-Proteomics Pipeline increases the number of correctly identified peptides at a constant false discovery rate as compared with both PeptideProphet and another state-of-the-art tool Percolator. As the main outcome, iProphet permits the calculation of accurate posterior probabilities and false discovery rate estimates at the level of sequence identical peptide identifications, which in turn leads to more accurate probability estimates at the protein level. Fully integrated with the Trans-Proteomics Pipeline, it supports all commonly used MS instruments, search engines, and computer platforms. The performance of iProphet is demonstrated on two publicly available data sets: data from a human whole cell lysate proteome profiling experiment representative of typical proteomic data sets, and from a set of Streptococcus pyogenes experiments more representative of organism-specific composite data sets. PMID:21876204
Han, Mee-Jung; Yun, Hongseok; Lee, Jeong Wook; Lee, Yu Hyun; Lee, Sang Yup; Yoo, Jong-Shin; Kim, Jin Young; Kim, Jihyun F; Hur, Cheol-Goo
2011-04-01
Escherichia coli K-12 and B strains have most widely been employed for scientific studies as well as industrial applications. Recently, the complete genome sequences of two representative descendants of E. coli B strains, REL606 and BL21(DE3), have been determined. Here, we report the subproteome reference maps of E. coli B REL606 by analyzing cytoplasmic, periplasmic, inner and outer membrane, and extracellular proteomes based on the genome information using experimental and computational approaches. Among the total of 3487 spots, 651 proteins including 410 non-redundant proteins were identified and characterized by 2-DE and LC-MS/MS; they include 440 cytoplasmic, 45 periplasmic, 50 inner membrane, 61 outer membrane, and 55 extracellular proteins. In addition, subcellular localizations of all 4205 ORFs of E. coli B were predicted by combined computational prediction methods. The subcellular localizations of 1812 (43.09%) proteins of currently unknown function were newly assigned. The results of computational prediction were also compared with the experimental results, showing that overall precision and recall were 92.16 and 92.16%, respectively. This work represents the most comprehensive analyses of the subproteomes of E. coli B, and will be useful as a reference for proteome profiling studies under various conditions. The complete proteome data are available online (http://ecolib.kaist.ac.kr). Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Chen, Xiang; Velliste, Meel; Murphy, Robert F.
2010-01-01
Proteomics, the large scale identification and characterization of many or all proteins expressed in a given cell type, has become a major area of biological research. In addition to information on protein sequence, structure and expression levels, knowledge of a protein’s subcellular location is essential to a complete understanding of its functions. Currently subcellular location patterns are routinely determined by visual inspection of fluorescence microscope images. We review here research aimed at creating systems for automated, systematic determination of location. These employ numerical feature extraction from images, feature reduction to identify the most useful features, and various supervised learning (classification) and unsupervised learning (clustering) methods. These methods have been shown to perform significantly better than human interpretation of the same images. When coupled with technologies for tagging large numbers of proteins and high-throughput microscope systems, the computational methods reviewed here enable the new subfield of location proteomics. This subfield will make critical contributions in two related areas. First, it will provide structured, high-resolution information on location to enable Systems Biology efforts to simulate cell behavior from the gene level on up. Second, it will provide tools for Cytomics projects aimed at characterizing the behaviors of all cell types before, during and after the onset of various diseases. PMID:16752421
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, Roslyn N.; Sanford, James A.; Park, Jea H.
Towards developing a systems-level pathobiological understanding of Salmonella enterica, we performed a subcellular proteomic analysis of this pathogen grown under standard laboratory and infection-mimicking conditions in vitro. Analysis of proteins from cytoplasmic, inner membrane, periplasmic, and outer membrane fractions yielded coverage of over 30% of the theoretical proteome. Confident subcellular location could be assigned to over 1000 proteins, with good agreement between experimentally observed location and predicted/known protein properties. Comparison of protein location under the different environmental conditions provided insight into dynamic protein localization and possible moonlighting (multiple function) activities. Notable examples of dynamic localization were the response regulators ofmore » two-component regulatory systems (e.g., ArcB, PhoQ). The DNA-binding protein Dps that is generally regarded as cytoplasmic was significantly enriched in the outer membrane for all growth conditions examined, suggestive of moonlighting activities. These observations imply the existence of unknown transport mechanisms and novel functions for a subset of Salmonella proteins. Overall, this work provides a catalog of experimentally verified subcellular protein location for Salmonella and a framework for further investigations using computational modeling.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lopez-Ferrer, Daniel; Petritis, Konstantinos; Robinson, Errol W.
2011-02-01
Integrated top-down bottom-up proteomics combined with online digestion has great potential to improve the characterization of protein isoforms in biological systems and is amendable to highthroughput proteomics experiments. Bottom-up proteomics ultimately provides the peptide sequences derived from the tandem MS analyses of peptides after the proteome has been digested. Top-down proteomics conversely entails the MS analyses of intact proteins for more effective characterization of genetic variations and/or post-translational modifications (PTMs). Herein, we describe recent efforts towards efficient integration of bottom-up and top-down LCMS based proteomic strategies. Since most proteomic platforms (i.e. LC systems) operate in acidic environments, we exploited themore » compatibility of the pepsin (i.e. the enzyme’s natural acidic activity) for the integration of bottom-up and top-down proteomics. Pressure enhanced pepsin digestions were successfully performed and characterized with several standard proteins in either an offline mode using a Barocycler or an online mode using a modified high pressure LC system referred to as a fast online digestion system (FOLDS). FOLDS was tested using pepsin and a whole microbial proteome, and the results compared against traditional trypsin digestions on the same platform. Additionally, FOLDS was integrated with a RePlay configuration to demonstrate an ultra-rapid integrated bottom-up top-down proteomic strategy employing a standard mixture of proteins and a monkey pox virus proteome.« less
Systems Proteomics for Translational Network Medicine
Arrell, D. Kent; Terzic, Andre
2012-01-01
Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016
Recent developments in structural proteomics for protein structure determination.
Liu, Hsuan-Liang; Hsu, Jyh-Ping
2005-05-01
The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.
Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study
Sun, Su; Xie, Shangxian; Cheng, Yanbing; ...
2017-09-12
Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level formore » the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.« less
Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study.
Sun, Su; Xie, Shangxian; Cheng, Yanbing; Yu, Hongbo; Zhao, Honglu; Li, Muzi; Li, Xiaotong; Zhang, Xiaoyu; Yuan, Joshua S; Dai, Susie Y
2017-09-12
Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level for the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.
Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sun, Su; Xie, Shangxian; Cheng, Yanbing
Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level formore » the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.« less
2013-01-01
Background The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing - the matching of peptide measurements across samples. Results We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Conclusions Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods. PMID:24341404
Benjamin, Ashlee M; Thompson, J Will; Soderblom, Erik J; Geromanos, Scott J; Henao, Ricardo; Kraus, Virginia B; Moseley, M Arthur; Lucas, Joseph E
2013-12-16
The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing--the matching of peptide measurements across samples. We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods.
Pressurized Pepsin Digestion in Proteomics
López-Ferrer, Daniel; Petritis, Konstantinos; Robinson, Errol W.; Hixson, Kim K.; Tian, Zhixin; Lee, Jung Hwa; Lee, Sang-Won; Tolić, Nikola; Weitz, Karl K.; Belov, Mikhail E.; Smith, Richard D.; Paša-Tolić, Ljiljana
2011-01-01
Integrated top-down bottom-up proteomics combined with on-line digestion has great potential to improve the characterization of protein isoforms in biological systems and is amendable to high throughput proteomics experiments. Bottom-up proteomics ultimately provides the peptide sequences derived from the tandem MS analyses of peptides after the proteome has been digested. Top-down proteomics conversely entails the MS analyses of intact proteins for more effective characterization of genetic variations and/or post-translational modifications. Herein, we describe recent efforts toward efficient integration of bottom-up and top-down LC-MS-based proteomics strategies. Since most proteomics separations utilize acidic conditions, we exploited the compatibility of pepsin (where the optimal digestion conditions are at low pH) for integration into bottom-up and top-down proteomics work flows. Pressure-enhanced pepsin digestions were successfully performed and characterized with several standard proteins in either an off-line mode using a Barocycler or an on-line mode using a modified high pressure LC system referred to as a fast on-line digestion system (FOLDS). FOLDS was tested using pepsin and a whole microbial proteome, and the results were compared against traditional trypsin digestions on the same platform. Additionally, FOLDS was integrated with a RePlay configuration to demonstrate an ultrarapid integrated bottom-up top-down proteomics strategy using a standard mixture of proteins and a monkey pox virus proteome. PMID:20627868
Computational approaches to protein inference in shotgun proteomics
2012-01-01
Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programing and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area. PMID:23176300
Affordable proteomics: the two-hybrid systems.
Gillespie, Marc
2003-06-01
Numerous proteomic methodologies exist, but most require a heavy investment in expertise and technology. This puts these approaches out of reach for many laboratories and small companies, rarely allowing proteomics to be used as a pilot approach for biomarker or target identification. Two proteomic approaches, 2D gel electrophoresis and the two-hybrid systems, are currently available to most researchers. The two-hybrid systems, though accommodating to large-scale experiments, were originally designed as practical screens, that by comparison to current proteomics tools were small-scale, affordable and technically feasible. The screens rapidly generated data, identifying protein interactions that were previously uncharacterized. The foundation for a two-hybrid proteomic investigation can be purchased as separate kits from a number of companies. The true power of the technique lies not in its affordability, but rather in its portability. The two-hybrid system puts proteomics back into laboratories where the output of the screens can be evaluated by researchers with experience in the particular fields of basic research, cancer biology, toxicology or drug development.
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Parasites, proteomes and systems: has Descartes' clock run out of time?
Wastling, J M; Armstrong, S D; Krishna, R; Xia, D
2012-08-01
Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types.
Parasites, proteomes and systems: has Descartes’ clock run out of time?
WASTLING, J. M.; ARMSTRONG, S. D.; KRISHNA, R.; XIA, D.
2012-01-01
SUMMARY Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types. PMID:22828391
Cohen Freue, Gabriela V.; Meredith, Anna; Smith, Derek; Bergman, Axel; Sasaki, Mayu; Lam, Karen K. Y.; Hollander, Zsuzsanna; Opushneva, Nina; Takhar, Mandeep; Lin, David; Wilson-McManus, Janet; Balshaw, Robert; Keown, Paul A.; Borchers, Christoph H.; McManus, Bruce; Ng, Raymond T.; McMaster, W. Robert
2013-01-01
Recent technical advances in the field of quantitative proteomics have stimulated a large number of biomarker discovery studies of various diseases, providing avenues for new treatments and diagnostics. However, inherent challenges have limited the successful translation of candidate biomarkers into clinical use, thus highlighting the need for a robust analytical methodology to transition from biomarker discovery to clinical implementation. We have developed an end-to-end computational proteomic pipeline for biomarkers studies. At the discovery stage, the pipeline emphasizes different aspects of experimental design, appropriate statistical methodologies, and quality assessment of results. At the validation stage, the pipeline focuses on the migration of the results to a platform appropriate for external validation, and the development of a classifier score based on corroborated protein biomarkers. At the last stage towards clinical implementation, the main aims are to develop and validate an assay suitable for clinical deployment, and to calibrate the biomarker classifier using the developed assay. The proposed pipeline was applied to a biomarker study in cardiac transplantation aimed at developing a minimally invasive clinical test to monitor acute rejection. Starting with an untargeted screening of the human plasma proteome, five candidate biomarker proteins were identified. Rejection-regulated proteins reflect cellular and humoral immune responses, acute phase inflammatory pathways, and lipid metabolism biological processes. A multiplex multiple reaction monitoring mass-spectrometry (MRM-MS) assay was developed for the five candidate biomarkers and validated by enzyme-linked immune-sorbent (ELISA) and immunonephelometric assays (INA). A classifier score based on corroborated proteins demonstrated that the developed MRM-MS assay provides an appropriate methodology for an external validation, which is still in progress. Plasma proteomic biomarkers of acute cardiac rejection may offer a relevant post-transplant monitoring tool to effectively guide clinical care. The proposed computational pipeline is highly applicable to a wide range of biomarker proteomic studies. PMID:23592955
Computational Systems Biology in Cancer: Modeling Methods and Applications
Materi, Wayne; Wishart, David S.
2007-01-01
In recent years it has become clear that carcinogenesis is a complex process, both at the molecular and cellular levels. Understanding the origins, growth and spread of cancer, therefore requires an integrated or system-wide approach. Computational systems biology is an emerging sub-discipline in systems biology that utilizes the wealth of data from genomic, proteomic and metabolomic studies to build computer simulations of intra and intercellular processes. Several useful descriptive and predictive models of the origin, growth and spread of cancers have been developed in an effort to better understand the disease and potential therapeutic approaches. In this review we describe and assess the practical and theoretical underpinnings of commonly-used modeling approaches, including ordinary and partial differential equations, petri nets, cellular automata, agent based models and hybrid systems. A number of computer-based formalisms have been implemented to improve the accessibility of the various approaches to researchers whose primary interest lies outside of model development. We discuss several of these and describe how they have led to novel insights into tumor genesis, growth, apoptosis, vascularization and therapy. PMID:19936081
HTAPP: High-Throughput Autonomous Proteomic Pipeline
Yu, Kebing; Salomon, Arthur R.
2011-01-01
Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676
Havugimana, Pierre C; Hu, Pingzhao; Emili, Andrew
2017-10-01
Elucidation of the networks of physical (functional) interactions present in cells and tissues is fundamental for understanding the molecular organization of biological systems, the mechanistic basis of essential and disease-related processes, and for functional annotation of previously uncharacterized proteins (via guilt-by-association or -correlation). After a decade in the field, we felt it timely to document our own experiences in the systematic analysis of protein interaction networks. Areas covered: Researchers worldwide have contributed innovative experimental and computational approaches that have driven the rapidly evolving field of 'functional proteomics'. These include mass spectrometry-based methods to characterize macromolecular complexes on a global-scale and sophisticated data analysis tools - most notably machine learning - that allow for the generation of high-quality protein association maps. Expert commentary: Here, we recount some key lessons learned, with an emphasis on successful workflows, and challenges, arising from our own and other groups' ongoing efforts to generate, interpret and report proteome-scale interaction networks in increasingly diverse biological contexts.
The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.
Tyanova, Stefka; Temu, Tikira; Cox, Juergen
2016-12-01
MaxQuant is one of the most frequently used platforms for mass-spectrometry (MS)-based proteomics data analysis. Since its first release in 2008, it has grown substantially in functionality and can be used in conjunction with more MS platforms. Here we present an updated protocol covering the most important basic computational workflows, including those designed for quantitative label-free proteomics, MS1-level labeling and isobaric labeling techniques. This protocol presents a complete description of the parameters used in MaxQuant, as well as of the configuration options of its integrated search engine, Andromeda. This protocol update describes an adaptation of an existing protocol that substantially modifies the technique. Important concepts of shotgun proteomics and their implementation in MaxQuant are briefly reviewed, including different quantification strategies and the control of false-discovery rates (FDRs), as well as the analysis of post-translational modifications (PTMs). The MaxQuant output tables, which contain information about quantification of proteins and PTMs, are explained in detail. Furthermore, we provide a short version of the workflow that is applicable to data sets with simple and standard experimental designs. The MaxQuant algorithms are efficiently parallelized on multiple processors and scale well from desktop computers to servers with many cores. The software is written in C# and is freely available at http://www.maxquant.org.
A comprehensive and scalable database search system for metaproteomics.
Chatterjee, Sandip; Stupp, Gregory S; Park, Sung Kyu Robin; Ducom, Jean-Christophe; Yates, John R; Su, Andrew I; Wolan, Dennis W
2016-08-16
Mass spectrometry-based shotgun proteomics experiments rely on accurate matching of experimental spectra against a database of protein sequences. Existing computational analysis methods are limited in the size of their sequence databases, which severely restricts the proteomic sequencing depth and functional analysis of highly complex samples. The growing amount of public high-throughput sequencing data will only exacerbate this problem. We designed a broadly applicable metaproteomic analysis method (ComPIL) that addresses protein database size limitations. Our approach to overcome this significant limitation in metaproteomics was to design a scalable set of sequence databases assembled for optimal library querying speeds. ComPIL was integrated with a modified version of the search engine ProLuCID (termed "Blazmass") to permit rapid matching of experimental spectra. Proof-of-principle analysis of human HEK293 lysate with a ComPIL database derived from high-quality genomic libraries was able to detect nearly all of the same peptides as a search with a human database (~500x fewer peptides in the database), with a small reduction in sensitivity. We were also able to detect proteins from the adenovirus used to immortalize these cells. We applied our method to a set of healthy human gut microbiome proteomic samples and showed a substantial increase in the number of identified peptides and proteins compared to previous metaproteomic analyses, while retaining a high degree of protein identification accuracy and allowing for a more in-depth characterization of the functional landscape of the samples. The combination of ComPIL with Blazmass allows proteomic searches to be performed with database sizes much larger than previously possible. These large database searches can be applied to complex meta-samples with unknown composition or proteomic samples where unexpected proteins may be identified. The protein database, proteomic search engine, and the proteomic data files for the 5 microbiome samples characterized and discussed herein are open source and available for use and additional analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matzke, Melissa M.; Brown, Joseph N.; Gritsenko, Marina A.
2013-02-01
Liquid chromatography coupled with mass spectrometry (LC-MS) is widely used to identify and quantify peptides in complex biological samples. In particular, label-free shotgun proteomics is highly effective for the identification of peptides and subsequently obtaining a global protein profile of a sample. As a result, this approach is widely used for discovery studies. Typically, the objective of these discovery studies is to identify proteins that are affected by some condition of interest (e.g. disease, exposure). However, for complex biological samples, label-free LC-MS proteomics experiments measure peptides and do not directly yield protein quantities. Thus, protein quantification must be inferred frommore » one or more measured peptides. In recent years, many computational approaches to relative protein quantification of label-free LC-MS data have been published. In this review, we examine the most commonly employed quantification approaches to relative protein abundance from peak intensity values, evaluate their individual merits, and discuss challenges in the use of the various computational approaches.« less
Applying systems biology methods to the study of human physiology in extreme environments
2013-01-01
Systems biology is defined in this review as ‘an iterative process of computational model building and experimental model revision with the aim of understanding or simulating complex biological systems’. We propose that, in practice, systems biology rests on three pillars: computation, the omics disciplines and repeated experimental perturbation of the system of interest. The number of ethical and physiologically relevant perturbations that can be used in experiments on healthy humans is extremely limited and principally comprises exercise, nutrition, infusions (e.g. Intralipid), some drugs and altered environment. Thus, we argue that systems biology and environmental physiology are natural symbionts for those interested in a system-level understanding of human biology. However, despite excellent progress in high-altitude genetics and several proteomics studies, systems biology research into human adaptation to extreme environments is in its infancy. A brief description and overview of systems biology in its current guise is given, followed by a mini review of computational methods used for modelling biological systems. Special attention is given to high-altitude research, metabolic network reconstruction and constraint-based modelling. PMID:23849719
Unexpected features of the dark proteome.
Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S; Buckley, Michael J; Tabor, Bruce; Signal, Beth; Gloss, Brian S; Hammang, Christopher J; Rost, Burkhard; Schafferhans, Andrea; O'Donoghue, Seán I
2015-12-29
We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.
Computation as the mechanistic bridge between precision medicine and systems therapeutics.
Hansen, J; Iyengar, R
2013-01-01
Over the past 50 years, like molecular cell biology, medicine and pharmacology have been driven by a reductionist approach. The focus on individual genes and cellular components as disease loci and drug targets has been a necessary step in understanding the basic mechanisms underlying tissue/organ physiology and drug action. Recent progress in genomics and proteomics, as well as advances in other technologies that enable large-scale data gathering and computational approaches, is providing new knowledge of both normal and disease states. Systems-biology approaches enable integration of knowledge from different types of data for precision medicine and systems therapeutics. In this review, we describe recent studies that contribute to these emerging fields and discuss how together these fields can lead to a mechanism-based therapy for individual patients.
Computation as the Mechanistic Bridge Between Precision Medicine and Systems Therapeutics
Hansen, J; Iyengar, R
2014-01-01
Over the past 50 years, like molecular cell biology, medicine and pharmacology have been driven by a reductionist approach. The focus on individual genes and cellular components as disease loci and drug targets has been a necessary step in understanding the basic mechanisms underlying tissue/organ physiology and drug action. Recent progress in genomics and proteomics, as well as advances in other technologies that enable large-scale data gathering and computational approaches, is providing new knowledge of both normal and disease states. Systems-biology approaches enable integration of knowledge from different types of data for precision medicine and systems therapeutics. In this review, we describe recent studies that contribute to these emerging fields and discuss how together these fields can lead to a mechanism-based therapy for individual patients. PMID:23212109
QCloud: A cloud-based quality control system for mass spectrometry-based proteomics laboratories
Chiva, Cristina; Olivella, Roger; Borràs, Eva; Espadas, Guadalupe; Pastor, Olga; Solé, Amanda
2018-01-01
The increasing number of biomedical and translational applications in mass spectrometry-based proteomics poses new analytical challenges and raises the need for automated quality control systems. Despite previous efforts to set standard file formats, data processing workflows and key evaluation parameters for quality control, automated quality control systems are not yet widespread among proteomics laboratories, which limits the acquisition of high-quality results, inter-laboratory comparisons and the assessment of variability of instrumental platforms. Here we present QCloud, a cloud-based system to support proteomics laboratories in daily quality assessment using a user-friendly interface, easy setup, automated data processing and archiving, and unbiased instrument evaluation. QCloud supports the most common targeted and untargeted proteomics workflows, it accepts data formats from different vendors and it enables the annotation of acquired data and reporting incidences. A complete version of the QCloud system has successfully been developed and it is now open to the proteomics community (http://qcloud.crg.eu). QCloud system is an open source project, publicly available under a Creative Commons License Attribution-ShareAlike 4.0. PMID:29324744
Accurate proteome-wide protein quantification from high-resolution 15N mass spectra
2011-01-01
In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of 15N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing 15N spectra in which quantification is integrated with identification. Application of this method to an Escherichia coli growth transition reveals significant improvement in quantification accuracy over previous methods. PMID:22182234
Proteome | Office of Cancer Clinical Proteomics Research
A proteome is the entire complement of proteins, including modifications made to a particular set of proteins, produced by an organism or a cellular system. This will vary with time and distinct requirements such as growth conditions and stresses, and thus is highly dynamic and spatial. Proteomics is the study of the proteome.
Comparative proteome analysis of laboratory grown Brucella abortus 2308 and Brucella melitensis 16M.
Eschenbrenner, Michel; Horn, Troy A; Wagner, Mary Ann; Mujer, Cesar V; Miller-Scandle, Tabbi L; DelVecchio, Vito G
2006-07-01
Brucella species are pathogenic agents that cause brucellosis, a debilitating zoonotic disease that affects a large variety of domesticated animals and humans. Brucella melitensis and Brucella abortus are considered major health threats because of their highly infectious nature and worldwide occurrence. The availability of the annotated genomes for these two species has allowed a comparative proteomics study of laboratory grown B. melitensis 16M and B. abortus 2308 by two-dimensional (2-D) gel electrophoresis and peptide mass fingerprinting. Computer-assisted analysis of the different 2-D gel images of strains 16M and 2308 revealed significant quantitative and qualitative differences in their protein expression patterns. Proteins involved in membrane transport, particularly the high affinity amino acids binding proteins, and those involved in Sec-dependent secretion systems related to type IV and type V secretion systems, were differentially expressed. Differential expression of these proteins may be responsible for conferring specific host preference in the two strains 2308 and 16M.
Yates, John R
2015-11-01
Advances in computer technology and software have driven developments in mass spectrometry over the last 50 years. Computers and software have been impactful in three areas: the automation of difficult calculations to aid interpretation, the collection of data and control of instruments, and data interpretation. As the power of computers has grown, so too has the utility and impact on mass spectrometers and their capabilities. This has been particularly evident in the use of tandem mass spectrometry data to search protein and nucleotide sequence databases to identify peptide and protein sequences. This capability has driven the development of many new approaches to study biological systems, including the use of "bottom-up shotgun proteomics" to directly analyze protein mixtures. Graphical Abstract ᅟ.
Computational Omics Funding Opportunity | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) and the NVIDIA Foundation are pleased to announce funding opportunities in the fight against cancer. Each organization has launched a request for proposals (RFP) that will collectively fund up to $2 million to help to develop a new generation of data-intensive scientific tools to find new ways to treat cancer.
Mapping the Small Molecule Interactome by Mass Spectrometry.
Flaxman, Hope A; Woo, Christina M
2018-01-16
Mapping small molecule interactions throughout the proteome provides the critical structural basis for functional analysis of their impact on biochemistry. However, translation of mass spectrometry-based proteomics methods to directly profile the interaction between a small molecule and the whole proteome is challenging because of the substoichiometric nature of many interactions, the diversity of covalent and noncovalent interactions involved, and the subsequent computational complexity associated with their spectral assignment. Recent advances in chemical proteomics have begun fill this gap to provide a structural basis for the breadth of small molecule-protein interactions in the whole proteome. Innovations enabling direct characterization of the small molecule interactome include faster, more sensitive instrumentation coupled to chemical conjugation, enrichment, and labeling methods that facilitate detection and assignment. These methods have started to measure molecular interaction hotspots due to inherent differences in local amino acid reactivity and binding affinity throughout the proteome. Measurement of the small molecule interactome is producing structural insights and methods for probing and engineering protein biochemistry. Direct structural characterization of the small molecule interactome is a rapidly emerging area pushing new frontiers in biochemistry at the interface of small molecules and the proteome.
Deshmukh, Rupesh K; Sonah, Humira; Bélanger, Richard R
2016-01-01
Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is offered as a resource for AQP research.
Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.
2016-01-01
Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is offered as a resource for AQP research. PMID:28066459
Workflow based framework for life science informatics.
Tiwari, Abhishek; Sekhar, Arvind K T
2007-10-01
Workflow technology is a generic mechanism to integrate diverse types of available resources (databases, servers, software applications and different services) which facilitate knowledge exchange within traditionally divergent fields such as molecular biology, clinical research, computational science, physics, chemistry and statistics. Researchers can easily incorporate and access diverse, distributed tools and data to develop their own research protocols for scientific analysis. Application of workflow technology has been reported in areas like drug discovery, genomics, large-scale gene expression analysis, proteomics, and system biology. In this article, we have discussed the existing workflow systems and the trends in applications of workflow based systems.
Recent Advance in Applications of Proteomics Technologies on Traditional Chinese Medicine Research
Zhu, Fangshi; Liu, Xuan; Li, Qi; Su, Shi-bing
2015-01-01
Proteomics technology, a major component of system biology, has gained comprehensive attention in the area of medical diagnosis, drug development, and mechanism research. On the holistic and systemic theory, proteomics has a convergence with traditional Chinese medicine (TCM). In this review, we discussed the applications of proteomic technologies in diseases-TCM syndrome combination researches. We also introduced the proteomic studies on the in vivo and in vitro effects and underlying mechanisms of TCM treatments using Chinese herbal medicine (CHM), Chinese herbal formula (CHF), and acupuncture. Furthermore, the combined studies of proteomics with other “-omics” technologies in TCM were also discussed. In summary, this report presents an overview of the recent advances in the application of proteomic technologies in TCM studies and sheds a light on the future global and further research on TCM. PMID:26557869
A Computer-Aided Diagnosis System for Breast Cancer Combining Mammography and Proteomics
2007-05-01
findings in both Data sets C and M. The likelihood ratio is the probability of the features un- der the malignant case divided by the probability of...likelihood ratio value as a classification decision variable, the probabilities of detection and false alarm are calculated as follows: Pdfusion...lowered the fused classifier’s performance to near chance levels. A genetic algorithm searched over the likelihood- ratio thresh- old values for each
Proteome complexity and the forces that drive proteome imbalance.
Harper, J Wade; Bennett, Eric J
2016-09-15
The cellular proteome is a complex microcosm of structural and regulatory networks that requires continuous surveillance and modification to meet the dynamic needs of the cell. It is therefore crucial that the protein flux of the cell remains in balance to ensure proper cell function. Genetic alterations that range from chromosome imbalance to oncogene activation can affect the speed, fidelity and capacity of protein biogenesis and degradation systems, which often results in proteome imbalance. An improved understanding of the causes and consequences of proteome imbalance is helping to reveal how these systems can be targeted to treat diseases such as cancer.
Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpievitch, Yuliya V.; Polpitiya, Ashoka D.; Anderson, Gordon A.
2010-12-01
Mass spectrometry-based proteomics has become the tool of choice for identifying and quantifying the proteome of an organism. Though recent years have seen a tremendous improvement in instrument performance and the computational tools used, significant challenges remain, and there are many opportunities for statisticians to make important contributions. In the most widely used "bottom-up" approach to proteomics, complex mixtures of proteins are first subjected to enzymatic cleavage, the resulting peptide products are separated based on chemical or physical properties and analyzed using a mass spectrometer. The two fundamental challenges in the analysis of bottom-up MS-based proteomics are: (1) Identifying themore » proteins that are present in a sample, and (2) Quantifying the abundance levels of the identified proteins. Both of these challenges require knowledge of the biological and technological context that gives rise to observed data, as well as the application of sound statistical principles for estimation and inference. We present an overview of bottom-up proteomics and outline the key statistical issues that arise in protein identification and quantification.« less
A tutorial for software development in quantitative proteomics using PSI standard formats☆
Gonzalez-Galarza, Faviel F.; Qi, Da; Fan, Jun; Bessant, Conrad; Jones, Andrew R.
2014-01-01
The Human Proteome Organisation — Proteomics Standards Initiative (HUPO-PSI) has been working for ten years on the development of standardised formats that facilitate data sharing and public database deposition. In this article, we review three HUPO-PSI data standards — mzML, mzIdentML and mzQuantML, which can be used to design a complete quantitative analysis pipeline in mass spectrometry (MS)-based proteomics. In this tutorial, we briefly describe the content of each data model, sufficient for bioinformaticians to devise proteomics software. We also provide guidance on the use of recently released application programming interfaces (APIs) developed in Java for each of these standards, which makes it straightforward to read and write files of any size. We have produced a set of example Java classes and a basic graphical user interface to demonstrate how to use the most important parts of the PSI standards, available from http://code.google.com/p/psi-standard-formats-tutorial. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23584085
Proteomics of the Human Placenta: Promises and Realities
Robinson, J.M.; Ackerman, W.E.; Kniss, D.A.; Takizawa, T.; Vandré, D.D.
2015-01-01
Proteomics is an area of study that sets as its ultimate goal the global analysis of all of the proteins expressed in a biological system of interest. However, technical limitations currently hamper proteome-wide analyses of complex systems. In a more practical sense, a desired outcome of proteomics research is the translation of large protein data sets into formats that provide meaningful information regarding clinical conditions (e.g., biomarkers to serve as diagnostic and/or prognostic indicators of disease). Herein, we discuss placental proteomics by describing existing studies, pointing out their strengths and weaknesses. In so doing, we strive to inform investigators interested in this area of research about the current gap between hyperbolic promises and realities. Additionally, we discuss the utility of proteomics in discovery-based research, particularly as regards the capacity to unearth novel insights into placental biology. Importantly, when considering under studied systems such as the human placenta and diseases associated with abnormalities in placental function, proteomics can serve as a robust ‘shortcut’ to obtaining information unlikely to be garnered using traditional approaches. PMID:18222537
Systems biology approaches to understand the effects of nutrition and promote health.
Badimon, Lina; Vilahur, Gemma; Padro, Teresa
2017-01-01
Within the last years the implementation of systems biology in nutritional research has emerged as a powerful tool to understand the mechanisms by which dietary components promote health and prevent disease as well as to identify the biologically active molecules involved in such effects. Systems biology, by combining several '-omics' disciplines (mainly genomics/transcriptomics, proteomics and metabolomics), creates large data sets that upon computational integration provide in silico predictive networks that allow a more extensive analysis of the individual response to a nutritional intervention and provide a more global comprehensive understanding of how diet may influence health and disease. Numerous studies have demonstrated that diet and particularly bioactive food components play a pivotal role in helping to counteract environmental-related oxidative damage. Oxidative stress is considered to be strongly implicated in ageing and the pathophysiology of numerous diseases including neurodegenerative disease, cancers, metabolic disorders and cardiovascular diseases. In the following review we will provide insights into the role of systems biology in nutritional research and focus on transcriptomic, proteomic and metabolomics studies that have demonstrated the ability of functional foods and their bioactive components to fight against oxidative damage and contribute to health benefits. © 2016 The British Pharmacological Society.
Proteome complexity and the forces that drive proteome imbalance
Harper, J. Wade; Bennett, Eric J.
2016-01-01
Summary The cellular proteome is a complex microcosm of structural and regulatory networks that requires continuous surveillance and modification to meet the dynamic needs of the cell. It is therefore crucial that the protein flux of the cell remains in balance to ensure proper cell function. Genetic alterations that range from chromosome imbalance to oncogene activation can affect the speed, fidelity and capacity of protein biogenesis and degradation systems, which often results in proteome imbalance. An improved understanding of the causes and consequences of proteome imbalance is helping to reveal how these systems can be targeted to treat diseases such as cancer. PMID:27629639
Stable isotope labelling methods in mass spectrometry-based quantitative proteomics.
Chahrour, Osama; Cobice, Diego; Malone, John
2015-09-10
Mass-spectrometry based proteomics has evolved as a promising technology over the last decade and is undergoing a dramatic development in a number of different areas, such as; mass spectrometric instrumentation, peptide identification algorithms and bioinformatic computational data analysis. The improved methodology allows quantitative measurement of relative or absolute protein amounts, which is essential for gaining insights into their functions and dynamics in biological systems. Several different strategies involving stable isotopes label (ICAT, ICPL, IDBEST, iTRAQ, TMT, IPTL, SILAC), label-free statistical assessment approaches (MRM, SWATH) and absolute quantification methods (AQUA) are possible, each having specific strengths and weaknesses. Inductively coupled plasma mass spectrometry (ICP-MS), which is still widely recognised as elemental detector, has recently emerged as a complementary technique to the previous methods. The new application area for ICP-MS is targeting the fast growing field of proteomics related research, allowing absolute protein quantification using suitable elemental based tags. This document describes the different stable isotope labelling methods which incorporate metabolic labelling in live cells, ICP-MS based detection and post-harvest chemical label tagging for protein quantification, in addition to summarising their pros and cons. Copyright © 2015 Elsevier B.V. All rights reserved.
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Efficient visualization of high-throughput targeted proteomics experiments: TAPIR.
Röst, Hannes L; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars
2015-07-15
Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis.
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V Lila; Karagouni, Amalia D; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V. Lila; Karagouni, Amalia D.; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality. PMID:22267904
P2P proteomics -- data sharing for enhanced protein identification
2012-01-01
Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that could be derived from the proposed P2P proteomics system, however it is not straightforward to arrive to the same conclusion by conventional means as it is difficult to discard organic contamination of samples. The actual presence of this contaminant was only stated after the ABRF study of all the identifications reported by the laboratories. PMID:22293032
Marx, Harald; Lemeer, Simone; Schliep, Jan Erik; Matheron, Lucrece; Mohammed, Shabaz; Cox, Jürgen; Mann, Matthias; Heck, Albert J R; Kuster, Bernhard
2013-06-01
We present a peptide library and data resource of >100,000 synthetic, unmodified peptides and their phosphorylated counterparts with known sequences and phosphorylation sites. Analysis of the library by mass spectrometry yielded a data set that we used to evaluate the merits of different search engines (Mascot and Andromeda) and fragmentation methods (beam-type collision-induced dissociation (HCD) and electron transfer dissociation (ETD)) for peptide identification. We also compared the sensitivities and accuracies of phosphorylation-site localization tools (Mascot Delta Score, PTM score and phosphoRS), and we characterized the chromatographic behavior of peptides in the library. We found that HCD identified more peptides and phosphopeptides than did ETD, that phosphopeptides generally eluted later from reversed-phase columns and were easier to identify than unmodified peptides and that current computational tools for proteomics can still be substantially improved. These peptides and spectra will facilitate the development, evaluation and improvement of experimental and computational proteomic strategies, such as separation techniques and the prediction of retention times and fragmentation patterns.
A practical data processing workflow for multi-OMICS projects.
Kohl, Michael; Megger, Dominik A; Trippler, Martin; Meckel, Hagen; Ahrens, Maike; Bracht, Thilo; Weber, Frank; Hoffmann, Andreas-Claudius; Baba, Hideo A; Sitek, Barbara; Schlaak, Jörg F; Meyer, Helmut E; Stephan, Christian; Eisenacher, Martin
2014-01-01
Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Proteomics of Skeletal Muscle: Focus on Insulin Resistance and Exercise Biology
Deshmukh, Atul S.
2016-01-01
Skeletal muscle is the largest tissue in the human body and plays an important role in locomotion and whole body metabolism. It accounts for ~80% of insulin stimulated glucose disposal. Skeletal muscle insulin resistance, a primary feature of Type 2 diabetes, is caused by a decreased ability of muscle to respond to circulating insulin. Physical exercise improves insulin sensitivity and whole body metabolism and remains one of the most promising interventions for the prevention of Type 2 diabetes. Insulin resistance and exercise adaptations in skeletal muscle might be a cause, or consequence, of altered protein expressions profiles and/or their posttranslational modifications (PTMs). Mass spectrometry (MS)-based proteomics offer enormous promise for investigating the molecular mechanisms underlying skeletal muscle insulin resistance and exercise-induced adaptation; however, skeletal muscle proteomics are challenging. This review describes the technical limitations of skeletal muscle proteomics as well as emerging developments in proteomics workflow with respect to samples preparation, liquid chromatography (LC), MS and computational analysis. These technologies have not yet been fully exploited in the field of skeletal muscle proteomics. Future studies that involve state-of-the-art proteomics technology will broaden our understanding of exercise-induced adaptations as well as molecular pathogenesis of insulin resistance. This could lead to the identification of new therapeutic targets. PMID:28248217
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.; ...
2015-08-18
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
Stadlmann, Johannes; Hoi, David M; Taubenschmid, Jasmin; Mechtler, Karl; Penninger, Josef M
2018-05-18
SugarQb (www.imba.oeaw.ac.at/sugarqb) is a freely available collection of computational tools for the automated identification of intact glycopeptides from high-resolution HCD MS/MS data-sets in the Proteome Discoverer environment. We report the migration of SugarQb to the latest and free version of Proteome Discoverer 2.1, and apply it to the analysis of PNGase F-resistant N-glycopeptides from mouse embryonic stem cells. The analysis of intact glycopeptides highlights unexpected technical limitations to PNGase F-dependent glycoproteomic workflows at the proteome level, and warrants a critical re-interpretation of seminal data-sets in the context of N-glycosylation-site prediction. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Gadher, Suresh Jivan; Drahos, László; Vékey, Károly; Kovarova, Hana
2017-07-01
The Central and Eastern European Proteomic Conference (CEEPC) proudly celebrated its 10th Anniversary with an exciting scientific program inclusive of proteome, proteomics and systems biology in Budapest, Hungary. Since 2007, CEEPC has represented 'state-of the-art' proteomics in and around Central and Eastern Europe and these series of conferences have become a well-recognized event in the proteomic calendar. Fresher challenges and global healthcare issues such as ageing and chronic diseases are driving clinical and scientific research towards regenerative, reparative and personalized medicine. To this end, proteomics may enable diverse intertwining research fields to reach their end goals. CEEPC will endeavor to facilitate these goals.
Integrated network analysis and effective tools in plant systems biology
Fukushima, Atsushi; Kanaya, Shigehiko; Nishida, Kozo
2014-01-01
One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms. PMID:25408696
Structural systems pharmacology: a new frontier in discovering novel drug targets.
Tan, Hepan; Ge, Xiaoxia; Xie, Lei
2013-08-01
The modern target-based drug discovery process, characterized by the one-drug-one-gene paradigm, has been of limited success. In contrast, phenotype-based screening produces thousands of active compounds but gives no hint as to what their molecular targets are or which ones merit further research. This presents a question: What is a suitable target for an efficient and safe drug? In this paper, we argue that target selection should take into account the proteome-wide energetic and kinetic landscape of drug-target interactions, as well as their cellular and organismal consequences. We propose a new paradigm of structural systems pharmacology to deconvolute the molecular targets of successful drugs as well as to identify druggable targets and their drug-like binders. Here we face two major challenges in structural systems pharmacology: How do we characterize and analyze the structural and energetic origins of drug-target interactions on a proteome scale? How do we correlate the dynamic molecular interactions to their in vivo activity? We will review recent advances in developing new computational tools for biophysics, bioinformatics, chemoinformatics, and systems biology related to the identification of genome-wide target profiles. We believe that the integration of these tools will realize structural systems pharmacology, enabling us to both efficiently develop effective therapeutics for complex diseases and combat drug resistance.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.
Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify codingmore » regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.« less
SELDI PROTEINCHIP-BASED LIVER BIOMARKERS IN FUNGICIDE EXPOSED ZEBRAFISH
The research presented here is part of a three-phased small fish computational toxicology project using a combination of 1) whole organism endpoints, 2) genomic, proteomic, and metabolomic approaches, and 3) computational modeling to (a) identify new molecular biomarkers of expos...
Nyman, Tuula A; Lorey, Martina B; Cypryk, Wojciech; Matikainen, Sampsa
2017-05-01
The immune system is our defense system against microbial infections and tissue injury, and understanding how it works in detail is essential for developing drugs for different diseases. Mass spectrometry-based proteomics can provide in-depth information on the molecular mechanisms involved in immune responses. Areas covered: Summarized are the key immunology findings obtained with MS-based proteomics in the past five years, with a focus on inflammasome activation, global protein secretion, mucosal immunology, immunopeptidome and T cells. Special focus is on extracellular vesicle-mediated protein secretion and its role in immune responses. Expert commentary: Proteomics is an essential part of modern omics-scale immunology research. To date, MS-based proteomics has been used in immunology to study protein expression levels, their subcellular localization, secretion, post-translational modifications, and interactions in immune cells upon activation by different stimuli. These studies have made major contributions to understanding the molecular mechanisms involved in innate and adaptive immune responses. New developments in proteomics offer constantly novel possibilities for exploring the immune system. Examples of these techniques include mass cytometry and different MS-based imaging approaches which can be widely used in immunology.
The Proteome Folding Project: Proteome-scale prediction of structure and function
Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard
2011-01-01
The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995
Unexpected features of the dark proteome
Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S.; Buckley, Michael J.; Tabor, Bruce; Signal, Beth; Gloss, Brian S.; Hammang, Christopher J.; Rost, Burkhard; Schafferhans, Andrea
2015-01-01
We surveyed the “dark” proteome–that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44–54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology. PMID:26578815
Assay Portal | Office of Cancer Clinical Proteomics Research
The CPTAC Assay Portal serves as a centralized public repository of "fit-for-purpose," multiplexed quantitative mass spectrometry-based proteomic targeted assays. Targeted proteomic assays eliminate issues that are commonly observed using conventional protein detection systems.
17th Chromosome-Centric Human Proteome Project Symposium in Tehran.
Meyfour, Anna; Pahlavan, Sara; Sobhanian, Hamid; Salekdeh, Ghasem Hosseini
2018-04-01
This report describes the 17th Chromosome-Centric Human Proteome Project which was held in Tehran, Iran, April 27 and 28, 2017. A brief summary of the symposium's talks including new technical and computational approaches for the identification of novel proteins from non-coding genomic regions, physicochemical and biological causes of missing proteins, and the close interactions between Chromosome- and Biology/Disease-driven Human Proteome Project are presented. A synopsis of decisions made on the prospective programs to maintain collaborative works, share resources and information, and establishment of a newly organized working group, the task force for missing protein analysis are discussed. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Managing expectations when publishing tools and methods for computational proteomics.
Martens, Lennart; Kohlbacher, Oliver; Weintraub, Susan T
2015-05-01
Computational tools are pivotal in proteomics because they are crucial for identification, quantification, and statistical assessment of data. The gateway to finding the best choice of a tool or approach for a particular problem is frequently journal articles, yet there is often an overwhelming variety of options that makes it hard to decide on the best solution. This is particularly difficult for nonexperts in bioinformatics. The maturity, reliability, and performance of tools can vary widely because publications may appear at different stages of development. A novel idea might merit early publication despite only offering proof-of-principle, while it may take years before a tool can be considered mature, and by that time it might be difficult for a new publication to be accepted because of a perceived lack of novelty. After discussions with members of the computational mass spectrometry community, we describe here proposed recommendations for organization of informatics manuscripts as a way to set the expectations of readers (and reviewers) through three different manuscript types that are based on existing journal designations. Brief Communications are short reports describing novel computational approaches where the implementation is not necessarily production-ready. Research Articles present both a novel idea and mature implementation that has been suitably benchmarked. Application Notes focus on a mature and tested tool or concept and need not be novel but should offer advancement from improved quality, ease of use, and/or implementation. Organizing computational proteomics contributions into these three manuscript types will facilitate the review process and will also enable readers to identify the maturity and applicability of the tool for their own workflows.
Sma3s: A universal tool for easy functional annotation of proteomes and transcriptomes.
Casimiro-Soriguer, Carlos S; Muñoz-Mérida, Antonio; Pérez-Pulido, Antonio J
2017-06-01
The current cheapening of next-generation sequencing has led to an enormous growth in the number of sequenced genomes and transcriptomes, allowing wet labs to get the sequences from their organisms of study. To make the most of these data, one of the first things that should be done is the functional annotation of the protein-coding genes. But it used to be a slow and tedious step that can involve the characterization of thousands of sequences. Sma3s is an accurate computational tool for annotating proteins in an unattended way. Now, we have developed a completely new version, which includes functionalities that will be of utility for fundamental and applied science. Currently, the results provide functional categories such as biological processes, which become useful for both characterizing particular sequence datasets and comparing results from different projects. But one of the most important implemented innovations is that it has now low computational requirements, and the complete annotation of a simple proteome or transcriptome usually takes around 24 hours in a personal computer. Sma3s has been tested with a large amount of complete proteomes and transcriptomes, and it has demonstrated its potential in health science and other specific projects. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
MannDB: A microbial annotation database for protein characterization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C; Lam, M; Smith, J
2006-05-19
MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less
Medina-Aunon, J. Alberto; Martínez-Bartolomé, Salvador; López-García, Miguel A.; Salazar, Emilio; Navajas, Rosana; Jones, Andrew R.; Paradela, Alberto; Albar, Juan P.
2011-01-01
The development of the HUPO-PSI's (Proteomics Standards Initiative) standard data formats and MIAPE (Minimum Information About a Proteomics Experiment) guidelines should improve proteomics data sharing within the scientific community. Proteomics journals have encouraged the use of these standards and guidelines to improve the quality of experimental reporting and ease the evaluation and publication of manuscripts. However, there is an evident lack of bioinformatics tools specifically designed to create and edit standard file formats and reports, or embed them within proteomics workflows. In this article, we describe a new web-based software suite (The ProteoRed MIAPE web toolkit) that performs several complementary roles related to proteomic data standards. First, it can verify that the reports fulfill the minimum information requirements of the corresponding MIAPE modules, highlighting inconsistencies or missing information. Second, the toolkit can convert several XML-based data standards directly into human readable MIAPE reports stored within the ProteoRed MIAPE repository. Finally, it can also perform the reverse operation, allowing users to export from MIAPE reports into XML files for computational processing, data sharing, or public database submission. The toolkit is thus the first application capable of automatically linking the PSI's MIAPE modules with the corresponding XML data exchange standards, enabling bidirectional conversions. This toolkit is freely available at http://www.proteored.org/MIAPE/. PMID:21983993
Weston, Andrea D; Hood, Leroy
2004-01-01
The emergence of systems biology is bringing forth a new set of challenges for advancing science and technology. Defining ways of studying biological systems on a global level, integrating large and disparate data types, and dealing with the infrastructural changes necessary to carry out systems biology, are just a few of the extraordinary tasks of this growing discipline. Despite these challenges, the impact of systems biology will be far-reaching, and significant progress has already been made. Moving forward, the issue of how to use systems biology to improve the health of individuals must be a priority. It is becoming increasingly apparent that the field of systems biology and one of its important disciplines, proteomics, will have a major role in creating a predictive, preventative, and personalized approach to medicine. In this review, we define systems biology, discuss the current capabilities of proteomics and highlight some of the necessary milestones for moving systems biology and proteomics into mainstream health care.
Top-down Proteomics in Health and Disease: Challenges and Opportunities
Gregorich, Zachery R.; Ge, Ying
2014-01-01
Proteomics is essential for deciphering how molecules interact as a system and for understanding the functions of cellular systems in human disease; however, the unique characteristics of the human proteome, which include a high dynamic range of protein expression and extreme complexity due to a plethora of post-translational modifications (PTMs) and sequence variations, make such analyses challenging. An emerging “top-down” mass spectrometry (MS)-based proteomics approach, which provides a “bird’s eye” view of all proteoforms, has unique advantages for the assessment of PTMs and sequence variations. Recently, a number of studies have showcased the potential of top-down proteomics for unraveling of disease mechanisms and discovery of new biomarkers. Nevertheless, the top-down approach still faces significant challenges in terms of protein solubility, separation, and the detection of large intact proteins, as well as the under-developed data analysis tools. Consequently, new technological developments are urgently needed to advance the field of top-down proteomics. Herein, we intend to provide an overview of the recent applications of top-down proteomics in biomedical research. Moreover, we will outline the challenges and opportunities facing top-down proteomics strategies aimed at understanding and diagnosing human diseases. PMID:24723472
Ji, Zhiwei; Wang, Bing; Yan, Ke; Dong, Ligang; Meng, Guanmin; Shi, Lei
2017-12-21
In recent years, the integration of 'omics' technologies, high performance computation, and mathematical modeling of biological processes marks that the systems biology has started to fundamentally impact the way of approaching drug discovery. The LINCS public data warehouse provides detailed information about cell responses with various genetic and environmental stressors. It can be greatly helpful in developing new drugs and therapeutics, as well as improving the situations of lacking effective drugs, drug resistance and relapse in cancer therapies, etc. In this study, we developed a Ternary status based Integer Linear Programming (TILP) method to infer cell-specific signaling pathway network and predict compounds' treatment efficacy. The novelty of our study is that phosphor-proteomic data and prior knowledge are combined for modeling and optimizing the signaling network. To test the power of our approach, a generic pathway network was constructed for a human breast cancer cell line MCF7; and the TILP model was used to infer MCF7-specific pathways with a set of phosphor-proteomic data collected from ten representative small molecule chemical compounds (most of them were studied in breast cancer treatment). Cross-validation indicated that the MCF7-specific pathway network inferred by TILP were reliable predicting a compound's efficacy. Finally, we applied TILP to re-optimize the inferred cell-specific pathways and predict the outcomes of five small compounds (carmustine, doxorubicin, GW-8510, daunorubicin, and verapamil), which were rarely used in clinic for breast cancer. In the simulation, the proposed approach facilitates us to identify a compound's treatment efficacy qualitatively and quantitatively, and the cross validation analysis indicated good accuracy in predicting effects of five compounds. In summary, the TILP model is useful for discovering new drugs for clinic use, and also elucidating the potential mechanisms of a compound to targets.
A Computational Algorithm for Functional Clustering of Proteome Dynamics During Development
Wang, Yaqun; Wang, Ningtao; Hao, Han; Guo, Yunqian; Zhen, Yan; Shi, Jisen; Wu, Rongling
2014-01-01
Phenotypic traits, such as seed development, are a consequence of complex biochemical interactions among genes, proteins and metabolites, but the underlying mechanisms that operate in a coordinated and sequential manner remain elusive. Here, we address this issue by developing a computational algorithm to monitor proteome changes during the course of trait development. The algorithm is built within the mixture-model framework in which each mixture component is modeled by a specific group of proteins that display a similar temporal pattern of expression in trait development. A nonparametric approach based on Legendre orthogonal polynomials was used to fit dynamic changes of protein expression, increasing the power and flexibility of protein clustering. By analyzing a dataset of proteomic dynamics during early embryogenesis of the Chinese fir, the algorithm has successfully identified several distinct types of proteins that coordinate with each other to determine seed development in this forest tree commercially and environmentally important to China. The algorithm will find its immediate applications for the characterization of mechanistic underpinnings for any other biological processes in which protein abundance plays a key role. PMID:24955031
Turewicz, Michael; Kohl, Michael; Ahrens, Maike; Mayer, Gerhard; Uszkoreit, Julian; Naboulsi, Wael; Bracht, Thilo; Megger, Dominik A; Sitek, Barbara; Marcus, Katrin; Eisenacher, Martin
2017-11-10
The analysis of high-throughput mass spectrometry-based proteomics data must address the specific challenges of this technology. To this end, the comprehensive proteomics workflow offered by the de.NBI service center BioInfra.Prot provides indispensable components for the computational and statistical analysis of this kind of data. These components include tools and methods for spectrum identification and protein inference, protein quantification, expression analysis as well as data standardization and data publication. All particular methods of the workflow which address these tasks are state-of-the-art or cutting edge. As has been shown in previous publications, each of these methods is adequate to solve its specific task and gives competitive results. However, the methods included in the workflow are continuously reviewed, updated and improved to adapt to new scientific developments. All of these particular components and methods are available as stand-alone BioInfra.Prot services or as a complete workflow. Since BioInfra.Prot provides manifold fast communication channels to get access to all components of the workflow (e.g., via the BioInfra.Prot ticket system: bioinfraprot@rub.de) users can easily benefit from this service and get support by experts. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Dinakaran, Vasudevan; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash
2016-01-01
Brucella spp. are facultative intracellular pathogens that cause brucellosis in various mammals including humans. Brucella survive inside the host cells by forming vacuoles and subverting host defence systems. This study was aimed to predict the secretion systems and the secretomes of Brucella spp. from 39 complete genome sequences available in the databases. Furthermore, an attempt was made to identify the type IV secretion effectors and their interactions with host proteins. We predicted the secretion systems of Brucella by the KEGG pathway and SecReT4. Brucella secretomes and type IV effectors (T4SEs) were predicted through genome-wide screening using JVirGel and S4TE, respectively. Protein-protein interactions of Brucella T4SEs with their hosts were analyzed by HPIDB 2.0. Genes coding for Sec and Tat pathways of secretion and type I (T1SS), type IV (T4SS) and type V (T5SS) secretion systems were identified and they are conserved in all the species of Brucella. In addition to the well-known VirB operon coding for the type IV secretion system (T4SS), we have identified the presence of additional genes showing homology with T4SS of other organisms. On the whole, 10.26 to 14.94% of total proteomes were found to be either secreted (secretome) or membrane associated (membrane proteome). Approximately, 1.7 to 3.0% of total proteomes were identified as type IV secretion effectors (T4SEs). Prediction of protein-protein interactions showed 29 and 36 host-pathogen specific interactions between Bos taurus (cattle)-B. abortus and Ovis aries (sheep)-B. melitensis, respectively. Functional characterization of the predicted T4SEs and their interactions with their respective hosts may reveal the secrets of host specificity of Brucella.
CEBS object model for systems biology data, SysBio-OM.
Xirasagar, Sandhya; Gustafson, Scott; Merrick, B Alex; Tomer, Kenneth B; Stasiewicz, Stanley; Chan, Denny D; Yost, Kenneth J; Yates, John R; Sumner, Susan; Xiao, Nianqing; Waters, Michael D
2004-09-01
To promote a systems biology approach to understanding the biological effects of environmental stressors, the Chemical Effects in Biological Systems (CEBS) knowledge base is being developed to house data from multiple complex data streams in a systems friendly manner that will accommodate extensive querying from users. Unified data representation via a single object model will greatly aid in integrating data storage and management, and facilitate reuse of software to analyze and display data resulting from diverse differential expression or differential profile technologies. Data streams include, but are not limited to, gene expression analysis (transcriptomics), protein expression and protein-protein interaction analysis (proteomics) and changes in low molecular weight metabolite levels (metabolomics). To enable the integration of microarray gene expression, proteomics and metabolomics data in the CEBS system, we designed an object model, Systems Biology Object Model (SysBio-OM). The model is comprehensive and leverages other open source efforts, namely the MicroArray Gene Expression Object Model (MAGE-OM) and the Proteomics Experiment Data Repository (PEDRo) object model. SysBio-OM is designed by extending MAGE-OM to represent protein expression data elements (including those from PEDRo), protein-protein interaction and metabolomics data. SysBio-OM promotes the standardization of data representation and data quality by facilitating the capture of the minimum annotation required for an experiment. Such standardization refines the accuracy of data mining and interpretation. The open source SysBio-OM model, which can be implemented on varied computing platforms is presented here. A universal modeling language depiction of the entire SysBio-OM is available at http://cebs.niehs.nih.gov/SysBioOM/. The Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at http://cebs.niehs.nih.gov/cebsdownloads. The database and interface are being built to implement the model and will be available for public use at http://cebs.niehs.nih.gov.
Computational Methods to Predict Protein Interaction Partners
NASA Astrophysics Data System (ADS)
Valencia, Alfonso; Pazos, Florencio
In the new paradigm for studying biological phenomena represented by Systems Biology, cellular components are not considered in isolation but as forming complex networks of relationships. Protein interaction networks are among the first objects studied from this new point of view. Deciphering the interactome (the whole network of interactions for a given proteome) has been shown to be a very complex task. Computational techniques for detecting protein interactions have become standard tools for dealing with this problem, helping and complementing their experimental counterparts. Most of these techniques use genomic or sequence features intuitively related with protein interactions and are based on "first principles" in the sense that they do not involve training with examples. There are also other computational techniques that use other sources of information (i.e. structural information or even experimental data) or are based on training with examples.
Informatics and computational strategies for the study of lipids.
Yetukuri, Laxman; Ekroos, Kim; Vidal-Puig, Antonio; Oresic, Matej
2008-02-01
Recent advances in mass spectrometry (MS)-based techniques for lipidomic analysis have empowered us with the tools that afford studies of lipidomes at the systems level. However, these techniques pose a number of challenges for lipidomic raw data processing, lipid informatics, and the interpretation of lipidomic data in the context of lipid function and structure. Integration of lipidomic data with other systemic levels, such as genomic or proteomic, in the context of molecular pathways and biophysical processes provides a basis for the understanding of lipid function at the systems level. The present report, based on the limited literature, is an update on a young but rapidly emerging field of lipid informatics and related pathway reconstruction strategies.
Proteomic approaches to study the pig intestinal system.
Soler, Laura; Niewold, Theo A; Moreno, Ángela; Garrido, Juan Jose
2014-03-01
One of the major challenges in pig production is managing digestive health to maximize feed conversion and growth rates, but also to minimize treatment costs and to warrant public health. There is a great interest in the development of useful tools for intestinal health monitoring and the investigation of possible prophylactic/ therapeutic intervention pathways. A great variety of in vivo and in vitro intestinal models of study have been developed in the recent years. The understanding of such a complex system as the intestinal system (IS), and the study of its physiology and pathology is not an easy task. Analysis of such a complex system requires the use of systems biology techniques, like proteomics. However, for a correct interpretation of results and to maximize analysis performance, a careful selection of the IS model of study and proteomic platform is required. The study of the IS system is especially important in the pig, a species whose farming requires a very careful management of husbandry procedures regarding feeding and nutrition. The incorrect management of the pig digestive system leads directly to economic losses related suboptimal growth and feed utilization and/or the appearance of intestinal infections, in particular diarrhea. Furthermore, this species is the most suitable experimental model for human IS studies. Proteomics has risen as one of the most promising approaches to study the pig IS. In this review, we describe the most useful models of IS research in porcine and the different proteomic platforms available. An overview of the recent findings in pig IS proteomics is also provided.
Hur, Manhoi; Campbell, Alexis Ann; Almeida-de-Macedo, Marcia; Li, Ling; Ransom, Nick; Jose, Adarsh; Crispin, Matt; Nikolau, Basil J; Wurtele, Eve Syrkin
2013-04-01
Discovering molecular components and their functionality is key to the development of hypotheses concerning the organization and regulation of metabolic networks. The iterative experimental testing of such hypotheses is the trajectory that can ultimately enable accurate computational modelling and prediction of metabolic outcomes. This information can be particularly important for understanding the biology of natural products, whose metabolism itself is often only poorly defined. Here, we describe factors that must be in place to optimize the use of metabolomics in predictive biology. A key to achieving this vision is a collection of accurate time-resolved and spatially defined metabolite abundance data and associated metadata. One formidable challenge associated with metabolite profiling is the complexity and analytical limits associated with comprehensively determining the metabolome of an organism. Further, for metabolomics data to be efficiently used by the research community, it must be curated in publicly available metabolomics databases. Such databases require clear, consistent formats, easy access to data and metadata, data download, and accessible computational tools to integrate genome system-scale datasets. Although transcriptomics and proteomics integrate the linear predictive power of the genome, the metabolome represents the nonlinear, final biochemical products of the genome, which results from the intricate system(s) that regulate genome expression. For example, the relationship of metabolomics data to the metabolic network is confounded by redundant connections between metabolites and gene-products. However, connections among metabolites are predictable through the rules of chemistry. Therefore, enhancing the ability to integrate the metabolome with anchor-points in the transcriptome and proteome will enhance the predictive power of genomics data. We detail a public database repository for metabolomics, tools and approaches for statistical analysis of metabolomics data, and methods for integrating these datasets with transcriptomic data to create hypotheses concerning specialized metabolisms that generate the diversity in natural product chemistry. We discuss the importance of close collaborations among biologists, chemists, computer scientists and statisticians throughout the development of such integrated metabolism-centric databases and software.
Hur, Manhoi; Campbell, Alexis Ann; Almeida-de-Macedo, Marcia; Li, Ling; Ransom, Nick; Jose, Adarsh; Crispin, Matt; Nikolau, Basil J.
2013-01-01
Discovering molecular components and their functionality is key to the development of hypotheses concerning the organization and regulation of metabolic networks. The iterative experimental testing of such hypotheses is the trajectory that can ultimately enable accurate computational modelling and prediction of metabolic outcomes. This information can be particularly important for understanding the biology of natural products, whose metabolism itself is often only poorly defined. Here, we describe factors that must be in place to optimize the use of metabolomics in predictive biology. A key to achieving this vision is a collection of accurate time-resolved and spatially defined metabolite abundance data and associated metadata. One formidable challenge associated with metabolite profiling is the complexity and analytical limits associated with comprehensively determining the metabolome of an organism. Further, for metabolomics data to be efficiently used by the research community, it must be curated in publically available metabolomics databases. Such databases require clear, consistent formats, easy access to data and metadata, data download, and accessible computational tools to integrate genome system-scale datasets. Although transcriptomics and proteomics integrate the linear predictive power of the genome, the metabolome represents the nonlinear, final biochemical products of the genome, which results from the intricate system(s) that regulate genome expression. For example, the relationship of metabolomics data to the metabolic network is confounded by redundant connections between metabolites and gene-products. However, connections among metabolites are predictable through the rules of chemistry. Therefore, enhancing the ability to integrate the metabolome with anchor-points in the transcriptome and proteome will enhance the predictive power of genomics data. We detail a public database repository for metabolomics, tools and approaches for statistical analysis of metabolomics data, and methods for integrating these dataset with transcriptomic data to create hypotheses concerning specialized metabolism that generates the diversity in natural product chemistry. We discuss the importance of close collaborations among biologists, chemists, computer scientists and statisticians throughout the development of such integrated metabolism-centric databases and software. PMID:23447050
Özdemir, Vural; Dove, Edward S; Gürsoy, Ulvi K; Şardaş, Semra; Yıldırım, Arif; Yılmaz, Şenay Görücü; Ömer Barlas, I; Güngör, Kıvanç; Mete, Alper; Srivastava, Sanjeeva
2017-01-01
No field in science and medicine today remains untouched by Big Data, and psychiatry is no exception. Proteomics is a Big Data technology and a next generation biomarker, supporting novel system diagnostics and therapeutics in psychiatry. Proteomics technology is, in fact, much older than genomics and dates to the 1970s, well before the launch of the international Human Genome Project. While the genome has long been framed as the master or "elite" executive molecule in cell biology, the proteome by contrast is humble. Yet the proteome is critical for life-it ensures the daily functioning of cells and whole organisms. In short, proteins are the blue-collar workers of biology, the down-to-earth molecules that we cannot live without. Since 2010, proteomics has found renewed meaning and international attention with the launch of the Human Proteome Project and the growing interest in Big Data technologies such as proteomics. This article presents an interdisciplinary technology foresight analysis and conceptualizes the terms "environtome" and "social proteome". We define "environtome" as the entire complement of elements external to the human host, from microbiome, ambient temperature and weather conditions to government innovation policies, stock market dynamics, human values, political power and social norms that collectively shape the human host spatially and temporally. The "social proteome" is the subset of the environtome that influences the transition of proteomics technology to innovative applications in society. The social proteome encompasses, for example, new reimbursement schemes and business innovation models for proteomics diagnostics that depart from the "once-a-life-time" genotypic tests and the anticipated hype attendant to context and time sensitive proteomics tests. Building on the "nesting principle" for governance of complex systems as discussed by Elinor Ostrom, we propose here a 3-tiered organizational architecture for Big Data science such as proteomics. The proposed nested governance structure is comprised of (a) scientists, (b) ethicists, and (c) scholars in the nascent field of "ethics-of-ethics", and aims to cultivate a robust social proteome for personalized medicine. Ostrom often noted that such nested governance designs offer assurance that political power embedded in innovation processes is distributed evenly and is not concentrated disproportionately in a single overbearing stakeholder or person. We agree with this assessment and conclude by underscoring the synergistic value of social and biological proteomes to realize the full potentials of proteomics science for personalized medicine in psychiatry in the present era of Big Data.
Morris, Jeffrey S
2012-01-01
In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry ( Cromwell ) and 2D gel electrophoresis ( Pinnacle ) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods presented are applied to two specific proteomic technologies, MALDI-TOF and 2D gel electrophoresis, these methods and the other principles discussed in the paper apply much more broadly to other expression proteomics technologies.
2012-01-01
Background Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? Results The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Conclusion Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway. PMID:23216969
Günther, Oliver P; Chen, Virginia; Freue, Gabriela Cohen; Balshaw, Robert F; Tebbutt, Scott J; Hollander, Zsuzsanna; Takhar, Mandeep; McMaster, W Robert; McManus, Bruce M; Keown, Paul A; Ng, Raymond T
2012-12-08
Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway.
MPA Portable: A Stand-Alone Software Package for Analyzing Metaproteome Samples on the Go.
Muth, Thilo; Kohrs, Fabian; Heyer, Robert; Benndorf, Dirk; Rapp, Erdmann; Reichl, Udo; Martens, Lennart; Renard, Bernhard Y
2018-01-02
Metaproteomics, the mass spectrometry-based analysis of proteins from multispecies samples faces severe challenges concerning data analysis and results interpretation. To overcome these shortcomings, we here introduce the MetaProteomeAnalyzer (MPA) Portable software. In contrast to the original server-based MPA application, this newly developed tool no longer requires computational expertise for installation and is now independent of any relational database system. In addition, MPA Portable now supports state-of-the-art database search engines and a convenient command line interface for high-performance data processing tasks. While search engine results can easily be combined to increase the protein identification yield, an additional two-step workflow is implemented to provide sufficient analysis resolution for further postprocessing steps, such as protein grouping as well as taxonomic and functional annotation. Our new application has been developed with a focus on intuitive usability, adherence to data standards, and adaptation to Web-based workflow platforms. The open source software package can be found at https://github.com/compomics/meta-proteome-analyzer .
Mammary molecular portraits reveal lineage-specific features and progenitor cell vulnerabilities.
Casey, Alison E; Sinha, Ankit; Singhania, Rajat; Livingstone, Julie; Waterhouse, Paul; Tharmapalan, Pirashaanthy; Cruickshank, Jennifer; Shehata, Mona; Drysdale, Erik; Fang, Hui; Kim, Hyeyeon; Isserlin, Ruth; Bailey, Swneke; Medina, Tiago; Deblois, Genevieve; Shiah, Yu-Jia; Barsyte-Lovejoy, Dalia; Hofer, Stefan; Bader, Gary; Lupien, Mathieu; Arrowsmith, Cheryl; Knapp, Stefan; De Carvalho, Daniel; Berman, Hal; Boutros, Paul C; Kislinger, Thomas; Khokha, Rama
2018-06-19
The mammary epithelium depends on specific lineages and their stem and progenitor function to accommodate hormone-triggered physiological demands in the adult female. Perturbations of these lineages underpin breast cancer risk, yet our understanding of normal mammary cell composition is incomplete. Here, we build a multimodal resource for the adult gland through comprehensive profiling of primary cell epigenomes, transcriptomes, and proteomes. We define systems-level relationships between chromatin-DNA-RNA-protein states, identify lineage-specific DNA methylation of transcription factor binding sites, and pinpoint proteins underlying progesterone responsiveness. Comparative proteomics of estrogen and progesterone receptor-positive and -negative cell populations, extensive target validation, and drug testing lead to discovery of stem and progenitor cell vulnerabilities. Top epigenetic drugs exert cytostatic effects; prevent adult mammary cell expansion, clonogenicity, and mammopoiesis; and deplete stem cell frequency. Select drugs also abrogate human breast progenitor cell activity in normal and high-risk patient samples. This integrative computational and functional study provides fundamental insight into mammary lineage and stem cell biology. © 2018 Casey et al.
Assembling proteomics data as a prerequisite for the analysis of large scale experiments
Schmidt, Frank; Schmid, Monika; Thiede, Bernd; Pleißner, Klaus-Peter; Böhme, Martina; Jungblut, Peter R
2009-01-01
Background Despite the complete determination of the genome sequence of a huge number of bacteria, their proteomes remain relatively poorly defined. Beside new methods to increase the number of identified proteins new database applications are necessary to store and present results of large- scale proteomics experiments. Results In the present study, a database concept has been developed to address these issues and to offer complete information via a web interface. In our concept, the Oracle based data repository system SQL-LIMS plays the central role in the proteomics workflow and was applied to the proteomes of Mycobacterium tuberculosis, Helicobacter pylori, Salmonella typhimurium and protein complexes such as 20S proteasome. Technical operations of our proteomics labs were used as the standard for SQL-LIMS template creation. By means of a Java based data parser, post-processed data of different approaches, such as LC/ESI-MS, MALDI-MS and 2-D gel electrophoresis (2-DE), were stored in SQL-LIMS. A minimum set of the proteomics data were transferred in our public 2D-PAGE database using a Java based interface (Data Transfer Tool) with the requirements of the PEDRo standardization. Furthermore, the stored proteomics data were extractable out of SQL-LIMS via XML. Conclusion The Oracle based data repository system SQL-LIMS played the central role in the proteomics workflow concept. Technical operations of our proteomics labs were used as standards for SQL-LIMS templates. Using a Java based parser, post-processed data of different approaches such as LC/ESI-MS, MALDI-MS and 1-DE and 2-DE were stored in SQL-LIMS. Thus, unique data formats of different instruments were unified and stored in SQL-LIMS tables. Moreover, a unique submission identifier allowed fast access to all experimental data. This was the main advantage compared to multi software solutions, especially if personnel fluctuations are high. Moreover, large scale and high-throughput experiments must be managed in a comprehensive repository system such as SQL-LIMS, to query results in a systematic manner. On the other hand, these database systems are expensive and require at least one full time administrator and specialized lab manager. Moreover, the high technical dynamics in proteomics may cause problems to adjust new data formats. To summarize, SQL-LIMS met the requirements of proteomics data handling especially in skilled processes such as gel-electrophoresis or mass spectrometry and fulfilled the PSI standardization criteria. The data transfer into a public domain via DTT facilitated validation of proteomics data. Additionally, evaluation of mass spectra by post-processing using MS-Screener improved the reliability of mass analysis and prevented storage of data junk. PMID:19166578
Linking the proteins--elucidation of proteome-scale networks using mass spectrometry.
Pflieger, Delphine; Gonnet, Florence; de la Fuente van Bentem, Sergio; Hirt, Heribert; de la Fuente, Alberto
2011-01-01
Proteomes are intricate. Typically, thousands of proteins interact through physical association and post-translational modifications (PTMs) to give rise to the emergent functions of cells. Understanding these functions requires one to study proteomes as "systems" rather than collections of individual protein molecules. The abstraction of the interacting proteome to "protein networks" has recently gained much attention, as networks are effective representations, that lose specific molecular details, but provide the ability to see the proteome as a whole. Mostly two aspects of the proteome have been represented by network models: proteome-wide physical protein-protein-binding interactions organized into Protein Interaction Networks (PINs), and proteome-wide PTM relations organized into Protein Signaling Networks (PSNs). Mass spectrometry (MS) techniques have been shown to be essential to reveal both of these aspects on a proteome-wide scale. Techniques such as affinity purification followed by MS have been used to elucidate protein-protein interactions, and MS-based quantitative phosphoproteomics is critical to understand the structure and dynamics of signaling through the proteome. We here review the current state-of-the-art MS-based analytical pipelines for the purpose to characterize proteome-scale networks. Copyright © 2010 Wiley Periodicals, Inc.
Proteomics: Protein Identification Using Online Databases
ERIC Educational Resources Information Center
Eurich, Chris; Fields, Peter A.; Rice, Elizabeth
2012-01-01
Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Webb-Robertson, Bobbie-Jo M.; Wiberg, Holli K.; Matzke, Melissa M.; ...
2015-04-09
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yieldedmore » the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. In summary, on the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.« less
Nanoliter-Scale Oil-Air-Droplet Chip-Based Single Cell Proteomic Analysis.
Li, Zi-Yi; Huang, Min; Wang, Xiu-Kun; Zhu, Ying; Li, Jin-Song; Wong, Catherine C L; Fang, Qun
2018-04-17
Single cell proteomic analysis provides crucial information on cellular heterogeneity in biological systems. Herein, we describe a nanoliter-scale oil-air-droplet (OAD) chip for achieving multistep complex sample pretreatment and injection for single cell proteomic analysis in the shotgun mode. By using miniaturized stationary droplet microreaction and manipulation techniques, our system allows all sample pretreatment and injection procedures to be performed in a nanoliter-scale droplet with minimum sample loss and a high sample injection efficiency (>99%), thus substantially increasing the analytical sensitivity for single cell samples. We applied the present system in the proteomic analysis of 100 ± 10, 50 ± 5, 10, and 1 HeLa cell(s), and protein IDs of 1360, 612, 192, and 51 were identified, respectively. The OAD chip-based system was further applied in single mouse oocyte analysis, with 355 protein IDs identified at the single oocyte level, which demonstrated its special advantages of high enrichment of sequence coverage, hydrophobic proteins, and enzymatic digestion efficiency over the traditional in-tube system.
Gadher, Suresh Jivan; Marczak, Łukasz; Łuczak, Magdalena; Stobiecki, Maciej; Widlak, Piotr; Kovarova, Hana
2016-01-01
Every year since 2007, the Central and Eastern European Proteomic Conference (CEEPC) has excelled in representing state-of-the-art proteomics in and around Central and Eastern Europe, and linking it to international institutions worldwide. Its mission remains to contribute to all approaches of proteomics including traditional and often-revisited methodologies as well as the latest technological achievements in clinical, quantitative and structural proteomics with a view to systems biology of a variety of processes. The 9th CEEPC was held from June 15th to 18th, 2015, at the Institute of Bioorganic Chemistry, Polish Academy of Sciences in Poznań, Poland. The scientific program stimulated exchange of proteomic knowledge whilst the spectacular venue of the conference allowed participants to enjoy the cobblestoned historical city of Poznań.
Characterization of individual mouse cerebrospinal fluid proteomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Jeffrey S.; Angel, Thomas E.; Chavkin, Charles
2014-03-20
Analysis of cerebrospinal fluid (CSF) offers key insight into the status of the central nervous system. Characterization of murine CSF proteomes can provide a valuable resource for studying central nervous system injury and disease in animal models. However, the small volume of CSF in mice has thus far limited individual mouse proteome characterization. Through non-terminal CSF extractions in C57Bl/6 mice and high-resolution liquid chromatography-mass spectrometry analysis of individual murine samples, we report the most comprehensive proteome characterization of individual murine CSF to date. Utilizing stringent protein inclusion criteria that required the identification of at least two unique peptides (1% falsemore » discovery rate at the peptide level) we identified a total of 566 unique proteins, including 128 proteins from three individual CSF samples that have been previously identified in brain tissue. Our methods and analysis provide a mechanism for individual murine CSF proteome analysis.« less
An automated method for detecting alternatively spliced protein domains.
Coelho, Vitor; Sammeth, Michael
2018-06-01
Alternative splicing (AS) has been demonstrated to play a role in shaping eukaryotic gene diversity at the transcriptional level. However, the impact of AS on the proteome is still controversial. Studies that seek to explore the effect of AS at the proteomic level are hampered by technical difficulties in the cumbersome process of casting forth and back between genome, transcriptome and proteome space coordinates, and the naïve prediction of protein domains in the presence of AS suffers many redundant sequence scans that emerge from constitutively spliced regions that are shared between alternative products of a gene. We developed the AstaFunk pipeline that computes for every generic transcriptome all domains that are altered by AS events in a systematic and efficient manner. In a nutshell, our method employs Viterbi dynamic programming, which guarantees to find all score-optimal hits of the domains under consideration, while complementary optimisations at different levels avoid redundant and other irrelevant computations. We evaluate AstaFunk qualitatively and quantitatively using RNAseq in well-studied genes with AS, and on large-scale employing entire transcriptomes. Our study confirms complementary reports that the effect of most AS events on the proteome seems to be rather limited, but our results also pinpoint several cases where AS could have a major impact on the function of a protein domain. The JAVA implementation of AstaFunk is available as an open source project on http://astafunk.sammeth.net. micha@sammeth.net. Supplementary data are available at Bioinformatics online.
Helsens, Kenny; Colaert, Niklaas; Barsnes, Harald; Muth, Thilo; Flikka, Kristian; Staes, An; Timmerman, Evy; Wortelkamp, Steffi; Sickmann, Albert; Vandekerckhove, Joël; Gevaert, Kris; Martens, Lennart
2010-03-01
MS-based proteomics produces large amounts of mass spectra that require processing, identification and possibly quantification before interpretation can be undertaken. High-throughput studies require automation of these various steps, and management of the data in association with the results obtained. We here present ms_lims (http://genesis.UGent.be/ms_lims), a freely available, open-source system based on a central database to automate data management and processing in MS-driven proteomics analyses.
Human liver proteome project: plan, progress, and perspectives.
He, Fuchu
2005-12-01
The Human Liver Proteome Project is the first initiative of the human proteome project for human organs/tissues and aims at writing a modern Prometheus myth. Its global scientific objectives are to reveal the "solar system" of the human liver proteome, expression profiles, modification profiles, a protein linkage (protein-protein interaction) map, and a proteome localization map, and to define an ORFeome, physiome, and pathome. Since it was first proposed in April 2002, the Human Liver Proteome Project has attracted more than 100 laboratories from all over the world. In the ensuing 3 years, we set up a management infrastructure, identified reference laboratories, confirmed standard operating procedures, initiated international research collaborations, and finally achieved the first set of expression profile data.
Time-resolved Global and Chromatin Proteomics during Herpes Simplex Virus Type 1 (HSV-1) Infection*
Kulej, Katarzyna; Avgousti, Daphne C.; Sidoli, Simone; Herrmann, Christin; Della Fera, Ashley N.; Kim, Eui Tae; Garcia, Benjamin A.; Weitzman, Matthew D.
2017-01-01
Herpes simplex virus (HSV-1) lytic infection results in global changes to the host cell proteome and the proteins associated with host chromatin. We present a system level characterization of proteome dynamics during infection by performing a multi-dimensional analysis during HSV-1 lytic infection of human foreskin fibroblast (HFF) cells. Our study includes identification and quantification of the host and viral proteomes, phosphoproteomes, chromatin bound proteomes and post-translational modifications (PTMs) on cellular histones during infection. We analyzed proteomes across six time points of virus infection (0, 3, 6, 9, 12 and 15 h post-infection) and clustered trends in abundance using fuzzy c-means. Globally, we accurately quantified more than 4000 proteins, 200 differently modified histone peptides and 9000 phosphorylation sites on cellular proteins. In addition, we identified 67 viral proteins and quantified 571 phosphorylation events (465 with high confidence site localization) on viral proteins, which is currently the most comprehensive map of HSV-1 phosphoproteome. We investigated chromatin bound proteins by proteomic analysis of the high-salt chromatin fraction and identified 510 proteins that were significantly different in abundance during infection. We found 53 histone marks significantly regulated during virus infection, including a steady increase of histone H3 acetylation (H3K9ac and H3K14ac). Our data provide a resource of unprecedented depth for human and viral proteome dynamics during infection. Collectively, our results indicate that the proteome composition of the chromatin of HFF cells is highly affected during HSV-1 infection, and that phosphorylation events are abundant on viral proteins. We propose that our epi-proteomics approach will prove to be important in the characterization of other model infectious systems that involve changes to chromatin composition. PMID:28179408
Cloud parallel processing of tandem mass spectrometry based proteomics data.
Mohammed, Yassene; Mostovenko, Ekaterina; Henneman, Alex A; Marissen, Rob J; Deelder, André M; Palmblad, Magnus
2012-10-05
Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.
Quantitative proteomics in Giardia duodenalis-Achievements and challenges.
Emery, Samantha J; Lacey, Ernest; Haynes, Paul A
2016-08-01
Giardia duodenalis (syn. G. lamblia and G. intestinalis) is a protozoan parasite of vertebrates and a major contributor to the global burden of diarrheal diseases and gastroenteritis. The publication of multiple genome sequences in the G. duodenalis species complex has provided important insights into parasite biology, and made post-genomic technologies, including proteomics, significantly more accessible. The aims of proteomics are to identify and quantify proteins present in a cell, and assign functions to them within the context of dynamic biological systems. In Giardia, proteomics in the post-genomic era has transitioned from reliance on gel-based systems to utilisation of a diverse array of techniques based on bottom-up LC-MS/MS technologies. Together, these have generated crucial foundations for subcellular proteomes, elucidated intra- and inter-assemblage isolate variation, and identified pathways and markers in differentiation, host-parasite interactions and drug resistance. However, in Giardia, proteomics remains an emerging field, with considerable shortcomings evident from the published research. These include a bias towards assemblage A, a lack of emphasis on quantitative analytical techniques, and limited information on post-translational protein modifications. Additionally, there are multiple areas of research for which proteomic data is not available to add value to published transcriptomic data. The challenge of amalgamating data in the systems biology paradigm necessitates the further generation of large, high-quality quantitative datasets to accurately model parasite biology. This review surveys the current proteomic research available for Giardia and evaluates their technical and quantitative approaches, while contextualising their biological insights into parasite pathology, isolate variation and eukaryotic evolution. Finally, we propose areas of priority for the generation of future proteomic data to explore fundamental questions in Giardia, including the analysis of post-translational modifications, and the design of MS-based assays for validation of differentially expressed proteins in large datasets. Copyright © 2016 Elsevier B.V. All rights reserved.
Woo, Jongmin; Han, Dohyun; Wang, Joseph Injae; Park, Joonho; Kim, Hyunsoo; Kim, Youngsoo
2017-09-01
The development of systematic proteomic quantification techniques in systems biology research has enabled one to perform an in-depth analysis of cellular systems. We have developed a systematic proteomic approach that encompasses the spectrum from global to targeted analysis on a single platform. We have applied this technique to an activated microglia cell system to examine changes in the intracellular and extracellular proteomes. Microglia become activated when their homeostatic microenvironment is disrupted. There are varying degrees of microglial activation, and we chose to focus on the proinflammatory reactive state that is induced by exposure to such stimuli as lipopolysaccharide (LPS) and interferon-gamma (IFN-γ). Using an improved shotgun proteomics approach, we identified 5497 proteins in the whole-cell proteome and 4938 proteins in the secretome that were associated with the activation of BV2 mouse microglia by LPS or IFN-γ. Of the differentially expressed proteins in stimulated microglia, we classified pathways that were related to immune-inflammatory responses and metabolism. Our label-free parallel reaction monitoring (PRM) approach made it possible to comprehensively measure the hyper-multiplex quantitative value of each protein by high-resolution mass spectrometry. Over 450 peptides that corresponded to pathway proteins and direct or indirect interactors via the STRING database were quantified by label-free PRM in a single run. Moreover, we performed a longitudinal quantification of secreted proteins during microglial activation, in which neurotoxic molecules that mediate neuronal cell loss in the brain are released. These data suggest that latent pathways that are associated with neurodegenerative diseases can be discovered by constructing and analyzing a pathway network model of proteins. Furthermore, this systematic quantification platform has tremendous potential for applications in large-scale targeted analyses. The proteomics data for discovery and label-free PRM analysis have been deposited to the ProteomeXchange Consortium with identifiers
Micro-separation toward systems biology.
Liu, Bi-Feng; Xu, Bo; Zhang, Guisen; Du, Wei; Luo, Qingming
2006-02-17
Current biology is experiencing transformation in logic or philosophy that forces us to reevaluate the concept of cell, tissue or entire organism as a collection of individual components. Systems biology that aims at understanding biological system at the systems level is an emerging research area, which involves interdisciplinary collaborations of life sciences, computational and mathematical sciences, systems engineering, and analytical technology, etc. For analytical chemistry, developing innovative methods to meet the requirement of systems biology represents new challenges as also opportunities and responsibility. In this review, systems biology-oriented micro-separation technologies are introduced for comprehensive profiling of genome, proteome and metabolome, characterization of biomolecules interaction and single cell analysis such as capillary electrophoresis, ultra-thin layer gel electrophoresis, micro-column liquid chromatography, and their multidimensional combinations, parallel integrations, microfabricated formats, and nano technology involvement. Future challenges and directions are also suggested.
Albaum, Stefan P; Neuweger, Heiko; Fränzel, Benjamin; Lange, Sita; Mertens, Dominik; Trötschel, Christian; Wolters, Dirk; Kalinowski, Jörn; Nattkemper, Tim W; Goesmann, Alexander
2009-12-01
The goal of present -omics sciences is to understand biological systems as a whole in terms of interactions of the individual cellular components. One of the main building blocks in this field of study is proteomics where tandem mass spectrometry (LC-MS/MS) in combination with isotopic labelling techniques provides a common way to obtain a direct insight into regulation at the protein level. Methods to identify and quantify the peptides contained in a sample are well established, and their output usually results in lists of identified proteins and calculated relative abundance values. The next step is to move ahead from these abstract lists and apply statistical inference methods to compare measurements, to identify genes that are significantly up- or down-regulated, or to detect clusters of proteins with similar expression profiles. We introduce the Rich Internet Application (RIA) Qupe providing comprehensive data management and analysis functions for LC-MS/MS experiments. Starting with the import of mass spectra data the system guides the experimenter through the process of protein identification by database search, the calculation of protein abundance ratios, and in particular, the statistical evaluation of the quantification results including multivariate analysis methods such as analysis of variance or hierarchical cluster analysis. While a data model to store these results has been developed, a well-defined programming interface facilitates the integration of novel approaches. A compute cluster is utilized to distribute computationally intensive calculations, and a web service allows to interchange information with other -omics software applications. To demonstrate that Qupe represents a step forward in quantitative proteomics analysis an application study on Corynebacterium glutamicum has been carried out. Qupe is implemented in Java utilizing Hibernate, Echo2, R and the Spring framework. We encourage the usage of the RIA in the sense of the 'software as a service' concept, maintained on our servers and accessible at the following location: http://qupe.cebitec.uni-bielefeld.de. Supplementary data are available at Bioinformatics online.
Zanivan, Sara; Maione, Federica; Hein, Marco Y; Hernández-Fernaud, Juan Ramon; Ostasiewicz, Pawel; Giraudo, Enrico; Mann, Matthias
2013-12-01
Proteomics has been successfully used for cell culture on dishes, but more complex cellular systems have proven to be challenging and so far poorly approached with proteomics. Because of the complexity of the angiogenic program, we still do not have a complete understanding of the molecular mechanisms involved in this process, and there have been no in depth quantitative proteomic studies. Plating endothelial cells on matrigel recapitulates aspects of vessel growth, and here we investigate this mechanism by using a spike-in SILAC quantitative proteomic approach. By comparing proteomic changes in primary human endothelial cells morphogenesis on matrigel to general adhesion mechanisms in cells spreading on culture dish, we pinpoint pathways and proteins modulated by endothelial cells. The cell-extracellular matrix adhesion proteome depends on the adhesion substrate, and a detailed proteomic profile of the extracellular matrix secreted by endothelial cells identified CLEC14A as a matrix component, which binds to MMRN2. We verify deregulated levels of these proteins during tumor angiogenesis in models of multistage carcinogenesis. This is the most in depth quantitative proteomic study of endothelial cell morphogenesis, which shows the potential of applying high accuracy quantitative proteomics to in vitro models of vessel growth to shed new light on mechanisms that accompany pathological angiogenesis. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD000359.
Clinical veterinary proteomics: Techniques and approaches to decipher the animal plasma proteome.
Ghodasara, P; Sadowski, P; Satake, N; Kopp, S; Mills, P C
2017-12-01
Over the last two decades, technological advancements in the field of proteomics have advanced our understanding of the complex biological systems of living organisms. Techniques based on mass spectrometry (MS) have emerged as powerful tools to contextualise existing genomic information and to create quantitative protein profiles from plasma, tissues or cell lines of various species. Proteomic approaches have been used increasingly in veterinary science to investigate biological processes responsible for growth, reproduction and pathological events. However, the adoption of proteomic approaches by veterinary investigators lags behind that of researchers in the human medical field. Furthermore, in contrast to human proteomics studies, interpretation of veterinary proteomic data is difficult due to the limited protein databases available for many animal species. This review article examines the current use of advanced proteomics techniques for evaluation of animal health and welfare and covers the current status of clinical veterinary proteomics research, including successful protein identification and data interpretation studies. It includes a description of an emerging tool, sequential window acquisition of all theoretical fragment ion mass spectra (SWATH-MS), available on selected mass spectrometry instruments. This newly developed data acquisition technique combines advantages of discovery and targeted proteomics approaches, and thus has the potential to advance the veterinary proteomics field by enhancing identification and reproducibility of proteomics data. Copyright © 2017 Elsevier Ltd. All rights reserved.
[Methods of quantitative proteomics].
Kopylov, A T; Zgoda, V G
2007-01-01
In modern science proteomic analysis is inseparable from other fields of systemic biology. Possessing huge resources quantitative proteomics operates colossal information on molecular mechanisms of life. Advances in proteomics help researchers to solve complex problems of cell signaling, posttranslational modification, structure and functional homology of proteins, molecular diagnostics etc. More than 40 various methods have been developed in proteomics for quantitative analysis of proteins. Although each method is unique and has certain advantages and disadvantages all these use various isotope labels (tags). In this review we will consider the most popular and effective methods employing both chemical modifications of proteins and also metabolic and enzymatic methods of isotope labeling.
A systems biology-led insight into the role of the proteome in neurodegenerative diseases.
Fasano, Mauro; Monti, Chiara; Alberio, Tiziana
2016-09-01
Multifactorial disorders are the result of nonlinear interactions of several factors; therefore, a reductionist approach does not appear to be appropriate. Proteomics is a global approach that can be efficiently used to investigate pathogenetic mechanisms of neurodegenerative diseases. Here, we report a general introduction about the systems biology approach and mechanistic insights recently obtained by over-representation analysis of proteomics data of cellular and animal models of Alzheimer's disease, Parkinson's disease and other neurodegenerative disorders, as well as of affected human tissues. Expert commentary: As an inductive method, proteomics is based on unbiased observations that further require validation of generated hypotheses. Pathway databases and over-representation analysis tools allow researchers to assign an expectation value to pathogenetic mechanisms linked to neurodegenerative diseases. The systems biology approach based on omics data may be the key to unravel the complex mechanisms underlying neurodegeneration.
High-Throughput Cloning and Expression Library Creation for Functional Proteomics
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-01-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047
High-throughput cloning and expression library creation for functional proteomics.
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-05-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particularly important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single-gene experiments, creating the need for fast, flexible, and reliable cloning systems. These collections of ORF clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial, we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator(TM) DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This tutorial is part of the International Proteomics Tutorial Programme (IPTP12). © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Karim, Mohammad R; Petering, David H
2017-04-19
Nitric oxide (NO) is both an important regulatory molecule in biological systems and a toxic xenobiotic. Its oxidation products react with sulfhydryl groups and either nitrosylate or oxidize them. The aerobic reaction of NO supplied by diethylamine NONOate (DEA-NO) with pig kidney LLC-PK 1 cells and Zn-proteins within the isolated proteome was examined with three fluorescent zinc sensors, zinquin (ZQ), TSQ, and FluoZin-3 (FZ-3). Observations of Zn 2+ labilization from Zn-proteins depended on the specific sensor used. Upon cellular exposure to DEA-NO, ZQ sequestered about 13% of the proteomic Zn 2+ as Zn(ZQ) 2 and additional Zn 2+ as proteome·Zn-ZQ ternary complexes. TSQ, a sensor structurally related to ZQ with lower affinity for Zn 2+ , did not form Zn(TSQ) 2 . Instead, Zn 2+ mobilized by DEA-NO was exclusively bound as proteome·Zn-TSQ adducts. Analogous reactions of proteome with ZQ or TSQ in vitro displayed qualitatively similar products. Titration of native proteome with Zn 2+ in the presence of ZQ resulted in the sole formation of proteome·Zn-ZQ species. This result suggested that sulfhydryl groups are involved in non-specific proteomic binding of mobile Zn 2+ and that the appearance of Zn(ZQ) 2 after exposure of cells and proteome to DEA-NO resulted from a reduction in proteomic sulfhydryl ligands, favoring the formation of Zn(ZQ) 2 instead of proteome·Zn-ZQ. With the third sensor, FluoZin-3, neither Zn-FZ-3 nor proteome·Zn-FZ-3 was detected during the reaction of proteome with DEA-NO. Instead, it reacted independently with DEA-NO with a modest enhancement of fluorescence.
Anticoagulants and the propagation phase of thrombin generation.
Orfeo, Thomas; Gissel, Matthew; Butenas, Saulius; Undas, Anetta; Brummel-Ziedins, Kathleen E; Mann, Kenneth G
2011-01-01
The view that clot time-based assays do not provide a sufficient assessment of an individual's hemostatic competence, especially in the context of anticoagulant therapy, has provoked a search for new metrics, with significant focus directed at techniques that define the propagation phase of thrombin generation. Here we use our deterministic mathematical model of tissue-factor initiated thrombin generation in combination with reconstructions using purified protein components to characterize how the interplay between anticoagulant mechanisms and variable composition of the coagulation proteome result in differential regulation of the propagation phase of thrombin generation. Thrombin parameters were extracted from computationally derived thrombin generation profiles generated using coagulation proteome factor data from warfarin-treated individuals (N = 54) and matching groups of control individuals (N = 37). A computational clot time prolongation value (cINR) was devised that correlated with their actual International Normalized Ratio (INR) values, with differences between individual INR and cINR values shown to derive from the insensitivity of the INR to tissue factor pathway inhibitor (TFPI). The analysis suggests that normal range variation in TFPI levels could be an important contributor to the failure of the INR to adequately reflect the anticoagulated state in some individuals. Warfarin-induced changes in thrombin propagation phase parameters were then compared to those induced by unfractionated heparin, fondaparinux, rivaroxaban, and a reversible thrombin inhibitor. Anticoagulants were assessed at concentrations yielding equivalent cINR values, with each anticoagulant evaluated using 32 unique coagulation proteome compositions. The analyses showed that no anticoagulant recapitulated all features of warfarin propagation phase dynamics; differences in propagation phase effects suggest that anticoagulants that selectively target fXa or thrombin may provoke fewer bleeding episodes. More generally, the study shows that computational modeling of the response of core elements of the coagulation proteome to a physiologically relevant tissue factor stimulus may improve the monitoring of a broad range of anticoagulants.
CMG-biotools, a free workbench for basic comparative microbial genomics.
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weckwerth, Wolfram; Baginsky, Sacha; Van Wijk, Klass
2009-12-01
In the past 10 years, we have witnessed remarkable advances in the field of plant molecular biology. The rapid development of proteomic technologies and the speed with which these techniques have been applied to the field have altered our perception of how we can analyze proteins in complex systems. At nearly the same time, the availability of the complete genome for the model plant Arabidopsis thaliana was released; this effort provides an unsurpassed resource for the identification of proteins when researchers use MS to analyze plant samples. Recognizing the growth in this area, the Multinational Arabidopsis Steering Committee (MASC) establishedmore » a subcommittee for A. thaliana proteomics in 2006 with the objective of consolidating databases, technique standards, and experimentally validated candidate genes and functions. Since the establishment of the Multinational Arabidopsis Steering Subcommittee for Proteomics (MASCP), many new approaches and resources have become available. Recently, the subcommittee established a webpage to consolidate this information (www.masc-proteomics.org). It includes links to plant proteomic databases, general information about proteomic techniques, meeting information, a summary of proteomic standards, and other relevant resources. Altogether, this website provides a useful resource for the Arabidopsis proteomics community. In the future, the website will host discussions and investigate the cross-linking of databases. The subcommittee members have extensive experience in arabidopsis proteomics and collectively have produced some of the most extensive proteomics data sets for this model plant (Table S1 in the Supporting Information has a list of resources). The largest collection of proteomics data from a single study in A. thaliana was assembled into an accessible database (AtProteome; http://fgcz-atproteome.unizh.ch/index.php) and was recently published by the Baginsky lab.1 The database provides links to major Arabidopsis online resources, and raw data have been deposited in PRIDE and PRIDE BioMart. Included in this database is an Arabidopsis proteome map that provides evidence for the expression of {approx}50% of all predicted gene models, including several alternative gene models that are not represented in The Arabidopsis Information Resource (TAIR) protein database. A set of organ-specific biomarkers is provided, as well as organ-specific proteotypic peptides for 4105 proteins that can be used to facilitate targeted quantitative proteomic surveys. In the future, the AtProteome database will be linked to additional existing resources developed by MASCP members, such as PPDB, ProMEX, and SUBA. The most comprehensive study on the Arabidopsis chloroplast proteome, which includes information on chloroplast sorting signals, posttranslational modifications (PTMs), and protein abundances (analyzed by high-accuracy MS [Orbitrap]), was recently published by the van Wijk lab.2 These and previous data are available via the plant proteome database (PPDB; http://ppdb.tc.cornell.edu) for A. thaliana and maize. PPDB provides genome-wide experimental and functional characterization of the A. thaliana and maize proteomes, including PTMs and subcellular localization information, with an emphasis on leaf and plastid proteins. Maize and Arabidopsis proteome entries are directly linked via internal BLAST alignments within PPDB. Direct links for each protein to TAIR, SUBA, ProMEX, and other resources are also provided.« less
Wang, Degeng
2008-01-01
Discrepancy between the abundance of cognate protein and RNA molecules is frequently observed. A theoretical understanding of this discrepancy remains elusive, and it is frequently described as surprises and/or technical difficulties in the literature. Protein and RNA represent different steps of the multi-stepped cellular genetic information flow process, in which they are dynamically produced and degraded. This paper explores a comparison with a similar process in computers - multi-step information flow from storage level to the execution level. Functional similarities can be found in almost every facet of the retrieval process. Firstly, common architecture is shared, as the ribonome (RNA space) and the proteome (protein space) are functionally similar to the computer primary memory and the computer cache memory respectively. Secondly, the retrieval process functions, in both systems, to support the operation of dynamic networks – biochemical regulatory networks in cells and, in computers, the virtual networks (of CPU instructions) that the CPU travels through while executing computer programs. Moreover, many regulatory techniques are implemented in computers at each step of the information retrieval process, with a goal of optimizing system performance. Cellular counterparts can be easily identified for these regulatory techniques. In other words, this comparative study attempted to utilize theoretical insight from computer system design principles as catalysis to sketch an integrative view of the gene expression process, that is, how it functions to ensure efficient operation of the overall cellular regulatory network. In context of this bird’s-eye view, discrepancy between protein and RNA abundance became a logical observation one would expect. It was suggested that this discrepancy, when interpreted in the context of system operation, serves as a potential source of information to decipher regulatory logics underneath biochemical network operation. PMID:18757239
Wang, Degeng
2008-12-01
Discrepancy between the abundance of cognate protein and RNA molecules is frequently observed. A theoretical understanding of this discrepancy remains elusive, and it is frequently described as surprises and/or technical difficulties in the literature. Protein and RNA represent different steps of the multi-stepped cellular genetic information flow process, in which they are dynamically produced and degraded. This paper explores a comparison with a similar process in computers-multi-step information flow from storage level to the execution level. Functional similarities can be found in almost every facet of the retrieval process. Firstly, common architecture is shared, as the ribonome (RNA space) and the proteome (protein space) are functionally similar to the computer primary memory and the computer cache memory, respectively. Secondly, the retrieval process functions, in both systems, to support the operation of dynamic networks-biochemical regulatory networks in cells and, in computers, the virtual networks (of CPU instructions) that the CPU travels through while executing computer programs. Moreover, many regulatory techniques are implemented in computers at each step of the information retrieval process, with a goal of optimizing system performance. Cellular counterparts can be easily identified for these regulatory techniques. In other words, this comparative study attempted to utilize theoretical insight from computer system design principles as catalysis to sketch an integrative view of the gene expression process, that is, how it functions to ensure efficient operation of the overall cellular regulatory network. In context of this bird's-eye view, discrepancy between protein and RNA abundance became a logical observation one would expect. It was suggested that this discrepancy, when interpreted in the context of system operation, serves as a potential source of information to decipher regulatory logics underneath biochemical network operation.
Stekhoven, Daniel J; Omasits, Ulrich; Quebatte, Maxime; Dehio, Christoph; Ahrens, Christian H
2014-03-17
Proteomics data provide unique insights into biological systems, including the predominant subcellular localization (SCL) of proteins, which can reveal important clues about their functions. Here we analyzed data of a complete prokaryotic proteome expressed under two conditions mimicking interaction of the emerging pathogen Bartonella henselae with its mammalian host. Normalized spectral count data from cytoplasmic, total membrane, inner and outer membrane fractions allowed us to identify the predominant SCL for 82% of the identified proteins. The spectral count proportion of total membrane versus cytoplasmic fractions indicated the propensity of cytoplasmic proteins to co-fractionate with the inner membrane, and enabled us to distinguish cytoplasmic, peripheral inner membrane and bona fide inner membrane proteins. Principal component analysis and k-nearest neighbor classification training on selected marker proteins or predominantly localized proteins, allowed us to determine an extensive catalog of at least 74 expressed outer membrane proteins, and to extend the SCL assignment to 94% of the identified proteins, including 18% where in silico methods gave no prediction. Suitable experimental proteomics data combined with straightforward computational approaches can thus identify the predominant SCL on a proteome-wide scale. Finally, we present a conceptual approach to identify proteins potentially changing their SCL in a condition-dependent fashion. The work presented here describes the first prokaryotic proteome-wide subcellular localization (SCL) dataset for the emerging pathogen B. henselae (Bhen). The study indicates that suitable subcellular fractionation experiments combined with straight-forward computational analysis approaches assessing the proportion of spectral counts observed in different subcellular fractions are powerful for determining the predominant SCL of a large percentage of the experimentally observed proteins. This includes numerous cases where in silico prediction methods do not provide any prediction. Avoiding a treatment with harsh conditions, cytoplasmic proteins tend to co-fractionate with proteins of the inner membrane fraction, indicative of close functional interactions. The spectral count proportion (SCP) of total membrane versus cytoplasmic fractions allowed us to obtain a good indication about the relative proximity of individual protein complex members to the inner membrane. Using principal component analysis and k-nearest neighbor approaches, we were able to extend the percentage of proteins with a predominant experimental localization to over 90% of all expressed proteins and identified a set of at least 74 outer membrane (OM) proteins. In general, OM proteins represent a rich source of candidates for the development of urgently needed new therapeutics in combat of resurgence of infectious disease and multi-drug resistant bacteria. Finally, by comparing the data from two infection biology relevant conditions, we conceptually explore methods to identify and visualize potential candidates that may partially change their SCL in these different conditions. The data are made available to researchers as a SCL compendium for Bhen and as an assistance in further improving in silico SCL prediction algorithms. Copyright © 2014 Elsevier B.V. All rights reserved.
Ferrante, Michele; Blackwell, Kim T.; Migliore, Michele; Ascoli, Giorgio A.
2012-01-01
The identification and characterization of potential pharmacological targets in neurology and psychiatry is a fundamental problem at the intersection between medicinal chemistry and the neurosciences. Exciting new techniques in proteomics and genomics have fostered rapid progress, opening numerous questions as to the functional consequences of ligand binding at the systems level. Psycho- and neuro-active drugs typically work in nerve cells by affecting one or more aspects of electrophysiological activity. Thus, an integrated understanding of neuropharmacological agents requires bridging the gap between their molecular mechanisms and the biophysical determinants of neuronal function. Computational neuroscience and bioinformatics can play a major role in this functional connection. Robust quantitative models exist describing all major active membrane properties under endogenous and exogenous chemical control. These include voltage-dependent ionic channels (sodium, potassium, calcium, etc.), synaptic receptor channels (e.g. glutamatergic, GABAergic, cholinergic), and G protein coupled signaling pathways (protein kinases, phosphatases, and other enzymatic cascades). This brief review of neuromolecular medicine from the computational perspective provides compelling examples of how simulations can elucidate, explain, and predict the effect of chemical agonists, antagonists, and modulators in the nervous system. PMID:18855673
Extracellular proteome analysis of Leptospira interrogans serovar Lai.
Zeng, Lingbing; Zhang, Yunyi; Zhu, Yongzhang; Yin, Haidi; Zhuang, Xuran; Zhu, Weinan; Guo, Xiaokui; Qin, Jinhong
2013-10-01
Abstract Leptospirosis is one of the most important zoonoses. Leptospira interrogans serovar Lai is a pathogenic spirochete that is responsible for leptospirosis. Extracellular proteins play an important role in the pathogenicity of this bacterium. In this study, L. interrogans serovar Lai was grown in protein-free medium; the supernatant was collected and subsequently analyzed as the extracellular proteome. A total of 66 proteins with more than two unique peptides were detected by MS/MS, and 33 of these were predicted to be extracellular proteins by a combination of bioinformatics analyses, including Psortb, cello, SoSuiGramN and SignalP. Comparisons of the transcriptional levels of these 33 genes between in vivo and in vitro conditions revealed that 15 genes were upregulated and two genes were downregulated in vivo compared to in vitro. A BLAST search for the components of secretion system at the genomic and proteomic levels revealed the presence of the complete type I secretion system and type II secretion system in this strain. Moreover, this strain also exhibits complete Sec translocase and Tat translocase systems. The extracellular proteome analysis of L. interrogans will supplement the previously generated whole proteome data and provide more information for studying the functions of specific proteins in the infection process and for selecting candidate molecules for vaccines or diagnostic tools for leptospirosis.
PrePhyloPro: phylogenetic profile-based prediction of whole proteome linkages
Niu, Yulong; Liu, Chengcheng; Moghimyfiroozabad, Shayan; Yang, Yi
2017-01-01
Direct and indirect functional links between proteins as well as their interactions as part of larger protein complexes or common signaling pathways may be predicted by analyzing the correlation of their evolutionary patterns. Based on phylogenetic profiling, here we present a highly scalable and time-efficient computational framework for predicting linkages within the whole human proteome. We have validated this method through analysis of 3,697 human pathways and molecular complexes and a comparison of our results with the prediction outcomes of previously published co-occurrency model-based and normalization methods. Here we also introduce PrePhyloPro, a web-based software that uses our method for accurately predicting proteome-wide linkages. We present data on interactions of human mitochondrial proteins, verifying the performance of this software. PrePhyloPro is freely available at http://prephylopro.org/phyloprofile/. PMID:28875072
Cambiaghi, Alice; Díaz, Ramón; Martinez, Julia Bauzá; Odena, Antonia; Brunelli, Laura; Caironi, Pietro; Masson, Serge; Baselli, Giuseppe; Ristagno, Giuseppe; Gattinoni, Luciano; de Oliveira, Eliandre; Pastorelli, Roberta; Ferrario, Manuela
2018-04-27
In this work, we examined plasma metabolome, proteome and clinical features in patients with severe septic shock enrolled in the multicenter ALBIOS study. The objective was to identify changes in the levels of metabolites involved in septic shock progression and to integrate this information with the variation occurring in proteins and clinical data. Mass spectrometry-based targeted metabolomics and untargeted proteomics allowed us to quantify absolute metabolites concentration and relative proteins abundance. We computed the ratio D7/D1 to take into account their variation from day 1 (D1) to day 7 (D7) after shock diagnosis. Patients were divided into two groups according to 28-day mortality. Three different elastic net logistic regression models were built: one on metabolites only, one on metabolites and proteins and one to integrate metabolomics and proteomics data with clinical parameters. Linear discriminant analysis and Partial least squares Discriminant Analysis were also implemented. All the obtained models correctly classified the observations in the testing set. By looking at the variable importance (VIP) and the selected features, the integration of metabolomics with proteomics data showed the importance of circulating lipids and coagulation cascade in septic shock progression, thus capturing a further layer of biological information complementary to metabolomics information.
Marine proteomics: a critical assessment of an emerging technology.
Slattery, Marc; Ankisetty, Sridevi; Corrales, Jone; Marsh-Hunkin, K Erica; Gochfeld, Deborah J; Willett, Kristine L; Rimoldi, John M
2012-10-26
The application of proteomics to marine sciences has increased in recent years because the proteome represents the interface between genotypic and phenotypic variability and, thus, corresponds to the broadest possible biomarker for eco-physiological responses and adaptations. Likewise, proteomics can provide important functional information regarding biosynthetic pathways, as well as insights into mechanism of action, of novel marine natural products. The goal of this review is to (1) explore the application of proteomics methodologies to marine systems, (2) assess the technical approaches that have been used, and (3) evaluate the pros and cons of this proteomic research, with the intent of providing a critical analysis of its future roles in marine sciences. To date, proteomics techniques have been utilized to investigate marine microbe, plant, invertebrate, and vertebrate physiology, developmental biology, seafood safety, susceptibility to disease, and responses to environmental change. However, marine proteomics studies often suffer from poor experimental design, sample processing/optimization difficulties, and data analysis/interpretation issues. Moreover, a major limitation is the lack of available annotated genomes and proteomes for most marine organisms, including several "model species". Even with these challenges in mind, there is no doubt that marine proteomics is a rapidly expanding and powerful integrative molecular research tool from which our knowledge of the marine environment, and the natural products from this resource, will be significantly expanded.
Knowledge Translation: Moving Proteomics Science to Innovation in Society.
Holmes, Christina; McDonald, Fiona; Jones, Mavis; Graham, Janice
2016-06-01
Proteomics is one of the pivotal next-generation biotechnologies in the current "postgenomics" era. Little is known about the ways in which innovative proteomics science is navigating the complex socio-political space between laboratory and society. It cannot be assumed that the trajectory between proteomics laboratory and society is linear and unidirectional. Concerned about public accountability and hopes for knowledge-based innovations, funding agencies and citizens increasingly expect that emerging science and technologies, such as proteomics, are effectively translated and disseminated as innovation in society. Here, we describe translation strategies promoted in the knowledge translation (KT) and science communication literatures and examine the use of these strategies within the field of proteomics. Drawing on data generated from qualitative interviews with proteomics scientists and ethnographic observation of international proteomics conferences over a 5-year period, we found that proteomics science incorporates a variety of KT strategies to reach knowledge users outside the field. To attain the full benefit of KT, however, proteomics scientists must challenge their own normative assumptions and approaches to innovation dissemination-beyond the current paradigm relying primarily on publication for one's scientific peers within one's field-and embrace the value of broader (interdisciplinary) KT strategies in promoting the uptake of their research. Notably, the Human Proteome Organization (HUPO) is paying increasing attention to a broader range of KT strategies, including targeted dissemination, integrated KT, and public outreach. We suggest that increasing the variety of KT strategies employed by proteomics scientists is timely and would serve well the omics system sciences community.
GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data.
Carvalho, Paulo C; Fischer, Juliana Sg; Chen, Emily I; Domont, Gilberto B; Carvalho, Maria Gc; Degrave, Wim M; Yates, John R; Barbosa, Valmir C
2009-02-24
Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Here we present a new algorithm, termed GO Explorer (GOEx), that leverages the gene ontology (GO) to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172). We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.
Proteomic approaches to understanding the role of the cytoskeleton in host-defense mechanisms
Radulovic, Marko; Godovac-Zimmermann, Jasminka
2014-01-01
The cytoskeleton is a cellular scaffolding system whose functions include maintenance of cellular shape, enabling cellular migration, division, intracellular transport, signaling and membrane organization. In addition, in immune cells, the cytoskeleton is essential for phagocytosis. Following the advances in proteomics technology over the past two decades, cytoskeleton proteome analysis in resting and activated immune cells has emerged as a possible powerful approach to expand our understanding of cytoskeletal composition and function. However, so far there have only been a handful of studies of the cytoskeleton proteome in immune cells. This article considers promising proteomics strategies that could augment our understanding of the role of the cytoskeleton in host-defense mechanisms. PMID:21329431
Proteomic approaches in cancer risk and response assessment.
Petricoin, Emanuel F; Liotta, Lance A
2004-02-01
Proteomics is more than just a list-generating exercise where increases or decreases in protein expression are identified. Proteomic technologies will ultimately characterize information-flow through the protein circuitry that interconnects the extracellular microenvironment to the serum or plasma macroenvironment through intracellular signaling systems and their control of gene transcription. The nature of this information can be a cause or a consequence of disease processes and how patients respond to therapy. Analysis of human cancer as a model for how proteomics can have an impact at the bedside can take advantage of several promising new proteomic technologies. These technologies are being developed for early detection and risk assessment, therapeutic targeting and patient-tailored therapy.
Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus
2014-01-01
The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868
USDA-ARS?s Scientific Manuscript database
In addition to microarray technology, which provides a robust method to study protein function in a rapid, economical, and proteome-wide fashion, plasmid-based functional proteomics is an important technology for rapidly obtaining large quantities of protein and determining protein function across a...
Proteomics in Heart Failure: Top-down or Bottom-up?
Gregorich, Zachery R.; Chang, Ying-Hua; Ge, Ying
2014-01-01
Summary The pathophysiology of heart failure (HF) is diverse, owing to multiple etiologies and aberrations in a number of cellular processes. Therefore, it is essential to understand how defects in the molecular pathways that mediate cellular responses to internal and external stressors function as a system to drive the HF phenotype. Mass spectrometry (MS)-based proteomics strategies have great potential for advancing our understanding of disease mechanisms at the systems level because proteins are the effector molecules for all cell functions and, thus, are directly responsible for determining cell phenotype. Two MS-based proteomics strategies exist: peptide-based bottom-up and protein-based top-down proteomics—each with its own unique strengths and weaknesses for interrogating the proteome. In this review, we will discuss the advantages and disadvantages of bottom-up and top-down MS for protein identification, quantification, and the analysis of post-translational modifications, as well as highlight how both of these strategies have contributed to our understanding of the molecular and cellular mechanisms underlying HF. Additionally, the challenges associated with both proteomics approaches will be discussed and insights will be offered regarding the future of MS-based proteomics in HF research. PMID:24619480
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Wiberg, Holli K.; Matzke, Melissa M.
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yieldedmore » the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. In summary, on the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.« less
Alkylation Damage by Lipid Electrophiles Targets Functional Protein Systems*
Codreanu, Simona G.; Ullery, Jody C.; Zhu, Jing; Tallman, Keri A.; Beavers, William N.; Porter, Ned A.; Marnett, Lawrence J.; Zhang, Bing; Liebler, Daniel C.
2014-01-01
Protein alkylation by reactive electrophiles contributes to chemical toxicities and oxidative stress, but the functional impact of alkylation damage across proteomes is poorly understood. We used Click chemistry and shotgun proteomics to profile the accumulation of proteome damage in human cells treated with lipid electrophile probes. Protein target profiles revealed three damage susceptibility classes, as well as proteins that were highly resistant to alkylation. Damage occurred selectively across functional protein interaction networks, with the most highly alkylation-susceptible proteins mapping to networks involved in cytoskeletal regulation. Proteins with lower damage susceptibility mapped to networks involved in protein synthesis and turnover and were alkylated only at electrophile concentrations that caused significant toxicity. Hierarchical susceptibility of proteome systems to alkylation may allow cells to survive sublethal damage while protecting critical cell functions. PMID:24429493
Plant proteomics in India and Nepal: current status and challenges ahead.
Deswal, Renu; Gupta, Ravi; Dogra, Vivek; Singh, Raksha; Abat, Jasmeet Kaur; Sarkar, Abhijit; Mishra, Yogesh; Rai, Vandana; Sreenivasulu, Yelam; Amalraj, Ramesh Sundar; Raorane, Manish; Chaudhary, Ram Prasad; Kohli, Ajay; Giri, Ashok Prabhakar; Chakraborty, Niranjan; Zargar, Sajad Majeed; Agrawal, Vishwanath Prasad; Agrawal, Ganesh Kumar; Job, Dominique; Renaut, Jenny; Rakwal, Randeep
2013-10-01
Plant proteomics has made tremendous contributions in understanding the complex processes of plant biology. Here, its current status in India and Nepal is discussed. Gel-based proteomics is predominantly utilized on crops and non-crops to analyze majorly abiotic (49 %) and biotic (18 %) stress, development (11 %) and post-translational modifications (7 %). Rice is the most explored system (36 %) with major focus on abiotic mainly dehydration (36 %) stress. In spite of expensive proteomics setup and scarcity of trained workforce, output in form of publications is encouraging. To boost plant proteomics in India and Nepal, researchers have discussed ground level issues among themselves and with the International Plant Proteomics Organization (INPPO) to act in priority on concerns like food security. Active collaboration may help in translating this knowledge to fruitful applications.
Gadher, Suresh; Bhide, Mangesh; Kovarova, Hana
2018-05-01
The Central and Eastern European Proteomic Conference (CEEPC) successfully launched its second decade of proteomics in Košice, Slovakia with a program of systems biology, cellular, clinical, veterinary and sports proteomics. Whilst many conferences are struggling to attract participants, CEEPC with its outstanding track record and unique 'family - feel' packaged with excellent ambiance is thriving and bringing together proteomics experts from academia, industry, scientific specialties, clinics and precision medicine communities interested in resolving mysteries about protein functionalities in health and disease. CEEPC is also renowned for addressing humanitarian global healthcare issues, may it be ageing, chronic diseases or global epidemics. This year CEEPC intertwined with Košice Peace Marathon's pursuit of excellence in sports and initiatives including sports medicine and global peace.
Proteomics and plant disease: advances in combating a major threat to the global food supply.
Rampitsch, Christof; Bykova, Natalia V
2012-02-01
The study of plant disease and immunity is benefiting tremendously from proteomics. Parallel streams of research from model systems, from pathogens in vitro and from the relevant pathogen-crop interactions themselves have begun to reveal a model of how plants succumb to invading pathogens and how they defend themselves without the benefit of a circulating immune system. In this review, we discuss the contribution of proteomics to these advances, drawing mainly on examples from crop-fungus interactions, from Arabidopsis-bacteria interactions, from elicitor-based model systems and from pathogen studies, to highlight also the important contribution of non-crop systems to advancing crop protection. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong
2008-04-01
The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.
MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics*
Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.; Chen, Albert J.; Ayaz-Guner, Serife; Peng, Ying; Valeja, Santosh G.; Liu, Xiaowen; Ge, Ying
2016-01-01
Top-down mass spectrometry (MS)-based proteomics is arguably a disruptive technology for the comprehensive analysis of all proteoforms arising from genetic variation, alternative splicing, and posttranslational modifications (PTMs). However, the complexity of top-down high-resolution mass spectra presents a significant challenge for data analysis. In contrast to the well-developed software packages available for data analysis in bottom-up proteomics, the data analysis tools in top-down proteomics remain underdeveloped. Moreover, despite recent efforts to develop algorithms and tools for the deconvolution of top-down high-resolution mass spectra and the identification of proteins from complex mixtures, a multifunctional software platform, which allows for the identification, quantitation, and characterization of proteoforms with visual validation, is still lacking. Herein, we have developed MASH Suite Pro, a comprehensive software tool for top-down proteomics with multifaceted functionality. MASH Suite Pro is capable of processing high-resolution MS and tandem MS (MS/MS) data using two deconvolution algorithms to optimize protein identification results. In addition, MASH Suite Pro allows for the characterization of PTMs and sequence variations, as well as the relative quantitation of multiple proteoforms in different experimental conditions. The program also provides visualization components for validation and correction of the computational outputs. Furthermore, MASH Suite Pro facilitates data reporting and presentation via direct output of the graphics. Thus, MASH Suite Pro significantly simplifies and speeds up the interpretation of high-resolution top-down proteomics data by integrating tools for protein identification, quantitation, characterization, and visual validation into a customizable and user-friendly interface. We envision that MASH Suite Pro will play an integral role in advancing the burgeoning field of top-down proteomics. PMID:26598644
Uddin, Reaz; Jamil, Faiza
2018-06-01
Pseudomonas aeruginosa is an opportunistic gram-negative bacterium that has the capability to acquire resistance under hostile conditions and become a threat worldwide. It is involved in nosocomial infections. In the current study, potential novel drug targets against P. aeruginosa have been identified using core proteomic analysis and Protein-Protein Interactions (PPIs) studies. The non-redundant reference proteome of 68 strains having complete genome and latest assembly version of P. aeruginosa were downloaded from ftp NCBI RefSeq server in October 2016. The standalone CD-HIT tool was used to cluster ortholog proteins (having >=80% amino acid identity) present in all strains. The pan-proteome was clustered in 12,380 Clusters of Orthologous Proteins (COPs). By using in-house shell scripts, 3252 common COPs were extracted out and designated as clusters of core proteome. The core proteome of PAO1 strain was selected by fetching PAO1's proteome from common COPs. As a result, 1212 proteins were shortlisted that are non-homologous to the human but essential for the survival of the pathogen. Among these 1212 proteins, 321 proteins are conserved hypothetical proteins. Considering their potential as drug target, those 321 hypothetical proteins were selected and their probable functions were characterized. Based on the druggability criteria, 18 proteins were shortlisted. The interacting partners were identified by investigating the PPIs network using STRING v10 database. Subsequently, 8 proteins were shortlisted as 'hub proteins' and proposed as potential novel drug targets against P. aeruginosa. The study is interesting for the scientific community working to identify novel drug targets against MDR pathogens particularly P. aeruginosa. Copyright © 2018 Elsevier Ltd. All rights reserved.
The Need for Integrated Approaches in Metabolic Engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lechner, Anna; Brunk, Elizabeth; Keasling, Jay D.
This review highlights state-of-the-art procedures for heterologous small-molecule biosynthesis, the associated bottlenecks, and new strategies that have the potential to accelerate future accomplishments in metabolic engineering. We emphasize that a combination of different approaches over multiple time and size scales must b e considered for successful pathway engineering in a heterologous host. We have classified these optimization procedures based on the "system" that is being manipulated: transcriptome, translatome, proteome, or reactome. By bridging multiple disciplines, including molecular biology, biochemistry, biophysics, and computational sciences, we can create an integral framework for the discovery and implementation of novel biosynthetic production routes.
An insight into cyanobacterial genomics--a perspective.
Lakshmi, Palaniswamy Thanga Velan
2007-05-20
At the turn of the millennium, cyanobacteria deserve attention to be reviewed to understand the past, present and future. The advent of post genomic research, which encompasses functional genomics, structural genomics, transcriptomics, pharmacogenomics, proteomics and metabolomics that allows a systematic wide approach for biological system studies. Thus by exploiting genomic and associated protein information through computational analyses, the fledging information that are generated by biotechnological analyses, could be well extrapolated to fill in the lacuna of scarce information on cyanobacteria and as an effort this paper attempts to highlights the perspectives available and awakens researcher to concentrate in the field of cyanobacterial informatics.
Time-resolved Global and Chromatin Proteomics during Herpes Simplex Virus Type 1 (HSV-1) Infection.
Kulej, Katarzyna; Avgousti, Daphne C; Sidoli, Simone; Herrmann, Christin; Della Fera, Ashley N; Kim, Eui Tae; Garcia, Benjamin A; Weitzman, Matthew D
2017-04-01
Herpes simplex virus (HSV-1) lytic infection results in global changes to the host cell proteome and the proteins associated with host chromatin. We present a system level characterization of proteome dynamics during infection by performing a multi-dimensional analysis during HSV-1 lytic infection of human foreskin fibroblast (HFF) cells. Our study includes identification and quantification of the host and viral proteomes, phosphoproteomes, chromatin bound proteomes and post-translational modifications (PTMs) on cellular histones during infection. We analyzed proteomes across six time points of virus infection (0, 3, 6, 9, 12 and 15 h post-infection) and clustered trends in abundance using fuzzy c-means. Globally, we accurately quantified more than 4000 proteins, 200 differently modified histone peptides and 9000 phosphorylation sites on cellular proteins. In addition, we identified 67 viral proteins and quantified 571 phosphorylation events (465 with high confidence site localization) on viral proteins, which is currently the most comprehensive map of HSV-1 phosphoproteome. We investigated chromatin bound proteins by proteomic analysis of the high-salt chromatin fraction and identified 510 proteins that were significantly different in abundance during infection. We found 53 histone marks significantly regulated during virus infection, including a steady increase of histone H3 acetylation (H3K9ac and H3K14ac). Our data provide a resource of unprecedented depth for human and viral proteome dynamics during infection. Collectively, our results indicate that the proteome composition of the chromatin of HFF cells is highly affected during HSV-1 infection, and that phosphorylation events are abundant on viral proteins. We propose that our epi-proteomics approach will prove to be important in the characterization of other model infectious systems that involve changes to chromatin composition. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics.
Deutsch, Eric W; Sun, Zhi; Campbell, David S; Binz, Pierre-Alain; Farrah, Terry; Shteynberg, David; Mendoza, Luis; Omenn, Gilbert S; Moritz, Robert L
2016-11-04
The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances-a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ∼20,000 primary isoforms plus contaminants to a very large database that includes almost all nonredundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/ .
Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics
Deutsch, Eric W.; Sun, Zhi; Campbell, David S.; Binz, Pierre-Alain; Farrah, Terry; Shteynberg, David; Mendoza, Luis; Omenn, Gilbert S.; Moritz, Robert L.
2016-01-01
The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances – a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ~20,000 primary isoforms plus contaminants to a very large database that includes almost all non-redundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/. PMID:27577934
Quantitative body fluid proteomics in medicine - A focus on minimal invasiveness.
Csősz, Éva; Kalló, Gergő; Márkus, Bernadett; Deák, Eszter; Csutak, Adrienne; Tőzsér, József
2017-02-05
Identification of new biomarkers specific for various pathological conditions is an important field in medical sciences. Body fluids have emerging potential in biomarker studies especially those which are continuously available and can be collected by non-invasive means. Changes in the protein composition of body fluids such as tears, saliva, sweat, etc. may provide information on both local and systemic conditions of medical relevance. In this review, our aim is to discuss the quantitative proteomics techniques used in biomarker studies, and to present advances in quantitative body fluid proteomics of non-invasively collectable body fluids with relevance to biomarker identification. The advantages and limitations of the widely used quantitative proteomics techniques are also presented. Based on the reviewed literature, we suggest an ideal pipeline for body fluid analyses aiming at biomarkers discoveries: starting from identification of biomarker candidates by shotgun quantitative proteomics or protein arrays, through verification of potential biomarkers by targeted mass spectrometry, to the antibody-based validation of biomarkers. The importance of body fluids as a rich source of biomarkers is discussed. Quantitative proteomics is a challenging part of proteomics applications. The body fluids collected by non-invasive means have high relevance in medicine; they are good sources for biomarkers used in establishing the diagnosis, follow up of disease progression and predicting high risk groups. The review presents the most widely used quantitative proteomics techniques in body fluid analysis and lists the potential biomarkers identified in tears, saliva, sweat, nasal mucus and urine for local and systemic diseases. Copyright © 2016 Elsevier B.V. All rights reserved.
Scheltema, Richard A; Mann, Matthias
2012-06-01
With the advent of high-throughput mass spectrometry (MS)-based proteomics, the magnitude and complexity of the performed experiments has increased dramatically. Likewise, investments in chromatographic and MS instrumentation are a large proportion of the budget of proteomics laboratories. Guarding measurement quality and maximizing uptime of the LC-MS/MS systems therefore requires constant care despite automated workflows. We describe a real-time surveillance system, called SprayQc, that continuously monitors the status of the peripheral equipment to ensure that operational parameters are within an acceptable range. SprayQc is composed of multiple plug-in software components that use computer vision to analyze electrospray conditions, monitor the chromatographic device for stable backpressure, interact with a column oven to control pressure by temperature, and ensure that the mass spectrometer is still acquiring data. Action is taken when a failure condition has been detected, such as stopping the column oven and the LC flow, as well as automatically notifying the appropriate operator. Additionally, all defined metrics can be recorded synchronized on retention time with the MS acquisition file, allowing for later inspection and providing valuable information for optimization. SprayQc has been extensively tested in our laboratory, supports third-party plug-in development, and is freely available for download from http://sourceforge.org/projects/sprayqc .
Elamin, Ashraf; Titz, Bjoern; Dijon, Sophie; Merg, Celine; Geertz, Marcel; Schneider, Thomas; Martin, Florian; Schlage, Walter K; Frentzel, Stefan; Talamo, Fabio; Phillips, Blaine; Veljkovic, Emilija; Ivanov, Nikolai V; Vanscheeuwijck, Patrick; Peitsch, Manuel C; Hoeng, Julia
2016-08-11
Smoking is associated with several serious diseases, such as lung cancer and chronic obstructive pulmonary disease (COPD). Within our systems toxicology framework, we are assessing whether potential modified risk tobacco products (MRTP) can reduce smoking-related health risks compared to conventional cigarettes. In this article, we evaluated to what extent 2D-PAGE/MALDI MS/MS (2D-PAGE) can complement the iTRAQ LC-MS/MS results from a previously reported mouse inhalation study, in which we assessed a prototypic MRTP (pMRTP). Selected differentially expressed proteins identified by both LC-MS/MS and 2D-PAGE approaches were further verified using reverse-phase protein microarrays. LC-MS/MS captured the effects of cigarette smoke (CS) on the lung proteome more comprehensively than 2D-PAGE. However, an integrated analysis of both proteomics data sets showed that 2D-PAGE data complement the LC-MS/MS results by supporting the overall trend of lower effects of pMRTP aerosol than CS on the lung proteome. Biological effects of CS exposure supported by both methods included increases in immune-related, surfactant metabolism, proteasome, and actin cytoskeleton protein clusters. Overall, while 2D-PAGE has its value, especially as a complementary method for the analysis of effects on intact proteins, LC-MS/MS approaches will likely be the method of choice for proteome analysis in systems toxicology investigations. Quantitative proteomics is anticipated to play a growing role within systems toxicology assessment frameworks in the future. To further understand how different proteomics technologies can contribute to toxicity assessment, we conducted a quantitative proteomics analysis using 2D-PAGE and isobaric tag-based LC-MS/MS approaches and compared the results produced from the 2 approaches. Using a prototypic modified risk tobacco product (pMRTP) as our test item, we show compared with cigarette smoke, how 2D-PAGE results can complement and support LC-MS/MS data, demonstrating the much lower effects of pMRTP aerosol than cigarette smoke on the mouse lung proteome. The combined analysis of 2D-PAGE and LC-MS/MS data identified an effect of cigarette smoke on the proteasome and actin cytoskeleton in the lung. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Lavallée-Adam, Mathieu; Rauniyar, Navin; McClatchy, Daniel B; Yates, John R
2014-12-05
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights.
2015-01-01
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights. PMID:25177766
Mei, Suyu; Zhu, Hao
2015-01-26
Protein-protein interaction (PPI) prediction is generally treated as a problem of binary classification wherein negative data sampling is still an open problem to be addressed. The commonly used random sampling is prone to yield less representative negative data with considerable false negatives. Meanwhile rational constraints are seldom exerted on model selection to reduce the risk of false positive predictions for most of the existing computational methods. In this work, we propose a novel negative data sampling method based on one-class SVM (support vector machine, SVM) to predict proteome-wide protein interactions between HTLV retrovirus and Homo sapiens, wherein one-class SVM is used to choose reliable and representative negative data, and two-class SVM is used to yield proteome-wide outcomes as predictive feedback for rational model selection. Computational results suggest that one-class SVM is more suited to be used as negative data sampling method than two-class PPI predictor, and the predictive feedback constrained model selection helps to yield a rational predictive model that reduces the risk of false positive predictions. Some predictions have been validated by the recent literature. Lastly, gene ontology based clustering of the predicted PPI networks is conducted to provide valuable cues for the pathogenesis of HTLV retrovirus.
Vijay, Sonam
2014-01-01
Salivary gland proteins of Anopheles mosquitoes offer attractive targets to understand interactions with sporozoites, blood feeding behavior, homeostasis, and immunological evaluation of malaria vectors and parasite interactions. To date limited studies have been carried out to elucidate salivary proteins of An. stephensi salivary glands. The aim of the present study was to provide detailed analytical attributives of functional salivary gland proteins of urban malaria vector An. stephensi. A proteomic approach combining one-dimensional electrophoresis (1DE), ion trap liquid chromatography mass spectrometry (LC/MS/MS), and computational bioinformatic analysis was adopted to provide the first direct insight into identification and functional characterization of known salivary proteins and novel salivary proteins of An. stephensi. Computational studies by online servers, namely, MASCOT and OMSSA algorithms, identified a total of 36 known salivary proteins and 123 novel proteins analysed by LC/MS/MS. This first report describes a baseline proteomic catalogue of 159 salivary proteins belonging to various categories of signal transduction, regulation of blood coagulation cascade, and various immune and energy pathways of An. stephensi sialotranscriptome by mass spectrometry. Our results may serve as basis to provide a putative functional role of proteins in concept of blood feeding, biting behavior, and other aspects of vector-parasite host interactions for parasite development in anopheline mosquitoes. PMID:25126571
Vijay, Sonam; Rawat, Manmeet; Sharma, Arun
2014-01-01
Salivary gland proteins of Anopheles mosquitoes offer attractive targets to understand interactions with sporozoites, blood feeding behavior, homeostasis, and immunological evaluation of malaria vectors and parasite interactions. To date limited studies have been carried out to elucidate salivary proteins of An. stephensi salivary glands. The aim of the present study was to provide detailed analytical attributives of functional salivary gland proteins of urban malaria vector An. stephensi. A proteomic approach combining one-dimensional electrophoresis (1DE), ion trap liquid chromatography mass spectrometry (LC/MS/MS), and computational bioinformatic analysis was adopted to provide the first direct insight into identification and functional characterization of known salivary proteins and novel salivary proteins of An. stephensi. Computational studies by online servers, namely, MASCOT and OMSSA algorithms, identified a total of 36 known salivary proteins and 123 novel proteins analysed by LC/MS/MS. This first report describes a baseline proteomic catalogue of 159 salivary proteins belonging to various categories of signal transduction, regulation of blood coagulation cascade, and various immune and energy pathways of An. stephensi sialotranscriptome by mass spectrometry. Our results may serve as basis to provide a putative functional role of proteins in concept of blood feeding, biting behavior, and other aspects of vector-parasite host interactions for parasite development in anopheline mosquitoes.
Khorsandi, Shirin Elizabeth; Salehi, Siamak; Cortes, Miriam; Vilca-Melendez, Hector; Menon, Krishna; Srinivasan, Parthi; Prachalias, Andreas; Jassem, Wayel; Heaton, Nigel
2018-02-15
Mitochondria have their own genomic, transcriptomic and proteomic machinery but are unable to be autonomous, needing both nuclear and mitochondrial genomes. The aim of this work was to use computational biology to explore the involvement of Mitochondrial microRNAs (MitomiRs) and their interactions with the mitochondrial proteome in a clinical model of primary non function (PNF) of the donor after cardiac death (DCD) liver. Archival array data on the differential expression of miRNA in DCD PNF was re-analyzed using a number of publically available computational algorithms. 10 MitomiRs were identified of importance in DCD PNF, 7 with predicted interaction of their seed sequence with the mitochondrial transcriptome that included both coding, and non coding areas of the hypervariability region 1 (HVR1) and control region. Considering miRNA regulation of the nuclear encoded mitochondrial proteome, 7 hypothetical small proteins were identified with homolog function that ranged from co-factor for formation of ATP Synthase, REDOX balance and an importin/exportin protein. In silico, unconventional seed interactions, both non canonical and alternative seed sites, appear to be of greater importance in MitomiR regulation of the mitochondrial genome. Additionally, a number of novel small proteins of relevance in transplantation have been identified which need further characterization.
The Modular Organization of Protein Interactions in Escherichia coli
Peregrín-Alvarez, José M.; Xiong, Xuejian; Su, Chong; Parkinson, John
2009-01-01
Escherichia coli serves as an excellent model for the study of fundamental cellular processes such as metabolism, signalling and gene expression. Understanding the function and organization of proteins within these processes is an important step towards a ‘systems’ view of E. coli. Integrating experimental and computational interaction data, we present a reliable network of 3,989 functional interactions between 1,941 E. coli proteins (∼45% of its proteome). These were combined with a recently generated set of 3,888 high-quality physical interactions between 918 proteins and clustered to reveal 316 discrete modules. In addition to known protein complexes (e.g., RNA and DNA polymerases), we identified modules that represent biochemical pathways (e.g., nitrate regulation and cell wall biosynthesis) as well as batteries of functionally and evolutionarily related processes. To aid the interpretation of modular relationships, several case examples are presented, including both well characterized and novel biochemical systems. Together these data provide a global view of the modular organization of the E. coli proteome and yield unique insights into structural and evolutionary relationships in bacterial networks. PMID:19798435
Bioinformatics in proteomics: application, terminology, and pitfalls.
Wiemer, Jan C; Prokudin, Alexander
2004-01-01
Bioinformatics applies data mining, i.e., modern computer-based statistics, to biomedical data. It leverages on machine learning approaches, such as artificial neural networks, decision trees and clustering algorithms, and is ideally suited for handling huge data amounts. In this article, we review the analysis of mass spectrometry data in proteomics, starting with common pre-processing steps and using single decision trees and decision tree ensembles for classification. Special emphasis is put on the pitfall of overfitting, i.e., of generating too complex single decision trees. Finally, we discuss the pros and cons of the two different decision tree usages.
EPA'S TOXICOGENOMICS PARTNERSHIPS ACROSS GOVERNMENT, ACADEMIA AND INDUSTRY
Genomics, proteomics and metabonomics technologies are transforming the science of toxicology, and concurrent advances in computing and informatics are providing management and analysis solutions for this onslaught of toxicogenomic data. EPA has been actively developing an intra...
Computer Simulation of the Virulome of Bacillus anthracis Using Proteomics
2006-07-31
hypothetical protein gi|47526566 spermidine /putrescine ABC transporter, spermidine /putrescine-binding protein gi|47526625 oligoendopeptidase F, putative gi...glutamyl-trna(gln) amidotransferase, a subunit x gi|50196927 aspartate aminotransferase x gi|50196970 spermidine synthase x
Mass spectrometry based proteomics: existing capabilities and future directions
Angel, Thomas E.; Aryal, Uma K.; Hengel, Shawna M.; Baker, Erin S.; Kelly, Ryan T.; Robinson, Errol W.; Smith, Richard D.
2012-01-01
Mass spectrometry (MS)-based proteomics is emerging as a broadly effective means for identification, characterization, and quantification of proteins that are integral components of the processes essential for life. Characterization of proteins at the proteome and sub-proteome (e.g., the phosphoproteome, proteoglycome, or degradome/peptidome) levels provides a foundation for understanding fundamental aspects of biology. Emerging technologies such as ion mobility separations coupled with MS and microchip-based-proteome measurements combined with MS instrumentation and chromatographic separation techniques, such as nanoscale reversed phase liquid chromatography and capillary electrophoresis, show great promise for both broad undirected and targeted highly sensitive measurements. MS-based proteomics is increasingly contribute to our understanding of the dynamics, interactions, and roles that proteins and peptides play, advancing our understanding of biology on a systems wide level for a wide range of applications including investigations of microbial communities, bioremediation, and human health. PMID:22498958
Quantitative proteomics in biological research.
Wilm, Matthias
2009-10-01
Proteomics has enabled the direct investigation of biological material, at first through the analysis of individual proteins, then of lysates from cell cultures, and finally of extracts from tissues and biopsies from entire organisms. Its latest manifestation - quantitative proteomics - allows deeper insight into biological systems. This article reviews the different methods used to extract quantitative information from mass spectra. It follows the technical developments aimed toward global proteomics, the attempt to characterize every expressed protein in a cell by at least one peptide. When applications of the technology are discussed, the focus is placed on yeast biology. In particular, differential quantitative proteomics, the comparison between an experiment and its control, is very discriminating for proteins involved in the process being studied. When trying to understand biological processes on a molecular level, differential quantitative proteomics tends to give a clearer picture than global transcription analyses. As a result, MS has become an even more indispensable tool for biochemically motivated biological research.
Proteomic analysis of albumin and globulin fractions of pea (Pisum sativum L.) seeds.
Dziuba, Jerzy; Szerszunowicz, Iwona; Nałęcz, Dorota; Dziuba, Marta
2014-01-01
Proteomic analysis is emerging as a highly useful tool in food research, including studies of food allergies. Two-dimensional gel electrophoresis involving isoelectric focusing and sodium dodecyl sulfate polyacrylamide gel electrophoresis is the most effective method of separating hundreds or even thousands of proteins. In this study, albumin and globulin tractions of pea seeds cv. Ramrod were subjected to proteomic analysis. Selected potentially alergenic proteins were identified based on their molecular weights and isoelectric points. Pea seeds (Pisum sativum L.) cv. Ramrod harvested over a period of two years (Plant Breeding Station in Piaski-Szelejewo) were used in the experiment. The isolated albumins, globulins and legumin and vicilin fractions of globulins were separated by two-dimensional gel electrophoresis. Proteomic images were analysed in the ImageMaster 2D Platinum program with the use of algorithms from the Melanie application. The relative content, isoelectric points and molecular weights were computed for all identified proteins. Electrophoregrams were analysed by matching spot positions from three independent replications. The proteomes of albumins, globulins and legumin and vicilin fractions of globulins produced up to several hundred spots (proteins). Spots most characteristic of a given fraction were identified by computer analysis and spot matching. The albumin proteome accumulated spots of relatively high intensity over a broad range of pi values of ~4.2-8.1 in 3 molecular weight (MW) ranges: I - high molecular-weight albumins with MW of ~50-110 kDa, II - average molecular-weight albumins with MW of ~20-35 kDa, and III - low molecular-weight albumins with MW of ~13-17 kDa. 2D gel electrophoregrams revealed the presence of 81 characteristic spots, including 24 characteristic of legumin and 14 - of vicilin. Two-dimensional gel electrophoresis proved to be a useful tool for identifying pea proteins. Patterns of spots with similar isoelectric points and different molecular weights or spots with different isoelectric points and similar molecular weights play an important role in proteome analysis. The regions characteristic of albumin, globulin and legumin and vicilin fractions of globulin with typical MW and pi values were identified as the results of performed 2D electrophoretic separations of pea proteins. 2D gel electrophoresis of albumins and the vicilin fraction of globulins revealed the presence of 4 and 2 spots, respectively, representing potentially allergenic proteins. They probably corresponded to vicilin fragments synthesized during post-translational modification of the analysed protein.
Virtual Labs in proteomics: new E-learning tools.
Ray, Sandipan; Koshy, Nicole Rachel; Reddy, Panga Jaipal; Srivastava, Sanjeeva
2012-05-17
Web-based educational resources have gained enormous popularity recently and are increasingly becoming a part of modern educational systems. Virtual Labs are E-learning platforms where learners can gain the experience of practical experimentation without any direct physical involvement on real bench work. They use computerized simulations, models, videos, animations and other instructional technologies to create interactive content. Proteomics being one of the most rapidly growing fields of the biological sciences is now an important part of college and university curriculums. Consequently, many E-learning programs have started incorporating the theoretical and practical aspects of different proteomic techniques as an element of their course work in the form of Video Lectures and Virtual Labs. To this end, recently we have developed a Virtual Proteomics Lab at the Indian Institute of Technology Bombay, which demonstrates different proteomics techniques, including basic and advanced gel and MS-based protein separation and identification techniques, bioinformatics tools and molecular docking methods, and their applications in different biological samples. This Tutorial will discuss the prominent Virtual Labs featuring proteomics content, including the Virtual Proteomics Lab of IIT-Bombay, and E-resources available for proteomics study that are striving to make proteomic techniques and concepts available and accessible to the student and research community. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 14). Details can be found at: http://www.proteomicstutorials.org/. Copyright © 2012 Elsevier B.V. All rights reserved.
Application of proteomics to ecology and population biology.
Karr, T L
2008-02-01
Proteomics is a relatively new scientific discipline that merges protein biochemistry, genome biology and bioinformatics to determine the spatial and temporal expression of proteins in cells, tissues and whole organisms. There has been very little application of proteomics to the fields of behavioral genetics, evolution, ecology and population dynamics, and has only recently been effectively applied to the closely allied fields of molecular evolution and genetics. However, there exists considerable potential for proteomics to impact in areas related to functional ecology; this review will introduce the general concepts and methodologies that define the field of proteomics and compare and contrast the advantages and disadvantages with other methods. Examples of how proteomics can aid, complement and indeed extend the study of functional ecology will be discussed including the main tool of ecological studies, population genetics with an emphasis on metapopulation structure analysis. Because proteomic analyses provide a direct measure of gene expression, it obviates some of the limitations associated with other genomic approaches, such as microarray and EST analyses. Likewise, in conjunction with associated bioinformatics and molecular evolutionary tools, proteomics can provide the foundation of a systems-level integration approach that can enhance ecological studies. It can be envisioned that proteomics will provide important new information on issues specific to metapopulation biology and adaptive processes in nature. A specific example of the application of proteomics to sperm ageing is provided to illustrate the potential utility of the approach.
Machine learning applications in proteomics research: how the past can boost the future.
Kelchtermans, Pieter; Bittremieux, Wout; De Grave, Kurt; Degroeve, Sven; Ramon, Jan; Laukens, Kris; Valkenborg, Dirk; Barsnes, Harald; Martens, Lennart
2014-03-01
Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Time, space, and disorder in the expanding proteome universe.
Minde, David-Paul; Dunker, A Keith; Lilley, Kathryn S
2017-04-01
Proteins are highly dynamic entities. Their myriad functions require specific structures, but proteins' dynamic nature ranges all the way from the local mobility of their amino acid constituents to mobility within and well beyond single cells. A truly comprehensive view of the dynamic structural proteome includes: (i) alternative sequences, (ii) alternative conformations, (iii) alternative interactions with a range of biomolecules, (iv) cellular localizations, (v) alternative behaviors in different cell types. While these aspects have traditionally been explored one protein at a time, we highlight recently emerging global approaches that accelerate comprehensive insights into these facets of the dynamic nature of protein structure. Computational tools that integrate and expand on multiple orthogonal data types promise to enable the transition from a disjointed list of static snapshots to a structurally explicit understanding of the dynamics of cellular mechanisms. © 2017 The Authors. Proteomics Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Vrana, Julie A.; Theis, Jason D.; Dasari, Surendra; Mereuta, Oana M.; Dispenzieri, Angela; Zeldenrust, Steven R.; Gertz, Morie A.; Kurtin, Paul J.; Grogg, Karen L.; Dogan, Ahmet
2014-01-01
Examination of abdominal subcutaneous fat aspirates is a practical, sensitive and specific method for the diagnosis of systemic amyloidosis. Here we describe the development and implementation of a clinical assay using mass spectrometry-based proteomics to type amyloidosis in subcutaneous fat aspirates. First, we validated the assay comparing amyloid-positive (n=43) and -negative (n=26) subcutaneous fat aspirates. The assay classified amyloidosis with 88% sensitivity and 96% specificity. We then implemented the assay as a clinical test, and analyzed 366 amyloid-positive subcutaneous fat aspirates in a 4-year period as part of routine clinical care. The assay had a sensitivity of 90%, and diverse amyloid types, including immunoglobulin light chain (74%), transthyretin (13%), serum amyloid A (%1), gelsolin (1%), and lysozyme (1%), were identified. Using bioinformatics, we identified a universal amyloid proteome signature, which has high sensitivity and specificity for amyloidosis similar to that of Congo red staining. We curated proteome databases which included variant proteins associated with systemic amyloidosis, and identified clonotypic immunoglobulin variable gene usage in immunoglobulin light chain amyloidosis, and the variant peptides in hereditary transthyretin amyloidosis. In conclusion, mass spectrometry-based proteomic analysis of subcutaneous fat aspirates offers a powerful tool for the diagnosis and typing of systemic amyloidosis. The assay reveals the underlying pathogenesis by identifying variable gene usage in immunoglobulin light chains and the variant peptides in hereditary amyloidosis. PMID:24747948
The role of targeted chemical proteomics in pharmacology
Sutton, Chris W
2012-01-01
Traditionally, proteomics is the high-throughput characterization of the global complement of proteins in a biological system using cutting-edge technologies (robotics and mass spectrometry) and bioinformatics tools (Internet-based search engines and databases). As the field of proteomics has matured, a diverse range of strategies have evolved to answer specific problems. Chemical proteomics is one such direction that provides the means to enrich and detect less abundant proteins (the ‘hidden’ proteome) from complex mixtures of wide dynamic range (the ‘deep’ proteome). In pharmacology, chemical proteomics has been utilized to determine the specificity of drugs and their analogues, for anticipated known targets, only to discover other proteins that bind and could account for side effects observed in preclinical and clinical trials. As a consequence, chemical proteomics provides a valuable accessory in refinement of second- and third-generation drug design for treatment of many diseases. However, determining definitive affinity capture of proteins by a drug immobilized on soft gel chromatography matrices has highlighted some of the challenges that remain to be addressed. Examples of the different strategies that have emerged using well-established drugs against pharmaceutically important enzymes, such as protein kinases, metalloproteases, PDEs, cytochrome P450s, etc., indicate the potential opportunity to employ chemical proteomics as an early-stage screening approach in the identification of new targets. PMID:22074351
Proteomic profiles in hyperandrogenic syndromes.
Misiti, S; Stigliano, A; Borro, M; Gentile, G; Michienzi, S; Cerquetti, L; Bucci, B; Argese, N; Brunetti, E; Simmaco, M; Toscano, V
2010-03-01
Polycystic ovary syndrome (PCOS) and congenital adrenal hyperplasia (CAH) represent the most common causes of hyperandrogenism. Although the etiopathogeneses of these syndromes are different, they share many clinical and biochemical signs, such as hirsutism, acne, and chronic anovulation. Experimental data have shown that peripheral T-lymphocytes function as molecular sensors, being able to record molecular signals either at staminal and mature cell levels, or hormones at systemic levels. Twenty PCOS women and 10 CAH with 21-hydroxylase deficiency, aged between 18-35 yr, were studied. T-cells purified from all patients and 20 healthy donors have been analyzed by 2-dimensional gel electrophoresis. Silver-stained proteomic map of each patient was compared with a control map obtained by pooling protein samples of the 20 healthy subjects. Spots of interest were identified by peptide mass fingerprint. Computer analysis evidenced several peptidic spots significantly modulated in all patients examined. Some proteins were modulated in both syndromes, others only in PCOS or in CAH. These proteins are involved in many physiological processes as the functional state of immune system, the regulation of the cytoskeleton structure, the oxidative stress, the coagulation process, and the insulin resistance. Identification of the physiological function of these proteins could help to understand ethiopathogenetic mechanisms of hyperandrogenic syndromes and its complications.
Vivek-Ananth, R P; Mohanraj, Karthikeyan; Vandanashree, Muralidharan; Jhingran, Anupam; Craig, James P; Samal, Areejit
2018-04-26
Aspergillus fumigatus and multiple other Aspergillus species cause a wide range of lung infections, collectively termed aspergillosis. Aspergilli are ubiquitous in environment with healthy immune systems routinely eliminating inhaled conidia, however, Aspergilli can become an opportunistic pathogen in immune-compromised patients. The aspergillosis mortality rate and emergence of drug-resistance reveals an urgent need to identify novel targets. Secreted and cell membrane proteins play a critical role in fungal-host interactions and pathogenesis. Using a computational pipeline integrating data from high-throughput experiments and bioinformatic predictions, we have identified secreted and cell membrane proteins in ten Aspergillus species known to cause aspergillosis. Small secreted and effector-like proteins similar to agents of fungal-plant pathogenesis were also identified within each secretome. A comparison with humans revealed that at least 70% of Aspergillus secretomes have no sequence similarity with the human proteome. An analysis of antigenic qualities of Aspergillus proteins revealed that the secretome is significantly more antigenic than cell membrane proteins or the complete proteome. Finally, overlaying an expression dataset, four A. fumigatus proteins upregulated during infection and with available structures, were found to be structurally similar to known drug target proteins in other organisms, and were able to dock in silico with the respective drug.
Noukakis, Dimitrios; Gadola, Stephan; Stöcklin, Reto
2005-08-01
How close are we to using proteomics tools in the every day practice of physicians? What are the socio-economical issues our health care system may face with the advent of biomarkers for early diagnosis? How to get the specialists from the various disciplines integrated in proteomics to establish a common understanding of the clinical issues and develop the necessary standards (methods, biochemicals and IT)? These were the kind of questions a panel of specialists tried to answer during the roundtable discussion that took place in Bern during the Swiss Proteomics Society 2004 congress.
Multidimensional proteomics for cell biology.
Larance, Mark; Lamond, Angus I
2015-05-01
The proteome is a dynamic system in which each protein has interconnected properties - dimensions - that together contribute to the phenotype of a cell. Measuring these properties has proved challenging owing to their diversity and dynamic nature. Advances in mass spectrometry-based proteomics now enable the measurement of multiple properties for thousands of proteins, including their abundance, isoform expression, turnover rate, subcellular localization, post-translational modifications and interactions. Complementing these experimental developments are new data analysis, integration and visualization tools as well as data-sharing resources. Together, these advances in the multidimensional analysis of the proteome are transforming our understanding of various cellular and physiological processes.
The quest of the human proteome and the missing proteins: digging deeper.
Reddy, Panga Jaipal; Ray, Sandipan; Srivastava, Sanjeeva
2015-05-01
Given the diverse range of transcriptional and post-transcriptional mechanisms of gene regulation, the estimates of the human proteome is likely subject to scientific surprises as the field of proteomics has gained momentum worldwide. In this regard, the establishment of the "Human Proteome Draft" using high-resolution mass spectrometry (MS), tissue microarrays, and immunohistochemistry by three independent research groups (laboratories of Pandey, Kuster, and Uhlen) accelerated the pace of proteomics research. The Chromosome Centric Human Proteome Project (C-HPP) has taken initiative towards the completion of the Human Proteome Project (HPP) so as to understand the proteomics correlates of common complex human diseases and biological diversity, not to mention person-to-person and population differences in response to drugs, nutrition, vaccines, and other health interventions and host-environment interactions. Although high-resolution MS-based and antibody microarray approaches have shown enormous promises, we are still unable to map the whole human proteome due to the presence of numerous "missing proteins." In December 2014, at the Indian Institute of Technology Bombay, Mumbai the 6(th) Annual Meeting of the Proteomics Society, India (PSI) and the International Proteomics Conference was held. As part of this interdisciplinary summit, a panel discussion session on "The Quest of the Human Proteome and Missing Proteins" was organized. Eminent scientists in the field of proteomics and systems biology, including Akhilesh Pandey, Gilbert S. Omenn, Mark S. Baker, and Robert L. Mortiz, shed light on different aspects of the human proteome drafts and missing proteins. Importantly, the possible reasons for the "missing proteins" in shotgun MS workflow were identified and debated by experts as low tissue expression, lack of enzymatic digestion site, or protein lost during extraction, among other contributing factors. To capture the missing proteins, the experts' collective view was to study the wider tissue range with multiple digesting enzymes and follow targeted proteomics workflow in particular. On the innovation trajectory from the proteomics laboratory to novel proteomics diagnostics and therapeutics in society, we will also need new conceptual frames for translation science and innovation strategy in proteomics. These will embody both technical as well as rigorous social science and humanities considerations to understand the correlates of the proteome from cell to society.
2011-01-01
Background Labeling whole Arabidopsis (Arabidopsis thaliana) plants to high enrichment with 13C for proteomics and metabolomics applications would facilitate experimental approaches not possible by conventional methods. Such a system would use the plant's native capacity for carbon fixation to ubiquitously incorporate 13C from 13CO2 gas. Because of the high cost of 13CO2 it is critical that the design conserve the labeled gas. Results A fully enclosed automated plant growth enclosure has been designed and assembled where the system simultaneously monitors humidity, temperature, pressure and 13CO2 concentration with continuous adjustment of humidity, pressure and 13CO2 levels controlled by a computer running LabView software. The enclosure is mounted on a movable cart for mobility among growth environments. Arabidopsis was grown in the enclosure for up to 8 weeks and obtained on average >95 atom% enrichment for small metabolites, such as amino acids and >91 atom% for large metabolites, including proteins and peptides. Conclusion The capability of this labeling system for isotope dilution experiments was demonstrated by evaluation of amino acid turnover using GC-MS as well as protein turnover using LC-MS/MS. Because this 'open source' Arabidopsis 13C-labeling growth environment was built using readily available materials and software, it can be adapted easily to accommodate many different experimental designs. PMID:21310072
Comparative Testis Tissue Proteomics Using 2-Dye Versus 3-Dye DIGE Analysis.
Holland, Ashling
2018-01-01
Comparative tissue proteomics aims to analyze alterations of the proteome in response to a stimulus. Two-dimensional difference gel electrophoresis (2D-DIGE) is a modified and advanced form of 2D gel electrophoresis. DIGE is a powerful biochemical method that compares two or three protein samples on the same analytical gel, and can be used to establish differentially expressed protein levels between healthy normal and diseased pathological tissue sample groups. Minimal DIGE labeling can be used via a 2-dye system with Cy3 and Cy5 or a 3-dye system with Cy2, Cy3, and Cy5 to fluorescently label samples with CyDye flours pre-electrophoresis. DIGE circumvents gel-to-gel variability by multiplexing samples to a single gel and through the use of a pooled internal standard for normalization. This form of quantitative high-resolution proteomics facilitates the comparative analysis and evaluation of tissue protein compositions. Comparing tissue groups under different conditions is crucially important for advancing the biomedical field by characterization of cellular processes, understanding pathophysiological development and tissue biomarker discovery. This chapter discusses 2D-DIGE as a comparative tissue proteomic technique and describes in detail the experimental steps required for comparative proteomic analysis employing both options of 2-dye and 3-dye DIGE minimal labeling.
Morisawa, Hiraku; Hirota, Mikako; Toda, Tosifusa
2006-01-01
Background In the post-genome era, most research scientists working in the field of proteomics are confronted with difficulties in management of large volumes of data, which they are required to keep in formats suitable for subsequent data mining. Therefore, a well-developed open source laboratory information management system (LIMS) should be available for their proteomics research studies. Results We developed an open source LIMS appropriately customized for 2-D gel electrophoresis-based proteomics workflow. The main features of its design are compactness, flexibility and connectivity to public databases. It supports the handling of data imported from mass spectrometry software and 2-D gel image analysis software. The LIMS is equipped with the same input interface for 2-D gel information as a clickable map on public 2DPAGE databases. The LIMS allows researchers to follow their own experimental procedures by reviewing the illustrations of 2-D gel maps and well layouts on the digestion plates and MS sample plates. Conclusion Our new open source LIMS is now available as a basic model for proteome informatics, and is accessible for further improvement. We hope that many research scientists working in the field of proteomics will evaluate our LIMS and suggest ways in which it can be improved. PMID:17018156
The Pig PeptideAtlas: A resource for systems biology in animal production and biomedicine.
Hesselager, Marianne O; Codrea, Marius C; Sun, Zhi; Deutsch, Eric W; Bennike, Tue B; Stensballe, Allan; Bundgaard, Louise; Moritz, Robert L; Bendixen, Emøke
2016-02-01
Biological research of Sus scrofa, the domestic pig, is of immediate relevance for food production sciences, and for developing pig as a model organism for human biomedical research. Publicly available data repositories play a fundamental role for all biological sciences, and protein data repositories are in particular essential for the successful development of new proteomic methods. Cumulative proteome data repositories, including the PeptideAtlas, provide the means for targeted proteomics, system-wide observations, and cross-species observational studies, but pigs have so far been underrepresented in existing repositories. We here present a significantly improved build of the Pig PeptideAtlas, which includes pig proteome data from 25 tissues and three body fluid types mapped to 7139 canonical proteins. The content of the Pig PeptideAtlas reflects actively ongoing research within the veterinary proteomics domain, and this article demonstrates how the expression of isoform-unique peptides can be observed across distinct tissues and body fluids. The Pig PeptideAtlas is a unique resource for use in animal proteome research, particularly biomarker discovery and for preliminary design of SRM assays, which are equally important for progress in research that supports farm animal production and veterinary health, as for developing pig models with relevance to human health research. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The Pig PeptideAtlas: a resource for systems biology in animal production and biomedicine
Hesselager, Marianne O.; Codrea, Marius C.; Sun, Zhi; Deutsch, Eric W.; Bennike, Tue B.; Stensballe, Allan; Bundgaard, Louise; Moritz, Robert L.; Bendixen, Emøke
2016-01-01
Biological research of Sus scrofa, the domestic pig, is of immediate relevance for food production sciences, and for developing pig as a model organism for human biomedical research. Publicly available data repositories play a fundamental role for all biological sciences, and protein data repositories are in particular essential for the successful development of new proteomic methods. Cumulative proteome data repositories, including the PeptideAtlas, provide the means for targeted proteomics, system wide observations, and cross species observational studies, but pigs have so far been underrepresented in existing repositories. We here present a significantly improved build of the Pig PeptideAtlas, which includes pig proteome data from 25 tissues and three body fluid types mapped to 7139 canonical proteins. The content of the Pig PeptideAtlas reflects actively ongoing research within the veterinary proteomics domain, and this manuscript demonstrates how the expression of isoform-unique peptides can be observed across distinct tissues and body fluids. The Pig PeptideAtlas is a unique resource for use in animal proteome research, particularly biomarker discovery and for preliminary design of SRM assays, which are equally important for progress in research that supports farm animal production and veterinary health, as for developing pig models with relevance to human health research. PMID:26699206
Detergents: Friends not foes for high-performance membrane proteomics toward precision medicine.
Zhang, Xi
2017-02-01
Precision medicine, particularly therapeutics, emphasizes the atomic-precise, dynamic, and systems visualization of human membrane proteins and their endogenous modifiers. For years, bottom-up proteomics has grappled with removing and avoiding detergents, yet faltered at the therapeutic-pivotal membrane proteins, which have been tackled by classical approaches and are known for decades refractory to single-phase aqueous or organic denaturants. Hydrophobicity and aggregation commonly challenge tissue and cell lysates, biofluids, and enriched samples. Frequently, expected membrane proteins and peptides are not identified by shotgun bottom-up proteomics, let alone robust quantitation. This review argues the cause of this proteomic crisis is not detergents per se, but the choice of detergents. Recently, inclusion of compatible detergents for membrane protein extraction and digestion has revealed stark improvements in both quantitative and structural proteomics. This review analyzes detergent properties behind recent proteomic advances, and proposes that rational use of detergents may reconcile outstanding membrane proteomics dilemmas, enabling ultradeep coverage and minimal artifacts for robust protein and endogenous PTM measurements. The simplicity of detergent tools confers bottom-up membrane proteomics the sophistication toward precision medicine. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes
NASA Astrophysics Data System (ADS)
Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy
2007-01-01
Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.
Comparative and Quantitative Global Proteomics Approaches: An Overview
Deracinois, Barbara; Flahaut, Christophe; Duban-Deweer, Sophie; Karamanos, Yannis
2013-01-01
Proteomics became a key tool for the study of biological systems. The comparison between two different physiological states allows unravelling the cellular and molecular mechanisms involved in a biological process. Proteomics can confirm the presence of proteins suggested by their mRNA content and provides a direct measure of the quantity present in a cell. Global and targeted proteomics strategies can be applied. Targeted proteomics strategies limit the number of features that will be monitored and then optimise the methods to obtain the highest sensitivity and throughput for a huge amount of samples. The advantage of global proteomics strategies is that no hypothesis is required, other than a measurable difference in one or more protein species between the samples. Global proteomics methods attempt to separate quantify and identify all the proteins from a given sample. This review highlights only the different techniques of separation and quantification of proteins and peptides, in view of a comparative and quantitative global proteomics analysis. The in-gel and off-gel quantification of proteins will be discussed as well as the corresponding mass spectrometry technology. The overview is focused on the widespread techniques while keeping in mind that each approach is modular and often recovers the other. PMID:28250403
Al Kindi, Mahmood A; Colella, Alex D; Chataway, Tim K; Jackson, Michael W; Wang, Jing J; Gordon, Tom P
2016-04-01
The structures of epitopes bound by autoantibodies against RNA-protein complexes have been well-defined over several decades, but little is known of the clonality, immunoglobulin (Ig) variable (V) gene usage and mutational status of the autoantibodies themselves at the level of the secreted (serum) proteome. A novel proteomic workflow is presented based on affinity purification of specific Igs from serum, high-resolution two-dimensional gel electrophoresis, and de novo and database-driven sequencing of V-region proteins by mass spectrometry. Analysis of anti-Ro52/Ro60/La proteomes in primary Sjögren's syndrome (SS) and anti-Sm and anti-ribosomal P proteomes in systemic lupus erythematosus (SLE) has revealed that these antibody responses are dominated by restricted sets of public (shared) clonotypes, consistent with common pathways of production across unrelated individuals. The discovery of shared sets of specific V-region peptides can be exploited for diagnostic biomarkers in targeted mass spectrometry platforms and for tracking and removal of pathogenic clones. Copyright © 2016 Elsevier B.V. All rights reserved.
Weckwerth, Wolfram; Wienkoop, Stefanie; Hoehenwarter, Wolfgang; Egelhofer, Volker; Sun, Xiaoliang
2014-01-01
Genome sequencing and systems biology are revolutionizing life sciences. Proteomics emerged as a fundamental technique of this novel research area as it is the basis for gene function analysis and modeling of dynamic protein networks. Here a complete proteomics platform suited for functional genomics and systems biology is presented. The strategy includes MAPA (mass accuracy precursor alignment; http://www.univie.ac.at/mosys/software.html ) as a rapid exploratory analysis step; MASS WESTERN for targeted proteomics; COVAIN ( http://www.univie.ac.at/mosys/software.html ) for multivariate statistical analysis, data integration, and data mining; and PROMEX ( http://www.univie.ac.at/mosys/databases.html ) as a database module for proteogenomics and proteotypic peptides for targeted analysis. Moreover, the presented platform can also be utilized to integrate metabolomics and transcriptomics data for the analysis of metabolite-protein-transcript correlations and time course analysis using COVAIN. Examples for the integration of MAPA and MASS WESTERN data, proteogenomic and metabolic modeling approaches for functional genomics, phosphoproteomics by integration of MOAC (metal-oxide affinity chromatography) with MAPA, and the integration of metabolomics, transcriptomics, proteomics, and physiological data using this platform are presented. All software and step-by-step tutorials for data processing and data mining can be downloaded from http://www.univie.ac.at/mosys/software.html.
Proteomic and metallomic strategies for understanding the mode of action of anticancer metallodrugs.
Gabbiani, Chiara; Magherini, Francesca; Modesti, Alessandra; Messori, Luigi
2010-05-01
Since the discovery of cisplatin and its introduction in the clinics, metal compounds have been intensely investigated in view of their possible application in cancer therapy. In this frame, a deeper understanding of their mode of action, still rather obscure, might turn crucial for the design and the obtainment of new and better anticancer agents. Due to the extreme complexity of the biological systems, it is now widely accepted that innovative and information-rich methods are absolutely needed to afford such a goal. Recently, both proteomic and metallomic strategies were successfully implemented for the elucidation of specific mechanistic features of anticancer metallodrugs within an innovative "Systems Biology" perspective. Particular attention was paid to the following issues: i) proteomic studies of the molecular basis of platinum resistance; ii) proteomic analysis of cellular responses to cytotoxic metallodrugs; iii) metallomic studies of the transformation and fate of metallodrugs in cellular systems. Notably, those pioneering studies, that are reviewed here, allowed a significant progress in the understanding of the molecular mechanisms of metal based drugs at the cellular level. A further extension of those studies and a closer integration of proteomic and metallomic strategies and technologies might realistically lead to rapid and significant advancements in the mechanistic knowledge of anticancer metallodrugs.
Lam, Maggie P Y; Venkatraman, Vidya; Xing, Yi; Lau, Edward; Cao, Quan; Ng, Dominic C M; Su, Andrew I; Ge, Junbo; Van Eyk, Jennifer E; Ping, Peipei
2016-11-04
Amidst the proteomes of human tissues lie subsets of proteins that are closely involved in conserved pathophysiological processes. Much of biomedical research concerns interrogating disease signature proteins and defining their roles in disease mechanisms. With advances in proteomics technologies, it is now feasible to develop targeted proteomics assays that can accurately quantify protein abundance as well as their post-translational modifications; however, with rapidly accumulating number of studies implicating proteins in diseases, current resources are insufficient to target every protein without judiciously prioritizing the proteins with high significance and impact for assay development. We describe here a data science method to prioritize and expedite assay development on high-impact proteins across research fields by leveraging the biomedical literature record to rank and normalize proteins that are popularly and preferentially published by biomedical researchers. We demonstrate this method by finding priority proteins across six major physiological systems (cardiovascular, cerebral, hepatic, renal, pulmonary, and intestinal). The described method is data-driven and builds upon the collective knowledge of previous publications referenced on PubMed to lend objectivity to target selection. The method and resulting popular protein lists may also be useful for exploring biological processes associated with various physiological systems and research topics, in addition to benefiting ongoing efforts to facilitate the broad translation of proteomics technologies.
Proteomics data exchange and storage: the need for common standards and public repositories.
Jiménez, Rafael C; Vizcaíno, Juan Antonio
2013-01-01
Both the existence of data standards and public databases or repositories have been key factors behind the development of the existing "omics" approaches. In this book chapter we first review the main existing mass spectrometry (MS)-based proteomics resources: PRIDE, PeptideAtlas, GPMDB, and Tranche. Second, we report on the current status of the different proteomics data standards developed by the Proteomics Standards Initiative (PSI): the formats mzML, mzIdentML, mzQuantML, TraML, and PSI-MI XML are then reviewed. Finally, we present an easy way to query and access MS proteomics data in the PRIDE database, as a representative of the existing repositories, using the workflow management system (WMS) tool Taverna. Two different publicly available workflows are explained and described.
Head and neck cancer: proteomic advances and biomarker achievements.
Rezende, Taia Maria Berto; de Souza Freire, Mirna; Franco, Octávio Luiz
2010-11-01
Tumors of the head and neck comprise an important neoplasia group, the incidence of which is increasing in many parts of the world. Recent advances in diagnostic and therapeutic techniques for these lesions have yielded novel molecular targets, uncovered signal pathway dominance, and advanced early cancer detection. Proteomics is a powerful tool for investigating the distribution of proteins and small molecules within biological systems through the analysis of different types of samples. The proteomic profiles of different types of cancer have been studied, and this has provided remarkable advances in cancer understanding. This review covers recent advances for head and neck cancer; it encompasses the risk factors, pathogenesis, proteomic tools that can help in understanding cancer, and new proteomic findings in this type of cancer. Copyright © 2010 American Cancer Society.
van Herwijnen, Martijn J.C.; Zonneveld, Marijke I.; Goerdayal, Soenita; Nolte – 't Hoen, Esther N.M.; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A.F.; Redegeld, Frank A.; Wauben, Marca H.M.
2016-01-01
Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of the whole milk proteome and illustrates that milk-derived EV are macromolecular components with a unique functional proteome. PMID:27601599
A tutorial in displaying mass spectrometry-based proteomic data using heat maps.
Key, Melissa
2012-01-01
Data visualization plays a critical role in interpreting experimental results of proteomic experiments. Heat maps are particularly useful for this task, as they allow us to find quantitative patterns across proteins and biological samples simultaneously. The quality of a heat map can be vastly improved by understanding the options available to display and organize the data in the heat map. This tutorial illustrates how to optimize heat maps for proteomics data by incorporating known characteristics of the data into the image. First, the concepts used to guide the creating of heat maps are demonstrated. Then, these concepts are applied to two types of analysis: visualizing spectral features across biological samples, and presenting the results of tests of statistical significance. For all examples we provide details of computer code in the open-source statistical programming language R, which can be used for biologists and clinicians with little statistical background. Heat maps are a useful tool for presenting quantitative proteomic data organized in a matrix format. Understanding and optimizing the parameters used to create the heat map can vastly improve both the appearance and the interoperation of heat map data.
Hoehenwarter, Wolfgang; Larhlimi, Abdelhalim; Hummel, Jan; Egelhofer, Volker; Selbig, Joachim; van Dongen, Joost T; Wienkoop, Stefanie; Weckwerth, Wolfram
2011-07-01
Mass Accuracy Precursor Alignment is a fast and flexible method for comparative proteome analysis that allows the comparison of unprecedented numbers of shotgun proteomics analyses on a personal computer in a matter of hours. We compared 183 LC-MS analyses and more than 2 million MS/MS spectra and could define and separate the proteomic phenotypes of field grown tubers of 12 tetraploid cultivars of the crop plant Solanum tuberosum. Protein isoforms of patatin as well as other major gene families such as lipoxygenase and cysteine protease inhibitor that regulate tuber development were found to be the primary source of variability between the cultivars. This suggests that differentially expressed protein isoforms modulate genotype specific tuber development and the plant phenotype. We properly assigned the measured abundance of tryptic peptides to different protein isoforms that share extensive stretches of primary structure and thus inferred their abundance. Peptides unique to different protein isoforms were used to classify the remaining peptides assigned to the entire subset of isoforms based on a common abundance profile using multivariate statistical procedures. We identified nearly 4000 proteins which we used for quantitative functional annotation making this the most extensive study of the tuber proteome to date.
Rudnick, Paul A.; Clauser, Karl R.; Kilpatrick, Lisa E.; Tchekhovskoi, Dmitrii V.; Neta, Pedatsur; Blonder, Nikša; Billheimer, Dean D.; Blackman, Ronald K.; Bunk, David M.; Cardasis, Helene L.; Ham, Amy-Joan L.; Jaffe, Jacob D.; Kinsinger, Christopher R.; Mesri, Mehdi; Neubert, Thomas A.; Schilling, Birgit; Tabb, David L.; Tegeler, Tony J.; Vega-Montoto, Lorenzo; Variyath, Asokan Mulayath; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Carr, Steven A.; Fisher, Susan J.; Gibson, Bradford W.; Paulovich, Amanda G.; Regnier, Fred E.; Rodriguez, Henry; Spiegelman, Cliff; Tempst, Paul; Liebler, Daniel C.; Stein, Stephen E.
2010-01-01
A major unmet need in LC-MS/MS-based proteomics analyses is a set of tools for quantitative assessment of system performance and evaluation of technical variability. Here we describe 46 system performance metrics for monitoring chromatographic performance, electrospray source stability, MS1 and MS2 signals, dynamic sampling of ions for MS/MS, and peptide identification. Applied to data sets from replicate LC-MS/MS analyses, these metrics displayed consistent, reasonable responses to controlled perturbations. The metrics typically displayed variations less than 10% and thus can reveal even subtle differences in performance of system components. Analyses of data from interlaboratory studies conducted under a common standard operating procedure identified outlier data and provided clues to specific causes. Moreover, interlaboratory variation reflected by the metrics indicates which system components vary the most between laboratories. Application of these metrics enables rational, quantitative quality assessment for proteomics and other LC-MS/MS analytical applications. PMID:19837981
Lee, Hangyeore; Mun, Dong-Gi; Bae, Jingi; Kim, Hokeun; Oh, Se Yeon; Park, Young Soo; Lee, Jae-Hyuk; Lee, Sang-Won
2015-08-21
We report a new and simple design of a fully automated dual-online ultra-high pressure liquid chromatography system. The system employs only two nano-volume switching valves (a two-position four port valve and a two-position ten port valve) that direct solvent flows from two binary nano-pumps for parallel operation of two analytical columns and two solid phase extraction (SPE) columns. Despite the simple design, the sDO-UHPLC offers many advantageous features that include high duty cycle, back flushing sample injection for fast and narrow zone sample injection, online desalting, high separation resolution and high intra/inter-column reproducibility. This system was applied to analyze proteome samples not only in high throughput deep proteome profiling experiments but also in high throughput MRM experiments.
Advances in Proteomics Data Analysis and Display Using an Accurate Mass and Time Tag Approach
Zimmer, Jennifer S.D.; Monroe, Matthew E.; Qian, Wei-Jun; Smith, Richard D.
2007-01-01
Proteomics has recently demonstrated utility in understanding cellular processes on the molecular level as a component of systems biology approaches and for identifying potential biomarkers of various disease states. The large amount of data generated by utilizing high efficiency (e.g., chromatographic) separations coupled to high mass accuracy mass spectrometry for high-throughput proteomics analyses presents challenges related to data processing, analysis, and display. This review focuses on recent advances in nanoLC-FTICR-MS-based proteomics approaches and the accompanying data processing tools that have been developed to display and interpret the large volumes of data being produced. PMID:16429408
Kolbert, Zsuzsanna; Feigl, Gábor; Bordé, Ádám; Molnár, Árpád; Erdei, László
2017-04-01
Nitric oxide (NO) and related molecules (reactive nitrogen species) regulate diverse physiological processes mainly through posttranslational modifications such as protein tyrosine nitration (PTN). PTN is a covalent and specific modification of tyrosine (Tyr) residues resulting in altered protein structure and function. In the last decade, great efforts have been made to reveal candidate proteins, target Tyr residues and functional consequences of nitration in plants. This review intends to evaluate the accumulated knowledge about the biochemical mechanism, the structural and functional consequences and the selectivity of plants' protein nitration and also about the decomposition or conversion of nitrated proteins. At the same time, this review emphasizes yet unanswered or uncertain questions such as the reversibility/irreversibility of tyrosine nitration, the involvement of proteasomes in the removal of nitrated proteins or the effect of nitration on Tyr phosphorylation. The different NO producing systems of algae and higher plants raise the possibility of diversely regulated protein nitration. Therefore studying PTN from an evolutionary point of view would enrich our present understanding with novel aspects. Plant proteomic research can be promoted by the application of computational prediction tools such as GPS-YNO 2 and iNitro-Tyr software. Using the reference Arabidopsis proteome, Authors performed in silico analysis of tyrosine nitration in order to characterize plant tyrosine nitroproteome. Nevertheless, based on the common results of the present prediction and previous experiments the most likely nitrated proteins were selected thus recommending candidates for detailed future research. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
EPA SCIENCE FORUM - EPA'S TOXICOGENOMICS PARTNERSHIPS ACROSS GOVERNMENT, ACADEMIA AND INDUSTRY
Over the past decade genomics, proteomics and metabonomics technologies have transformed the science of toxicology, and concurrent advances in computing and informatics have provided management and analysis solutions for this onslaught of toxicogenomic data. EPA has been actively...
Evolving Relevance of Neuroproteomics in Alzheimer's Disease.
Lista, Simone; Zetterberg, Henrik; O'Bryant, Sid E; Blennow, Kaj; Hampel, Harald
2017-01-01
Substantial progress in the understanding of the biology of Alzheimer's disease (AD) has been achieved over the past decades. The early detection and diagnosis of AD and other age-related neurodegenerative diseases, however, remain a challenging scientific frontier. Therefore, the comprehensive discovery (relating to all individual, converging or diverging biochemical disease mechanisms), development, validation, and qualification of standardized biological markers with diagnostic and prognostic functions with a precise performance profile regarding specificity, sensitivity, and positive and negative predictive value are warranted.Methodological innovations in the area of exploratory high-throughput technologies, such as sequencing, microarrays, and mass spectrometry-based analyses of proteins/peptides, have led to the generation of large global molecular datasets from a multiplicity of biological systems, such as biological fluids, cells, tissues, and organs. Such methodological progress has shifted the attention to the execution of hypothesis-independent comprehensive exploratory analyses (opposed to the classical hypothesis-driven candidate approach), with the aim of fully understanding the biological systems in physiology and disease as a whole. The systems biology paradigm integrates experimental biology with accurate and rigorous computational modelling to describe and foresee the dynamic features of biological systems. The use of dynamically evolving technological platforms, including mass spectrometry, in the area of proteomics has enabled to rush the process of biomarker discovery and validation for refining significantly the diagnosis of AD. Currently, proteomics-which is part of the systems biology paradigm-is designated as one of the dominant matured sciences needed for the effective exploratory discovery of prospective biomarker candidates expected to play an effective role in aiding the early detection, diagnosis, prognosis, and therapy development in AD.
Jelonek, Karol; Pietrowska, Monika; Widlak, Piotr
2017-07-01
Blood is the most common replacement tissue used to study systemic responses of organisms to different types of pathological conditions and environmental insults. Local irradiation during cancer radiotherapy induces whole body responses that can be observed at the blood proteome and metabolome levels. Hence, comparative blood proteomics and metabolomics are emerging approaches used in the discovery of radiation biomarkers. These techniques enable the simultaneous measurement of hundreds of molecules and the identification of sets of components that can discriminate different physiological states of the human body. Radiation-induced changes are affected by the dose and volume of irradiated tissues; hence, the molecular composition of blood is a hypothetical source of biomarkers for dose assessment and the prediction and monitoring of systemic responses to radiation. This review aims to provide a comprehensive overview on the available evidence regarding molecular responses to ionizing radiation detected at the level of the human blood proteome and metabolome. It focuses on patients exposed to radiation during cancer radiotherapy and emphasizes effects related to radiation-induced toxicity and inflammation. Systemic responses to radiation detected at the blood proteome and metabolome levels are primarily related to the intensity of radiation-induced toxicity, including inflammatory responses. Thus, several inflammation-associated molecules can be used to monitor or even predict radiation-induced toxicity. However, these abundant molecular features have a rather limited applicability as universal biomarkers for dose assessment, reflecting the individual predisposition of the immune system and tissue-specific mechanisms involved in radiation-induced damage.
Mishra, Bud; Daruwala, Raoul-Sam; Zhou, Yi; Ugel, Nadia; Policriti, Alberto; Antoniotti, Marco; Paxia, Salvatore; Rejali, Marc; Rudra, Archisman; Cherepinsky, Vera; Silver, Naomi; Casey, William; Piazza, Carla; Simeoni, Marta; Barbano, Paolo; Spivak, Marina; Feng, Jiawu; Gill, Ofer; Venkatesh, Mysore; Cheng, Fang; Sun, Bing; Ioniata, Iuliana; Anantharaman, Thomas; Hubbard, E Jane Albert; Pnueli, Amir; Harel, David; Chandru, Vijay; Hariharan, Ramesh; Wigler, Michael; Park, Frank; Lin, Shih-Chieh; Lazebnik, Yuri; Winkler, Franz; Cantor, Charles R; Carbone, Alessandra; Gromov, Mikhael
2003-01-01
We collaborate in a research program aimed at creating a rigorous framework, experimental infrastructure, and computational environment for understanding, experimenting with, manipulating, and modifying a diverse set of fundamental biological processes at multiple scales and spatio-temporal modes. The novelty of our research is based on an approach that (i) requires coevolution of experimental science and theoretical techniques and (ii) exploits a certain universality in biology guided by a parsimonious model of evolutionary mechanisms operating at the genomic level and manifesting at the proteomic, transcriptomic, phylogenic, and other higher levels. Our current program in "systems biology" endeavors to marry large-scale biological experiments with the tools to ponder and reason about large, complex, and subtle natural systems. To achieve this ambitious goal, ideas and concepts are combined from many different fields: biological experimentation, applied mathematical modeling, computational reasoning schemes, and large-scale numerical and symbolic simulations. From a biological viewpoint, the basic issues are many: (i) understanding common and shared structural motifs among biological processes; (ii) modeling biological noise due to interactions among a small number of key molecules or loss of synchrony; (iii) explaining the robustness of these systems in spite of such noise; and (iv) cataloging multistatic behavior and adaptation exhibited by many biological processes.
Will Quantitative Proteomics Redefine Some of the Key Concepts in Skeletal Muscle Physiology?
Gizak, Agnieszka; Rakus, Dariusz
2016-01-11
Molecular and cellular biology methodology is traditionally based on the reasoning called "the mechanistic explanation". In practice, this means identifying and selecting correlations between biological processes which result from our manipulation of a biological system. In theory, a successful application of this approach requires precise knowledge about all parameters of a studied system. However, in practice, due to the systems' complexity, this requirement is rarely, if ever, accomplished. Typically, it is limited to a quantitative or semi-quantitative measurements of selected parameters (e.g., concentrations of some metabolites), and a qualitative or semi-quantitative description of expression/post-translational modifications changes within selected proteins. A quantitative proteomics approach gives a possibility of quantitative characterization of the entire proteome of a biological system, in the context of the titer of proteins as well as their post-translational modifications. This enables not only more accurate testing of novel hypotheses but also provides tools that can be used to verify some of the most fundamental dogmas of modern biology. In this short review, we discuss some of the consequences of using quantitative proteomics to verify several key concepts in skeletal muscle physiology.
Marondedze, Claudius; Wong, Aloysius; Groen, Arnoud; Serrano, Natalia; Jankovic, Boris; Lilley, Kathryn; Gehring, Christoph; Thomas, Ludivine
2014-12-31
The study of proteomes provides new insights into stimulus-specific responses of protein synthesis and turnover, and the role of post-translational modifications at the systems level. Due to the diverse chemical nature of proteins and shortcomings in the analytical techniques used in their study, only a partial display of the proteome is achieved in any study, and this holds particularly true for plant proteomes. Here we show that different solubilization and separation methods have profound effects on the resulting proteome. In particular, we observed that the type of detergents employed in the solubilization buffer preferentially enriches proteins in different functional categories. These include proteins with a role in signaling, transport, response to temperature stimuli and metabolism. This data may offer a functional bias on comparative analysis studies. In order to obtain a broader coverage, we propose a two-step solubilization protocol with first a detergent-free buffer and then a second step utilizing a combination of two detergents to solubilize proteins.
Exploring the Arabidopsis Proteome: Influence of Protein Solubilization Buffers on Proteome Coverage
Marondedze, Claudius; Wong, Aloysius; Groen, Arnoud; Serrano, Natalia; Jankovic, Boris; Lilley, Kathryn; Gehring, Christoph; Thomas, Ludivine
2014-01-01
The study of proteomes provides new insights into stimulus-specific responses of protein synthesis and turnover, and the role of post-translational modifications at the systems level. Due to the diverse chemical nature of proteins and shortcomings in the analytical techniques used in their study, only a partial display of the proteome is achieved in any study, and this holds particularly true for plant proteomes. Here we show that different solubilization and separation methods have profound effects on the resulting proteome. In particular, we observed that the type of detergents employed in the solubilization buffer preferentially enriches proteins in different functional categories. These include proteins with a role in signaling, transport, response to temperature stimuli and metabolism. This data may offer a functional bias on comparative analysis studies. In order to obtain a broader coverage, we propose a two-step solubilization protocol with first a detergent-free buffer and then a second step utilizing a combination of two detergents to solubilize proteins. PMID:25561235
Jiang, Xiaogang; Feng, Shun; Tian, Ruijun; Han, Guanghui; Jiang, Xinning; Ye, Mingliang; Zou, Hanfa
2007-02-01
An approach was developed to automate sample introduction for nanoflow LC-MS/MS (microLC-MS/MS) analysis using a strong cation exchange (SCX) trap column. The system consisted of a 100 microm id x 2 cm SCX trap column and a 75 microm id x 12 cm C18 RP analytical column. During the sample loading step, the flow passing through the SCX trap column was directed to waste for loading a large volume of sample at high flow rate. Then the peptides bound on the SCX trap column were eluted onto the RP analytical column by a high salt buffer followed by RP chromatographic separation of the peptides at nanoliter flow rate. It was observed that higher performance of separation could be achieved with the system using SCX trap column than with the system using C18 trap column. The high proteomic coverage using this approach was demonstrated in the analysis of tryptic digest of BSA and yeast cell lysate. In addition, this system was also applied to two-dimensional separation of tryptic digest of human hepatocellular carcinoma cell line SMMC-7721 for large scale proteome analysis. This system was fully automated and required minimum changes on current microLC-MS/MS system. This system represented a promising platform for routine proteome analysis.
Nesvizhskii, Alexey I.
2010-01-01
This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues. PMID:20816881
Characterization, design, and function of the mitochondrial proteome: from organs to organisms.
Lotz, Christopher; Lin, Amanda J; Black, Caitlin M; Zhang, Jun; Lau, Edward; Deng, Ning; Wang, Yueju; Zong, Nobel C; Choi, Jeong H; Xu, Tao; Liem, David A; Korge, Paavo; Weiss, James N; Hermjakob, Henning; Yates, John R; Apweiler, Rolf; Ping, Peipei
2014-02-07
Mitochondria are a common energy source for organs and organisms; their diverse functions are specialized according to the unique phenotypes of their hosting environment. Perturbation of mitochondrial homeostasis accompanies significant pathological phenotypes. However, the connections between mitochondrial proteome properties and function remain to be experimentally established on a systematic level. This uncertainty impedes the contextualization and translation of proteomic data to the molecular derivations of mitochondrial diseases. We present a collection of mitochondrial features and functions from four model systems, including two cardiac mitochondrial proteomes from distinct genomes (human and mouse), two unique organ mitochondrial proteomes from identical genetic codons (mouse heart and mouse liver), as well as a relevant metazoan out-group (drosophila). The data, composed of mitochondrial protein abundance and their biochemical activities, capture the core functionalities of these mitochondria. This investigation allowed us to redefine the core mitochondrial proteome from organs and organisms, as well as the relevant contributions from genetic information and hosting milieu. Our study has identified significant enrichment of disease-associated genes and their products. Furthermore, correlational analyses suggest that mitochondrial proteome design is primarily driven by cellular environment. Taken together, these results connect proteome feature with mitochondrial function, providing a prospective resource for mitochondrial pathophysiology and developing novel therapeutic targets in medicine.
Lehmann, Sylvain; Hoofnagle, Andrew; Hochstrasser, Denis; Brede, Cato; Glueckmann, Matthias; Cocho, José A; Ceglarek, Uta; Lenz, Christof; Vialaret, Jérôme; Scherl, Alexander; Hirtz, Christophe
2013-05-01
Proteomics studies typically aim to exhaustively detect peptides/proteins in a given biological sample. Over the past decade, the number of publications using proteomics methodologies has exploded. This was made possible due to the availability of high-quality genomic data and many technological advances in the fields of microfluidics and mass spectrometry. Proteomics in biomedical research was initially used in 'functional' studies for the identification of proteins involved in pathophysiological processes, complexes and networks. Improved sensitivity of instrumentation facilitated the analysis of even more complex sample types, including human biological fluids. It is at that point the field of clinical proteomics was born, and its fundamental aim was the discovery and (ideally) validation of biomarkers for the diagnosis, prognosis, or therapeutic monitoring of disease. Eventually, it was recognized that the technologies used in clinical proteomics studies [particularly liquid chromatography-tandem mass spectrometry (LC-MS/MS)] could represent an alternative to classical immunochemical assays. Prior to deploying MS in the measurement of peptides/proteins in the clinical laboratory, it seems likely that traditional proteomics workflows and data management systems will need to adapt to the clinical environment and meet in vitro diagnostic (IVD) regulatory constraints. This defines a new field, as reviewed in this article, that we have termed quantitative Clinical Chemistry Proteomics (qCCP).
Lee, Hangyeore; Mun, Dong-Gi; So, Jeong Eun; Bae, Jingi; Kim, Hokeun; Masselon, Christophe; Lee, Sang-Won
2016-12-06
Proteomics aims to achieve complete profiling of the protein content and protein modifications in cells, tissues, and biofluids and to quantitatively determine changes in their abundances. This information serves to elucidate cellular processes and signaling pathways and to identify candidate protein biomarkers and/or therapeutic targets. Analyses must therefore be both comprehensive and efficient. Here, we present a novel online two-dimensional reverse-phase/reverse-phase liquid chromatography separation platform, which is based on a newly developed online noncontiguous fractionating and concatenating device (NCFC fractionator). In bottom-up proteomics analyses of a complex proteome, this system provided significantly improved exploitation of the separation space of the two RPs, considerably increasing the numbers of peptides identified compared to a contiguous 2D-RP/RPLC method. The fully automated online 2D-NCFC-RP/RPLC system bypassed a number of labor-intensive manual processes required with the previously described offline 2D-NCFC RP/RPLC method, and thus, it offers minimal sample loss in a context of highly reproducible 2D-RP/RPLC experiments.
Wiśniewski, Jacek R; Mann, Matthias
2016-07-01
Proteomics and other protein-based analysis methods such as Western blotting all face the challenge of discriminating changes in the levels of proteins of interest from inadvertent changes in the amount loaded for analysis. Mass-spectrometry-based proteomics can now estimate the relative and absolute amounts of thousands of proteins across diverse biological systems. We reasoned that this new technology could prove useful for selection of very stably expressed proteins that could serve as better loading controls than those traditionally employed. Large-scale proteomic analyses of SDS lysates of cultured cells and tissues revealed deglycase DJ-1 as the protein with the lowest variability in abundance among different cell types in human, mouse, and amphibian cells. The protein constitutes 0.069 ± 0.017% of total cellular protein and occurs at a specific concentration of 34.6 ± 8.7 pmol/mg of total protein. Since DJ-1 is ubiquitous and therefore easily detectable with several peptides, it can be helpful in normalization of proteomic data sets. In addition, DJ-1 appears to be an advantageous loading control for Western blot that is superior to those used commonly used, allowing comparisons between tissues and cells originating from evolutionarily distant vertebrate species. Notably, this is not possible by the detection and quantitation of housekeeping proteins, which are often used in the Western blot technique. The approach introduced here can be applied to select the most appropriate loading controls for MS-based proteomics or Western blotting in any biological system.
Tetrazine ligation for chemical proteomics.
Kang, Kyungtae; Park, Jongmin; Kim, Eunha
2016-01-01
Determining small molecule-target protein interaction is essential for the chemical proteomics. One of the most important keys to explore biological system in chemical proteomics field is finding first-class molecular tools. Chemical probes can provide great spatiotemporal control to elucidate biological functions of proteins as well as for interrogating biological pathways. The invention of bioorthogonal chemistry has revolutionized the field of chemical biology by providing superior chemical tools and has been widely used for investigating the dynamics and function of biomolecules in live condition. Among 20 different bioorthogonal reactions, tetrazine ligation has been spotlighted as the most advanced bioorthogonal chemistry because of their extremely faster kinetics and higher specificity than others. Therefore, tetrazine ligation has a tremendous potential to enhance the proteomic research. This review highlights the current status of tetrazine ligation reaction as a molecular tool for the chemical proteomics.
Proteomic contributions to our understanding of vaccine and immune responses
Galassie, Allison C.; Link, Andrew J.
2015-01-01
Vaccines are one of the greatest public health successes; yet, due to the empirical nature of vaccine design, we have an incomplete understanding of how the genes and proteins induced by vaccines contribute to the development of both protective innate and adaptive immune responses. While the advent of genomics has enabled new vaccine development and facilitated understanding of the immune response, proteomics identifies potentially new vaccine antigens with increasing speed and sensitivity. In addition, as proteomics is complementary to transcriptomic approaches, a combination of both approaches provides a more comprehensive view of the immune response after vaccination via systems vaccinology. This review details the advances that proteomic strategies have made in vaccine development and reviews how proteomics contributes to the development of a more complete understanding of human vaccines and immune responses. PMID:26172619
Unraveling snake venom complexity with 'omics' approaches: challenges and perspectives.
Zelanis, André; Tashima, Alexandre Keiji
2014-09-01
The study of snake venom proteomes (venomics) has been experiencing a burst of reports, however the comprehensive knowledge of the dynamic range of proteins present within a single venom, the set of post-translational modifications (PTMs) as well as the lack of a comprehensive database related to venom proteins are among the main challenges in venomics research. The phenotypic plasticity in snake venom proteomes together with their inherent toxin proteoform diversity, points out to the use of integrative analysis in order to better understand their actual complexity. In this regard, such a systems venomics task should encompass the integration of data from transcriptomic and proteomic studies (specially the venom gland proteome), the identification of biological PTMs, and the estimation of artifactual proteomes and peptidomes generated by sample handling procedures. Copyright © 2014 Elsevier Ltd. All rights reserved.
2016 update of the PRIDE database and its related tools
Vizcaíno, Juan Antonio; Csordas, Attila; del-Toro, Noemi; Dianes, José A.; Griss, Johannes; Lavidas, Ilias; Mayer, Gerhard; Perez-Riverol, Yasset; Reisinger, Florian; Ternent, Tobias; Xu, Qing-Wei; Wang, Rui; Hermjakob, Henning
2016-01-01
The PRoteomics IDEntifications (PRIDE) database is one of the world-leading data repositories of mass spectrometry (MS)-based proteomics data. Since the beginning of 2014, PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) is the new PRIDE archival system, replacing the original PRIDE database. Here we summarize the developments in PRIDE resources and related tools since the previous update manuscript in the Database Issue in 2013. PRIDE Archive constitutes a complete redevelopment of the original PRIDE, comprising a new storage backend, data submission system and web interface, among other components. PRIDE Archive supports the most-widely used PSI (Proteomics Standards Initiative) data standard formats (mzML and mzIdentML) and implements the data requirements and guidelines of the ProteomeXchange Consortium. The wide adoption of ProteomeXchange within the community has triggered an unprecedented increase in the number of submitted data sets (around 150 data sets per month). We outline some statistics on the current PRIDE Archive data contents. We also report on the status of the PRIDE related stand-alone tools: PRIDE Inspector, PRIDE Converter 2 and the ProteomeXchange submission tool. Finally, we will give a brief update on the resources under development ‘PRIDE Cluster’ and ‘PRIDE Proteomes’, which provide a complementary view and quality-scored information of the peptide and protein identification data available in PRIDE Archive. PMID:26527722
Nicolaou, Orthodoxia; Kousios, Andreas; Hadjisavvas, Andreas; Lauwerys, Bernard; Sokratous, Kleitos; Kyriacou, Kyriacos
2017-05-01
Advances in mass spectrometry technologies have created new opportunities for discovering novel protein biomarkers in systemic lupus erythematosus (SLE). We performed a systematic review of published reports on proteomic biomarkers identified in SLE patients using mass spectrometry-based proteomics and highlight their potential disease association and clinical utility. Two electronic databases, MEDLINE and EMBASE, were systematically searched up to July 2015. The methodological quality of studies included in the review was performed according to Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines. Twenty-five studies were included in the review, identifying 241 SLE candidate proteomic biomarkers related to various aspects of the disease including disease diagnosis and activity or pinpointing specific organ involvement. Furthermore, 13 of the 25 studies validated their results for a selected number of biomarkers in an independent cohort, resulting in the validation of 28 candidate biomarkers. It is noteworthy that 11 candidate biomarkers were identified in more than one study. A significant number of potential proteomic biomarkers that are related to a number of aspects of SLE have been identified using mass spectrometry proteomic approaches. However, further studies are required to assess the utility of these biomarkers in routine clinical practice. © 2016 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
Koczorowska, Maria Magdalena; Friedemann, Charlotte; Geiger, Klaus; Follo, Marie; Biniossek, Martin Lothar; Schilling, Oliver
2017-12-01
Solid tumors contain various components that together form the tumor microenvironment. Cancer associated fibroblasts (CAFs) are capable of secreting and responding to signaling molecules and growth factors. Due to their role in tumor development, CAFs are considered as potential therapeutic targets. A prominent tumor-associated signaling molecule is transforming growth factor β (TGFβ), an inducer of epithelial-to-mesenchymal transition (EMT). The differential action of TGFβ on CAFs and ETCs (epithelial tumor cells) has recently gained interest. Here, we aimed to investigate the effects of TGFβ on CAFs and ETCs at the proteomic level. We established a 2D co-culture system of differentially fluorescently labeled CAFs and ETCs and stimulated this co-culture system with TGFβ. The respective cell types were separated using FACS and subjected to quantitative analyses of individual proteomes using mass spectrometry. We found that TGFβ treatment had a strong impact on the proteome composition of CAFs, whereas ETCs responded only marginally to TGFβ. Quantitative proteomic analyses of the different cell types revealed up-regulation of extracellular matrix (ECM) proteins in TGFβ treated CAFs. In addition, we found that the TGFβ treated CAFs exhibited increased N-cadherin levels. From our data we conclude that CAFs respond to TGFβ treatment by changing their proteome composition, while ETCs appear to be rather resilient.
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-01-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)1 not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. PMID:27215607
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes.
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-08-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)(1) not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
The emerging CHO systems biology era: harnessing the 'omics revolution for biotechnology.
Kildegaard, Helene Faustrup; Baycin-Hizal, Deniz; Lewis, Nathan E; Betenbaugh, Michael J
2013-12-01
Chinese hamster ovary (CHO) cells are the primary factories for biopharmaceuticals because of their capacity to correctly fold and post-translationally modify recombinant proteins compatible with humans. New opportunities are arising to enhance these cell factories, especially since the CHO-K1 cell line was recently sequenced. Now, the CHO systems biology era is underway. Critical 'omics data sets, including proteomics, transcriptomics, metabolomics, fluxomics, and glycomics, are emerging, allowing the elucidation of the molecular basis of CHO cell physiology. The incorporation of these data sets into mathematical models that describe CHO phenotypes will provide crucial biotechnology insights. As 'omics technologies and computational systems biology mature, genome-scale approaches will lead to major innovations in cell line development and metabolic engineering, thereby improving protein production and bioprocessing. Copyright © 2013 Elsevier Ltd. All rights reserved.
Zhang, Yaoyang; Xu, Tao; Shan, Bing; Hart, Jonathan; Aslanian, Aaron; Han, Xuemei; Zong, Nobel; Li, Haomin; Choi, Howard; Wang, Dong; Acharya, Lipi; Du, Lisa; Vogt, Peter K; Ping, Peipei; Yates, John R
2015-11-03
Shotgun proteomics generates valuable information from large-scale and target protein characterizations, including protein expression, protein quantification, protein post-translational modifications (PTMs), protein localization, and protein-protein interactions. Typically, peptides derived from proteolytic digestion, rather than intact proteins, are analyzed by mass spectrometers because peptides are more readily separated, ionized and fragmented. The amino acid sequences of peptides can be interpreted by matching the observed tandem mass spectra to theoretical spectra derived from a protein sequence database. Identified peptides serve as surrogates for their proteins and are often used to establish what proteins were present in the original mixture and to quantify protein abundance. Two major issues exist for assigning peptides to their originating protein. The first issue is maintaining a desired false discovery rate (FDR) when comparing or combining multiple large datasets generated by shotgun analysis and the second issue is properly assigning peptides to proteins when homologous proteins are present in the database. Herein we demonstrate a new computational tool, ProteinInferencer, which can be used for protein inference with both small- or large-scale data sets to produce a well-controlled protein FDR. In addition, ProteinInferencer introduces confidence scoring for individual proteins, which makes protein identifications evaluable. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
An introduction to statistical process control in research proteomics.
Bramwell, David
2013-12-16
Statistical process control is a well-established and respected method which provides a general purpose, and consistent framework for monitoring and improving the quality of a process. It is routinely used in many industries where the quality of final products is critical and is often required in clinical diagnostic laboratories [1,2]. To date, the methodology has been little utilised in research proteomics. It has been shown to be capable of delivering quantitative QC procedures for qualitative clinical assays [3] making it an ideal methodology to apply to this area of biological research. To introduce statistical process control as an objective strategy for quality control and show how it could be used to benefit proteomics researchers and enhance the quality of the results they generate. We demonstrate that rules which provide basic quality control are easy to derive and implement and could have a major impact on data quality for many studies. Statistical process control is a powerful tool for investigating and improving proteomics research work-flows. The process of characterising measurement systems and defining control rules forces the exploration of key questions that can lead to significant improvements in performance. This work asserts that QC is essential to proteomics discovery experiments. Every experimenter must know the current capabilities of their measurement system and have an objective means for tracking and ensuring that performance. Proteomic analysis work-flows are complicated and multi-variate. QC is critical for clinical chemistry measurements and huge strides have been made in ensuring the quality and validity of results in clinical biochemistry labs. This work introduces some of these QC concepts and works to bridge their use from single analyte QC to applications in multi-analyte systems. This article is part of a Special Issue entitled: Standardization and Quality Control in Proteomics. Copyright © 2013 The Author. Published by Elsevier B.V. All rights reserved.
Thiele, H.; Glandorf, J.; Koerting, G.; Reidegeld, K.; Blüggel, M.; Meyer, H.; Stephan, C.
2007-01-01
In today’s proteomics research, various techniques and instrumentation bioinformatics tools are necessary to manage the large amount of heterogeneous data with an automatic quality control to produce reliable and comparable results. Therefore a data-processing pipeline is mandatory for data validation and comparison in a data-warehousing system. The proteome bioinformatics platform ProteinScape has been proven to cover these needs. The reprocessing of HUPO BPP participants’ MS data was done within ProteinScape. The reprocessed information was transferred into the global data repository PRIDE. ProteinScape as a data-warehousing system covers two main aspects: archiving relevant data of the proteomics workflow and information extraction functionality (protein identification, quantification and generation of biological knowledge). As a strategy for automatic data validation, different protein search engines are integrated. Result analysis is performed using a decoy database search strategy, which allows the measurement of the false-positive identification rate. Peptide identifications across different workflows, different MS techniques, and different search engines are merged to obtain a quality-controlled protein list. The proteomics identifications database (PRIDE), as a public data repository, is an archiving system where data are finally stored and no longer changed by further processing steps. Data submission to PRIDE is open to proteomics laboratories generating protein and peptide identifications. An export tool has been developed for transferring all relevant HUPO BPP data from ProteinScape into PRIDE using the PRIDE.xml format. The EU-funded ProDac project will coordinate the development of software tools covering international standards for the representation of proteomics data. The implementation of data submission pipelines and systematic data collection in public standards–compliant repositories will cover all aspects, from the generation of MS data in each laboratory to the conversion of all the annotating information and identifications to a standardized format. Such datasets can be used in the course of publishing in scientific journals.
Pietrowska, M; Marczak, L; Polanska, J; Nowicka, E; Behrent, K; Tarnawski, R; Stobiecki, M; Polanski, A; Widlak, P
2010-01-01
Mass spectrometry-based analysis of the serum proteome allows identifying multi-peptide patterns/signatures specific for blood of cancer patients, thus having high potential value for cancer diagnostics. However, because of problems with optimization and standardization of experimental and computational design, none of identified proteome patterns/signatures was approved for diagnostics in clinical practice as yet. Here we compared two methods of serum sample preparation for mass spectrometry-based proteome pattern analysis aimed to identify biomarkers that could be used in early detection of breast cancer patients. Blood samples were collected in a group of 92 patients diagnosed at early (I and II) stages of the disease before the start of therapy, and in a group of age-matched healthy controls (104 women). Serum specimens were purified and analyzed using MALDI-ToF spectrometry, either directly or after membrane filtration (50 kDa cut-off) to remove albumin and other large serum proteins. Mass spectra of the low-molecular-weight fraction (2-10 kDa) of the serum proteome were resolved using the Gaussian mixture decomposition, and identified spectral components were used to build classifiers that differentiated samples from breast cancer patients and healthy persons. Mass spectra of complete serum and membrane-filtered albumin-depleted samples have apparently different structure and peaks specific for both types of samples could be identified. The optimal classifier built for the complete serum specimens consisted of 8 spectral components, and had 81% specificity and 72% sensitivity, while that built for the membrane-filtered samples consisted of 4 components, and had 80% specificity and 81% sensitivity. We concluded that pre-processing of samples to remove albumin might be recommended before MALDI-ToF mass spectrometric analysis of the low-molecular-weight components of human serum Keywords: albumin removal; breast cancer; clinical proteomics; mass spectrometry; pattern analysis; serum proteome.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huang, Eric L.; Orsat, Valerie; Shah, Manesh B
2012-01-01
System biology and bioprocess technology can be better understood using shotgun proteomics as a monitoring system during the fermentation. We demonstrated a shotgun proteomic method to monitor the temporal yeast proteome in early, middle and late exponential phases. Our study identified a total of 1389 proteins combining all 2D-LC-MS/MS runs. The temporal Saccharomyces cerevisiae proteome was enriched with proteolysis, radical detoxification, translation, one-carbon metabolism, glycolysis and TCA cycle. Heat shock proteins and proteins associated with oxidative stress response were found throughout the exponential phase. The most abundant proteins observed were translation elongation factors, ribosomal proteins, chaperones and glycolytic enzymes. Themore » high abundance of the H-protein of the glycine decarboxylase complex (Gcv3p) indicated the availability of glycine in the environment. We observed differentially expressed proteins and the induced proteins at mid-exponential phase were involved in ribosome biogenesis, mitochondria DNA binding/replication and transcriptional activator. Induction of tryptophan synthase (Trp5p) indicated the abundance of tryptophan during the fermentation. As fermentation progressed toward late exponential phase, a decrease in cell proliferation was implied from the repression of ribosomal proteins, transcription coactivators, methionine aminopeptidase and translation-associated proteins.« less
Decoding 2D-PAGE complex maps: relevance to proteomics.
Pietrogrande, Maria Chiara; Marchetti, Nicola; Dondi, Francesco; Righetti, Pier Giorgio
2006-03-20
This review describes two mathematical approaches useful for decoding the complex signal of 2D-PAGE maps of protein mixtures. These methods are helpful for interpreting the large amount of data of each 2D-PAGE map by extracting all the analytical information hidden therein by spot overlapping. Here the basic theory and application to 2D-PAGE maps are reviewed: the means for extracting information from the experimental data and their relevance to proteomics are discussed. One method is based on the quantitative theory of statistical model of peak overlapping (SMO) using the spot experimental data (intensity and spatial coordinates). The second method is based on the study of the 2D-autocovariance function (2D-ACVF) computed on the experimental digitised map. They are two independent methods that are able to extract equal and complementary information from the 2D-PAGE map. Both methods permit to obtain fundamental information on the sample complexity and the separation performance and to single out ordered patterns present in spot positions: the availability of two independent procedures to compute the same separation parameters is a powerful tool to estimate the reliability of the obtained results. The SMO procedure is an unique tool to quantitatively estimate the degree of spot overlapping present in the map, while the 2D-ACVF method is particularly powerful in simply singling out the presence of order in the spot position from the complexity of the whole 2D map, i.e., spot trains. The procedures were validated by extensive numerical computation on computer-generated maps describing experimental 2D-PAGE gels of protein mixtures. Their applicability to real samples was tested on reference maps obtained from literature sources. The review describes the most relevant information for proteomics: sample complexity, separation performance, overlapping extent, identification of spot trains related to post-translational modifications (PTMs).
Wang, Guanghui; Wu, Wells W; Zeng, Weihua; Chou, Chung-Lin; Shen, Rong-Fong
2006-05-01
A critical step in protein biomarker discovery is the ability to contrast proteomes, a process referred generally as quantitative proteomics. While stable-isotope labeling (e.g., ICAT, 18O- or 15N-labeling, or AQUA) remains the core technology used in mass spectrometry-based proteomic quantification, increasing efforts have been directed to the label-free approach that relies on direct comparison of peptide peak areas between LC-MS runs. This latter approach is attractive to investigators for its simplicity as well as cost effectiveness. In the present study, the reproducibility and linearity of using a label-free approach to highly complex proteomes were evaluated. Various amounts of proteins from different proteomes were subjected to repeated LC-MS analyses using an ion trap or Fourier transform mass spectrometer. Highly reproducible data were obtained between replicated runs, as evidenced by nearly ideal Pearson's correlation coefficients (for ion's peak areas or retention time) and average peak area ratios. In general, more than 50% and nearly 90% of the peptide ion ratios deviated less than 10% and 20%, respectively, from the average in duplicate runs. In addition, the multiplicity ratios of the amounts of proteins used correlated nicely with the observed averaged ratios of peak areas calculated from detected peptides. Furthermore, the removal of abundant proteins from the samples led to an improvement in reproducibility and linearity. A computer program has been written to automate the processing of data sets from experiments with groups of multiple samples for statistical analysis. Algorithms for outlier-resistant mean estimation and for adjusting statistical significance threshold in multiplicity of testing were incorporated to minimize the rate of false positives. The program was applied to quantify changes in proteomes of parental and p53-deficient HCT-116 human cells and found to yield reproducible results. Overall, this study demonstrates an alternative approach that allows global quantification of differentially expressed proteins in complex proteomes. The utility of this method to biomarker discovery is likely to synergize with future improvements in the detecting sensitivity of mass spectrometers.
CMG-Biotools, a Free Workbench for Basic Comparative Microbial Genomics
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Background Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. Results The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. Conclusion This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training. PMID:23577086
Terfve, Camille; Sabidó, Eduard; Wu, Yibo; Gonçalves, Emanuel; Choi, Meena; Vaga, Stefania; Vitek, Olga; Saez-Rodriguez, Julio; Aebersold, Ruedi
2017-02-03
Advances in mass spectrometry have made the quantitative measurement of proteins across multiple samples a reality, allowing for the study of complex biological systems such as the metabolic syndrome. Although the deregulation of lipid metabolism and increased hepatic storage of triacylglycerides are known to play a part in the onset of the metabolic syndrome, its molecular basis and dependency on dietary and genotypic factors are poorly characterized. Here, we used an experimental design with two different mouse strains and dietary and metabolic perturbations to generate a compendium of quantitative proteome data using three mass spectrometric techniques. The data reproduce known properties of the metabolic system and indicate differential molecular adaptation of the two mouse strains to perturbations, contributing to a better understanding of the metabolic syndrome. We show that high-quality, high-throughput proteomic data sets provide an unbiased broad overview of the behavior of complex systems after perturbation.
Liu, Kehui; Zhang, Jiyang; Fu, Bin; Xie, Hongwei; Wang, Yingchun; Qian, Xiaohong
2014-07-01
Precise protein quantification is essential in comparative proteomics. Currently, quantification bias is inevitable when using proteotypic peptide-based quantitative proteomics strategy for the differences in peptides measurability. To improve quantification accuracy, we proposed an "empirical rule for linearly correlated peptide selection (ERLPS)" in quantitative proteomics in our previous work. However, a systematic evaluation on general application of ERLPS in quantitative proteomics under diverse experimental conditions needs to be conducted. In this study, the practice workflow of ERLPS was explicitly illustrated; different experimental variables, such as, different MS systems, sample complexities, sample preparations, elution gradients, matrix effects, loading amounts, and other factors were comprehensively investigated to evaluate the applicability, reproducibility, and transferability of ERPLS. The results demonstrated that ERLPS was highly reproducible and transferable within appropriate loading amounts and linearly correlated response peptides should be selected for each specific experiment. ERLPS was used to proteome samples from yeast to mouse and human, and in quantitative methods from label-free to O18/O16-labeled and SILAC analysis, and enabled accurate measurements for all proteotypic peptide-based quantitative proteomics over a large dynamic range. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zervantonakis, Ioannis K; Iavarone, Claudia; Chen, Hsing-Yu; Selfors, Laura M; Palakurthi, Sangeetha; Liu, Joyce F; Drapkin, Ronny; Matulonis, Ursula; Leverson, Joel D; Sampath, Deepak; Mills, Gordon B; Brugge, Joan S
2017-08-28
The lack of effective chemotherapies for high-grade serous ovarian cancers (HGS-OvCa) has motivated a search for alternative treatment strategies. Here, we present an unbiased systems-approach to interrogate a panel of 14 well-annotated HGS-OvCa patient-derived xenografts for sensitivity to PI3K and PI3K/mTOR inhibitors and uncover cell death vulnerabilities. Proteomic analysis reveals that PI3K/mTOR inhibition in HGS-OvCa patient-derived xenografts induces both pro-apoptotic and anti-apoptotic signaling responses that limit cell killing, but also primes cells for inhibitors of anti-apoptotic proteins. In-depth quantitative analysis of BCL-2 family proteins and other apoptotic regulators, together with computational modeling and selective anti-apoptotic protein inhibitors, uncovers new mechanistic details about apoptotic regulators that are predictive of drug sensitivity (BIM, caspase-3, BCL-X L ) and resistance (MCL-1, XIAP). Our systems-approach presents a strategy for systematic analysis of the mechanisms that limit effective tumor cell killing and the identification of apoptotic vulnerabilities to overcome drug resistance in ovarian and other cancers.High-grade serous ovarian cancers (HGS-OvCa) frequently develop chemotherapy resistance. Here, the authors through a systematic analysis of proteomic and drug response data of 14 HGS-OvCa PDXs demonstrate that targeting apoptosis regulators can improve response of these tumors to inhibitors of the PI3K/mTOR pathway.
2017-01-01
The changes of protein expression that are monitored in proteomic experiments are a type of biological transformation that also involves changes in chemical composition. Accompanying the myriad molecular-level interactions that underlie any proteomic transformation, there is an overall thermodynamic potential that is sensitive to microenvironmental conditions, including local oxidation and hydration potential. Here, up- and down-expressed proteins identified in 71 comparative proteomics studies were analyzed using the average oxidation state of carbon (ZC) and water demand per residue (\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\overline{n}}_{{\\mathrm{H}}_{2}\\mathrm{O}}$\\end{document}n¯H2O), calculated using elemental abundances and stoichiometric reactions to form proteins from basis species. Experimental lowering of oxygen availability (hypoxia) or water activity (hyperosmotic stress) generally results in decreased ZC or \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\overline{n}}_{{\\mathrm{H}}_{2}\\mathrm{O}}$\\end{document}n¯H2O of up-expressed compared to down-expressed proteins. This correspondence of chemical composition with experimental conditions provides evidence for attraction of the proteomes to a low-energy state. An opposite compositional change, toward higher average oxidation or hydration state, is found for proteomic transformations in colorectal and pancreatic cancer, and in two experiments for adipose-derived stem cells. Calculations of chemical affinity were used to estimate the thermodynamic potentials for proteomic transformations as a function of fugacity of O2 and activity of H2O, which serve as scales of oxidation and hydration potential. Diagrams summarizing the relative potential for formation of up- and down-expressed proteins have predicted equipotential lines that cluster around particular values of oxygen fugacity and water activity for similar datasets. The changes in chemical composition of proteomes are likely linked with reactions among other cellular molecules. A redox balance calculation indicates that an increase in the lipid to protein ratio in cancer cells by 20% over hypoxic cells would generate a large enough electron sink for oxidation of the cancer proteomes. The datasets and computer code used here are made available in a new R package, canprot. PMID:28603672
Why proteomics is not the new genomics and the future of mass spectrometry in cell biology.
Sidoli, Simone; Kulej, Katarzyna; Garcia, Benjamin A
2017-01-02
Mass spectrometry (MS) is an essential part of the cell biologist's proteomics toolkit, allowing analyses at molecular and system-wide scales. However, proteomics still lag behind genomics in popularity and ease of use. We discuss key differences between MS-based -omics and other booming -omics technologies and highlight what we view as the future of MS and its role in our increasingly deep understanding of cell biology. © 2017 Sidoli et al.
2011-01-01
The 2011 International Conference on Bioinformatics (InCoB) conference, which is the annual scientific conference of the Asia-Pacific Bioinformatics Network (APBioNet), is hosted by Kuala Lumpur, Malaysia, is co-organized with the first ISCB-Asia conference of the International Society for Computational Biology (ISCB). InCoB and the sequencing of the human genome are both celebrating their tenth anniversaries and InCoB’s goalposts for the next decade, implementing standards in bioinformatics and globally distributed computational networks, will be discussed and adopted at this conference. Of the 49 manuscripts (selected from 104 submissions) accepted to BMC Genomics and BMC Bioinformatics conference supplements, 24 are featured in this issue, covering software tools, genome/proteome analysis, systems biology (networks, pathways, bioimaging) and drug discovery and design. PMID:22372736
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pincus, David; Ryan, Christopher J.; Smith, Richard D.
2013-03-12
Cell signaling systems transmit information by post-translationally modifying signaling proteins, often via phosphorylation. While thousands of sites of phosphorylation have been identified in proteomic studies, the vast majority of sites have no known function. Assigning functional roles to the catalog of uncharacterized phosphorylation sites is a key research challenge. Here we present a general approach to address this challenge and apply it to a prototypical signaling pathway, the pheromone response pathway in Saccharomyces cerevisiae. The pheromone pathway includes a mitogen activated protein kinase (MAPK) cascade activated by a G-protein coupled receptor (GPCR). We used mass spectrometry-based proteomics to identify sitesmore » whose phosphorylation changed when the system was active, and evolutionary conservation to assign priority to a list of candidate MAPK regulatory sites. We made targeted alterations in those sites, and measured the effects of the mutations on pheromone pathway output in single cells. Our work identified six new sites that quantitatively tuned system output. We developed simple computational models to find system architectures that recapitulated the quantitative phenotypes of the mutants. Our results identify a number of regulated phosphorylation events that contribute to adjust the input-output relationship of this model eukaryotic signaling system. We believe this combined approach constitutes a general means not only to reveal modification sites required to turn a pathway on and off, but also those required for more subtle quantitative effects that tune pathway output. Our results further suggest that relatively small quantitative influences from individual regulatory phosphorylation events endow signaling systems with plasticity that evolution may exploit to quantitatively tailor signaling outcomes.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Chin-Rang
Astronauts and workers in nuclear plants who repeatedly exposed to low doses of ionizing radiation (IR, <10 cGy) are likely to incur specific changes in signal transduction and gene expression in various tissues of their body. Remarkable advances in high throughput genomics and proteomics technologies enable researchers to broaden their focus from examining single gene/protein kinetics to better understanding global gene/protein expression profiling and biological pathway analyses, namely Systems Biology. An ultimate goal of systems biology is to develop dynamic mathematical models of interacting biological systems capable of simulating living systems in a computer. This Glue Grant is to complementmore » Dr. Boothman’s existing DOE grant (No. DE-FG02-06ER64186) entitled “The IGF1/IGF-1R-MAPK-Secretory Clusterin (sCLU) Pathway: Mediator of a Low Dose IR-Inducible Bystander Effect” to develop sensitive and quantitative proteomic technology that suitable for low dose radiobiology researches. An improved version of quantitative protein array platform utilizing linear Quantum dot signaling for systematically measuring protein levels and phosphorylation states for systems biology modeling is presented. The signals are amplified by a confocal laser Quantum dot scanner resulting in ~1000-fold more sensitivity than traditional Western blots and show the good linearity that is impossible for the signals of HRP-amplification. Therefore this improved protein array technology is suitable to detect weak responses of low dose radiation. Software is developed to facilitate the quantitative readout of signaling network activities. Kinetics of EGFRvIII mutant signaling was analyzed to quantify cross-talks between EGFR and other signaling pathways.« less
Comparative proteome analysis reveals pathogen specific outer membrane proteins of Leptospira.
Dhandapani, Gunasekaran; Sikha, Thoduvayil; Rana, Aarti; Brahma, Rahul; Akhter, Yusuf; Gopalakrishnan Madanan, Madathiparambil
2018-04-10
Proteomes of pathogenic Leptospira interrogans and L. borgpetersenii and the saprophytic L. biflexa were filtered through computational tools to identify Outer Membrane Proteins (OMPs) that satisfy the required biophysical parameters for their presence on the outer membrane. A total of 133, 130, and 144 OMPs were identified in L. interrogans, L. borgpetersenii, and L. biflexa, respectively, which forms approximately 4% of proteomes. A holistic analysis of transporting and pathogenic characteristics of OMPs together with Clusters of Orthologous Groups (COGs) among the OMPs and their distribution across 3 species was made and put forward a set of 21 candidate OMPs specific to pathogenic leptospires. It is also found that proteins homologous to the candidate OMPs were also present in other pathogenic species of leptospires. Six OMPs from L. interrogans and 2 from L. borgpetersenii observed to have similar COGs while those were not found in any intermediate or saprophytic forms. These OMPs appears to have role in infection and pathogenesis and useful for anti-leptospiral strategies. © 2018 Wiley Periodicals, Inc.
Building high-quality assay libraries for targeted analysis of SWATH MS data.
Schubert, Olga T; Gillet, Ludovic C; Collins, Ben C; Navarro, Pedro; Rosenberger, George; Wolski, Witold E; Lam, Henry; Amodei, Dario; Mallick, Parag; MacLean, Brendan; Aebersold, Ruedi
2015-03-01
Targeted proteomics by selected/multiple reaction monitoring (S/MRM) or, on a larger scale, by SWATH (sequential window acquisition of all theoretical spectra) MS (mass spectrometry) typically relies on spectral reference libraries for peptide identification. Quality and coverage of these libraries are therefore of crucial importance for the performance of the methods. Here we present a detailed protocol that has been successfully used to build high-quality, extensive reference libraries supporting targeted proteomics by SWATH MS. We describe each step of the process, including data acquisition by discovery proteomics, assertion of peptide-spectrum matches (PSMs), generation of consensus spectra and compilation of MS coordinates that uniquely define each targeted peptide. Crucial steps such as false discovery rate (FDR) control, retention time normalization and handling of post-translationally modified peptides are detailed. Finally, we show how to use the library to extract SWATH data with the open-source software Skyline. The protocol takes 2-3 d to complete, depending on the extent of the library and the computational resources available.
Fischer, Martina; Jehmlich, Nico; Rose, Laura; Koch, Sophia; Laue, Michael; Renard, Bernhard Y.; Schmidt, Frank; Heuer, Dagmar
2015-01-01
Chlamydia trachomatis is an important human pathogen that replicates inside the infected host cell in a unique vacuole, the inclusion. The formation of this intracellular bacterial niche is essential for productive Chlamydia infections. Despite its importance for Chlamydia biology, a holistic view on the protein composition of the inclusion, including its membrane, is currently missing. Here we describe the host cell-derived proteome of isolated C. trachomatis inclusions by quantitative proteomics. Computational analysis indicated that the inclusion is a complex intracellular trafficking platform that interacts with host cells’ antero- and retrograde trafficking pathways. Furthermore, the inclusion is highly enriched for sorting nexins of the SNX-BAR retromer, a complex essential for retrograde trafficking. Functional studies showed that in particular, SNX5 controls the C. trachomatis infection and that retrograde trafficking is essential for infectious progeny formation. In summary, these findings suggest that C. trachomatis hijacks retrograde pathways for effective infection. PMID:26042774
A novel strategy for global analysis of the dynamic thiol redox proteome.
Martínez-Acedo, Pablo; Núñez, Estefanía; Gómez, Francisco J Sánchez; Moreno, Margoth; Ramos, Elena; Izquierdo-Álvarez, Alicia; Miró-Casas, Elisabet; Mesa, Raquel; Rodriguez, Patricia; Martínez-Ruiz, Antonio; Dorado, David Garcia; Lamas, Santiago; Vázquez, Jesús
2012-09-01
Nitroxidative stress in cells occurs mainly through the action of reactive nitrogen and oxygen species (RNOS) on protein thiol groups. Reactive nitrogen and oxygen species-mediated protein modifications are associated with pathophysiological states, but can also convey physiological signals. Identification of Cys residues that are modified by oxidative stimuli still poses technical challenges and these changes have never been statistically analyzed from a proteome-wide perspective. Here we show that GELSILOX, a method that combines a robust proteomics protocol with a new computational approach that analyzes variance at the peptide level, allows a simultaneous analysis of dynamic alterations in the redox state of Cys sites and of protein abundance. GELSILOX permits the characterization of the major endothelial redox targets of hydrogen peroxide in endothelial cells and reveals that hypoxia induces a significant increase in the status of oxidized thiols. GELSILOX also detected thiols that are redox-modified by ischemia-reperfusion in heart mitochondria and demonstrated that these alterations are abolished in ischemia-preconditioned animals.
Ichibangase, Tomoko; Sugawara, Yasuhiro; Yamabe, Akio; Koshiyama, Akiyo; Yoshimura, Akari; Enomoto, Takemi; Imai, Kazuhiro
2012-01-01
Systems biology aims to understand biological phenomena in terms of complex biological and molecular interactions, and thus proteomics plays an important role in elucidating protein networks. However, many proteomic methods have suffered from their high variability, resulting in only showing altered protein names. Here, we propose a strategy for elucidating cellular protein networks based on an FD-LC-MS/MS proteomic method. The strategy permits reproducible relative quantitation of differences in protein levels between different cell populations and allows for integration of the data with those obtained through other methods. We demonstrate the validity of the approach through a comparison of differential protein expression in normal and conditional superoxide dismutase 1 gene knockout cells and believe that beginning with an FD-LC-MS/MS proteomic approach will enable researchers to elucidate protein networks more easily and comprehensively. PMID:23029042
Linkage Of Exposure And Effects Using Genomics, Proteomics, And Metabolomics In Small Fish Models
Poster for the BOSC Computational Toxicology Research Program review. Knowledge of possible toxic mechanisms/modes of action (MOA) of chemicals can provide valuable insights as to appropriate methods for assessing exposure and effects, thereby reducing uncertainties related to e...
EMERGING MOLECULAR COMPUTATIONAL APPROACHES FOR CROSS-SPECIES EXTRAPOLATIONS: A WORKSHOP SUMMARY
Advances in molecular technology have led to the elucidation of full genomic sequences of several multicellular organisms, ranging from nematodes to man. The related molecular field of proteomics and metabolomics are now beginning to advance rapidly as well. In addition, advances...
Biomedical data integration in computational drug design and bioinformatics.
Seoane, Jose A; Aguiar-Pulido, Vanessa; Munteanu, Cristian R; Rivero, Daniel; Rabunal, Juan R; Dorado, Julian; Pazos, Alejandro
2013-03-01
In recent years, in the post genomic era, more and more data is being generated by biological high throughput technologies, such as proteomics and transcriptomics. This omics data can be very useful, but the real challenge is to analyze all this data, as a whole, after integrating it. Biomedical data integration enables making queries to different, heterogeneous and distributed biomedical data sources. Data integration solutions can be very useful not only in the context of drug design, but also in biomedical information retrieval, clinical diagnosis, system biology, etc. In this review, we analyze the most common approaches to biomedical data integration, such as federated databases, data warehousing, multi-agent systems and semantic technology, as well as the solutions developed using these approaches in the past few years.
The Perseus computational platform for comprehensive analysis of (prote)omics data.
Tyanova, Stefka; Temu, Tikira; Sinitcyn, Pavel; Carlson, Arthur; Hein, Marco Y; Geiger, Tamar; Mann, Matthias; Cox, Jürgen
2016-09-01
A main bottleneck in proteomics is the downstream biological analysis of highly multivariate quantitative protein abundance data generated using mass-spectrometry-based analysis. We developed the Perseus software platform (http://www.perseus-framework.org) to support biological and biomedical researchers in interpreting protein quantification, interaction and post-translational modification data. Perseus contains a comprehensive portfolio of statistical tools for high-dimensional omics data analysis covering normalization, pattern recognition, time-series analysis, cross-omics comparisons and multiple-hypothesis testing. A machine learning module supports the classification and validation of patient groups for diagnosis and prognosis, and it also detects predictive protein signatures. Central to Perseus is a user-friendly, interactive workflow environment that provides complete documentation of computational methods used in a publication. All activities in Perseus are realized as plugins, and users can extend the software by programming their own, which can be shared through a plugin store. We anticipate that Perseus's arsenal of algorithms and its intuitive usability will empower interdisciplinary analysis of complex large data sets.
On the importance of mathematical methods for analysis of MALDI-imaging mass spectrometry data.
Trede, Dennis; Kobarg, Jan Hendrik; Oetjen, Janina; Thiele, Herbert; Maass, Peter; Alexandrov, Theodore
2012-03-21
In the last decade, matrix-assisted laser desorption/ionization (MALDI) imaging mass spectrometry (IMS), also called as MALDI-imaging, has proven its potential in proteomics and was successfully applied to various types of biomedical problems, in particular to histopathological label-free analysis of tissue sections. In histopathology, MALDI-imaging is used as a general analytic tool revealing the functional proteomic structure of tissue sections, and as a discovery tool for detecting new biomarkers discriminating a region annotated by an experienced histologist, in particular, for cancer studies. A typical MALDI-imaging data set contains 10⁸ to 10⁹ intensity values occupying more than 1 GB. Analysis and interpretation of such huge amount of data is a mathematically, statistically and computationally challenging problem. In this paper we overview some computational methods for analysis of MALDI-imaging data sets. We discuss the importance of data preprocessing, which typically includes normalization, baseline removal and peak picking, and hightlight the importance of image denoising when visualizing IMS data.
On the Importance of Mathematical Methods for Analysis of MALDI-Imaging Mass Spectrometry Data.
Trede, Dennis; Kobarg, Jan Hendrik; Oetjen, Janina; Thiele, Herbert; Maass, Peter; Alexandrov, Theodore
2012-03-01
In the last decade, matrix-assisted laser desorption/ionization (MALDI) imaging mass spectrometry (IMS), also called as MALDI-imaging, has proven its potential in proteomics and was successfully applied to various types of biomedical problems, in particular to histopathological label-free analysis of tissue sections. In histopathology, MALDI-imaging is used as a general analytic tool revealing the functional proteomic structure of tissue sections, and as a discovery tool for detecting new biomarkers discriminating a region annotated by an experienced histologist, in particular, for cancer studies. A typical MALDI-imaging data set contains 108 to 109 intensity values occupying more than 1 GB. Analysis and interpretation of such huge amount of data is a mathematically, statistically and computationally challenging problem. In this paper we overview some computational methods for analysis of MALDI-imaging data sets. We discuss the importance of data preprocessing, which typically includes normalization, baseline removal and peak picking, and hightlight the importance of image denoising when visualizing IMS data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ryu, Jae San; Shary, Semarjit; Houtman, Carl J.
2011-11-01
Abstract Brown rot basidiomycetes have an important ecological role in lignocellulose recycling and are notable for their rapid degradation of wood polymers via oxidative and hydrolytic mechanisms. However, most of these fungi apparently lack processive (exo-acting) cellulases, such as cellobiohydrolases, which are generally required for efficient cellulolysis. The recent sequencing of the Postia placenta genome now permits a proteomic approach to this longstanding conundrum. We grew P. placenta on solid aspen wood, extracted proteins from the biodegrading substrate, and analyzed tryptic digests by shotgun liquid chromatography-tandem mass spectrometry. Comparison of the data with the predicted P. placenta proteome revealed themore » presence of 34 likely glycoside hydrolases, but only four of these-two in glycoside hydrolase family 5, one in family 10, and one in family 12-have sequences that suggested possible activity on cellulose. We expressed these enzymes heterologously and determined that they all exhibited endoglucanase activity on phosphoric acid-swollen cellulose. They also slowly hydrolyzed filter paper, a more crystalline substrate, but the soluble/insoluble reducing sugar ratios they produced classify them as nonprocessive. Computer simulations indicated that these enzymes produced soluble/insoluble ratios on reduced phosphoric acid-swollen cellulose that were higher than expected for random hydrolysis, which suggests that they could possess limited exo activity, but they are at best 10-fold less processive than cellobiohydrolases. It appears likely that P. placenta employs a combination of oxidative mechanisms and endo-acting cellulases to degrade cellulose efficiently in the absence of a significant processive component.« less
Di Girolamo, Francesco; Righetti, Pier Giorgio; Soste, Martin; Feng, Yuehan; Picotti, Paola
2013-08-26
Systems biology studies require the capability to quantify with high precision proteins spanning a broad range of abundances across multiple samples. However, the broad range of protein expression in cells often precludes the detection of low-abundance proteins. Different sample processing techniques can be applied to increase proteome coverage. Among these, combinatorial (hexa)peptide ligand libraries (CPLLs) bound to solid matrices have been used to specifically capture and detect low-abundance proteins in complex samples. To assess whether CPLL capture can be applied in systems biology studies involving the precise quantitation of proteins across a multitude of samples, we evaluated its performance across the whole range of protein abundances in Saccharomyces cerevisiae. We used selected reaction monitoring assays for a set of target proteins covering a broad abundance range to quantitatively evaluate the precision of the approach and its capability to detect low-abundance proteins. Replicated CPLL-isolates showed an average variability of ~10% in the amount of the isolated proteins. The high reproducibility of the technique was not dependent on the abundance of the protein or the amount of beads used for the capture. However, the protein-to-bead ratio affected the enrichment of specific proteins. We did not observe a normalization effect of CPLL beads on protein abundances. However, CPLLs enriched for and depleted specific sets of proteins and thus changed the abundances of proteins from a whole proteome extract. This allowed the identification of ~400 proteins otherwise undetected in an untreated sample, under the experimental conditions used. CPLL capture is thus a useful tool to increase protein identifications in proteomic experiments, but it should be coupled to the analysis of untreated samples, to maximize proteome coverage. Our data also confirms that CPLL capture is reproducible and can be confidently used in quantitative proteomic experiments. Combinatorial hexapeptide ligand libraries (CPLLs) bound to solid matrices have been proposed to specifically capture and detect low-abundance proteins in complex samples. To assess whether the CPLL capture can be confidently applied in systems biology studies involving the precise quantitation of proteins across a broad range of abundances and a multitude of samples, we evaluated its reproducibility and performance features. Using selected reaction monitoring assays for proteins covering the whole range of abundances we show that the technique is reproducible and compatible with quantitative proteomic studies. However, the protein-to-bead ratio affects the enrichment of specific proteins and CPLLs depleted specific sets of proteins from a whole proteome extract. Our results suggest that CPLL-based analyses should be coupled to the analysis of untreated samples, to maximize proteome coverage. Overall, our data confirms the applicability of CPLLs in systems biology research and guides the correct use of this technique. Copyright © 2013 Elsevier B.V. All rights reserved.
Takemori, Nobuaki; Takemori, Ayako; Tanaka, Yuki; Endo, Yaeta; Hurst, Jane L.; Gómez-Baena, Guadalupe; Harman, Victoria M.; Beynon, Robert J.
2017-01-01
A major challenge in proteomics is the absolute accurate quantification of large numbers of proteins. QconCATs, artificial proteins that are concatenations of multiple standard peptides, are well established as an efficient means to generate standards for proteome quantification. Previously, QconCATs have been expressed in bacteria, but we now describe QconCAT expression in a robust, cell-free system. The new expression approach rescues QconCATs that previously were unable to be expressed in bacteria and can reduce the incidence of proteolytic damage to QconCATs. Moreover, it is possible to cosynthesize QconCATs in a highly-multiplexed translation reaction, coexpressing tens or hundreds of QconCATs simultaneously. By obviating bacterial culture and through the gain of high level multiplexing, it is now possible to generate tens of thousands of standard peptides in a matter of weeks, rendering absolute quantification of a complex proteome highly achievable in a reproducible, broadly deployable system. PMID:29055021
Progress toward a reduced phage genetic code.
Yao, Anzhi; Reed, Sean A; Koh, Minseob; Yu, Chenguang; Luo, Xiaozhou; Mehta, Angad P; Schultz, Peter G
2018-03-26
All known living organisms use at least 20 amino acids as the basic building blocks of life. Efforts to reduce the number of building blocks in a replicating system to below the 20 canonical amino acids have not been successful to date. In this work, we use filamentous phage as a model system to investigate the feasibility of removing methionine (Met) from the proteome. We show that all 24 elongation Met sites in the M13 phage genome can be replaced by other canonical amino acids. Most of these changes involve substitution of methionine by leucine (Leu), but in some cases additional compensatory mutations are required. Combining Met substituted sites in the proteome generally led to lower viability/infectivity of the mutant phages, which remains the major challenge in eliminating all methionines from the phage proteome. To date a total of 15 (out of all 24) elongation Mets have been simultaneously deleted from the M13 proteome, providing a useful foundation for future efforts to minimize the genetic code. Copyright © 2018. Published by Elsevier Ltd.
Musi, Valeria; Birdsall, Berry; Fernandez-Ballester, Gregorio; Guerrini, Remo; Salvatori, Severo; Serrano, Luis; Pastore, Annalisa
2006-04-01
SH3 domains are small protein modules that are involved in protein-protein interactions in several essential metabolic pathways. The availability of the complete genome and the limited number of clearly identifiable SH3 domains make the yeast Saccharomyces cerevisae an ideal proteomic-based model system to investigate the structural rules dictating the SH3-mediated protein interactions and to develop new tools to assist these studies. In the present work, we have determined the solution structure of the SH3 domain from Myo3 and modeled by homology that of the highly homologous Myo5, two myosins implicated in actin polymerization. We have then implemented an integrated approach that makes use of experimental and computational methods to characterize their binding properties. While accommodating their targets in the classical groove, the two domains have selectivity in both orientation and sequence specificity of the target peptides. From our study, we propose a consensus sequence that may provide a useful guideline to identify new natural partners and suggest a strategy of more general applicability that may be of use in other structural proteomic studies.
Proteomics of industrial fungi: trends and insights for biotechnology.
de Oliveira, José Miguel P Ferreira; de Graaff, Leo H
2011-01-01
Filamentous fungi are widely known for their industrial applications, namely, the production of food-processing enzymes and metabolites such as antibiotics and organic acids. In the past decade, the full genome sequencing of filamentous fungi increased the potential to predict encoded proteins enormously, namely, hydrolytic enzymes or proteins involved in the biosynthesis of metabolites of interest. The integration of genome sequence information with possible phenotypes requires, however, the knowledge of all the proteins in the cell in a system-wise manner, given by proteomics. This review summarises the progress of proteomics and its importance for the study of biotechnological processes in filamentous fungi. A major step forward in proteomics was to couple protein separation with high-resolution mass spectrometry, allowing accurate protein quantification. Despite the fact that most fungal proteomic studies have been focused on proteins from mycelial extracts, many proteins are related to processes which are compartmentalised in the fungal cell, e.g. β-lactam antibiotic production in the microbody. For the study of such processes, a targeted approach is required, e.g. by organelle proteomics. Typical workflows for sample preparation in fungal organelle proteomics are discussed, including homogenisation and sub-cellular fractionation. Finally, examples are presented of fungal organelle proteomic studies, which have enlarged the knowledge on areas of interest to biotechnology, such as protein secretion, energy production or antibiotic biosynthesis.
Proteomics of industrial fungi: trends and insights for biotechnology
de Oliveira, José Miguel P. Ferreira
2010-01-01
Filamentous fungi are widely known for their industrial applications, namely, the production of food-processing enzymes and metabolites such as antibiotics and organic acids. In the past decade, the full genome sequencing of filamentous fungi increased the potential to predict encoded proteins enormously, namely, hydrolytic enzymes or proteins involved in the biosynthesis of metabolites of interest. The integration of genome sequence information with possible phenotypes requires, however, the knowledge of all the proteins in the cell in a system-wise manner, given by proteomics. This review summarises the progress of proteomics and its importance for the study of biotechnological processes in filamentous fungi. A major step forward in proteomics was to couple protein separation with high-resolution mass spectrometry, allowing accurate protein quantification. Despite the fact that most fungal proteomic studies have been focused on proteins from mycelial extracts, many proteins are related to processes which are compartmentalised in the fungal cell, e.g. β-lactam antibiotic production in the microbody. For the study of such processes, a targeted approach is required, e.g. by organelle proteomics. Typical workflows for sample preparation in fungal organelle proteomics are discussed, including homogenisation and sub-cellular fractionation. Finally, examples are presented of fungal organelle proteomic studies, which have enlarged the knowledge on areas of interest to biotechnology, such as protein secretion, energy production or antibiotic biosynthesis. PMID:20922379
Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.
Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin
2014-06-13
Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Lenčo, Juraj; Vajrychová, Marie; Pimková, Kristýna; Prokšová, Magdaléna; Benková, Markéta; Klimentová, Jana; Tambor, Vojtěch; Soukup, Ondřej
2018-04-17
Due to its sensitivity and productivity, bottom-up proteomics based on liquid chromatography-mass spectrometry (LC-MS) has become the core approach in the field. The de facto standard LC-MS platform for proteomics operates at sub-μL/min flow rates, and nanospray is required for efficiently introducing peptides into a mass spectrometer. Although this is almost a "dogma", this view is being reconsidered in light of developments in highly efficient chromatographic columns, and especially with the introduction of exceptionally sensitive MS instruments. Although conventional-flow LC-MS platforms have recently penetrated targeted proteomics successfully, their possibilities in discovery-oriented proteomics have not yet been thoroughly explored. Our objective was to determine what are the extra costs and what optimization and adjustments to a conventional-flow LC-MS system must be undertaken to identify a comparable number of proteins as can be identified on a nanoLC-MS system. We demonstrate that the amount of a complex tryptic digest needed for comparable proteome coverage can be roughly 5-fold greater, providing the column dimensions are properly chosen, extra-column peak dispersion is minimized, column temperature and flow rate are set to levels appropriate for peptide separation, and the composition of mobile phases is fine-tuned. Indeed, we identified 2 835 proteins from 2 μg of HeLa cells tryptic digest separated during a 60 min gradient at 68 μL/min on a 1.0 mm × 250 mm column held at 55 °C and using an aqua-acetonitrile mobile phases containing 0.1% formic acid, 0.4% acetic acid, and 3% dimethyl sulfoxide. Our results document that conventional-flow LC-MS is an attractive alternative for bottom-up exploratory proteomics.
van Herwijnen, Martijn J C; Zonneveld, Marijke I; Goerdayal, Soenita; Nolte-'t Hoen, Esther N M; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A F; Redegeld, Frank A; Wauben, Marca H M
2016-11-01
Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of the whole milk proteome and illustrates that milk-derived EV are macromolecular components with a unique functional proteome. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
To label or not to label: applications of quantitative proteomics in neuroscience research.
Filiou, Michaela D; Martins-de-Souza, Daniel; Guest, Paul C; Bahn, Sabine; Turck, Christoph W
2012-02-01
Proteomics has provided researchers with a sophisticated toolbox of labeling-based and label-free quantitative methods. These are now being applied in neuroscience research where they have already contributed to the elucidation of fundamental mechanisms and the discovery of candidate biomarkers. In this review, we evaluate and compare labeling-based and label-free quantitative proteomic techniques for applications in neuroscience research. We discuss the considerations required for the analysis of brain and central nervous system specimens, the experimental design of quantitative proteomic workflows as well as the feasibility, advantages, and disadvantages of the available techniques for neuroscience-oriented questions. Furthermore, we assess the use of labeled standards as internal controls for comparative studies in humans and review applications of labeling-based and label-free mass spectrometry approaches in relevant model organisms and human subjects. Providing a comprehensive guide of feasible and meaningful quantitative proteomic methodologies for neuroscience research is crucial not only for overcoming current limitations but also for gaining useful insights into brain function and translating proteomics from bench to bedside. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sanchez-Lucas, Rosa; Mehta, Angela; Valledor, Luis; Cabello-Hurtado, Francisco; Romero-Rodrıguez, M Cristina; Simova-Stoilova, Lyudmila; Demir, Sekvan; Rodriguez-de-Francisco, Luis E; Maldonado-Alconada, Ana M; Jorrin-Prieto, Ana L; Jorrín-Novo, Jesus V
2016-03-01
The present review is an update of the previous one published in Proteomics 2015 Reviews special issue [Jorrin-Novo, J. V. et al., Proteomics 2015, 15, 1089-1112] covering the July 2014-2015 period. It has been written on the bases of the publications that appeared in Proteomics journal during that period and the most relevant ones that have been published in other high-impact journals. Methodological advances and the contribution of the field to the knowledge of plant biology processes and its translation to agroforestry and environmental sectors will be discussed. This review has been organized in four blocks, with a starting general introduction (literature survey) followed by sections focusing on the methodology (in vitro, in vivo, wet, and dry), proteomics integration with other approaches (systems biology and proteogenomics), biological information, and knowledge (cell communication, receptors, and signaling), ending with a brief mention of some other biological and translational topics to which proteomics has made some contribution. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Maryáš, Josef; Faktor, Jakub; Dvořáková, Monika; Struhárová, Iva; Grell, Peter; Bouchal, Pavel
2014-03-01
Metastases are responsible for most of the cases of death in patients with solid tumors. There is thus an urgent clinical need of better understanding the exact molecular mechanisms and finding novel therapeutics targets and biomarkers of metastatic disease of various tumors. Metastases are formed in a complicated biological process called metastatic cascade. Up to now, proteomics has enabled the identification of number of metastasis-associated proteins and potential biomarkers in cancer tissues, microdissected cells, model systems, and secretomes. Expression profiles and biological role of key proteins were confirmed in verification and functional experiments. This communication reviews these observations and analyses the methodological aspects of the proteomics approaches used. Moreover, it reviews contribution of current proteomics in the field of functional characterization and interactome analysis of proteins involved in various events in metastatic cascade. It is evident that ongoing technical progress will further increase proteome coverage and sample capacity of proteomics technologies, giving complex answers to clinical and functional questions asked. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
2015-01-01
Background Meningitis is the inflammation of the meninges in response to infection or chemical agents. While aseptic meningitis, most frequently caused by enteroviruses, is usually benign with a self-limiting course, bacterial meningitis remains associated with high morbidity and mortality rates, despite advances in antimicrobial therapy and intensive care. Fast and accurate differential diagnosis is crucial for assertive choice of the appropriate therapeutic approach for each form of meningitis. Methods We used 2D-PAGE and mass spectrometry to identify the cerebrospinal fluid proteome specifically related to the host response to pneumococcal, meningococcal, and enteroviral meningitis. The disease-specific proteome signatures were inspected by pathway analysis. Results Unique cerebrospinal fluid proteome signatures were found to the three aetiological forms of meningitis investigated, and a qualitative predictive model with four protein markers was developed for the differential diagnosis of these diseases. Nevertheless, pathway analysis of the disease-specific proteomes unveiled that Kallikrein-kinin system may play a crucial role in the pathophysiological mechanisms leading to brain damage in bacterial meningitis. Proteins taking part in this cellular process are proposed as putative targets to novel adjunctive therapies. Conclusions Comparative proteomics of cerebrospinal fluid disclosed candidate biomarkers, which were combined in a qualitative and sequential predictive model with potential to improve the differential diagnosis of pneumococcal, meningococcal and enteroviral meningitis. Moreover, we present the first evidence of the possible implication of Kallikrein-kinin system in the pathophysiology of bacterial meningitis. PMID:26040285
Cordeiro, Ana Paula; Silva Pereira, Rosiane Aparecida; Chapeaurouge, Alex; Coimbra, Clarice Semião; Perales, Jonas; Oliveira, Guilherme; Sanchez Candiani, Talitah Michel; Coimbra, Roney Santos
2015-01-01
Meningitis is the inflammation of the meninges in response to infection or chemical agents. While aseptic meningitis, most frequently caused by enteroviruses, is usually benign with a self-limiting course, bacterial meningitis remains associated with high morbidity and mortality rates, despite advances in antimicrobial therapy and intensive care. Fast and accurate differential diagnosis is crucial for assertive choice of the appropriate therapeutic approach for each form of meningitis. We used 2D-PAGE and mass spectrometry to identify the cerebrospinal fluid proteome specifically related to the host response to pneumococcal, meningococcal, and enteroviral meningitis. The disease-specific proteome signatures were inspected by pathway analysis. Unique cerebrospinal fluid proteome signatures were found to the three aetiological forms of meningitis investigated, and a qualitative predictive model with four protein markers was developed for the differential diagnosis of these diseases. Nevertheless, pathway analysis of the disease-specific proteomes unveiled that Kallikrein-kinin system may play a crucial role in the pathophysiological mechanisms leading to brain damage in bacterial meningitis. Proteins taking part in this cellular process are proposed as putative targets to novel adjunctive therapies. Comparative proteomics of cerebrospinal fluid disclosed candidate biomarkers, which were combined in a qualitative and sequential predictive model with potential to improve the differential diagnosis of pneumococcal, meningococcal and enteroviral meningitis. Moreover, we present the first evidence of the possible implication of Kallikrein-kinin system in the pathophysiology of bacterial meningitis.
ProteoWizard: open source software for rapid proteomics tools development.
Kessner, Darren; Chambers, Matt; Burke, Robert; Agus, David; Mallick, Parag
2008-11-01
The ProteoWizard software project provides a modular and extensible set of open-source, cross-platform tools and libraries. The tools perform proteomics data analyses; the libraries enable rapid tool creation by providing a robust, pluggable development framework that simplifies and unifies data file access, and performs standard proteomics and LCMS dataset computations. The library contains readers and writers of the mzML data format, which has been written using modern C++ techniques and design principles and supports a variety of platforms with native compilers. The software has been specifically released under the Apache v2 license to ensure it can be used in both academic and commercial projects. In addition to the library, we also introduce a rapidly growing set of companion tools whose implementation helps to illustrate the simplicity of developing applications on top of the ProteoWizard library. Cross-platform software that compiles using native compilers (i.e. GCC on Linux, MSVC on Windows and XCode on OSX) is available for download free of charge, at http://proteowizard.sourceforge.net. This website also provides code examples, and documentation. It is our hope the ProteoWizard project will become a standard platform for proteomics development; consequently, code use, contribution and further development are strongly encouraged.
Proteomic Analysis of Male-Fertility Restoration in CMS Onion
USDA-ARS?s Scientific Manuscript database
The production of hybrid-onion seed is dependent on cytoplasmic-genic male sterility (CMS) systems. For the most commonly used CMS, male-sterile (S) cytoplasm interacts with a dominant allele at one nuclear male-fertility restoration locus (Ms) to condition male fertility. We are using proteomics ...
Qeli, Ermir; Omasits, Ulrich; Goetze, Sandra; Stekhoven, Daniel J; Frey, Juerg E; Basler, Konrad; Wollscheid, Bernd; Brunner, Erich; Ahrens, Christian H
2014-08-28
The in silico prediction of the best-observable "proteotypic" peptides in mass spectrometry-based workflows is a challenging problem. Being able to accurately predict such peptides would enable the informed selection of proteotypic peptides for targeted quantification of previously observed and non-observed proteins for any organism, with a significant impact for clinical proteomics and systems biology studies. Current prediction algorithms rely on physicochemical parameters in combination with positive and negative training sets to identify those peptide properties that most profoundly affect their general detectability. Here we present PeptideRank, an approach that uses learning to rank algorithm for peptide detectability prediction from shotgun proteomics data, and that eliminates the need to select a negative dataset for the training step. A large number of different peptide properties are used to train ranking models in order to predict a ranking of the best-observable peptides within a protein. Empirical evaluation with rank accuracy metrics showed that PeptideRank complements existing prediction algorithms. Our results indicate that the best performance is achieved when it is trained on organism-specific shotgun proteomics data, and that PeptideRank is most accurate for short to medium-sized and abundant proteins, without any loss in prediction accuracy for the important class of membrane proteins. Targeted proteomics approaches have been gaining a lot of momentum and hold immense potential for systems biology studies and clinical proteomics. However, since only very few complete proteomes have been reported to date, for a considerable fraction of a proteome there is no experimental proteomics evidence that would allow to guide the selection of the best-suited proteotypic peptides (PTPs), i.e. peptides that are specific to a given proteoform and that are repeatedly observed in a mass spectrometer. We describe a novel, rank-based approach for the prediction of the best-suited PTPs for targeted proteomics applications. By building on methods developed in the field of information retrieval (e.g. web search engines like Google's PageRank), we circumvent the delicate step of selecting positive and negative training sets and at the same time also more closely reflect the experimentalist´s need for selecting e.g. the 5 most promising peptides for targeting a protein of interest. This approach allows to predict PTPs for not yet observed proteins or for organisms without prior experimental proteomics data such as many non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Systems medicine: a new approach to clinical practice.
Cardinal-Fernández, Pablo; Nin, Nicolás; Ruíz-Cabello, Jesús; Lorente, José A
2014-10-01
Most respiratory diseases are considered complex diseases as their susceptibility and outcomes are determined by the interaction between host-dependent factors (genetic factors, comorbidities, etc.) and environmental factors (exposure to microorganisms or allergens, treatments received, etc.) The reductionist approach in the study of diseases has been of fundamental importance for the understanding of the different components of a system. Systems biology or systems medicine is a complementary approach aimed at analyzing the interactions between the different components within one organizational level (genome, transcriptome, proteome), and then between the different levels. Systems medicine is currently used for the interpretation and understanding of the pathogenesis and pathophysiology of different diseases, biomarker discovery, design of innovative therapeutic targets, and the drawing up of computational models for different biological processes. In this review we discuss the most relevant concepts of the theory underlying systems medicine, as well as its applications in the various biological processes in humans. Copyright © 2013 SEPAR. Published by Elsevier Espana. All rights reserved.
Systems Approaches to Biology and Disease Enable Translational Systems Medicine
Hood, Leroy; Tian, Qiang
2012-01-01
The development and application of systems strategies to biology and disease are transforming medical research and clinical practice in an unprecedented rate. In the foreseeable future, clinicians, medical researchers, and ultimately the consumers and patients will be increasingly equipped with a deluge of personal health information, e.g., whole genome sequences, molecular profiling of diseased tissues, and periodic multi-analyte blood testing of biomarker panels for disease and wellness. The convergence of these practices will enable accurate prediction of disease susceptibility and early diagnosis for actionable preventive schema and personalized treatment regimes tailored to each individual. It will also entail proactive participation from all major stakeholders in the health care system. We are at the dawn of predictive, preventive, personalized, and participatory (P4) medicine, the fully implementation of which requires marrying basic and clinical researches through advanced systems thinking and the employment of high-throughput technologies in genomics, proteomics, nanofluidics, single-cell analysis, and computation strategies in a highly-orchestrated discipline we termed translational systems medicine. PMID:23084773
Computational Prediction of Protein-Protein Interactions
Ehrenberger, Tobias; Cantley, Lewis C.; Yaffe, Michael B.
2015-01-01
The prediction of protein-protein interactions and kinase-specific phosphorylation sites on individual proteins is critical for correctly placing proteins within signaling pathways and networks. The importance of this type of annotation continues to increase with the continued explosion of genomic and proteomic data, particularly with emerging data categorizing posttranslational modifications on a large scale. A variety of computational tools are available for this purpose. In this chapter, we review the general methodologies for these types of computational predictions and present a detailed user-focused tutorial of one such method and computational tool, Scansite, which is freely available to the entire scientific community over the Internet. PMID:25859943
Computational biology for ageing
Wieser, Daniela; Papatheodorou, Irene; Ziehm, Matthias; Thornton, Janet M.
2011-01-01
High-throughput genomic and proteomic technologies have generated a wealth of publicly available data on ageing. Easy access to these data, and their computational analysis, is of great importance in order to pinpoint the causes and effects of ageing. Here, we provide a description of the existing databases and computational tools on ageing that are available for researchers. We also describe the computational approaches to data interpretation in the field of ageing including gene expression, comparative and pathway analyses, and highlight the challenges for future developments. We review recent biological insights gained from applying bioinformatics methods to analyse and interpret ageing data in different organisms, tissues and conditions. PMID:21115530
Top-down Proteomics: Technology Advancements and Applications to Heart Diseases
Cai, Wenxuan; Tucholski, Trisha M.; Gregorich, Zachery R.; Ge, Ying
2016-01-01
Introduction Diseases of the heart are a leading cause of morbidity and mortality for both men and women worldwide, and impose significant economic burdens on the healthcare systems. Despite substantial effort over the last several decades, the molecular mechanisms underlying diseases of the heart remain poorly understood. Areas covered Altered protein post-translational modifications (PTMs) and protein isoform switching are increasingly recognized as important disease mechanisms. Top-down high-resolution mass spectrometry (MS)-based proteomics has emerged as the most powerful method for the comprehensive analysis of PTMs and protein isoforms. Here, we will review recent technology developments in the field of top-down proteomics, as well as highlight recent studies utilizing top-down proteomics to decipher the cardiac proteome for the understanding of the molecular mechanisms underlying diseases of the heart. Expert commentary Top-down proteomics is a premier method for the global and comprehensive study of protein isoforms and their PTMs, enabling the identification of novel protein isoforms and PTMs, characterization of sequence variations, and quantification of disease-associated alterations. Despite significant challenges, continuous development of top-down proteomics technology will greatly aid the dissection of the molecular mechanisms underlying diseases of the hearts for the identification of novel biomarkers and therapeutic targets. PMID:27448560
Petricoin, Emanuel F; Rajapaske, Vinodh; Herman, Eugene H; Arekani, Ali M; Ross, Sally; Johann, Donald; Knapton, Alan; Zhang, J; Hitt, Ben A; Conrads, Thomas P; Veenstra, Timothy D; Liotta, Lance A; Sistare, Frank D
2004-01-01
Proteomics is more than just generating lists of proteins that increase or decrease in expression as a cause or consequence of pathology. The goal should be to characterize the information flow through the intercellular protein circuitry which communicates with the extracellular microenvironment and then ultimately to the serum/plasma macroenvironment. The nature of this information can be a cause, or a consequence, of disease and toxicity based processes as cascades of reinforcing information percolate through the system and become reflected in changing proteomic information content of the circulation. Serum Proteomic Pattern Diagnostics is a new type of proteomic platform in which patterns of proteomic signatures from high dimensional mass spectrometry data are used as a diagnostic classifier. While this approach has shown tremendous promise in early detection of cancers, detection of drug-induced toxicity may also be possible with this same technology. Analysis of serum from rat models of anthracycline and anthracenedione induced cardiotoxicity indicate the potential clinical utility of diagnostic proteomic patterns where low molecular weight peptides and protein fragments may have higher accuracy than traditional biomarkers of cardiotoxicity such as troponins. These fragments may one day be harvested by circulating nanoparticles designed to absorb, enrich and amplify the diagnostic biomarker repertoire generated even at the critical initial stages of toxicity.
MitoMiner: a data warehouse for mitochondrial proteomics data
Smith, Anthony C.; Blackshaw, James A.; Robinson, Alan J.
2012-01-01
MitoMiner (http://mitominer.mrc-mbu.cam.ac.uk/) is a data warehouse for the storage and analysis of mitochondrial proteomics data gathered from publications of mass spectrometry and green fluorescent protein tagging studies. In MitoMiner, these data are integrated with data from UniProt, Gene Ontology, Online Mendelian Inheritance in Man, HomoloGene, Kyoto Encyclopaedia of Genes and Genomes and PubMed. The latest release of MitoMiner stores proteomics data sets from 46 studies covering 11 different species from eumetazoa, viridiplantae, fungi and protista. MitoMiner is implemented by using the open source InterMine data warehouse system, which provides a user interface allowing users to upload data for analysis, personal accounts to store queries and results and enables queries of any data in the data model. MitoMiner also provides lists of proteins for use in analyses, including the new MitoMiner mitochondrial proteome reference sets that specify proteins with substantial experimental evidence for mitochondrial localization. As further mitochondrial proteomics data sets from normal and diseased tissue are published, MitoMiner can be used to characterize the variability of the mitochondrial proteome between tissues and investigate how changes in the proteome may contribute to mitochondrial dysfunction and mitochondrial-associated diseases such as cancer, neurodegenerative diseases, obesity, diabetes, heart failure and the ageing process. PMID:22121219
Proteomics boosts translational and clinical microbiology.
Del Chierico, F; Petrucca, A; Vernocchi, P; Bracaglia, G; Fiscarelli, E; Bernaschi, P; Muraca, M; Urbani, A; Putignani, L
2014-01-31
The application of proteomics to translational and clinical microbiology is one of the most advanced frontiers in the management and control of infectious diseases and in the understanding of complex microbial systems within human fluids and districts. This new approach aims at providing, by dedicated bioinformatic pipelines, a thorough description of pathogen proteomes and their interactions within the context of human host ecosystems, revolutionizing the vision of infectious diseases in biomedicine and approaching new viewpoints in both diagnostic and clinical management of the patient. Indeed, in the last few years, many laboratories have matured a series of advanced proteomic applications, aiming at providing individual proteome charts of pathogens, with respect to their morph and/or cell life stages, antimicrobial or antimycotic resistance profiling, epidemiological dispersion. Herein, we aim at reviewing the current state-of-the-art on proteomic protocols designed and set-up for translational and diagnostic microbiological purposes, from axenic pathogens' characterization to microbiota ecosystems' full description. The final goal is to describe applications of the most common MALDI-TOF MS platforms to advanced diagnostic issues related to emerging infections, increasing of fastidious bacteria, and generation of patient-tailored phylotypes. This article is part of a Special Issue entitled: Trends in Microbial Proteomics. © 2013. Published by Elsevier B.V. All rights reserved.
The “Dark Side” of Food Stuff Proteomics: The CPLL-Marshals Investigate
Righetti, Pier Giorgio; Fasoli, Elisa; D’Amato, Alfonsina; Boschetti, Egisto
2014-01-01
The present review deals with analysis of the proteome of animal and plant-derived food stuff, as well as of non-alcoholic and alcoholic beverages. The survey is limited to those systems investigated with the help of combinatorial peptide ligand libraries, a most powerful technique allowing access to low- to very-low-abundance proteins, i.e., to those proteins that might characterize univocally a given biological system and, in the case of commercial food preparations, attest their genuineness or adulteration. Among animal foods the analysis of cow’s and donkey’s milk is reported, together with the proteomic composition of egg white and yolk, as well as of honey, considered as a hybrid between floral and animal origin. In terms of plant and fruits, a survey is offered of spinach, artichoke, banana, avocado, mango and lemon proteomics, considered as recalcitrant tissues in that small amounts of proteins are dispersed into a large body of plant polymers and metabolites. As examples of non-alcoholic beverages, ginger ale, coconut milk, a cola drink, almond milk and orgeat syrup are analyzed. Finally, the trace proteome of white and red wines, beer and aperitifs is reported, with the aim of tracing the industrial manipulations and herbal usage prior to their commercialization. PMID:28234315
Wegrzyn, Jill L.; Bark, Steven J.; Funkelstein, Lydiane; Mosier, Charles; Yap, Angel; Kazemi-Esfarjani, Parasa; La Spada, Albert; Sigurdson, Christina; O’Connor, Daniel T.; Hook, Vivian
2010-01-01
Regulated secretion of neurotransmitters and neurohumoural factors from dense core secretory vesicles provides essential neuroeffectors for cell-cell communication in the nervous and endocrine systems. This study provides comprehensive proteomic characterization of the categories of proteins in chromaffin dense core secretory vesicles that participate in cell-cell communication from the adrenal medulla. Proteomic studies were conducted by nano-HPLC Chip MS/MS tandem mass spectrometry. Results demonstrate that these secretory vesicles contain proteins of distinct functional categories consisting of neuropeptides and neurohumoural factors, protease systems, neurotransmitter enzymes and transporters, receptors, enzymes for biochemical processes, reduction/oxidation regulation, ATPases, protein folding, lipid biochemistry, signal transduction, exocytosis, calcium regulation, as well as structural and cell adhesion proteins. The secretory vesicle proteomic data identified 371 distinct proteins in the soluble fraction and 384 distinct membrane proteins, for a total of 686 distinct secretory vesicle proteins. Notably, these proteomic analyses illustrate the presence of several neurological disease-related proteins in these secretory vesicles, including huntingtin interacting protein, cystatin C, ataxin 7, and prion protein. Overall, these findings demonstrate that multiple protein categories participate in dense core secretory vesicles for production, storage, and secretion of bioactive neuroeffectors for cell-cell communication in health and disease. PMID:20695487
NASA Astrophysics Data System (ADS)
Raphael, Itay; Mahesula, Swetha; Purkar, Anjali; Black, David; Catala, Alexis; Gelfond, Jonathon A. L.; Forsthuber, Thomas G.; Haskins, William E.
2014-09-01
Central nervous system-specific proteins (CSPs), transported across the damaged blood-brain-barrier (BBB) to cerebrospinal fluid (CSF) and blood (serum), might be promising diagnostic, prognostic and predictive protein biomarkers of disease in individual multiple sclerosis (MS) patients because they are not expected to be present at appreciable levels in the circulation of healthy subjects. We hypothesized that microwave & magnetic (M2) proteomics of CSPs in brain tissue might be an effective means to prioritize putative CSP biomarkers for future immunoassays in serum. To test this hypothesis, we used M2 proteomics to longitudinally assess CSP expression in brain tissue from mice during experimental autoimmune encephalomyelitis (EAE), a mouse model of MS. Confirmation of central nervous system (CNS)-infiltrating inflammatory cell response and CSP expression in serum was achieved with cytokine ELISPOT and ELISA immunoassays, respectively, for selected CSPs. M2 proteomics (and ELISA) revealed characteristic CSP expression waves, including synapsin-1 and α-II-spectrin, which peaked at day 7 in brain tissue (and serum) and preceded clinical EAE symptoms that began at day 10 and peaked at day 20. Moreover, M2 proteomics supports the concept that relatively few CNS-infiltrating inflammatory cells can have a disproportionally large impact on CSP expression prior to clinical manifestation of EAE.
USDA-ARS?s Scientific Manuscript database
The study of desiccation tolerance of lichens, and of their photobionts in particular, has frequently focused on the antioxidant system that protects the cell against photo-oxidative stress during dehydration/rehydration cycles. Thus, in this work we carried out proteomic and transcript analyses of ...
Activity-based protein profiling: from enzyme chemistry to proteomic chemistry.
Cravatt, Benjamin F; Wright, Aaron T; Kozarich, John W
2008-01-01
Genome sequencing projects have provided researchers with a complete inventory of the predicted proteins produced by eukaryotic and prokaryotic organisms. Assignment of functions to these proteins represents one of the principal challenges for the field of proteomics. Activity-based protein profiling (ABPP) has emerged as a powerful chemical proteomic strategy to characterize enzyme function directly in native biological systems on a global scale. Here, we review the basic technology of ABPP, the enzyme classes addressable by this method, and the biological discoveries attributable to its application.
Yan, Xianghe; Peng, Yun; Meng, Jianghong; Ruzante, Juliana; Fratamico, Pina M; Huang, Lihan; Juneja, Vijay; Needleman, David S
2011-01-01
Several factors have hindered effective use of information and resources related to food safety due to inconsistency among semantically heterogeneous data resources, lack of knowledge on profiling of food-borne pathogens, and knowledge gaps among research communities, government risk assessors/managers, and end-users of the information. This paper discusses technical aspects in the establishment of a comprehensive food safety information system consisting of the following steps: (a) computational collection and compiling publicly available information, including published pathogen genomic, proteomic, and metabolomic data; (b) development of ontology libraries on food-borne pathogens and design automatic algorithms with formal inference and fuzzy and probabilistic reasoning to address the consistency and accuracy of distributed information resources (e.g., PulseNet, FoodNet, OutbreakNet, PubMed, NCBI, EMBL, and other online genetic databases and information); (c) integration of collected pathogen profiling data, Foodrisk.org ( http://www.foodrisk.org ), PMP, Combase, and other relevant information into a user-friendly, searchable, "homogeneous" information system available to scientists in academia, the food industry, and government agencies; and (d) development of a computational model in semantic web for greater adaptability and robustness.
Quantifying Ubiquitin Signaling
Ordureau, Alban; Münch, Christian; Harper, J. Wade
2015-01-01
Ubiquitin (UB)-driven signaling systems permeate biology, and are often integrated with other types of post-translational modifications (PTMs), most notably phosphorylation. Flux through such pathways is typically dictated by the fractional stoichiometry of distinct regulatory modifications and protein assemblies as well as the spatial organization of pathway components. Yet, we rarely understand the dynamics and stoichiometry of rate-limiting intermediates along a reaction trajectory. Here, we review how quantitative proteomic tools and enrichment strategies are being used to quantify UB-dependent signaling systems, and to integrate UB signaling with regulatory phosphorylation events. A key regulatory feature of ubiquitylation is that the identity of UB chain linkage types can control downstream processes. We also describe how proteomic and enzymological tools can be used to identify and quantify UB chain synthesis and linkage preferences. The emergence of sophisticated quantitative proteomic approaches will set a new standard for elucidating biochemical mechanisms of UB-driven signaling systems. PMID:26000850
The Need for Integrated Approaches in Metabolic Engineering.
Lechner, Anna; Brunk, Elizabeth; Keasling, Jay D
2016-11-01
This review highlights state-of-the-art procedures for heterologous small-molecule biosynthesis, the associated bottlenecks, and new strategies that have the potential to accelerate future accomplishments in metabolic engineering. We emphasize that a combination of different approaches over multiple time and size scales must be considered for successful pathway engineering in a heterologous host. We have classified these optimization procedures based on the "system" that is being manipulated: transcriptome, translatome, proteome, or reactome. By bridging multiple disciplines, including molecular biology, biochemistry, biophysics, and computational sciences, we can create an integral framework for the discovery and implementation of novel biosynthetic production routes. Copyright © 2016 Cold Spring Harbor Laboratory Press; all rights reserved.
MacGilvray, Matthew E; Shishkova, Evgenia; Chasman, Deborah; Place, Michael; Gitter, Anthony; Coon, Joshua J; Gasch, Audrey P
2018-05-01
Cells respond to stressful conditions by coordinating a complex, multi-faceted response that spans many levels of physiology. Much of the response is coordinated by changes in protein phosphorylation. Although the regulators of transcriptome changes during stress are well characterized in Saccharomyces cerevisiae, the upstream regulatory network controlling protein phosphorylation is less well dissected. Here, we developed a computational approach to infer the signaling network that regulates phosphorylation changes in response to salt stress. We developed an approach to link predicted regulators to groups of likely co-regulated phospho-peptides responding to stress, thereby creating new edges in a background protein interaction network. We then use integer linear programming (ILP) to integrate wild type and mutant phospho-proteomic data and predict the network controlling stress-activated phospho-proteomic changes. The network we inferred predicted new regulatory connections between stress-activated and growth-regulating pathways and suggested mechanisms coordinating metabolism, cell-cycle progression, and growth during stress. We confirmed several network predictions with co-immunoprecipitations coupled with mass-spectrometry protein identification and mutant phospho-proteomic analysis. Results show that the cAMP-phosphodiesterase Pde2 physically interacts with many stress-regulated transcription factors targeted by PKA, and that reduced phosphorylation of those factors during stress requires the Rck2 kinase that we show physically interacts with Pde2. Together, our work shows how a high-quality computational network model can facilitate discovery of new pathway interactions during osmotic stress.
Min, Li; Cheng, Jianbo; Zhao, Shengguo; Tian, He; Zhang, Yangdong; Li, Songli; Yang, Hongjian; Zheng, Nan; Wang, Jiaqi
2016-09-02
Heat stress (HS) has an enormous economic impact on the dairy industry. In recent years, many researchers have investigated changes in the gene expression and metabolomics profiles in dairy cows caused by HS. However, the proteomics profiles of heat-stressed dairy cows have not yet been completely elucidated. We compared plasma proteomics from HS-free and heat-stressed dairy cows using an iTRAQ labeling approach. After the depletion of high abundant proteins in the plasma, 1472 proteins were identified. Of these, 85 proteins were differentially abundant in cows exposed to HS relative to HS-free. Database searches combined with GO and KEGG pathway enrichment analyses revealed that many components of the complement and coagulation cascades were altered in heat-stressed cows compared with HS-free cows. Of these, many factors in the complement system (including complement components C1, C3, C5, C6, C7, C8, and C9, complement factor B, and factor H) were down-regulated by HS, while components of the coagulation system (including coagulation factors, vitamin K-dependent proteins, and fibrinogens) were up-regulated by HS. In conclusion, our results indicate that HS decreases plasma levels of complement system proteins, suggesting that immune function is impaired in dairy cows exposed to HS. Though many aspects of heat stress (HS) have been extensively researched, relatively little is known about the proteomics profile changes that occur during heat exposure. In this work, we employed a proteomics approach to investigate differential abundance of plasma proteins in HS-free and heat-stressed dairy cows. Database searches combined with GO and KEGG pathway enrichment analyses revealed that HS resulted in a decrease in complement components, suggesting that heat-stressed dairy cows have impaired immune function. In addition, through integrative analyses of proteomics and previous metabolomics, we showed enhanced glycolysis, lipid metabolic pathway shifts, and nitrogen repartitioning in dairy cows exposed to HS. Our findings expand our current knowledge on the effects of HS on plasma proteomics in dairy cows and offer a new perspective for future research. Copyright © 2016 Elsevier B.V. All rights reserved.
solveME: fast and reliable solution of nonlinear ME models.
Yang, Laurence; Ma, Ding; Ebrahim, Ali; Lloyd, Colton J; Saunders, Michael A; Palsson, Bernhard O
2016-09-22
Genome-scale models of metabolism and macromolecular expression (ME) significantly expand the scope and predictive capabilities of constraint-based modeling. ME models present considerable computational challenges: they are much (>30 times) larger than corresponding metabolic reconstructions (M models), are multiscale, and growth maximization is a nonlinear programming (NLP) problem, mainly due to macromolecule dilution constraints. Here, we address these computational challenges. We develop a fast and numerically reliable solution method for growth maximization in ME models using a quad-precision NLP solver (Quad MINOS). Our method was up to 45 % faster than binary search for six significant digits in growth rate. We also develop a fast, quad-precision flux variability analysis that is accelerated (up to 60× speedup) via solver warm-starts. Finally, we employ the tools developed to investigate growth-coupled succinate overproduction, accounting for proteome constraints. Just as genome-scale metabolic reconstructions have become an invaluable tool for computational and systems biologists, we anticipate that these fast and numerically reliable ME solution methods will accelerate the wide-spread adoption of ME models for researchers in these fields.
Proteomics in Traditional Chinese Medicine with an Emphasis on Alzheimer's Disease
Sulistio, Yanuar Alan
2015-01-01
In recent years, there has been an increasing worldwide interest in traditional Chinese medicine (TCM). This increasing demand for TCM needs to be accompanied by a deeper understanding of the mechanisms of action of TCM-based therapy. However, TCM is often described as a concept of Chinese philosophy, which is incomprehensible for Western medical society, thereby creating a gap between TCM and Western medicine (WM). In order to meet this challenge, TCM research has applied proteomics technologies for exploring the mechanisms of action of TCM treatment. Proteomics enables TCM researchers to oversee various pathways that are affected by treatment, as well as the dynamics of their interactions with one another. This review discusses the utility of comparative proteomics to better understand how TCM treatment may be used as a complementary therapy for Alzheimer's disease (AD). Additionally, we review the data from comparative AD-related TCM proteomics studies and establish the relevance of the data with available AD hypotheses, most notably regarding the ubiquitin proteasome system (UPS). PMID:26557146
Microchip-Based Single-Cell Functional Proteomics for Biomedical Applications
Lu, Yao; Yang, Liu; Wei, Wei; Shi, Qihui
2017-01-01
Cellular heterogeneity has been widely recognized but only recently have single cell tools become available that allow characterizing heterogeneity at the genomic and proteomic levels. We review the technological advances in microchip-based toolkits for single-cell functional proteomics. Each of these tools has distinct advantages and limitations, and a few have advanced toward being applied to address biological or clinical problems that fail to be addressed by traditional population-based methods. High-throughput single-cell proteomic assays generate high-dimensional data sets that contain new information and thus require developing new analytical framework to extract new biology. In this review article, we highlight a few biological and clinical applications in which the microchip-based single-cell proteomic tools provide unique advantages. The examples include resolving functional heterogeneity and dynamics of immune cells, dissecting cell-cell interaction by creating well-contolled on-chip microenvironment, capturing high-resolution snapshots of immune system functions in patients for better immunotherapy and elucidating phosphoprotein signaling networks in cancer cells for guiding effective molecularly targeted therapies. PMID:28280819
Proteomics and syndrome of Chinese medicine
Lu, Chuan-Li; Qv, Xiao-Ying; Jiang, Jian-Guo
2010-01-01
Abstract Syndrome of Chinese medicine is an understanding of the regularity of disease occurrence and development and its performance of symptoms. Syndrome is the key to recognize diseases and the foundation to treat them. However, because of the complexity of the concept and the limitation of present investigations, the research of syndrome is hard to go further. Proteomics has been received extensive attention in the area of medical diagnosis and drug development. In the holistic and systemic context, proteomics have a convergence with traditional Chinese medicine (TCM) syndrome, which could overcome the one-sidedness and singleness of TCM and avoid the complication and tedious processes. Chinese medicine has a wealth of experience and proteomics has a substantial research potential, the integration of the two aspects will bring a great enhancement of our knowledge of disease. PMID:20874721
Using HPLC-Mass Spectrometry to Teach Proteomics Concepts with Problem-Based Techniques
ERIC Educational Resources Information Center
Short, Michael; Short, Anne; Vankempen, Rachel; Seymour, Michael; Burnatowska-Hledin, Maria
2010-01-01
Practical instruction of proteomics concepts was provided using high-performance liquid chromatography coupled with a mass selective detection system (HPLC-MS) for the analysis of simulated protein digests. The samples were prepared from selected dipeptides in order to facilitate the mass spectral identification. As part of the prelaboratory…
Plant fluid proteomics: Delving into the xylem sap, phloem sap and apoplastic fluid proteomes
USDA-ARS?s Scientific Manuscript database
The phloem sap, xylem sap and apoplastic fluid play key roles in long and short distance transport of signals and nutrients, and act as a barrier against local and systemic pathogen infection. Among other components, these plant fluids contain proteins which are likely to be important players in the...
MALDI-TOF MS of Trichoderma: A model system for the identification of microfungi
USDA-ARS?s Scientific Manuscript database
This investigation aimed to assess whether MALDI-TOF MS analysis of proteomics could be applied to the study of Trichoderma, a fungal genus selected because it includes many species and is phylogenetically well defined. We also investigated whether MALDI-TOF MS analysis of proteomics would reveal ap...
Association of proteomics changes with Al-sensitive root zones in switchgrass
USDA-ARS?s Scientific Manuscript database
In this paper, we report on aluminum (Al)-induced root proteomic changes in switchgrass. After growth in a hydroponic culture system supplemented with 400 uM of Al, plants began to show signs of physiological stress such as a reduction in photosynthetic rate. At this time, the basal 2-cmlong root ti...
Plant fluid proteomics: Delving into the xylem sap, phloem sap and apoplastic fluid proteomes
USDA-ARS?s Scientific Manuscript database
The phloem sap, xylem sap and apoplastic fluid play key roles in long and short distance transport of signals and nutrients, and act as a barrier against local and systemic pathogen infection. Among other components, these plant fluids contain proteins, which are likely to be important players in th...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Duan, Jicheng; Gaffrey, Matthew J.; Qian, Wei-Jun
Protein cysteine thiols play a crucial role in redox signaling, regulation of enzymatic activity and protein function, and maintaining redox homeostasis in living systems. The unique chemical reactivity of thiol groups makes cysteine susceptible to oxidative modifications by reactive oxygen and nitrogen species to form a broad array of reversible and irreversible protein post-translational modifications (PTMs). The reversible modifications in particular are one of the major components of redox signaling and are involved in regulation of various cellular processes under physiological and pathological conditions. The biological significance of these redox PTMs in health and diseases has been increasingly recognized. Herein,more » we review the recent advances of quantitative proteomic approaches for investigating redox PTMs in complex biological systems, including the general considerations of sample processing, various chemical or affinity enrichment strategies, and quantitative approaches. We also highlight a number of redox proteomic approaches that enable effective profiling of redox PTMs for addressing specific biological questions. Although some technological limitations remain, redox proteomics is paving the way towards a better understanding of redox signaling and regulation in human health and diseases.« less
Masand, Ruchi; Paulo, Esther; Wu, Dongmei; Wang, Yangmeng; Swaney, Danielle L; Jimenez-Morales, David; Krogan, Nevan J; Wang, Biao
2018-03-06
Brown adipose tissue (BAT) thermogenesis is critical for thermoregulation and contributes to total energy expenditure. However, whether BAT has non-thermogenic functions is largely unknown. Here, we describe that BAT-specific liver kinase b1 knockout (Lkb1 BKO ) mice exhibited impaired BAT mitochondrial respiration and thermogenesis but reduced adiposity and liver triglyceride accumulation under high-fat-diet feeding at room temperature. Importantly, these metabolic benefits were also present in Lkb1 BKO mice at thermoneutrality, where BAT thermogenesis was not required. Mechanistically, decreased mRNA levels of mtDNA-encoded electron transport chain (ETC) subunits and ETC proteome imbalance led to defective BAT mitochondrial respiration in Lkb1 BKO mice. Furthermore, reducing mtDNA gene expression directly in BAT by removing mitochondrial transcription factor A (Tfam) in BAT also showed ETC proteome imbalance and the trade-off between BAT thermogenesis and systemic metabolism at room temperature and thermoneutrality. Collectively, our data demonstrate that ETC proteome imbalance in BAT regulates systemic metabolism independently of thermogenesis. Copyright © 2018 Elsevier Inc. All rights reserved.
Redox proteomics and drug development.
D'Alessandro, Angelo; Rinalducci, Sara; Zolla, Lello
2011-11-18
As alterations of the redox homeostasis lie at the root of many pathophysiological processes in human health, redox proteomics holds the promise to shed further light on fundamental biological processes. In this review, the mechanisms of reactive oxygen species (ROS) and reactive nitrogen species (RNS) production are reviewed, mainly addressing those chemical phenomena which have already been associated with pathological conditions (of the central nervous system, cardiovascular system, or simply related to aging and altered-cell cycle regulation). From Alzheimer's to Parkinson's and Hungtinton's disease, from ageing to cancer, oxidative stress (OS) appears to represent a common trait in so many relevant biological aspects of human health, that further investments in the field of redox proteomics ought to be mandatory. For the foreseeable future, redox proteomics will likely play a pivotal role in the quest for new therapeutical targets and their validation, in the process of determining OS-triggered cellular alteration upon drug treatments and thus in the very heart of the design and testing of new drugs and their metabolites against those pathologies relying on altered redox homeostasis. Copyright © 2011 Elsevier B.V. All rights reserved.
Ephritikhine, Geneviève; Ferro, Myriam; Rolland, Norbert
2004-12-01
Plant membrane proteins are involved in many different functions according to their location in the cell. For instance, the chloroplast has two membrane systems, thylakoids and envelope, with specialized membrane proteins for photosynthesis and metabolite and ion transporters, respectively. Although recent advances in sample preparation and analytical techniques have been achieved for the study of membrane proteins, the characterization of these proteins, especially the hydrophobic ones, is still challenging. The present review highlights recent advances in methodologies for identification of plant membrane proteins from purified subcellular structures. The interest of combining several complementary extraction procedures to take into account specific features of membrane proteins is discussed in the light of recent proteomics data, notably for chloroplast envelope, mitochondrial membranes and plasma membrane from Arabidopsis. These examples also illustrate how, on one hand, proteomics can feed bioinformatics for a better definition of prediction tools and, on the other hand, although prediction tools are not 100% reliable, they can give valuable information for biological investigations. In particular, membrane proteomics brings new insights over plant membrane systems, on both the membrane compartment where proteins are working and their putative cellular function.
Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.
Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi
2007-10-04
In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.
Sprenger, Adrian; Weber, Sebastian; Zarai, Mostafa; Engelke, Rudolf; Nascimento, Juliana M.; Gretzmeier, Christine; Hilpert, Martin; Boerries, Melanie; Has, Cristina; Busch, Hauke; Bruckner-Tuderman, Leena; Dengjel, Jörn
2013-01-01
Keratinocytes account for 95% of all cells of the epidermis, the stratified squamous epithelium forming the outer layer of the skin, in which a significant number of skin diseases takes root. Immortalized keratinocyte cell lines are often used as research model systems providing standardized, reproducible, and homogenous biological material. Apart from that, primary human keratinocytes are frequently used for medical studies because the skin provides an important route for drug administration and is readily accessible for biopsies. However, comparability of these cell systems is not known. Cell lines may undergo phenotypic shifts and may differ from the in vivo situation in important aspects. Primary cells, on the other hand, may vary in biological functions depending on gender and age of the donor and localization of the biopsy specimen. Here we employed metabolic labeling in combination with quantitative mass spectrometry-based proteomics to assess A431 and HaCaT cell lines for their suitability as model systems. Compared with cell lines, comprehensive profiling of the primary human keratinocyte proteome with respect to gender, age, and skin localization identified an unexpected high proteomic consistency. The data were analyzed by an improved ontology enrichment analysis workflow designed for the study of global proteomics experiments. It enables a quick, comprehensive and unbiased overview of altered biological phenomena and links experimental data to literature. We guide through our workflow, point out its advantages compared with other methods and apply it to visualize differences of cell lines compared with primary human keratinocytes. PMID:23722187
Polyphemus, Odysseus and the ovine milk proteome.
Cunsolo, Vincenzo; Fasoli, Elisa; Di Francesco, Antonella; Saletti, Rosaria; Muccilli, Vera; Gallina, Serafina; Righetti, Pier Giorgio; Foti, Salvatore
2017-01-30
In the last years the amount of ovine milk production, mainly used to formulate a wide range of different and exclusive dairy products often categorized as gourmet food, has been progressively increasing. Taking also into account that sheep milk (SM) also appears to be potentially less allergenic than cow's one, an in-depth information about its protein composition is essential to improve the comprehension of its potential benefits for human consumption. The present work reports the results of an in-depth characterization of SM whey proteome, carried out by coupling the CPLL technology with SDS-PAGE and high resolution UPLC-nESI MS/MS analysis. This approach allowed the identification of 718 different protein components, 644 of which are from unique genes. Particularly, this identification has expanded literature data about sheep whey proteome by 193 novel proteins previously undetected, many of which are involved in the defence/immunity mechanisms or in the nutrient delivery system. A comparative analysis of SM proteome known to date with cow's milk proteome, evidenced that while about 29% of SM proteins are also present in CM, 71% of the identified components appear to be unique of SM proteome and include a heterogeneous group of components which seem to have health-promoting benefits. The data have been deposited to the ProteomeXchange with identifier
Proteomics of survival structures of fungal pathogens.
Loginov, Dmitry; Šebela, Marek
2016-09-25
Fungal pathogens are causal agents of numerous human, animal, and plant diseases. They employ various infection modes to overcome host defense systems. Infection mechanisms of different fungi have been subjected to many comprehensive studies. These investigations have been facilitated by the development of various '-omics' techniques, and proteomics has one of the leading roles in this regard. Fungal conidia and sclerotia could be considered the most important structures for pathogenesis as their germination is one of the first steps towards a host infection. They represent interesting objects for proteomic studies because of the presence of unique proteins with unexplored biotechnological potential required for pathogen viability, development and the subsequent host infection. Proteomic peculiarities of survival structures of different fungi, including those of biotechnological significance (e.g., Asperillus fumigatus, A. nidulans, Metarhizium anisopliae), in a dormant state, as well as changes in the protein production during early stages of fungal development are the subjects of the present review. We focused on biological aspects of proteomic studies of fungal survival structures rather than on an evaluation of proteomic approaches. For that reason, proteins that have been identified in this context are discussed from the point of view of their involvement in different biological processes and possible functions assigned to them. This is the first review paper summarizing recent advances in proteomics of fungal survival structures. Copyright © 2016 Elsevier B.V. All rights reserved.
Brambila-Tapia, Aniel Jessica Leticia; Perez-Rueda, Ernesto; Barrios, Humberto; Dávalos-Rodríguez, Nory Omayra; Dávalos-Rodríguez, Ingrid Patricia; Cardona-Muñoz, Ernesto Germán; Salazar-Páramo, Mario
2017-08-01
A systematic analysis of beta-lactamases based on comparative proteomics has not been performed thus far. In this report, we searched for the presence of beta-lactam-related proteins in 591 bacterial proteomes belonging to 52 species that are pathogenic to humans. The amino acid sequences for 19 different types of beta-lactamases (ACT, CARB, CifA, CMY, CTX, FOX, GES, GOB, IMP, IND, KPC, LEN, OKP, OXA, OXY, SHV, TEM, NDM, and VIM) were obtained from the ARG-ANNOT database and were used to construct 19 HMM profiles, which were used to identify potential beta-lactamases in the completely sequenced bacterial proteomes. A total of 2877 matches that included the word "beta-lactamase" and/or "penicillin" in the functional annotation and/or in any of its regions were obtained. These enzymes were mainly described as "penicillin-binding proteins," "beta-lactamases," and "metallo-beta-lactamases" and were observed in 47 of the 52 species studied. In addition, proteins classified as "beta-lactamases" were observed in 39 of the species included. A positive correlation between the number of beta-lactam-related proteins per species and the proteome size was observed (R 0.78, P < 0.00001). This correlation partially explains the high presence of beta-lactam-related proteins in large proteomes, such as Nocardia brasiliensis, Bacillus anthracis, and Mycobacterium tuberculosis, along with their absence in small proteomes, such as Chlamydia spp. and Mycoplasma spp. We detected only five types of beta-lactamases (TEM, SHV, CTX, IMP, and OXA) and other related proteins in particular species that corresponded with those reported in the literature. We additionally detected other potential species-specific beta-lactamases that have not yet been reported. In the future, better results will be achieved due to more accurate sequence annotations and a greater number of sequenced genomes.
Crysalis: an integrated server for computational analysis and design of protein crystallization.
Wang, Huilin; Feng, Liubin; Zhang, Ziding; Webb, Geoffrey I; Lin, Donghai; Song, Jiangning
2016-02-24
The failure of multi-step experimental procedures to yield diffraction-quality crystals is a major bottleneck in protein structure determination. Accordingly, several bioinformatics methods have been successfully developed and employed to select crystallizable proteins. Unfortunately, the majority of existing in silico methods only allow the prediction of crystallization propensity, seldom enabling computational design of protein mutants that can be targeted for enhancing protein crystallizability. Here, we present Crysalis, an integrated crystallization analysis tool that builds on support-vector regression (SVR) models to facilitate computational protein crystallization prediction, analysis, and design. More specifically, the functionality of this new tool includes: (1) rapid selection of target crystallizable proteins at the proteome level, (2) identification of site non-optimality for protein crystallization and systematic analysis of all potential single-point mutations that might enhance protein crystallization propensity, and (3) annotation of target protein based on predicted structural properties. We applied the design mode of Crysalis to identify site non-optimality for protein crystallization on a proteome-scale, focusing on proteins currently classified as non-crystallizable. Our results revealed that site non-optimality is based on biases related to residues, predicted structures, physicochemical properties, and sequence loci, which provides in-depth understanding of the features influencing protein crystallization. Crysalis is freely available at http://nmrcen.xmu.edu.cn/crysalis/.
Crysalis: an integrated server for computational analysis and design of protein crystallization
Wang, Huilin; Feng, Liubin; Zhang, Ziding; Webb, Geoffrey I.; Lin, Donghai; Song, Jiangning
2016-01-01
The failure of multi-step experimental procedures to yield diffraction-quality crystals is a major bottleneck in protein structure determination. Accordingly, several bioinformatics methods have been successfully developed and employed to select crystallizable proteins. Unfortunately, the majority of existing in silico methods only allow the prediction of crystallization propensity, seldom enabling computational design of protein mutants that can be targeted for enhancing protein crystallizability. Here, we present Crysalis, an integrated crystallization analysis tool that builds on support-vector regression (SVR) models to facilitate computational protein crystallization prediction, analysis, and design. More specifically, the functionality of this new tool includes: (1) rapid selection of target crystallizable proteins at the proteome level, (2) identification of site non-optimality for protein crystallization and systematic analysis of all potential single-point mutations that might enhance protein crystallization propensity, and (3) annotation of target protein based on predicted structural properties. We applied the design mode of Crysalis to identify site non-optimality for protein crystallization on a proteome-scale, focusing on proteins currently classified as non-crystallizable. Our results revealed that site non-optimality is based on biases related to residues, predicted structures, physicochemical properties, and sequence loci, which provides in-depth understanding of the features influencing protein crystallization. Crysalis is freely available at http://nmrcen.xmu.edu.cn/crysalis/. PMID:26906024
Awan, Muaaz Gul; Saeed, Fahad
2016-05-15
Modern proteomics studies utilize high-throughput mass spectrometers which can produce data at an astonishing rate. These big mass spectrometry (MS) datasets can easily reach peta-scale level creating storage and analytic problems for large-scale systems biology studies. Each spectrum consists of thousands of peaks which have to be processed to deduce the peptide. However, only a small percentage of peaks in a spectrum are useful for peptide deduction as most of the peaks are either noise or not useful for a given spectrum. This redundant processing of non-useful peaks is a bottleneck for streaming high-throughput processing of big MS data. One way to reduce the amount of computation required in a high-throughput environment is to eliminate non-useful peaks. Existing noise removing algorithms are limited in their data-reduction capability and are compute intensive making them unsuitable for big data and high-throughput environments. In this paper we introduce a novel low-complexity technique based on classification, quantization and sampling of MS peaks. We present a novel data-reductive strategy for analysis of Big MS data. Our algorithm, called MS-REDUCE, is capable of eliminating noisy peaks as well as peaks that do not contribute to peptide deduction before any peptide deduction is attempted. Our experiments have shown up to 100× speed up over existing state of the art noise elimination algorithms while maintaining comparable high quality matches. Using our approach we were able to process a million spectra in just under an hour on a moderate server. The developed tool and strategy has been made available to wider proteomics and parallel computing community and the code can be found at https://github.com/pcdslab/MSREDUCE CONTACT: : fahad.saeed@wmich.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Mühlhaus, Timo; Weiss, Julia; Hemme, Dorothea; Sommer, Frederik; Schroda, Michael
2011-01-01
Crop-plant-yield safety is jeopardized by temperature stress caused by the global climate change. To take countermeasures by breeding and/or transgenic approaches it is essential to understand the mechanisms underlying plant acclimation to heat stress. To this end proteomics approaches are most promising, as acclimation is largely mediated by proteins. Accordingly, several proteomics studies, mainly based on two-dimensional gel-tandem MS approaches, were conducted in the past. However, results often were inconsistent, presumably attributable to artifacts inherent to the display of complex proteomes via two-dimensional-gels. We describe here a new approach to monitor proteome dynamics in time course experiments. This approach involves full 15N metabolic labeling and mass spectrometry based quantitative shotgun proteomics using a uniform 15N standard over all time points. It comprises a software framework, IOMIQS, that features batch job mediated automated peptide identification by four parallelized search engines, peptide quantification and data assembly for the processing of large numbers of samples. We have applied this approach to monitor proteome dynamics in a heat stress time course using the unicellular green alga Chlamydomonas reinhardtii as model system. We were able to identify 3433 Chlamydomonas proteins, of which 1116 were quantified in at least three of five time points of the time course. Statistical analyses revealed that levels of 38 proteins significantly increased, whereas levels of 206 proteins significantly decreased during heat stress. The increasing proteins comprise 25 (co-)chaperones and 13 proteins involved in chromatin remodeling, signal transduction, apoptosis, photosynthetic light reactions, and yet unknown functions. Proteins decreasing during heat stress were significantly enriched in functional categories that mediate carbon flux from CO2 and external acetate into protein biosynthesis, which also correlated with a rapid, but fully reversible cell cycle arrest after onset of stress. Our approach opens up new perspectives for plant systems biology and provides novel insights into plant stress acclimation. PMID:21610104
Burillo, Elena; Jorge, Inmaculada; Martínez-López, Diego; Camafeita, Emilio; Blanco-Colio, Luis Miguel; Trevisan-Herraz, Marco; Ezkurdia, Iakes; Egido, Jesús; Michel, Jean-Baptiste; Meilhac, Olivier; Vázquez, Jesús; Martin-Ventura, Jose Luis
2016-01-01
High-density lipoproteins (HDLs) are complex protein and lipid assemblies whose composition is known to change in diverse pathological situations. Analysis of the HDL proteome can thus provide insight into the main mechanisms underlying abdominal aortic aneurysm (AAA) and potentially detect novel systemic biomarkers. We performed a multiplexed quantitative proteomics analysis of HDLs isolated from plasma of AAA patients (N = 14) and control study participants (N = 7). Validation was performed by western-blot (HDL), immunohistochemistry (tissue), and ELISA (plasma). HDL from AAA patients showed elevated expression of peroxiredoxin-6 (PRDX6), HLA class I histocompatibility antigen (HLA-I), retinol-binding protein 4, and paraoxonase/arylesterase 1 (PON1), whereas α-2 macroglobulin and C4b-binding protein were decreased. The main pathways associated with HDL alterations in AAA were oxidative stress and immune-inflammatory responses. In AAA tissue, PRDX6 colocalized with neutrophils, vascular smooth muscle cells, and lipid oxidation. Moreover, plasma PRDX6 was higher in AAA (N = 47) than in controls (N = 27), reflecting increased systemic oxidative stress. Finally, a positive correlation was recorded between PRDX6 and AAA diameter. The analysis of the HDL proteome demonstrates that redox imbalance is a major mechanism in AAA, identifying the antioxidant PRDX6 as a novel systemic biomarker of AAA. PMID:27934969
Computational prediction of protein-protein interactions in Leishmania predicted proteomes.
Rezende, Antonio M; Folador, Edson L; Resende, Daniela de M; Ruiz, Jeronimo C
2012-01-01
The Trypanosomatids parasites Leishmania braziliensis, Leishmania major and Leishmania infantum are important human pathogens. Despite of years of study and genome availability, effective vaccine has not been developed yet, and the chemotherapy is highly toxic. Therefore, it is clear just interdisciplinary integrated studies will have success in trying to search new targets for developing of vaccines and drugs. An essential part of this rationale is related to protein-protein interaction network (PPI) study which can provide a better understanding of complex protein interactions in biological system. Thus, we modeled PPIs for Trypanosomatids through computational methods using sequence comparison against public database of protein or domain interaction for interaction prediction (Interolog Mapping) and developed a dedicated combined system score to address the predictions robustness. The confidence evaluation of network prediction approach was addressed using gold standard positive and negative datasets and the AUC value obtained was 0.94. As result, 39,420, 43,531 and 45,235 interactions were predicted for L. braziliensis, L. major and L. infantum respectively. For each predicted network the top 20 proteins were ranked by MCC topological index. In addition, information related with immunological potential, degree of protein sequence conservation among orthologs and degree of identity compared to proteins of potential parasite hosts was integrated. This information integration provides a better understanding and usefulness of the predicted networks that can be valuable to select new potential biological targets for drug and vaccine development. Network modularity which is a key when one is interested in destabilizing the PPIs for drug or vaccine purposes along with multiple alignments of the predicted PPIs were performed revealing patterns associated with protein turnover. In addition, around 50% of hypothetical protein present in the networks received some degree of functional annotation which represents an important contribution since approximately 60% of Leishmania predicted proteomes has no predicted function.
Fasani, Rick A; Savageau, Michael A
2014-11-01
Overcoming the stress of starvation is one of an organism's most challenging phenotypic responses. Those organisms that frequently survive the challenge, by virtue of their fitness, will have evolved genomes that are shaped by their specific environments. Understanding this genotype-environment-phenotype relationship at a deep level will require quantitative predictive models of the complex molecular systems that link these aspects of an organism's existence. Here, we treat one of the most fundamental molecular systems, protein synthesis, and the amino acid biosynthetic pathways involved in the stringent response to starvation. These systems face an inherent logical dilemma: Building an amino acid biosynthetic pathway to synthesize its product-the cognate amino acid of the pathway-may require that very amino acid when it is no longer available. To study this potential "catch-22," we have created a generic model of amino acid biosynthesis in response to sudden starvation. Our mathematical analysis and computational results indicate that there are two distinctly different outcomes: Partial recovery to a new steady state, or full system failure. Moreover, the cell's fate is dictated by the cognate bias, the number of cognate amino acids in the corresponding biosynthetic pathway relative to the average number of that amino acid in the proteome. We test these implications by analyzing the proteomes of over 1,800 sequenced microbes, which reveals statistically significant evidence of low cognate bias, a genetic trait that would avoid the biosynthetic quandary. Furthermore, these results suggest that the pattern of cognate bias, which is readily derived by genome sequencing, may provide evolutionary clues to an organism's natural environment. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Fasani, Rick A.; Savageau, Michael A.
2014-01-01
Overcoming the stress of starvation is one of an organism’s most challenging phenotypic responses. Those organisms that frequently survive the challenge, by virtue of their fitness, will have evolved genomes that are shaped by their specific environments. Understanding this genotype–environment–phenotype relationship at a deep level will require quantitative predictive models of the complex molecular systems that link these aspects of an organism’s existence. Here, we treat one of the most fundamental molecular systems, protein synthesis, and the amino acid biosynthetic pathways involved in the stringent response to starvation. These systems face an inherent logical dilemma: Building an amino acid biosynthetic pathway to synthesize its product—the cognate amino acid of the pathway—may require that very amino acid when it is no longer available. To study this potential “catch-22,” we have created a generic model of amino acid biosynthesis in response to sudden starvation. Our mathematical analysis and computational results indicate that there are two distinctly different outcomes: Partial recovery to a new steady state, or full system failure. Moreover, the cell’s fate is dictated by the cognate bias, the number of cognate amino acids in the corresponding biosynthetic pathway relative to the average number of that amino acid in the proteome. We test these implications by analyzing the proteomes of over 1,800 sequenced microbes, which reveals statistically significant evidence of low cognate bias, a genetic trait that would avoid the biosynthetic quandary. Furthermore, these results suggest that the pattern of cognate bias, which is readily derived by genome sequencing, may provide evolutionary clues to an organism’s natural environment. PMID:25118252
Comparative Proteome Analysis of Brucella melitensis Vaccine Strain Rev 1 and a Virulent Strain, 16M
Eschenbrenner, Michel; Wagner, Mary Ann; Horn, Troy A.; Kraycer, Jo Ann; Mujer, Cesar V.; Hagius, Sue; Elzer, Philip; DelVecchio, Vito G.
2002-01-01
The genus Brucella consists of bacterial pathogens that cause brucellosis, a major zoonotic disease characterized by undulant fever and neurological disorders in humans. Among the different Brucella species, Brucella melitensis is considered the most virulent. Despite successful use in animals, the vaccine strains remain infectious for humans. To understand the mechanism of virulence in B. melitensis, the proteome of vaccine strain Rev 1 was analyzed by two-dimensional gel electrophoresis and compared to that of virulent strain 16M. The two strains were grown under identical laboratory conditions. Computer-assisted analysis of the two B. melitensis proteomes revealed proteins expressed in either 16M or Rev 1, as well as up- or down-regulation of proteins specific for each of these strains. These proteins were identified by peptide mass fingerprinting. It was found that certain metabolic pathways may be deregulated in Rev 1. Expression of an immunogenic 31-kDa outer membrane protein, proteins utilized for iron acquisition, and those that play a role in sugar binding, lipid degradation, and amino acid binding was altered in Rev 1. PMID:12193611
Eschenbrenner, Michel; Wagner, Mary Ann; Horn, Troy A; Kraycer, Jo Ann; Mujer, Cesar V; Hagius, Sue; Elzer, Philip; DelVecchio, Vito G
2002-09-01
The genus Brucella consists of bacterial pathogens that cause brucellosis, a major zoonotic disease characterized by undulant fever and neurological disorders in humans. Among the different Brucella species, Brucella melitensis is considered the most virulent. Despite successful use in animals, the vaccine strains remain infectious for humans. To understand the mechanism of virulence in B. melitensis, the proteome of vaccine strain Rev 1 was analyzed by two-dimensional gel electrophoresis and compared to that of virulent strain 16M. The two strains were grown under identical laboratory conditions. Computer-assisted analysis of the two B. melitensis proteomes revealed proteins expressed in either 16M or Rev 1, as well as up- or down-regulation of proteins specific for each of these strains. These proteins were identified by peptide mass fingerprinting. It was found that certain metabolic pathways may be deregulated in Rev 1. Expression of an immunogenic 31-kDa outer membrane protein, proteins utilized for iron acquisition, and those that play a role in sugar binding, lipid degradation, and amino acid binding was altered in Rev 1.
NASA Astrophysics Data System (ADS)
The, Matthew; MacCoss, Michael J.; Noble, William S.; Käll, Lukas
2016-11-01
Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator's processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method—grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein—in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license.
The, Matthew; MacCoss, Michael J; Noble, William S; Käll, Lukas
2016-11-01
Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator's processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method-grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein-in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license. Graphical Abstract ᅟ.
Proteomic and Biochemical Analyses of the Cotyledon and Root of Flooding-Stressed Soybean Plants
Komatsu, Setsuko; Makino, Takahiro; Yasue, Hiroshi
2013-01-01
Background Flooding significantly reduces the growth and grain yield of soybean plants. Proteomic and biochemical techniques were used to determine whether the function of cotyledon and root is altered in soybean under flooding stress. Results Two-day-old soybean plants were flooded for 2 days, after which the proteins from root and cotyledon were extracted for proteomic analysis. In response to flooding stress, the abundance of 73 and 28 proteins was significantly altered in the root and cotyledon, respectively. The accumulation of only one protein, 70 kDa heat shock protein (HSP70) (Glyma17g08020.1), increased in both organs following flooding. The ratio of protein abundance of HSP70 and biophoton emission in the cotyledon was higher than those detected in the root under flooding stress. Computed tomography and elemental analyses revealed that flooding stress decreases the number of calcium oxalate crystal the cotyledon, indicating calcium ion was elevated in the cotyledon under flooding stress. Conclusion These results suggest that calcium might play one role through HSP70 in the cotyledon under flooding stress. PMID:23799004
The Hemolymph Proteome of Fed and Starved Drosophila Larvae
Goetze, Sandra; Ahrens, Christian H.; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F.
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics. PMID:23840627
Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics
Breckels, Lisa M.; Holden, Sean B.; Wojnar, David; Mulvey, Claire M.; Christoforou, Andy; Groen, Arnoud; Trotter, Matthew W. B.; Kohlbacher, Oliver; Lilley, Kathryn S.; Gatto, Laurent
2016-01-01
Sub-cellular localisation of proteins is an essential post-translational regulatory mechanism that can be assayed using high-throughput mass spectrometry (MS). These MS-based spatial proteomics experiments enable us to pinpoint the sub-cellular distribution of thousands of proteins in a specific system under controlled conditions. Recent advances in high-throughput MS methods have yielded a plethora of experimental spatial proteomics data for the cell biology community. Yet, there are many third-party data sources, such as immunofluorescence microscopy or protein annotations and sequences, which represent a rich and vast source of complementary information. We present a unique transfer learning classification framework that utilises a nearest-neighbour or support vector machine system, to integrate heterogeneous data sources to considerably improve on the quantity and quality of sub-cellular protein assignment. We demonstrate the utility of our algorithms through evaluation of five experimental datasets, from four different species in conjunction with four different auxiliary data sources to classify proteins to tens of sub-cellular compartments with high generalisation accuracy. We further apply the method to an experiment on pluripotent mouse embryonic stem cells to classify a set of previously unknown proteins, and validate our findings against a recent high resolution map of the mouse stem cell proteome. The methodology is distributed as part of the open-source Bioconductor pRoloc suite for spatial proteomics data analysis. PMID:27175778
The hemolymph proteome of fed and starved Drosophila larvae.
Handke, Björn; Poernbacher, Ingrid; Goetze, Sandra; Ahrens, Christian H; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics.
An extensive library of surrogate peptides for all human proteins.
Mohammed, Yassene; Borchers, Christoph H
2015-11-03
Selecting the most appropriate surrogate peptides to represent a target protein is a major component of experimental design in Multiple Reaction Monitoring (MRM). Our software PeptidePicker with its v-score remains distinctive in its approach of integrating information about the proteins, their tryptic peptides, and the suitability of these peptides for MRM that is available online in UniProtKB, NCBI's dbSNP, ExPASy, PeptideAtlas, PRIDE, and GPMDB. The scoring algorithm reflects our "best knowledge" for selecting candidate peptides for MRM, based on the uniqueness of the peptide in the targeted proteome, its physiochemical properties, and whether it has previously been observed. Here we present an updated approach where we have already compiled a list of all possible surrogate peptides of the human proteome. Using our stringent selection criteria, the list includes 165k suitable MRM peptides covering 17k proteins of the human reviewed proteins in UniProtKB. Compared to average of 2-4min per protein for retrieving and integrating the information, the precompiled list includes all peptides available instantly. This allows a more cohesive and faster design of a multiplexed MRM experiment and provides insights into evidence for a protein's existence. We will keep this list up-to-date as proteomics data repositories continue to grow. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.
HiQuant: Rapid Postquantification Analysis of Large-Scale MS-Generated Proteomics Data.
Bryan, Kenneth; Jarboui, Mohamed-Ali; Raso, Cinzia; Bernal-Llinares, Manuel; McCann, Brendan; Rauch, Jens; Boldt, Karsten; Lynn, David J
2016-06-03
Recent advances in mass-spectrometry-based proteomics are now facilitating ambitious large-scale investigations of the spatial and temporal dynamics of the proteome; however, the increasing size and complexity of these data sets is overwhelming current downstream computational methods, specifically those that support the postquantification analysis pipeline. Here we present HiQuant, a novel application that enables the design and execution of a postquantification workflow, including common data-processing steps, such as assay normalization and grouping, and experimental replicate quality control and statistical analysis. HiQuant also enables the interpretation of results generated from large-scale data sets by supporting interactive heatmap analysis and also the direct export to Cytoscape and Gephi, two leading network analysis platforms. HiQuant may be run via a user-friendly graphical interface and also supports complete one-touch automation via a command-line mode. We evaluate HiQuant's performance by analyzing a large-scale, complex interactome mapping data set and demonstrate a 200-fold improvement in the execution time over current methods. We also demonstrate HiQuant's general utility by analyzing proteome-wide quantification data generated from both a large-scale public tyrosine kinase siRNA knock-down study and an in-house investigation into the temporal dynamics of the KSR1 and KSR2 interactomes. Download HiQuant, sample data sets, and supporting documentation at http://hiquant.primesdb.eu .
Image analysis tools and emerging algorithms for expression proteomics
English, Jane A.; Lisacek, Frederique; Morris, Jeffrey S.; Yang, Guang-Zhong; Dunn, Michael J.
2012-01-01
Since their origins in academic endeavours in the 1970s, computational analysis tools have matured into a number of established commercial packages that underpin research in expression proteomics. In this paper we describe the image analysis pipeline for the established 2-D Gel Electrophoresis (2-DE) technique of protein separation, and by first covering signal analysis for Mass Spectrometry (MS), we also explain the current image analysis workflow for the emerging high-throughput ‘shotgun’ proteomics platform of Liquid Chromatography coupled to MS (LC/MS). The bioinformatics challenges for both methods are illustrated and compared, whilst existing commercial and academic packages and their workflows are described from both a user’s and a technical perspective. Attention is given to the importance of sound statistical treatment of the resultant quantifications in the search for differential expression. Despite wide availability of proteomics software, a number of challenges have yet to be overcome regarding algorithm accuracy, objectivity and automation, generally due to deterministic spot-centric approaches that discard information early in the pipeline, propagating errors. We review recent advances in signal and image analysis algorithms in 2-DE, MS, LC/MS and Imaging MS. Particular attention is given to wavelet techniques, automated image-based alignment and differential analysis in 2-DE, Bayesian peak mixture models and functional mixed modelling in MS, and group-wise consensus alignment methods for LC/MS. PMID:21046614
Systems biology approaches and tools for analysis of interactomes and multi-target drugs.
Schrattenholz, André; Groebe, Karlfried; Soskic, Vukic
2010-01-01
Systems biology is essentially a proteomic and epigenetic exercise because the relatively condensed information of genomes unfolds on the level of proteins. The flexibility of cellular architectures is not only mediated by a dazzling number of proteinaceous species but moreover by the kinetics of their molecular changes: The time scales of posttranslational modifications range from milliseconds to years. The genetic framework of an organism only provides the blue print of protein embodiments which are constantly shaped by external input. Indeed, posttranslational modifications of proteins represent the scope and velocity of these inputs and fulfil the requirements of integration of external spatiotemporal signal transduction inside an organism. The optimization of biochemical networks for this type of information processing and storage results in chemically extremely fine tuned molecular entities. The huge dynamic range of concentrations, the chemical diversity and the necessity of synchronisation of complex protein expression patterns pose the major challenge of systemic analysis of biological models. One further message is that many of the key reactions in living systems are essentially based on interactions of moderate affinities and moderate selectivities. This principle is responsible for the enormous flexibility and redundancy of cellular circuitries. In complex disorders such as cancer or neurodegenerative diseases, which initially appear to be rooted in relatively subtle dysfunctions of multimodal physiologic pathways, drug discovery programs based on the concept of high affinity/high specificity compounds ("one-target, one-disease"), which has been dominating the pharmaceutical industry for a long time, increasingly turn out to be unsuccessful. Despite improvements in rational drug design and high throughput screening methods, the number of novel, single-target drugs fell much behind expectations during the past decade, and the treatment of "complex diseases" remains a most pressing medical need. Currently, a change of paradigm can be observed with regard to a new interest in agents that modulate multiple targets simultaneously, essentially "dirty drugs." Targeting cellular function as a system rather than on the level of the single target, significantly increases the size of the drugable proteome and is expected to introduce novel classes of multi-target drugs with fewer adverse effects and toxicity. Multiple target approaches have recently been used to design medications against atherosclerosis, cancer, depression, psychosis and neurodegenerative diseases. A focussed approach towards "systemic" drugs will certainly require the development of novel computational and mathematical concepts for appropriate modelling of complex data. But the key is the extraction of relevant molecular information from biological systems by implementing rigid statistical procedures to differential proteomic analytics.
Proteomics and Metabolomics: Two Emerging Areas for Legume Improvement
Ramalingam, Abirami; Kudapa, Himabindu; Pazhamala, Lekha T.; Weckwerth, Wolfram; Varshney, Rajeev K.
2015-01-01
The crop legumes such as chickpea, common bean, cowpea, peanut, pigeonpea, soybean, etc. are important sources of nutrition and contribute to a significant amount of biological nitrogen fixation (>20 million tons of fixed nitrogen) in agriculture. However, the production of legumes is constrained due to abiotic and biotic stresses. It is therefore imperative to understand the molecular mechanisms of plant response to different stresses and identify key candidate genes regulating tolerance which can be deployed in breeding programs. The information obtained from transcriptomics has facilitated the identification of candidate genes for the given trait of interest and utilizing them in crop breeding programs to improve stress tolerance. However, the mechanisms of stress tolerance are complex due to the influence of multi-genes and post-transcriptional regulations. Furthermore, stress conditions greatly affect gene expression which in turn causes modifications in the composition of plant proteomes and metabolomes. Therefore, functional genomics involving various proteomics and metabolomics approaches have been obligatory for understanding plant stress tolerance. These approaches have also been found useful to unravel different pathways related to plant and seed development as well as symbiosis. Proteome and metabolome profiling using high-throughput based systems have been extensively applied in the model legume species, Medicago truncatula and Lotus japonicus, as well as in the model crop legume, soybean, to examine stress signaling pathways, cellular and developmental processes and nodule symbiosis. Moreover, the availability of protein reference maps as well as proteomics and metabolomics databases greatly support research and understanding of various biological processes in legumes. Protein-protein interaction techniques, particularly the yeast two-hybrid system have been advantageous for studying symbiosis and stress signaling in legumes. In this review, several studies on proteomics and metabolomics in model and crop legumes have been discussed. Additionally, applications of advanced proteomics and metabolomics approaches have also been included in this review for future applications in legume research. The integration of these “omics” approaches will greatly support the identification of accurate biomarkers in legume smart breeding programs. PMID:26734026
Advancing Cell Biology Through Proteomics in Space and Time (PROSPECTS)*
Lamond, Angus I.; Uhlen, Mathias; Horning, Stevan; Makarov, Alexander; Robinson, Carol V.; Serrano, Luis; Hartl, F. Ulrich; Baumeister, Wolfgang; Werenskiold, Anne Katrin; Andersen, Jens S.; Vorm, Ole; Linial, Michal; Aebersold, Ruedi; Mann, Matthias
2012-01-01
The term “proteomics” encompasses the large-scale detection and analysis of proteins and their post-translational modifications. Driven by major improvements in mass spectrometric instrumentation, methodology, and data analysis, the proteomics field has burgeoned in recent years. It now provides a range of sensitive and quantitative approaches for measuring protein structures and dynamics that promise to revolutionize our understanding of cell biology and molecular mechanisms in both human cells and model organisms. The Proteomics Specification in Time and Space (PROSPECTS) Network is a unique EU-funded project that brings together leading European research groups, spanning from instrumentation to biomedicine, in a collaborative five year initiative to develop new methods and applications for the functional analysis of cellular proteins. This special issue of Molecular and Cellular Proteomics presents 16 research papers reporting major recent progress by the PROSPECTS groups, including improvements to the resolution and sensitivity of the Orbitrap family of mass spectrometers, systematic detection of proteins using highly characterized antibody collections, and new methods for absolute as well as relative quantification of protein levels. Manuscripts in this issue exemplify approaches for performing quantitative measurements of cell proteomes and for studying their dynamic responses to perturbation, both during normal cellular responses and in disease mechanisms. Here we present a perspective on how the proteomics field is moving beyond simply identifying proteins with high sensitivity toward providing a powerful and versatile set of assay systems for characterizing proteome dynamics and thereby creating a new “third generation” proteomics strategy that offers an indispensible tool for cell biology and molecular medicine. PMID:22311636
ERIC Educational Resources Information Center
Magana, Alejandra J.; Taleyarkhan, Manaz; Alvarado, Daniela Rivera; Kane, Michael; Springer, John; Clase, Kari
2014-01-01
Bioinformatics education can be broadly defined as the teaching and learning of the use of computer and information technology, along with mathematical and statistical analysis for gathering, storing, analyzing, interpreting, and integrating data to solve biological problems. The recent surge of genomics, proteomics, and structural biology in the…
Joyce, Brendan; Lee, Danny; Rubio, Alex; Ogurtsov, Aleksey; Alves, Gelio; Yu, Yi-Kuo
2018-03-15
RAId is a software package that has been actively developed for the past 10 years for computationally and visually analyzing MS/MS data. Founded on rigorous statistical methods, RAId's core program computes accurate E-values for peptides and proteins identified during database searches. Making this robust tool readily accessible for the proteomics community by developing a graphical user interface (GUI) is our main goal here. We have constructed a graphical user interface to facilitate the use of RAId on users' local machines. Written in Java, RAId_GUI not only makes easy executions of RAId but also provides tools for data/spectra visualization, MS-product analysis, molecular isotopic distribution analysis, and graphing the retrieval versus the proportion of false discoveries. The results viewer displays and allows the users to download the analyses results. Both the knowledge-integrated organismal databases and the code package (containing source code, the graphical user interface, and a user manual) are available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/raid.html .
Microfluidic-Mass Spectrometry Interfaces for Translational Proteomics.
Pedde, R Daniel; Li, Huiyan; Borchers, Christoph H; Akbari, Mohsen
2017-10-01
Interfacing mass spectrometry (MS) with microfluidic chips (μchip-MS) holds considerable potential to transform a clinician's toolbox, providing translatable methods for the early detection, diagnosis, monitoring, and treatment of noncommunicable diseases by streamlining and integrating laborious sample preparation workflows on high-throughput, user-friendly platforms. Overcoming the limitations of competitive immunoassays - currently the gold standard in clinical proteomics - μchip-MS can provide unprecedented access to complex proteomic assays having high sensitivity and specificity, but without the labor, costs, and complexities associated with conventional MS sample processing. This review surveys recent μchip-MS systems for clinical applications and examines their emerging role in streamlining the development and translation of MS-based proteomic assays by alleviating many of the challenges that currently inhibit widespread clinical adoption. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Vascular Sap Proteomics: Providing Insight into Long-Distance Signaling during Stress
Carella, Philip; Wilson, Daniel C.; Kempthorne, Christine J.; Cameron, Robin K.
2016-01-01
The plant vascular system, composed of the xylem and phloem, is important for the transport of water, mineral nutrients, and photosynthate throughout the plant body. The vasculature is also the primary means by which developmental and stress signals move from one organ to another. Due to practical and technological limitations, proteomics analysis of xylem and phloem sap has been understudied in comparison to accessible sample types such as leaves and roots. However, recent advances in sample collection techniques and mass spectrometry technology are making it possible to comprehensively analyze vascular sap proteomes. In this mini-review, we discuss the emerging field of vascular sap proteomics, with a focus on recent comparative studies to identify vascular proteins that may play roles in long-distance signaling and other processes during stress responses in plants. PMID:27242852
Protein-Protein Interaction Network and Gene Ontology
NASA Astrophysics Data System (ADS)
Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah
Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.
Droit, Arnaud; Hunter, Joanna M; Rouleau, Michèle; Ethier, Chantal; Picard-Cloutier, Aude; Bourgais, David; Poirier, Guy G
2007-01-01
Background In the "post-genome" era, mass spectrometry (MS) has become an important method for the analysis of proteins and the rapid advancement of this technique, in combination with other proteomics methods, results in an increasing amount of proteome data. This data must be archived and analysed using specialized bioinformatics tools. Description We herein describe "PARPs database," a data analysis and management pipeline for liquid chromatography tandem mass spectrometry (LC-MS/MS) proteomics. PARPs database is a web-based tool whose features include experiment annotation, protein database searching, protein sequence management, as well as data-mining of the peptides and proteins identified. Conclusion Using this pipeline, we have successfully identified several interactions of biological significance between PARP-1 and other proteins, namely RFC-1, 2, 3, 4 and 5. PMID:18093328
Standardization approaches in absolute quantitative proteomics with mass spectrometry.
Calderón-Celis, Francisco; Encinar, Jorge Ruiz; Sanz-Medel, Alfredo
2017-07-31
Mass spectrometry-based approaches have enabled important breakthroughs in quantitative proteomics in the last decades. This development is reflected in the better quantitative assessment of protein levels as well as to understand post-translational modifications and protein complexes and networks. Nowadays, the focus of quantitative proteomics shifted from the relative determination of proteins (ie, differential expression between two or more cellular states) to absolute quantity determination, required for a more-thorough characterization of biological models and comprehension of the proteome dynamism, as well as for the search and validation of novel protein biomarkers. However, the physico-chemical environment of the analyte species affects strongly the ionization efficiency in most mass spectrometry (MS) types, which thereby require the use of specially designed standardization approaches to provide absolute quantifications. Most common of such approaches nowadays include (i) the use of stable isotope-labeled peptide standards, isotopologues to the target proteotypic peptides expected after tryptic digestion of the target protein; (ii) use of stable isotope-labeled protein standards to compensate for sample preparation, sample loss, and proteolysis steps; (iii) isobaric reagents, which after fragmentation in the MS/MS analysis provide a final detectable mass shift, can be used to tag both analyte and standard samples; (iv) label-free approaches in which the absolute quantitative data are not obtained through the use of any kind of labeling, but from computational normalization of the raw data and adequate standards; (v) elemental mass spectrometry-based workflows able to provide directly absolute quantification of peptides/proteins that contain an ICP-detectable element. A critical insight from the Analytical Chemistry perspective of the different standardization approaches and their combinations used so far for absolute quantitative MS-based (molecular and elemental) proteomics is provided in this review. © 2017 Wiley Periodicals, Inc.
Be different--the diversity of peroxisomes in the animal kingdom.
Islinger, M; Cardoso, M J R; Schrader, M
2010-08-01
Peroxisomes represent so-called "multipurpose organelles" as they contribute to various anabolic as well as catabolic pathways. Thus, with respect to the physiological specialization of an individual organ or animal species, peroxisomes exhibit a functional diversity, which is documented by significant variations in their proteome. These differences are usually regarded as an adaptational response to the nutritional and environmental life conditions of a specific organism. Thus, human peroxisomes can be regarded as an in part physiologically unique organellar entity fulfilling metabolic functions that differ from our animal model systems. In line with this, a profound understanding on how peroxisomes acquired functional heterogeneity in terms of an evolutionary and mechanistic background is required. This review summarizes our current knowledge on the heterogeneity of peroxisomal physiology, providing insights into the genetic and cell biological mechanisms, which lead to the differential localization or expression of peroxisomal proteins and further gives an overview on peroxisomal biochemical pathways, which are specialized in different animal species and organs. Moreover, it addresses the impact of proteome studies on our understanding of differential peroxisome function describing the utility of mass spectrometry and computer-assisted algorithms to identify peroxisomal target sequences for the detection of new organ- or species-specific peroxisomal proteins. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Quantifying ubiquitin signaling.
Ordureau, Alban; Münch, Christian; Harper, J Wade
2015-05-21
Ubiquitin (UB)-driven signaling systems permeate biology, and are often integrated with other types of post-translational modifications (PTMs), including phosphorylation. Flux through such pathways is dictated by the fractional stoichiometry of distinct modifications and protein assemblies as well as the spatial organization of pathway components. Yet, we rarely understand the dynamics and stoichiometry of rate-limiting intermediates along a reaction trajectory. Here, we review how quantitative proteomic tools and enrichment strategies are being used to quantify UB-dependent signaling systems, and to integrate UB signaling with regulatory phosphorylation events, illustrated with the PINK1/PARKIN pathway. A key feature of ubiquitylation is that the identity of UB chain linkage types can control downstream processes. We also describe how proteomic and enzymological tools can be used to identify and quantify UB chain synthesis and linkage preferences. The emergence of sophisticated quantitative proteomic approaches will set a new standard for elucidating biochemical mechanisms of UB-driven signaling systems. Copyright © 2015 Elsevier Inc. All rights reserved.
Gadher, Suresh Jivan; Kovarova, Hana
2017-02-05
The Central and Eastern European Proteomic Conference (CEEPC), has reached a special milestone as it celebrates its 10th anniversary. Today, an expansive network of proteomics in Central and Eastern Europe stands established to facilitate scientific interactions and collaborations in and around Central and Eastern Europe, as well as with international research institutions worldwide. Currently, when many conferences are struggling to attract participants, CEEPC is thriving in its status and stature as well as expanding by attracting newer member countries. CEEPC's success is driven by mutual respect between scientists sharing interest in proteomics and its applications in multidisciplinary research areas related to biological systems. This effort when interwoven with exciting ambience steeped with culture, and tradition is also a reason why participants enjoy it. CEEPC's careful balance between excellence and cohesion holds the key to its success. It is evident that CEEPC is ready for the next decade of excitement and expectations of multifaceted proteomics in Central and Eastern Europe. Additionally, in the era of emerging personalized medicine where treatment selection for each patient is becoming individualized, CEEPC and proteomics is expected to play a significant role moving forward for the benefit of mankind. Copyright © 2016 Elsevier B.V. All rights reserved.
Proteomics wants cRacker: automated standardized data analysis of LC-MS derived proteomic data.
Zauber, Henrik; Schulze, Waltraud X
2012-11-02
The large-scale analysis of thousands of proteins under various experimental conditions or in mutant lines has gained more and more importance in hypothesis-driven scientific research and systems biology in the past years. Quantitative analysis by large scale proteomics using modern mass spectrometry usually results in long lists of peptide ion intensities. The main interest for most researchers, however, is to draw conclusions on the protein level. Postprocessing and combining peptide intensities of a proteomic data set requires expert knowledge, and the often repetitive and standardized manual calculations can be time-consuming. The analysis of complex samples can result in very large data sets (lists with several 1000s to 100,000 entries of different peptides) that cannot easily be analyzed using standard spreadsheet programs. To improve speed and consistency of the data analysis of LC-MS derived proteomic data, we developed cRacker. cRacker is an R-based program for automated downstream proteomic data analysis including data normalization strategies for metabolic labeling and label free quantitation. In addition, cRacker includes basic statistical analysis, such as clustering of data, or ANOVA and t tests for comparison between treatments. Results are presented in editable graphic formats and in list files.
Clinical proteomic analysis of scrub typhus infection.
Park, Edmond Changkyun; Lee, Sang-Yeop; Yun, Sung Ho; Choi, Chi-Won; Lee, Hayoung; Song, Hyun Seok; Jun, Sangmi; Kim, Gun-Hwa; Lee, Chang-Seop; Kim, Seung Il
2018-01-01
Scrub typhus is an acute and febrile infectious disease caused by the Gram-negative α-proteobacterium Orientia tsutsugamushi from the family Rickettsiaceae that is widely distributed in Northern, Southern and Eastern Asia. In the present study, we analysed the serum proteome of scrub typhus patients to investigate specific clinical protein patterns in an attempt to explain pathophysiology and discover potential biomarkers of infection. Serum samples were collected from three patients (before and after treatment with antibiotics) and three healthy subjects. One-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis followed by liquid chromatography-tandem mass spectrometry was performed to identify differentially abundant proteins using quantitative proteomic approaches. Bioinformatic analysis was then performed using Ingenuity Pathway Analysis. Proteomic analysis identified 236 serum proteins, of which 32 were differentially expressed in normal subjects, naive scrub typhus patients and patients treated with antibiotics. Comparative bioinformatic analysis of the identified proteins revealed up-regulation of proteins involved in immune responses, especially complement system, following infection with O. tsutsugamushi , and normal expression was largely rescued by antibiotic treatment. This is the first proteomic study of clinical serum samples from scrub typhus patients. Proteomic analysis identified changes in protein expression upon infection with O. tsutsugamushi and following antibiotic treatment. Our results provide valuable information for further investigation of scrub typhus therapy and diagnosis.
Zlatic, Stephanie A; Vrailas-Mortimer, Alysia; Gokhale, Avanti; Carey, Lucas J; Scott, Elizabeth; Burch, Reid; McCall, Morgan M; Rudin-Rush, Samantha; Davis, John Bowen; Hartwig, Cortnie; Werner, Erica; Li, Lian; Petris, Michael; Faundez, Victor
2018-03-28
Rare neurological diseases shed light onto universal neurobiological processes. However, molecular mechanisms connecting genetic defects to their disease phenotypes are elusive. Here, we obtain mechanistic information by comparing proteomes of cells from individuals with rare disorders with proteomes from their disease-free consanguineous relatives. We use triple-SILAC mass spectrometry to quantify proteomes from human pedigrees affected by mutations in ATP7A, which cause Menkes disease, a rare neurodegenerative and neurodevelopmental disorder stemming from systemic copper depletion. We identified 214 proteins whose expression was altered in ATP7A -/y fibroblasts. Bioinformatic analysis of ATP7A-mutant proteomes identified known phenotypes and processes affected in rare genetic diseases causing copper dyshomeostasis, including altered mitochondrial function. We found connections between copper dyshomeostasis and the UCHL1/PARK5 pathway of Parkinson disease, which we validated with mitochondrial respiration and Drosophila genetics assays. We propose that our genealogical "omics" strategy can be broadly applied to identify mechanisms linking a genomic locus to its phenotypes. Copyright © 2018 Elsevier Inc. All rights reserved.
The speciation of the proteome
Jungblut, Peter R; Holzhütter, Hermann G; Apweiler, Rolf; Schlüter, Hartmut
2008-01-01
Introduction In proteomics a paradox situation developed in the last years. At one side it is basic knowledge that proteins are post-translationally modified and occur in different isoforms. At the other side the protein expression concept disclaims post-translational modifications by connecting protein names directly with function. Discussion Optimal proteome coverage is today reached by bottom-up liquid chromatography/mass spectrometry. But quantification at the peptide level in shotgun or bottom-up approaches by liquid chromatography and mass spectrometry is completely ignoring that a special peptide may exist in an unmodified form and in several-fold modified forms. The acceptance of the protein species concept is a basic prerequisite for meaningful quantitative analyses in functional proteomics. In discovery approaches only top-down analyses, separating the protein species before digestion, identification and quantification by two-dimensional gel electrophoresis or protein liquid chromatography, allow the correlation between changes of a biological situation and function. Conclusion To obtain biological relevant information kinetics and systems biology have to be performed at the protein species level, which is the major challenge in proteomics today. PMID:18638390
Hong, Haifa; Ye, Lincai; Chen, Huiwen; Xia, Yu; Liu, Yue; Liu, Jinfen; Lu, Yanan; Zhang, Haibo
2015-08-01
We aimed to evaluate global changes in protein expression associated with patency by undertaking proteomic analysis of human constricted and patent ductus arteriosus (DA). Ten constricted and 10 patent human DAs were excised from infants with ductal-dependent heart disease during surgery. Using isobaric tags for relative and absolute quantitation-based quantitative proteomics, 132 differentially expressed proteins were identified. Of 132 proteins, voltage-gated sodium channel 1.3 (SCN3A), myosin 1d (Myo1d), Rho GTPase activating protein 26 (ARHGAP26), and retinitis pigmentosa 1 (RP1) were selected for validation by Western blot and quantitative real-time polymerase chain reaction analyses. Significant upregulation of SCN3A, Myo1d, and RP1 messenger RNA, and protein levels was observed in the patent DA group (all P ≤ 0.048). ARHGAP26 messenger RNA and protein levels were decreased in patent DA tissue (both P ≤ 0.018). Immunohistochemistry analysis revealed that Myo1d, ARHGAP26, and RP1 were specifically expressed in the subendothelial region of constricted DAs; however, diffuse expression of these proteins was noted in the patent group. Proteomic analysis revealed global changes in the expression of proteins that regulate oxygen sensing, ion channels, smooth muscle cell migration, nervous system, immune system, and metabolism, suggesting a basis for the systemic regulation of DA patency by diverse signaling pathways, which will be confirmed in further studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Matzke, Melissa M.; Datta, Susmita
As the capability of mass spectrometry-based proteomics has matured, tens of thousands of peptides can be measured simultaneously, which has the benefit of offering a systems view of protein expression. However, a major challenge is that with an increase in throughput, protein quantification estimation from the native measured peptides has become a computational task. A limitation to existing computationally-driven protein quantification methods is that most ignore protein variation, such as alternate splicing of the RNA transcript and post-translational modifications or other possible proteoforms, which will affect a significant fraction of the proteome. The consequence of this assumption is that statisticalmore » inference at the protein level, and consequently downstream analyses, such as network and pathway modeling, have only limited power for biomarker discovery. Here, we describe a Bayesian model (BP-Quant) that uses statistically derived peptides signatures to identify peptides that are outside the dominant pattern, or the existence of multiple over-expressed patterns to improve relative protein abundance estimates. It is a research-driven approach that utilizes the objectives of the experiment, defined in the context of a standard statistical hypothesis, to identify a set of peptides exhibiting similar statistical behavior relating to a protein. This approach infers that changes in relative protein abundance can be used as a surrogate for changes in function, without necessarily taking into account the effect of differential post-translational modifications, processing, or splicing in altering protein function. We verify the approach using a dilution study from mouse plasma samples and demonstrate that BP-Quant achieves similar accuracy as the current state-of-the-art methods at proteoform identification with significantly better specificity. BP-Quant is available as a MatLab ® and R packages at https://github.com/PNNL-Comp-Mass-Spec/BP-Quant.« less
Standardized protocols for quality control of MRM-based plasma proteomic workflows.
Percy, Andrew J; Chambers, Andrew G; Smith, Derek S; Borchers, Christoph H
2013-01-04
Mass spectrometry (MS)-based proteomics is rapidly emerging as a viable technology for the identification and quantitation of biological samples, such as human plasma--the most complex yet commonly employed biofluid in clinical analyses. The transition from a qualitative to quantitative science is required if proteomics is going to successfully make the transition to a clinically useful technique. MS, however, has been criticized for a lack of reproducibility and interlaboratory transferability. Currently, the MS and plasma proteomics communities lack standardized protocols and reagents to ensure that high-quality quantitative data can be accurately and precisely reproduced by laboratories across the world using different MS technologies. Toward addressing this issue, we have developed standard protocols for multiple reaction monitoring (MRM)-based assays with customized isotopically labeled internal standards for quality control of the sample preparation workflow and the MS platform in quantitative plasma proteomic analyses. The development of reference standards and their application to a single MS platform is discussed herein, along with the results from intralaboratory tests. The tests highlighted the importance of the reference standards in assessing the efficiency and reproducibility of the entire bottom-up proteomic workflow and revealed errors related to the sample preparation and performance quality and deficits of the MS and LC systems. Such evaluations are necessary if MRM-based quantitative plasma proteomics is to be used in verifying and validating putative disease biomarkers across different research laboratories and eventually in clinical laboratories.
Krüger, Thomas; Luo, Ting; Schmidt, Hella; Shopova, Iordana; Kniemeyer, Olaf
2015-12-14
Opportunistic human pathogenic fungi including the saprotrophic mold Aspergillus fumigatus and the human commensal Candida albicans can cause severe fungal infections in immunocompromised or critically ill patients. The first line of defense against opportunistic fungal pathogens is the innate immune system. Phagocytes such as macrophages, neutrophils and dendritic cells are an important pillar of the innate immune response and have evolved versatile defense strategies against microbial pathogens. On the other hand, human-pathogenic fungi have sophisticated virulence strategies to counteract the innate immune defense. In this context, proteomic approaches can provide deeper insights into the molecular mechanisms of the interaction of host immune cells with fungal pathogens. This is crucial for the identification of both diagnostic biomarkers for fungal infections and therapeutic targets. Studying host-fungal interactions at the protein level is a challenging endeavor, yet there are few studies that have been undertaken. This review draws attention to proteomic techniques and their application to fungal pathogens and to challenges, difficulties, and limitations that may arise in the course of simultaneous dual proteome analysis of host immune cells interacting with diverse morphotypes of fungal pathogens. On this basis, we discuss strategies to overcome these multifaceted experimental and analytical challenges including the viability of immune cells during co-cultivation, the increased and heterogeneous protein complexity of the host proteome dynamically interacting with the fungal proteome, and the demands on normalization strategies in terms of relative quantitative proteome analysis.
Trauma-associated Human Neutrophil Alterations Revealed by Comparative Proteomics Profiling
Zhou, Jian-Ying; Krovvidi, Ravi K.; Gao, Yuqian; Gao, Hong; Petritis, Brianne O.; De, Asit; Miller-Graziano, Carol; Bankey, Paul E.; Petyuk, Vladislav A.; Nicora, Carrie D.; Clauss, Therese R; Moore, Ronald J.; Shi, Tujin; Brown, Joseph N.; Kaushal, Amit; Xiao, Wenzhong; Davis, Ronald W.; Maier, Ronald V.; Tompkins, Ronald G.; Qian, Wei-Jun; Camp, David G.; Smith, Richard D.
2013-01-01
PURPOSE Polymorphonuclear neutrophils (PMNs) play an important role in mediating the innate immune response after severe traumatic injury; however, the cellular proteome response to traumatic condition is still largely unknown. EXPERIMENTAL DESIGN We applied 2D-LC-MS/MS based shotgun proteomics to perform comparative proteome profiling of human PMNs from severe trauma patients and healthy controls. RESULTS A total of 197 out of ~2500 proteins (being identified with at least two peptides) were observed with significant abundance changes following the injury. The proteomics data were further compared with transcriptomics data for the same genes obtained from an independent patient cohort. The comparison showed that the protein abundance changes for the majority of proteins were consistent with the mRNA abundance changes in terms of directions of changes. Moreover, increased protein secretion was suggested as one of the mechanisms contributing to the observed discrepancy between protein and mRNA abundance changes. Functional analyses of the altered proteins showed that many of these proteins were involved in immune response, protein biosynthesis, protein transport, NRF2-mediated oxidative stress response, the ubiquitin-proteasome system, and apoptosis pathways. CONCLUSIONS AND CLINICAL RELEVANCE Our data suggest increased neutrophil activation and inhibited neutrophil apoptosis in response to trauma. The study not only reveals an overall picture of functional neutrophil response to trauma at the proteome level, but also provides a rich proteomics data resource of trauma-associated changes in the neutrophil that will be valuable for further studies of the functions of individual proteins in PMNs. PMID:23589343
Gilany, Kambiz; Minai-Tehrani, Arash; Savadi-Shiraz, Elham; Rezadoost, Hassan; Lakpour, Niknam
2015-01-01
The human seminal fluid is a complex body fluid. It is not known how many proteins are expressed in the seminal plasma; however in analog with the blood it is possible up to 10,000 proteins are expressed in the seminal plasma. The human seminal fluid is a rich source of potential biomarkers for male infertility and reproduction disorder. In this review, the ongoing list of proteins identified from the human seminal fluid was collected. To date, 4188 redundant proteins of the seminal fluid are identified using different proteomics technology, including 2-DE, SDS-PAGE-LC-MS/MS, MudPIT. However, this was reduced to a database of 2168 non-redundant protein using UniProtKB/Swiss-Prot reviewed database. The core concept of proteome were analyzed including pI, MW, Amino Acids, Chromosome and PTM distribution in the human seminal plasma proteome. Additionally, the biological process, molecular function and KEGG pathway were investigated using DAVID software. Finally, the biomarker identified in different male reproductive system disorder was investigated using proteomics platforms so far. In this study, an attempt was made to update the human seminal plasma proteome database. Our finding showed that human seminal plasma studies used to date seem to have converged on a set of proteins that are repeatedly identified in many studies and that represent only a small fraction of the entire human seminal plasma proteome.
Use of proteomic methods in the analysis of human body fluids in Alzheimer research.
Zürbig, Petra; Jahn, Holger
2012-12-01
Proteomics is the study of the entire population of proteins and peptides in an organism or a part of it, such as a cell, tissue, or fluids like cerebrospinal fluid, plasma, serum, urine, or saliva. It is widely assumed that changes in the composition of the proteome may reflect disease states and provide clues to its origin, eventually leading to targets for new treatments. The ability to perform large-scale proteomic studies now is based jointly on recent advances in our analytical methods. Separation techniques like CE and 2DE have developed and matured. Detection methods like MS have also improved greatly in the last 5 years. These developments have also driven the fields of bioinformatics, needed to deal with the increased data production and systems biology. All these developing methods offer specific advantages but also come with certain limitations. This review describes the different proteomic methods used in the field, their limitations, and their possible pitfalls. Based on a literature search in PubMed, we identified 112 studies that applied proteomic techniques to identify biomarkers for Alzheimer disease. This review describes the results of these studies on proteome changes in human body fluids of Alzheimer patients reviewing the most important studies. We extracted a list of 366 proteins and peptides that were identified by these studies as potential targets in Alzheimer research. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Plant fluid proteomics: Delving into the xylem sap, phloem sap and apoplastic fluid proteomes.
Rodríguez-Celma, Jorge; Ceballos-Laita, Laura; Grusak, Michael A; Abadía, Javier; López-Millán, Ana-Flor
2016-08-01
The phloem sap, xylem sap and apoplastic fluid play key roles in long and short distance transport of signals and nutrients, and act as a barrier against local and systemic pathogen infection. Among other components, these plant fluids contain proteins which are likely to be important players in their functionalities. However, detailed information about their proteomes is only starting to arise due to the difficulties inherent to the collection methods. This review compiles the proteomic information available to date in these three plant fluids, and compares the proteomes obtained in different plant species in order to shed light into conserved functions in each plant fluid. Inter-species comparisons indicate that all these fluids contain the protein machinery for self-maintenance and defense, including proteins related to cell wall metabolism, pathogen defense, proteolysis, and redox response. These analyses also revealed that proteins may play more relevant roles in signaling in the phloem sap and apoplastic fluid than in the xylem sap. A comparison of the proteomes of the three fluids indicates that although functional categories are somewhat similar, proteins involved are likely to be fluid-specific, except for a small group of proteins present in the three fluids, which may have a universal role, especially in cell wall maintenance and defense. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock. Copyright © 2016 Elsevier B.V. All rights reserved.
Oulas, Anastasis; Minadakis, George; Zachariou, Margarita; Sokratous, Kleitos; Bourdakou, Marilena M; Spyrou, George M
2017-11-27
Systems Bioinformatics is a relatively new approach, which lies in the intersection of systems biology and classical bioinformatics. It focuses on integrating information across different levels using a bottom-up approach as in systems biology with a data-driven top-down approach as in bioinformatics. The advent of omics technologies has provided the stepping-stone for the emergence of Systems Bioinformatics. These technologies provide a spectrum of information ranging from genomics, transcriptomics and proteomics to epigenomics, pharmacogenomics, metagenomics and metabolomics. Systems Bioinformatics is the framework in which systems approaches are applied to such data, setting the level of resolution as well as the boundary of the system of interest and studying the emerging properties of the system as a whole rather than the sum of the properties derived from the system's individual components. A key approach in Systems Bioinformatics is the construction of multiple networks representing each level of the omics spectrum and their integration in a layered network that exchanges information within and between layers. Here, we provide evidence on how Systems Bioinformatics enhances computational therapeutics and diagnostics, hence paving the way to precision medicine. The aim of this review is to familiarize the reader with the emerging field of Systems Bioinformatics and to provide a comprehensive overview of its current state-of-the-art methods and technologies. Moreover, we provide examples of success stories and case studies that utilize such methods and tools to significantly advance research in the fields of systems biology and systems medicine. © The Author 2017. Published by Oxford University Press.
Proteomics with Mass Spectrometry Imaging: Beyond Amyloid Typing.
Lavatelli, Francesca; Merlini, Giampaolo
2018-04-01
Detection and typing of amyloid deposits in tissues are two crucial steps in the management of systemic amyloidoses. The presence of amyloid deposits is routinely evaluated through Congo red staining, whereas proteomics is now a mainstay in the identification of the deposited proteins. In article number 1700236, Winter et al. [Proteomics 2017, 17, Issue 22] describe a novel method based on MALDI-MS imaging coupled to ion mobility separation and peptide filtering, to detect the presence of amyloid in histology samples and to identify its composition, while preserving the spatial distribution of proteins in tissues. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Advances in microscale separations towards nanoproteomics applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yi, Lian; Piehowski, Paul D.; Shi, Tujin
Microscale separations (e.g., liquid chromatography or capillary electrophoresis) coupled with mass spectrometry (MS) has become the primary tool for advanced proteomics, an indispensable technology for gaining understanding of complex biological processes. While significant advances have been achieved in MS-based proteomics, the current platforms still face a significant challenge in overall sensitivity towards nanoproteomics (i.e., with less than 1 g total amount of proteins available) applications such as cellular heterogeneity in tissue pathologies. Herein, we review recent advances in microscale separation techniques and integrated sample processing systems that improve the overall sensitivity and coverage of the proteomics workflow, and their contributionsmore » towards nanoproteomics applications.« less
A Unique Model Platform for C4 Plant Systems and Synthetic Biology
2015-12-10
International Conference in Bioinformatics , Sydney, Australia, July 31 - August 2, 2014. Nielsen LK (2015) Genome scale metabolic and regulatory...the comparison of transcriptome proteome and central metabolome in mature and immature tissue. Preliminary data were obtained suggesting successful...guide the comparison of transcriptome, proteome and central metabolome in mature and immature tissue. Preliminary data were obtained suggesting
Large Scale Proteomic Data and Network-Based Systems Biology Approaches to Explore the Plant World.
Di Silvestre, Dario; Bergamaschi, Andrea; Bellini, Edoardo; Mauri, PierLuigi
2018-06-03
The investigation of plant organisms by means of data-derived systems biology approaches based on network modeling is mainly characterized by genomic data, while the potential of proteomics is largely unexplored. This delay is mainly caused by the paucity of plant genomic/proteomic sequences and annotations which are fundamental to perform mass-spectrometry (MS) data interpretation. However, Next Generation Sequencing (NGS) techniques are contributing to filling this gap and an increasing number of studies are focusing on plant proteome profiling and protein-protein interactions (PPIs) identification. Interesting results were obtained by evaluating the topology of PPI networks in the context of organ-associated biological processes as well as plant-pathogen relationships. These examples foreshadow well the benefits that these approaches may provide to plant research. Thus, in addition to providing an overview of the main-omic technologies recently used on plant organisms, we will focus on studies that rely on concepts of module, hub and shortest path, and how they can contribute to the plant discovery processes. In this scenario, we will also consider gene co-expression networks, and some examples of integration with metabolomic data and genome-wide association studies (GWAS) to select candidate genes will be mentioned.
Proteomic Analyses of the Effects of Drugs of Abuse on Monocyte-Derived Mature Dendritic Cells
Reynolds, Jessica L.; Mahajan, Supriya D.; Aalinkeel, Ravikunar; Nair, B.; Sykes, Donald E.; Schwartz, Stanley A.
2010-01-01
Drug abuse has become a global health concern. Understanding how drug abuse modulates the immune system and how the immune system responds to pathogens associated with drug abuse, such hepatitis C virus (HCV) and human immunodeficiency virus (HIV-1), can be assessed by an integrated approach comparing proteomic analyses and quantitation of gene expression. Two-dimensional (2D) difference gel electrophoresis was used to determine the molecular mechanisms underlying the proteomic changes that alter normal biological processes when monocyte-derived mature dendritic cells were treated with cocaine or methamphetamine. Both drugs differentially regulated the expression of several functional classes of proteins including those that modulate apoptosis, protein folding, protein kinase activity, and metabolism and proteins that function as intracellular signal transduction molecules. Proteomic data were validated using a combination of quantitative, real-time PCR and Western blot analyses. These studies will help to identify the molecular mechanisms, including the expression of several functionally important classes of proteins that have emerged as potential mediators of pathogenesis. These proteins may predispose immunocompetent cells, including dendritic cells, to infection with viruses such as HCV and HIV-1, which are associated with drug abuse. PMID:19811410
García-Santamarina, Sarela; Boronat, Susanna; Calvo, Isabel A.; Rodríguez-Gabriel, Miguel; Ayté, José; Molina, Henrik
2013-01-01
Abstract Cysteine oxidation mediates oxidative stress toxicity and signaling. It has been long proposed that the thioredoxin (Trx) system, which consists of Trx and thioredoxin reductase (Trr), is not only involved in recycling classical Trx substrates, such as ribonucleotide reductase, but it also regulates general cytoplasmic thiol homeostasis. To investigate such a role, we have performed a proteome-wide analysis of cells expressing or not the two components of the Trx system. We have compared the reversibly oxidized thiol proteomes of wild-type Schizosaccharomyces pombe cells with mutants lacking Trx or Trr. Specific Trx substrates are reversibly-oxidized in both strain backgrounds; however, in the absence of Trr, Trx can weakly recycle its substrates at the expense of an alternative electron donor. A massive thiol oxidation occurs only in cells lacking Trr, with 30% of all cysteine-containing peptides being reversibly oxidized; this oxidized cysteine proteome depends on the presence of Trxs. Our observations lead to the hypothesis that, in the absence of its reductase, the natural electron donor Trx becomes a powerful oxidant and triggers general thiol oxidation. Antioxid. Redox Signal. 18, 1549–1556. PMID:23121505
Defining the extracellular matrix using proteomics
Byron, Adam; Humphries, Jonathan D; Humphries, Martin J
2013-01-01
The cell microenvironment has a profound influence on the behaviour, growth and survival of cells. The extracellular matrix (ECM) provides not only mechanical and structural support to cells and tissues but also binds soluble ligands and transmembrane receptors to provide spatial coordination of signalling processes. The ability of cells to sense the chemical, mechanical and topographical features of the ECM enables them to integrate complex, multiparametric information into a coherent response to the surrounding microenvironment. Consequently, dysregulation or mutation of ECM components results in a broad range of pathological conditions. Characterization of the composition of ECM derived from various cells has begun to reveal insights into ECM structure and function, and mechanisms of disease. Proteomic methodologies permit the global analysis of subcellular systems, but extracellular and transmembrane proteins present analytical difficulties to proteomic strategies owing to the particular biochemical properties of these molecules. Here, we review advances in proteomic approaches that have been applied to furthering our understanding of the ECM microenvironment. We survey recent studies that have addressed challenges in the analysis of ECM and discuss major outcomes in the context of health and disease. In addition, we summarize efforts to progress towards a systems-level understanding of ECM biology. PMID:23419153
Global profiling of lysine reactivity and ligandability in the human proteome
NASA Astrophysics Data System (ADS)
Hacker, Stephan M.; Backus, Keriann M.; Lazear, Michael R.; Forli, Stefano; Correia, Bruno E.; Cravatt, Benjamin F.
2017-12-01
Nucleophilic amino acids make important contributions to protein function, including performing key roles in catalysis and serving as sites for post-translational modification. Electrophilic groups that target amino-acid nucleophiles have been used to create covalent ligands and drugs, but have, so far, been mainly limited to cysteine and serine. Here, we report a chemical proteomic platform for the global and quantitative analysis of lysine residues in native biological systems. We have quantified, in total, more than 9,000 lysines in human cell proteomes and have identified several hundred residues with heightened reactivity that are enriched at protein functional sites and can frequently be targeted by electrophilic small molecules. We have also discovered lysine-reactive fragment electrophiles that inhibit enzymes by active site and allosteric mechanisms, as well as disrupt protein-protein interactions in transcriptional regulatory complexes, emphasizing the broad potential and diverse functional consequences of liganding lysine residues throughout the human proteome.
From the genome sequence to the protein inventory of Bacillus subtilis.
Becher, Dörte; Büttner, Knut; Moche, Martin; Hessling, Bernd; Hecker, Michael
2011-08-01
Owing to the low number of proteins necessary to render a bacterial cell viable, bacteria are extremely attractive model systems to understand how the genome sequence is translated into actual life processes. One of the most intensively investigated model organisms is Bacillus subtilis. It has attracted world-wide research interest, addressing cell differentiation and adaptation on a molecular scale as well as biotechnological production processes. Meanwhile, we are looking back on more than 25 years of B. subtilis proteomics. A wide range of methods have been developed during this period for the large-scale qualitative and quantitative proteome analysis. Currently, it is possible to identify and quantify more than 50% of the predicted proteome in different cellular subfractions. In this review, we summarize the development of B. subtilis proteomics during the past 25 years. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Global profiling of lysine reactivity and ligandability in the human proteome.
Hacker, Stephan M; Backus, Keriann M; Lazear, Michael R; Forli, Stefano; Correia, Bruno E; Cravatt, Benjamin F
2017-12-01
Nucleophilic amino acids make important contributions to protein function, including performing key roles in catalysis and serving as sites for post-translational modification. Electrophilic groups that target amino-acid nucleophiles have been used to create covalent ligands and drugs, but have, so far, been mainly limited to cysteine and serine. Here, we report a chemical proteomic platform for the global and quantitative analysis of lysine residues in native biological systems. We have quantified, in total, more than 9,000 lysines in human cell proteomes and have identified several hundred residues with heightened reactivity that are enriched at protein functional sites and can frequently be targeted by electrophilic small molecules. We have also discovered lysine-reactive fragment electrophiles that inhibit enzymes by active site and allosteric mechanisms, as well as disrupt protein-protein interactions in transcriptional regulatory complexes, emphasizing the broad potential and diverse functional consequences of liganding lysine residues throughout the human proteome.
Genome and proteome annotation: organization, interpretation and integration
Reeves, Gabrielle A.; Talavera, David; Thornton, Janet M.
2008-01-01
Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations. PMID:19019817
Kim, Eun Young; Lee, Min Young; Kim, Se Hyun; Ha, Kyooseob; Kim, Kwang Pyo; Ahn, Yong Min
2017-06-02
Major depressive disorder (MDD) is a systemic and multifactorial disorder that involves abnormalities in multiple biochemical pathways and the autonomic nervous system. This study applied a machine-learning method to classify MDD and control groups by incorporating data from serum proteomic analysis and heart rate variability (HRV) analysis for the identification of novel peripheral biomarkers. The study subjects consisted of 25 drug-free female MDD patients and 25 age- and sex-matched healthy controls. First, quantitative serum proteome profiles were analyzed by liquid chromatography-tandem mass spectrometry using pooled serum samples from 10 patients and 10 controls. Next, candidate proteins were quantified with multiple reaction monitoring (MRM) in 50 subjects. We also analyzed 22 linear and nonlinear HRV parameters in 50 subjects. Finally, we identified a combined biomarker panel consisting of proteins and HRV indexes using a support vector machine with recursive feature elimination. A separation between MDD and control groups was achieved using five parameters (apolipoprotein B, group-specific component, ceruloplasmin, RMSSD, and SampEn) at 80.1% classification accuracy. A combination of HRV and proteomic data achieved better classification accuracy. A high classification accuracy can be achieved by combining multimodal information from heart rate dynamics and serum proteomics in MDD. Our approach can be helpful for accurate clinical diagnosis of MDD. Further studies using larger, independent cohorts are needed to verify the role of these candidate biomarkers for MDD diagnosis. Copyright © 2017 Elsevier Inc. All rights reserved.
Stickel, Stefanie; Gomes, Nathan; Su, Tin Tin
2014-01-01
In this review, we will summarize the data from different model systems that illustrate the need for proteome-wide analyses of the biological consequences of ionizing radiation (IR). IR remains one of three main therapy choices for oncology, the others being surgery and chemotherapy. Understanding how cells and tissues respond to IR is essential for improving therapeutic regimes against cancer. Numerous studies demonstrating the changes in the transcriptome following exposure to IR, in diverse systems, can be found in the scientific literature. However, the limitation of our knowledge is illustrated by the fact that the number of transcripts that change after IR exposure is approximately an order of magnitude lower than the number of transcripts that re-localize to or from ribosomes under similar conditions. Furthermore, changes in the post-translational modifications of proteins (phosphorylation, acetylation as well as degradation) are profoundly important for the cellular response to IR. These considerations make proteomics a highly suitable tool for mechanistic studies of the effect of IR. Strikingly such studies remain outnumbered by those utilizing proteomics for diagnostic purposes such as the identification of biomarkers for the outcome of radiation therapy. Here we will discuss the role of the ribosome and translational regulation in the survival and preservation of cells and tissues after exposure to ionizing radiation. In doing so we hope to provide a strong incentive for the study of proteome-wide changes following IR exposure. PMID:26269784
Yang, Ming-Yu; Chiang, Yuan-Cheng; Huang, Yu-Ting; Chen, Chien-Chang; Wang, Feng-Sheng; Wang, Ching-Jen; Kuo, Yur-Ren
2014-01-01
Previous studies have demonstrated that extracorporeal shock wave therapy has a significant positive effect on accelerating diabetic wound healing. However, the systemic effect after therapy is still unclear. This study investigated the plasma protein expression in the extracorporeal shock wave therapy group and diabetic controls using proteomic study. A dorsal skin defect (6 × 5 cm) in a streptozotocin-induced diabetic Wistar rat model was used. Diabetic rats receiving either no therapy or extracorporeal shock wave therapy after wounding were analyzed. The spots of interest were subjected to in-gel trypsin digestion and matrix-assisted laser desorption ionization time-of-flight mass spectrometry to elucidate the peptide mass fingerprints. The mass spectrometric characteristics of the identified proteins, including their theoretical isoelectric points, molecular weights, sequence coverage, and Mascot score, were analyzed. Protein expression was validated using immunohistochemical analysis of topical periwounding tissues. The proteomic study revealed that at days 3 and 10 after therapy rats had significantly higher abundance of haptoglobin and significantly lower levels of the vitamin D-binding protein precursor as compared with the diabetic controls. Immunohistochemical staining of topical periwounding tissue also revealed significant upregulation of haptoglobin and downregulation of vitamin D-binding protein expression in the extracorporeal shock wave therapy group, which was consistent with the systemic proteome study. Proteome analyses demonstrated an upregulation of haptoglobin and a downregulation of vitamin D-binding protein in extracorporeal shock wave therapy-enhanced diabetic wound healing.
Proteomics and bioinformatics strategies to design countermeasures against infectious threat agents.
Khan, Akbar S; Mujer, Cesar V; Alefantis, Timothy G; Connolly, Joseph P; Mayr, Ulrike Beate; Walcher, Petra; Lubitz, Werner; Delvecchio, Vito G
2006-01-01
The potential devastation resulting from an intentional outbreak caused by biological warfare agents such as Brucella abortus and Bacillus anthracis underscores the need for next generation vaccines. Proteomics, genomics, and systems biology approaches coupled with the bacterial ghost (BG) vaccine delivery strategy offer an ideal approach for developing safer, cost-effective, and efficacious vaccines for human use in a relatively rapid time frame. Critical to any subunit vaccine development strategy is the identification of a pathogen's proteins with the greatest potential of eliciting a protective immune response. These proteins are collectively referred to as the pathogen's immunome. Proteomics provides high-resolution identification of these immunogenic proteins using standard proteomic technologies, Western blots probed with antisera from infected patients, and the pathogen's sequenced and annotated genome. Selected immunoreactive proteins can be then cloned and expressed in nonpathogenic Gram-negative bacteria. Subsequently, a temperature shift or chemical induction process is initiated to induce expression of the PhiX174 E-lysis gene, whose protein product forms an E tunnel between the inner and outer membrane of the bacteria, expelling all intracellular contents. The BG vaccine system is a proven strategy developed for many different pathogens and tested in a complete array of animal models. The BG vaccine system also has great potential for producing multiagent vaccines for protection to multiple species in a single formulation.
Proteomic effects of wet cupping (Al-hijamah).
Almaiman, Amer A
2018-01-01
Wet cupping (Al-hijamah) is a therapeutic technique practiced worldwide as a part of the Unani system of medicine. It involves bloodletting from acupoints on a patient's skin to produce a therapeutic outcome. A thorough review of research articles on wet cupping with relevance to proteomics field that are indexed by Google Scholar, PubMed, and/or Science Direct databases was performed. Eight original research articles were summarized in this paper. Overall, wet cupping did not have a significant effect on C-reactive protein, Hsp-27, sister chromatid exchanges, and cell replication index. In contrast, wet cupping was found to produce higher oxygen saturation, eliminate lactate from subcutaneous tissues, remove blood containing higher levels of malondialdehyde and nitric oxide, and produce higher activity of myeloperoxidase. The proteomic effects of wet cupping therapy have not been adequately investigated. Thus, future studies on wet cupping that use systemic and sound protocols to avoid bias should be conducted.
Proteomic effects of wet cupping (Al-hijamah)
Almaiman, Amer A.
2018-01-01
Wet cupping (Al-hijamah) is a therapeutic technique practiced worldwide as a part of the Unani system of medicine. It involves bloodletting from acupoints on a patient’s skin to produce a therapeutic outcome. A thorough review of research articles on wet cupping with relevance to proteomics field that are indexed by Google Scholar, PubMed, and/or Science Direct databases was performed. Eight original research articles were summarized in this paper. Overall, wet cupping did not have a significant effect on C-reactive protein, Hsp-27, sister chromatid exchanges, and cell replication index. In contrast, wet cupping was found to produce higher oxygen saturation, eliminate lactate from subcutaneous tissues, remove blood containing higher levels of malondialdehyde and nitric oxide, and produce higher activity of myeloperoxidase. The proteomic effects of wet cupping therapy have not been adequately investigated. Thus, future studies on wet cupping that use systemic and sound protocols to avoid bias should be conducted. PMID:29332103
Year 2 Report: Protein Function Prediction Platform
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C E
2012-04-27
Upon completion of our second year of development in a 3-year development cycle, we have completed a prototype protein structure-function annotation and function prediction system: Protein Function Prediction (PFP) platform (v.0.5). We have met our milestones for Years 1 and 2 and are positioned to continue development in completion of our original statement of work, or a reasonable modification thereof, in service to DTRA Programs involved in diagnostics and medical countermeasures research and development. The PFP platform is a multi-scale computational modeling system for protein structure-function annotation and function prediction. As of this writing, PFP is the only existing fullymore » automated, high-throughput, multi-scale modeling, whole-proteome annotation platform, and represents a significant advance in the field of genome annotation (Fig. 1). PFP modules perform protein functional annotations at the sequence, systems biology, protein structure, and atomistic levels of biological complexity (Fig. 2). Because these approaches provide orthogonal means of characterizing proteins and suggesting protein function, PFP processing maximizes the protein functional information that can currently be gained by computational means. Comprehensive annotation of pathogen genomes is essential for bio-defense applications in pathogen characterization, threat assessment, and medical countermeasure design and development in that it can short-cut the time and effort required to select and characterize protein biomarkers.« less
The impact of network medicine in gastroenterology and hepatology.
Baffy, György
2013-10-01
In the footsteps of groundbreaking achievements made by biomedical research, another scientific revolution is unfolding. Systems biology draws from the chaos and complexity theory and applies computational models to predict emerging behavior of the interactions between genes, gene products, and environmental factors. Adaptation of systems biology to translational and clinical sciences has been termed network medicine, and is likely to change the way we think about preventing, predicting, diagnosing, and treating complex human diseases. Network medicine finds gene-disease associations by analyzing the unparalleled digital information discovered and created by high-throughput technologies (dubbed as "omics" science) and links genetic variance to clinical disease phenotypes through intermediate organizational levels of life such as the epigenome, transcriptome, proteome, and metabolome. Supported by large reference databases, unprecedented data storage capacity, and innovative computational analysis, network medicine is poised to find links between conditions that were thought to be distinct, uncover shared disease mechanisms and key drivers of the pathogenesis, predict individual disease outcomes and trajectories, identify novel therapeutic applications, and help avoid off-target and undesirable drug effects. Recent advances indicate that these perspectives are increasingly within our reach for understanding and managing complex diseases of the digestive system. Copyright © 2013 AGA Institute. Published by Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Costa, Pedro M.; Fadeel, Bengt, E-mail: Bengt.Fade
Engineered nanomaterials are being developed for a variety of technological applications. However, the increasing use of nanomaterials in society has led to concerns about their potential adverse effects on human health and the environment. During the first decade of nanotoxicological research, the realization has emerged that effective risk assessment of the multitudes of new nanomaterials would benefit from a comprehensive understanding of their toxicological mechanisms, which is difficult to achieve with traditional, low-throughput, single end-point oriented approaches. Therefore, systems biology approaches are being progressively applied within the nano(eco)toxicological sciences. This novel paradigm implies that the study of biological systems shouldmore » be integrative resulting in quantitative and predictive models of nanomaterial behaviour in a biological system. To this end, global ‘omics’ approaches with which to assess changes in genes, proteins, metabolites, etc. are deployed allowing for computational modelling of the biological effects of nanomaterials. Here, we highlight omics and systems biology studies in nanotoxicology, aiming towards the implementation of a systems nanotoxicology and mechanism-based risk assessment of nanomaterials. - Highlights: • Systems nanotoxicology is a multi-disciplinary approach to quantitative modelling. • Transcriptomics, proteomics and metabolomics remain the most common methods. • Global “omics” techniques should be coupled to computational modelling approaches. • The discovery of nano-specific toxicity pathways and biomarkers is a prioritized goal. • Overall, experimental nanosafety research must endeavour reproducibility and relevance.« less
Shen, Xiaomeng; Hu, Qiang; Li, Jun; Wang, Jianmin; Qu, Jun
2015-10-02
Comprehensive and accurate evaluation of data quality and false-positive biomarker discovery is critical to direct the method development/optimization for quantitative proteomics, which nonetheless remains challenging largely due to the high complexity and unique features of proteomic data. Here we describe an experimental null (EN) method to address this need. Because the method experimentally measures the null distribution (either technical or biological replicates) using the same proteomic samples, the same procedures and the same batch as the case-vs-contol experiment, it correctly reflects the collective effects of technical variability (e.g., variation/bias in sample preparation, LC-MS analysis, and data processing) and project-specific features (e.g., characteristics of the proteome and biological variation) on the performances of quantitative analysis. To show a proof of concept, we employed the EN method to assess the quantitative accuracy and precision and the ability to quantify subtle ratio changes between groups using different experimental and data-processing approaches and in various cellular and tissue proteomes. It was found that choices of quantitative features, sample size, experimental design, data-processing strategies, and quality of chromatographic separation can profoundly affect quantitative precision and accuracy of label-free quantification. The EN method was also demonstrated as a practical tool to determine the optimal experimental parameters and rational ratio cutoff for reliable protein quantification in specific proteomic experiments, for example, to identify the necessary number of technical/biological replicates per group that affords sufficient power for discovery. Furthermore, we assessed the ability of EN method to estimate levels of false-positives in the discovery of altered proteins, using two concocted sample sets mimicking proteomic profiling using technical and biological replicates, respectively, where the true-positives/negatives are known and span a wide concentration range. It was observed that the EN method correctly reflects the null distribution in a proteomic system and accurately measures false altered proteins discovery rate (FADR). In summary, the EN method provides a straightforward, practical, and accurate alternative to statistics-based approaches for the development and evaluation of proteomic experiments and can be universally adapted to various types of quantitative techniques.
Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures
Manolakos, Elias S.
2015-01-01
Fast increasing computational demand for all-to-all protein structures comparison (PSC) is a result of three confounding factors: rapidly expanding structural proteomics databases, high computational complexity of pairwise protein comparison algorithms, and the trend in the domain towards using multiple criteria for protein structures comparison (MCPSC) and combining results. We have developed a software framework that exploits many-core and multicore CPUs to implement efficient parallel MCPSC in modern processors based on three popular PSC methods, namely, TMalign, CE, and USM. We evaluate and compare the performance and efficiency of the two parallel MCPSC implementations using Intel's experimental many-core Single-Chip Cloud Computer (SCC) as well as Intel's Core i7 multicore processor. We show that the 48-core SCC is more efficient than the latest generation Core i7, achieving a speedup factor of 42 (efficiency of 0.9), making many-core processors an exciting emerging technology for large-scale structural proteomics. We compare and contrast the performance of the two processors on several datasets and also show that MCPSC outperforms its component methods in grouping related domains, achieving a high F-measure of 0.91 on the benchmark CK34 dataset. The software implementation for protein structure comparison using the three methods and combined MCPSC, along with the developed underlying rckskel algorithmic skeletons library, is available via GitHub. PMID:26605332
Meher, Prabina K.; Sahu, Tanmaya K.; Gahoi, Shachi; Rao, Atmakuri R.
2018-01-01
Heat shock proteins (HSPs) play a pivotal role in cell growth and variability. Since conventional approaches are expensive and voluminous protein sequence information is available in the post-genomic era, development of an automated and accurate computational tool is highly desirable for prediction of HSPs, their families and sub-types. Thus, we propose a computational approach for reliable prediction of all these components in a single framework and with higher accuracy as well. The proposed approach achieved an overall accuracy of ~84% in predicting HSPs, ~97% in predicting six different families of HSPs, and ~94% in predicting four types of DnaJ proteins, with bench mark datasets. The developed approach also achieved higher accuracy as compared to most of the existing approaches. For easy prediction of HSPs by experimental scientists, a user friendly web server ir-HSP is made freely accessible at http://cabgrid.res.in:8080/ir-hsp. The ir-HSP was further evaluated for proteome-wide identification of HSPs by using proteome datasets of eight different species, and ~50% of the predicted HSPs in each species were found to be annotated with InterPro HSP families/domains. Thus, the developed computational method is expected to supplement the currently available approaches for prediction of HSPs, to the extent of their families and sub-types. PMID:29379521
Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures.
Sharma, Anuj; Manolakos, Elias S
2015-01-01
Fast increasing computational demand for all-to-all protein structures comparison (PSC) is a result of three confounding factors: rapidly expanding structural proteomics databases, high computational complexity of pairwise protein comparison algorithms, and the trend in the domain towards using multiple criteria for protein structures comparison (MCPSC) and combining results. We have developed a software framework that exploits many-core and multicore CPUs to implement efficient parallel MCPSC in modern processors based on three popular PSC methods, namely, TMalign, CE, and USM. We evaluate and compare the performance and efficiency of the two parallel MCPSC implementations using Intel's experimental many-core Single-Chip Cloud Computer (SCC) as well as Intel's Core i7 multicore processor. We show that the 48-core SCC is more efficient than the latest generation Core i7, achieving a speedup factor of 42 (efficiency of 0.9), making many-core processors an exciting emerging technology for large-scale structural proteomics. We compare and contrast the performance of the two processors on several datasets and also show that MCPSC outperforms its component methods in grouping related domains, achieving a high F-measure of 0.91 on the benchmark CK34 dataset. The software implementation for protein structure comparison using the three methods and combined MCPSC, along with the developed underlying rckskel algorithmic skeletons library, is available via GitHub.
Cell-free protein synthesis: applications in proteomics and biotechnology.
He, Mingyue
2008-01-01
Protein production is one of the key steps in biotechnology and functional proteomics. Expression of proteins in heterologous hosts (such as in E. coli) is generally lengthy and costly. Cell-free protein synthesis is thus emerging as an attractive alternative. In addition to the simplicity and speed for protein production, cell-free expression allows generation of functional proteins that are difficult to produce by in vivo systems. Recent exploitation of cell-free systems enables novel development of technologies for rapid discovery of proteins with desirable properties from very large libraries. This article reviews the recent development in cell-free systems and their application in the large scale protein analysis.
McGorum, Bruce C; Pirie, R Scott; Eaton, Samantha L; Keen, John A; Cumyn, Elizabeth M; Arnott, Danielle M; Chen, Wenzhang; Lamont, Douglas J; Graham, Laura C; Llavero Hurtado, Maica; Pemberton, Alan; Wishart, Thomas M
2015-11-01
Equine grass sickness (EGS) is an acute, predominantly fatal, multiple system neuropathy of grazing horses with reported incidence rates of ∼2%. An apparently identical disease occurs in multiple species, including but not limited to cats, dogs, and rabbits. Although the precise etiology remains unclear, ultrastructural findings have suggested that the primary lesion lies in the glycoprotein biosynthetic pathway of specific neuronal populations. The goal of this study was therefore to identify the molecular processes underpinning neurodegeneration in EGS. Here, we use a bottom-up approach beginning with the application of modern proteomic tools to the analysis of cranial (superior) cervical ganglion (CCG, a consistently affected tissue) from EGS-affected patients and appropriate control cases postmortem. In what appears to be the proteomic application of modern proteomic tools to equine neuronal tissues and/or to an inherent neurodegenerative disease of large animals (not a model of human disease), we identified 2,311 proteins in CCG extracts, with 320 proteins increased and 186 decreased by greater than 20% relative to controls. Further examination of selected proteomic candidates by quantitative fluorescent Western blotting (QFWB) and subcellular expression profiling by immunohistochemistry highlighted a previously unreported dysregulation in proteins commonly associated with protein misfolding/aggregation responses seen in a myriad of human neurodegenerative conditions, including but not limited to amyloid precursor protein (APP), microtubule associated protein (Tau), and multiple components of the ubiquitin proteasome system (UPS). Differentially expressed proteins eligible for in silico pathway analysis clustered predominantly into the following biofunctions: (1) diseases and disorders, including; neurological disease and skeletal and muscular disorders and (2) molecular and cellular functions, including cellular assembly and organization, cell-to-cell signaling and interaction (including epinephrine, dopamine, and adrenergic signaling and receptor function), and small molecule biochemistry. Interestingly, while the biofunctions identified in this study may represent pathways underpinning EGS-induced neurodegeneration, this is also the first demonstration of potential molecular conservation (including previously unreported dysregulation of the UPS and APP) spanning the degenerative cascades from an apparently unrelated condition of large animals, to small animal models with altered neuronal vulnerability, and human neurological conditions. Importantly, this study highlights the feasibility and benefits of applying modern proteomic techniques to veterinary investigations of neurodegenerative processes in diseases of large animals. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
2015-09-01
glioblastoma . We have successfully established several patient-derived cell lines from glioblastoma tumors and further established a number of...and single-cell technologies. Although the focus of this research is glioblastoma , the proposed tools are generally applicable to all cancer-based...studies. 15. SUBJECT TERMS Human cohorts, Glioblastoma , Genomic, Proteomic, Single-cell technologies, Hypothesis-driven, integrative systems approach
ERIC Educational Resources Information Center
Kim, Thomas D.; Craig, Paul A.
2010-01-01
Two-dimensional gel electrophoresis (2DGE) remains an important tool in the study of biological systems by proteomics. While the use of 2DGE is commonplace in research publications, there are few instructional laboratories that address the use of 2DGE for analyzing complex protein samples. One reason for this lack is the fact that the preparation…
Proteome changes in platelets after pathogen inactivation--an interlaboratory consensus.
Prudent, Michel; D'Alessandro, Angelo; Cazenave, Jean-Pierre; Devine, Dana V; Gachet, Christian; Greinacher, Andreas; Lion, Niels; Schubert, Peter; Steil, Leif; Thiele, Thomas; Tissot, Jean-Daniel; Völker, Uwe; Zolla, Lello
2014-04-01
Pathogen inactivation (PI) of platelet concentrates (PCs) reduces the proliferation/replication of a large range of bacteria, viruses, and parasites as well as residual leucocytes. Pathogen-inactivated PCs were evaluated in various clinical trials showing their efficacy and safety. Today, there is some debate over the hemostatic activity of treated PCs as the overall survival of PI platelets seems to be somewhat reduced, and in vitro measurements have identified some alterations in platelet function. Although the specific lesions resulting from PI of PCs are still not fully understood, proteomic studies have revealed potential damages at the protein level. This review merges the key findings of the proteomic analyses of PCs treated by the Mirasol Pathogen Reduction Technology, the Intercept Blood System, and the Theraflex UV-C system, respectively, and discusses the potential impact on the biological functions of platelets. The complementarities of the applied proteomic approaches allow the coverage of a wide range of proteins and provide a comprehensive overview of PI-mediated protein damage. It emerges that there is a relatively weak impact of PI on the overall proteome of platelets. However, some data show that the different PI treatments lead to an acceleration of platelet storage lesions, which is in agreement with the current model of platelet storage lesion in pathogen-inactivated PCs. Overall, the impact of the PI treatment on the proteome appears to be different among the PI systems. Mirasol impacts adhesion and platelet shape change, whereas Intercept seems to impact proteins of intracellular platelet activation pathways. Theraflex influences platelet shape change and aggregation, but the data reported to date are limited. This information provides the basis to understand the impact of different PI on the molecular mechanisms of platelet function. Moreover, these data may serve as basis for future developments of PI technologies for PCs. Further studies should address the impact of both the PI and the storage duration on platelets in PCs because PI may enable the extension of the shelf life of PCs by reducing the bacterial contamination risk. Copyright © 2014 Elsevier Inc. All rights reserved.
Network-Based Approaches in Drug Discovery and Early Development
Harrold, JM; Ramanathan, M; Mager, DE
2015-01-01
Identification of novel targets is a critical first step in the drug discovery and development process. Most diseases such as cancer, metabolic disorders, and neurological disorders are complex, and their pathogenesis involves multiple genetic and environmental factors. Finding a viable drug target–drug combination with high potential for yielding clinical success within the efficacy–toxicity spectrum is extremely challenging. Many examples are now available in which network-based approaches show potential for the identification of novel targets and for the repositioning of established targets. The objective of this article is to highlight network approaches for identifying novel targets with greater chances of gaining approved drugs with maximal efficacy and minimal side effects. Further enhancement of these approaches may emerge from effectively integrating computational systems biology with pharmacodynamic systems analysis. Coupling genomics, proteomics, and metabolomics databases with systems pharmacology modeling may aid in the development of disease-specific networks that can be further used to build confidence in target identification. PMID:24025802
The Human Skeletal Muscle Proteome Project: a reappraisal of the current literature
Gonzalez‐Freire, Marta; Semba, Richard D.; Ubaida‐Mohien, Ceereena; Fabbri, Elisa; Scalzo, Paul; Højlund, Kurt; Dufresne, Craig; Lyashkov, Alexey
2016-01-01
Abstract Skeletal muscle is a large organ that accounts for up to half the total mass of the human body. A progressive decline in muscle mass and strength occurs with ageing and in some individuals configures the syndrome of ‘sarcopenia’, a condition that impairs mobility, challenges autonomy, and is a risk factor for mortality. The mechanisms leading to sarcopenia as well as myopathies are still little understood. The Human Skeletal Muscle Proteome Project was initiated with the aim to characterize muscle proteins and how they change with ageing and disease. We conducted an extensive review of the literature and analysed publically available protein databases. A systematic search of peer‐reviewed studies was performed using PubMed. Search terms included ‘human’, ‘skeletal muscle’, ‘proteome’, ‘proteomic(s)’, and ‘mass spectrometry’, ‘liquid chromatography‐mass spectrometry (LC‐MS/MS)’. A catalogue of 5431 non‐redundant muscle proteins identified by mass spectrometry‐based proteomics from 38 peer‐reviewed scientific publications from 2002 to November 2015 was created. We also developed a nosology system for the classification of muscle proteins based on localization and function. Such inventory of proteins should serve as a useful background reference for future research on changes in muscle proteome assessed by quantitative mass spectrometry‐based proteomic approaches that occur with ageing and diseases. This classification and compilation of the human skeletal muscle proteome can be used for the identification and quantification of proteins in skeletal muscle to discover new mechanisms for sarcopenia and specific muscle diseases that can be targeted for the prevention and treatment. PMID:27897395
Bio-TDS: bioscience query tool discovery system.
Gnimpieba, Etienne Z; VanDiermen, Menno S; Gustafson, Shayla M; Conn, Bill; Lushbough, Carol M
2017-01-04
Bioinformatics and computational biology play a critical role in bioscience and biomedical research. As researchers design their experimental projects, one major challenge is to find the most relevant bioinformatics toolkits that will lead to new knowledge discovery from their data. The Bio-TDS (Bioscience Query Tool Discovery Systems, http://biotds.org/) has been developed to assist researchers in retrieving the most applicable analytic tools by allowing them to formulate their questions as free text. The Bio-TDS is a flexible retrieval system that affords users from multiple bioscience domains (e.g. genomic, proteomic, bio-imaging) the ability to query over 12 000 analytic tool descriptions integrated from well-established, community repositories. One of the primary components of the Bio-TDS is the ontology and natural language processing workflow for annotation, curation, query processing, and evaluation. The Bio-TDS's scientific impact was evaluated using sample questions posed by researchers retrieved from Biostars, a site focusing on BIOLOGICAL DATA ANALYSIS: The Bio-TDS was compared to five similar bioscience analytic tool retrieval systems with the Bio-TDS outperforming the others in terms of relevance and completeness. The Bio-TDS offers researchers the capacity to associate their bioscience question with the most relevant computational toolsets required for the data analysis in their knowledge discovery process. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Jing, Li; Amster, I Jonathan
2009-10-15
Offline high performance liquid chromatography combined with matrix assisted laser desorption and Fourier transform ion cyclotron resonance mass spectrometry (HPLC-MALDI-FTICR/MS) provides the means to rapidly analyze complex mixtures of peptides, such as those produced by proteolytic digestion of a proteome. This method is particularly useful for making quantitative measurements of changes in protein expression by using (15)N-metabolic labeling. Proteolytic digestion of combined labeled and unlabeled proteomes produces complex mixtures that with many mass overlaps when analyzed by HPLC-MALDI-FTICR/MS. A significant challenge to data analysis is the matching of pairs of peaks which represent an unlabeled peptide and its labeled counterpart. We have developed an algorithm and incorporated it into a compute program which significantly accelerates the interpretation of (15)N metabolic labeling data by automating the process of identifying unlabeled/labeled peak pairs. The algorithm takes advantage of the high resolution and mass accuracy of FTICR mass spectrometry. The algorithm is shown to be able to successfully identify the (15)N/(14)N peptide pairs and calculate peptide relative abundance ratios in highly complex mixtures from the proteolytic digest of a whole organism protein extract.
The Impact of the Glomerular Filtration Rate on the Human Plasma Proteome.
Christensson, Anders; Ash, Jessica A; DeLisle, Robert K; Gaspar, Fraser W; Ostroff, Rachel; Grubb, Anders; Lindström, Veronica; Bruun, Laila; Williams, Steve A
2018-05-01
The application of proteomics in chronic kidney disease (CKD) can potentially uncover biomarkers and pathways that are predictive of disease. Within this context, this study examines the relationship between the human plasma proteome and glomerular filtration rate (GFR) as measured by iohexol clearance in a cohort from Sweden (n = 389; GFR range: 8-100 mL min -1 /1.73 m 2 ). A total of 2893 proteins are quantified using a modified aptamer assay. A large proportion of the proteome is associated with GFR, reinforcing the concept that CKD affects multiple physiological systems (individual protein-GFR correlations listed here). Of these, cystatin C shows the most significant correlation with GFR (rho = -0.85, p = 1.2 × 10 -97 ), establishing strong validation for the use of this biomarker in CKD diagnostics. Among the other highly significant protein markers are insulin-like growth factor-binding protein 6, neuroblastoma suppressor of tumorigenicity 1, follistatin-related protein 3, trefoil factor 3, and beta-2 microglobulin. These proteins may indicate an imbalance in homeostasis across a variety of cellular processes, which may be underlying renal dysfunction. Overall, this study represents the most extensive characterization of the plasma proteome and its relation to GFR to date, and suggests the diagnostic and prognostic value of proteomics for CKD across all stages. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Advances in targeted proteomics and applications to biomedical research
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, Tujin; Song, Ehwang; Nie, Song
Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity (Shi et al., Proteomics, 12, 1074–1092, 2012) herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications inmore » human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed.« less
Lereim, Ragnhild Reehorst; Oveland, Eystein; Xiao, Yichuan; Torkildsen, Øivind; Wergeland, Stig; Myhr, Kjell-Morten; Sun, Shao-Cong; Berven, Frode S
2016-09-01
The ubiquitin ligase Peli1 has previously been suggested as a potential treatment target in multiple sclerosis. In the multiple sclerosis disease model, experimental autoimmune encephalomyelitis, Peli1 knock-out led to less activated microglia and less inflammation in the central nervous system. Despite being important in microglia, Peli1 expression has also been detected in glial and neuronal cells. In the present study the overall brain proteomes of Peli1 knock-out mice and wild-type mice were compared prior to experimental autoimmune encephalomyelitis induction, at onset of the disease and at disease peak. Brain samples from the frontal hemisphere, peripheral from the extensive inflammatory foci, were analyzed using TMT-labeling of sample pools, and the discovered proteins were verified in individual mice using label-free proteomics. The greatest proteomic differences between Peli1 knock-out and wild-type mice were observed at the disease peak. In Peli1 knock-out a higher degree of antigen presentation, increased activity of adaptive and innate immune cells and alterations to proteins involved in iron metabolism were observed during experimental autoimmune encephalomyelitis. These results unravel global effects to the brain proteome when abrogating Peli1 expression, underlining the importance of Peli1 as a regulator of the immune response also peripheral to inflammatory foci during experimental autoimmune encephalomyelitis. The proteomics data is available in PRIDE with accession PXD003710.
Clement, Cristina C.; Aphkhazava, David; Nieves, Edward; Callaway, Myrasol; Olszewski, Waldemar; Rotzschke, Olaf; Santambrogio, Laura
2013-01-01
In this study a proteomic approach was used to define the protein content of matched samples of afferent prenodal lymph and plasma derived from healthy volunteers. The analysis was performed using two analytical methodologies coupled with nanoliquid chromatography-tandem mass spectrometry: one-dimensional gel electrophoresis (1DEF nanoLC Orbitrap–ESI–MS/MS), and two-dimensional fluorescence difference-in-gel electrophoresis (2D-DIGE nanoLC–ESI–MS/MS). The 253 significantly identified proteins (p<0.05), obtained from the tandem mass spectrometry data, were further analyzed with pathway analysis (IPA) to define the functional signature of prenodal lymph and matched plasma. The 1DEF coupled with nanoLC–MS–MS revealed that the common proteome between the two biological fluids (144 out of 253 proteins) was dominated by complement activation and blood coagulation components, transporters and protease inhibitors. The enriched proteome of human lymph (72 proteins) consisted of products derived from the extracellular matrix, apoptosis and cellular catabolism. In contrast, the enriched proteome of human plasma (37 proteins) consisted of soluble molecules of the coagulation system and cell–cell signaling factors. The functional networks associated with both common and source-distinctive proteomes highlight the principal biological activity of these immunologically relevant body fluids. PMID:23202415
Genomics, transcriptomics and proteomics to elucidate the pathogenesis of rheumatoid arthritis.
Song, Xinqiang; Lin, Qingsong
2017-08-01
Rheumatoid arthritis is an autoimmune disease that affects several organs and tissues, predominantly the synovial joints. The pathogenesis of this disease is not completely understood, which maybe involved in the genomic variations, gene expression, protein translation and post-translational modifications. These system variations in genomics, transcriptomics and proteomics are dynamic in nature and their crosstalk is overwhelmingly complex, thus analyzing them separately may not be very informative. However, various '-omics' techniques developed in recent years have opened up new possibilities for clarifying disease pathways and thereby facilitating early diagnosis and specific therapies. This review examines how recent advances in the fields of genomics, transcriptomics and proteomics have contributed to our understanding of rheumatoid arthritis.
Chen, Jin-Qiu; Wakefield, Lalage M; Goldstein, David J
2015-06-06
There is an emerging demand for the use of molecular profiling to facilitate biomarker identification and development, and to stratify patients for more efficient treatment decisions with reduced adverse effects. In the past decade, great strides have been made to advance genomic, transcriptomic and proteomic approaches to address these demands. While there has been much progress with these large scale approaches, profiling at the protein level still faces challenges due to limitations in clinical sample size, poor reproducibility, unreliable quantitation, and lack of assay robustness. A novel automated capillary nano-immunoassay (CNIA) technology has been developed. This technology offers precise and accurate measurement of proteins and their post-translational modifications using either charge-based or size-based separation formats. The system not only uses ultralow nanogram levels of protein but also allows multi-analyte analysis using a parallel single-analyte format for increased sensitivity and specificity. The high sensitivity and excellent reproducibility of this technology make it particularly powerful for analysis of clinical samples. Furthermore, the system can distinguish and detect specific protein post-translational modifications that conventional Western blot and other immunoassays cannot easily capture. This review will summarize and evaluate the latest progress to optimize the CNIA system for comprehensive, quantitative protein and signaling event characterization. It will also discuss how the technology has been successfully applied in both discovery research and clinical studies, for signaling pathway dissection, proteomic biomarker assessment, targeted treatment evaluation and quantitative proteomic analysis. Lastly, a comparison of this novel system with other conventional immuno-assay platforms is performed.
Proteomics research in India: an update.
Reddy, Panga Jaipal; Atak, Apurva; Ghantasala, Saicharan; Kumar, Saurabh; Gupta, Shabarni; Prasad, T S Keshava; Zingde, Surekha M; Srivastava, Sanjeeva
2015-09-08
After a successful completion of the Human Genome Project, deciphering the mystery surrounding the human proteome posed a major challenge. Despite not being largely involved in the Human Genome Project, the Indian scientific community contributed towards proteomic research along with the global community. Currently, more than 76 research/academic institutes and nearly 145 research labs are involved in core proteomic research across India. The Indian researchers have been major contributors in drafting the "human proteome map" along with international efforts. In addition to this, virtual proteomics labs, proteomics courses and remote triggered proteomics labs have helped to overcome the limitations of proteomics education posed due to expensive lab infrastructure. The establishment of Proteomics Society, India (PSI) has created a platform for the Indian proteomic researchers to share ideas, research collaborations and conduct annual conferences and workshops. Indian proteomic research is really moving forward with the global proteomics community in a quest to solve the mysteries of proteomics. A draft map of the human proteome enhances the enthusiasm among intellectuals to promote proteomic research in India to the world.This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.
Proteomic technology for biomarker profiling in cancer: an update*
Alaoui-Jamali, Moulay A.; Xu, Ying-jie
2006-01-01
The progress in the understanding of cancer progression and early detection has been slow and frustrating due to the complex multifactorial nature and heterogeneity of the cancer syndrome. To date, no effective treatment is available for advanced cancers, which remain a major cause of morbidity and mortality. Clearly, there is urgent need to unravel novel biomarkers for early detection. Most of the functional information of the cancer-associated genes resides in the proteome. The later is an exceptionally complex biological system involving several proteins that function through posttranslational modifications and dynamic intermolecular collisions with partners. These protein complexes can be regulated by signals emanating from cancer cells, their surrounding tissue microenvironment, and/or from the host. Some proteins are secreted and/or cleaved into the extracellular milieu and may represent valuable serum biomarkers for diagnosis purpose. It is estimated that the cancer proteome may include over 1.5 million proteins as a result of posttranslational processing and modifications. Such complexity clearly highlights the need for ultra-high resolution proteomic technology for robust quantitative protein measurements and data acquisition. This review is to update the current research efforts in high-resolution proteomic technology for discovery and monitoring cancer biomarkers. PMID:16625706
Facile preparation of salivary extracellular vesicles for cancer proteomics
NASA Astrophysics Data System (ADS)
Sun, Yan; Xia, Zhijun; Shang, Zhi; Sun, Kaibo; Niu, Xiaomin; Qian, Liqiang; Fan, Liu-Yin; Cao, Cheng-Xi; Xiao, Hua
2016-04-01
Extracellular vesicles (EVs) are membrane surrounded structures released by cells, which have been increasingly recognized as mediators of intercellular communication. Recent reports indicate that EVs participate in important biological processes and could serve as potential source for cancer biomarkers. As an attractive EVs source with merit of non-invasiveness, human saliva is a unique medium for clinical diagnostics. Thus, we proposed a facile approach to prepare salivary extracellular vesicles (SEVs). Affinity chromatography column combined with filter system (ACCF) was developed to efficiently remove the high abundant proteins and viscous interferences of saliva. Protein profiling in the SEVs obtained by this strategy was compared with conventional centrifugation method, which demonstrated that about 70% more SEVs proteins could be revealed. To explore its utility for cancer proteomics, we analyzed the proteome of SEVs in lung cancer patients and normal controls. Shotgun proteomic analysis illustrated that 113 and 95 proteins have been identified in cancer group and control group, respectively. Among those 63 proteins that have been consistently discovered only in cancer group, 12 proteins are lung cancer related. Our results demonstrated that SEVs prepared through the developed strategy are valuable samples for proteomics and could serve as a promising liquid biopsy for cancer.
Ultrahigh pressure fast size exclusion chromatography for top-down proteomics.
Chen, Xin; Ge, Ying
2013-09-01
Top-down MS-based proteomics has gained a solid growth over the past few years but still faces significant challenges in the LC separation of intact proteins. In top-down proteomics, it is essential to separate the high mass proteins from the low mass species due to the exponential decay in S/N as a function of increasing molecular mass. SEC is a favored LC method for size-based separation of proteins but suffers from notoriously low resolution and detrimental dilution. Herein, we reported the use of ultrahigh pressure (UHP) SEC for rapid and high-resolution separation of intact proteins for top-down proteomics. Fast separation of intact proteins (6-669 kDa) was achieved in < 7 min with high resolution and high efficiency. More importantly, we have shown that this UHP-SEC provides high-resolution separation of intact proteins using a MS-friendly volatile solvent system, allowing the direct top-down MS analysis of SEC-eluted proteins without an additional desalting step. Taken together, we have demonstrated that UHP-SEC is an attractive LC strategy for the size separation of proteins with great potential for top-down proteomics. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Grigoryan, Marine; Shamshurin, Dmitry; Spicer, Victor; Krokhin, Oleg V
2013-11-19
As an initial step in our efforts to unify the expression of peptide retention times in proteomic liquid chromatography-mass spectrometry (LC-MS) experiments, we aligned the chromatographic properties of a number of peptide retention standards against a collection of peptides commonly observed in proteomic experiments. The standard peptide mixtures and tryptic digests of samples of different origins were separated under the identical chromatographic condition most commonly employed in proteomics: 100 Å C18 sorbent with 0.1% formic acid as an ion-pairing modifier. Following our original approach (Krokhin, O. V.; Spicer, V. Anal. Chem. 2009, 81, 9522-9530) the retention characteristics of these standards and collection of tryptic peptides were mapped into hydrophobicity index (HI) or acetonitrile percentage units. This scale allows for direct visualization of the chromatographic outcome of LC-MS acquisitions, monitors the performance of the gradient LC system, and simplifies method development and interlaboratory data alignment. Wide adoption of this approach would significantly aid understanding the basic principles of gradient peptide RP-HPLC and solidify our collective efforts in acquiring confident peptide retention libraries, a key component in the development of targeted proteomic approaches.
Functional proteomics within the genus Lactobacillus.
De Angelis, Maria; Calasso, Maria; Cavallo, Noemi; Di Cagno, Raffaella; Gobbetti, Marco
2016-03-01
Lactobacillus are mainly used for the manufacture of fermented dairy, sourdough, meat, and vegetable foods or used as probiotics. Under optimal processing conditions, Lactobacillus strains contribute to food functionality through their enzyme portfolio and the release of metabolites. An extensive genomic diversity analysis was conducted to elucidate the core features of the genus Lactobacillus, and to provide a better comprehension of niche adaptation of the strains. However, proteomics is an indispensable "omics" science to elucidate the proteome diversity, and the mechanisms of regulation and adaptation of Lactobacillus strains. This review focuses on the novel and comprehensive knowledge of functional proteomics and metaproteomics of Lactobacillus species. A large list of proteomic case studies of different Lactobacillus species is provided to illustrate the adaptability of the main metabolic pathways (e.g., carbohydrate transport and metabolism, pyruvate metabolism, proteolytic system, amino acid metabolism, and protein synthesis) to various life conditions. These investigations have highlighted that lactobacilli modulate the level of a complex panel of proteins to growth/survive in different ecological niches. In addition to the general regulation and stress response, specific metabolic pathways can be switched on and off, modifying the behavior of the strains. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Next-Generation Proteomics and Its Application to Clinical Breast Cancer Research.
Mardamshina, Mariya; Geiger, Tamar
2017-10-01
Proteomics technology aims to map the protein landscapes of biological samples, and it can be applied to a variety of samples, including cells, tissues, and body fluids. Because the proteins are the main functional molecules in the cells, their levels reflect much more accurately the cellular phenotype and the regulatory processes within them than gene levels, mutations, and even mRNA levels. With the advancement in the technology, it is possible now to obtain comprehensive views of the biological systems and to study large patient cohorts in a streamlined manner. In this review we discuss the technological advancements in mass spectrometry-based proteomics, which allow analysis of breast cancer tissue samples, leading to the first large-scale breast cancer proteomics studies. Furthermore, we discuss the technological developments in blood-based biomarker discovery, which provide the basis for future development of assays for routine clinical use. Although these are only the first steps in implementation of proteomics into the clinic, extensive collaborative work between these worlds will undoubtedly lead to major discoveries and advances in clinical practice. Copyright © 2017 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
The Air Force In Silico -- Computational Biology in 2025
2007-11-01
and chromosome) these new fields are commonly referred to as “~omics.” Proteomics , transcriptomics, metabolomics , epigenomics, physiomics... Bioinformatics , 2006, http://journal.imbio.de/ http://www-bm.ipk-gatersleben.de/stable/php/ journal /articles/pdf/jib-22.pdf (accessed 30 September...Chirino, G. Tansley and I. Dryden, “The implications for Bioinformatics of integration across physical scales,” Journal of Integrative Bioinformatics
A Computational Tool to Detect and Avoid Redundancy in Selected Reaction Monitoring
Röst, Hannes; Malmström, Lars; Aebersold, Ruedi
2012-01-01
Selected reaction monitoring (SRM), also called multiple reaction monitoring, has become an invaluable tool for targeted quantitative proteomic analyses, but its application can be compromised by nonoptimal selection of transitions. In particular, complex backgrounds may cause ambiguities in SRM measurement results because peptides with interfering transitions similar to those of the target peptide may be present in the sample. Here, we developed a computer program, the SRMCollider, that calculates nonredundant theoretical SRM assays, also known as unique ion signatures (UIS), for a given proteomic background. We show theoretically that UIS of three transitions suffice to conclusively identify 90% of all yeast peptides and 85% of all human peptides. Using predicted retention times, the SRMCollider also simulates time-scheduled SRM acquisition, which reduces the number of interferences to consider and leads to fewer transitions necessary to construct an assay. By integrating experimental fragment ion intensities from large scale proteome synthesis efforts (SRMAtlas) with the information content-based UIS, we combine two orthogonal approaches to create high quality SRM assays ready to be deployed. We provide a user friendly, open source implementation of an algorithm to calculate UIS of any order that can be accessed online at http://www.srmcollider.org to find interfering transitions. Finally, our tool can also simulate the specificity of novel data-independent MS acquisition methods in Q1–Q3 space. This allows us to predict parameters for these methods that deliver a specificity comparable with that of SRM. Using SRM interference information in addition to other sources of information can increase the confidence in an SRM measurement. We expect that the consideration of information content will become a standard step in SRM assay design and analysis, facilitated by the SRMCollider. PMID:22535207
Graumann, Johannes; Scheltema, Richard A; Zhang, Yong; Cox, Jürgen; Mann, Matthias
2012-03-01
In the analysis of complex peptide mixtures by MS-based proteomics, many more peptides elute at any given time than can be identified and quantified by the mass spectrometer. This makes it desirable to optimally allocate peptide sequencing and narrow mass range quantification events. In computer science, intelligent agents are frequently used to make autonomous decisions in complex environments. Here we develop and describe a framework for intelligent data acquisition and real-time database searching and showcase selected examples. The intelligent agent is implemented in the MaxQuant computational proteomics environment, termed MaxQuant Real-Time. It analyzes data as it is acquired on the mass spectrometer, constructs isotope patterns and SILAC pair information as well as controls MS and tandem MS events based on real-time and prior MS data or external knowledge. Re-implementing a top10 method in the intelligent agent yields similar performance to the data dependent methods running on the mass spectrometer itself. We demonstrate the capabilities of MaxQuant Real-Time by creating a real-time search engine capable of identifying peptides "on-the-fly" within 30 ms, well within the time constraints of a shotgun fragmentation "topN" method. The agent can focus sequencing events onto peptides of specific interest, such as those originating from a specific gene ontology (GO) term, or peptides that are likely modified versions of already identified peptides. Finally, we demonstrate enhanced quantification of SILAC pairs whose ratios were poorly defined in survey spectra. MaxQuant Real-Time is flexible and can be applied to a large number of scenarios that would benefit from intelligent, directed data acquisition. Our framework should be especially useful for new instrument types, such as the quadrupole-Orbitrap, that are currently becoming available.
Graumann, Johannes; Scheltema, Richard A.; Zhang, Yong; Cox, Jürgen; Mann, Matthias
2012-01-01
In the analysis of complex peptide mixtures by MS-based proteomics, many more peptides elute at any given time than can be identified and quantified by the mass spectrometer. This makes it desirable to optimally allocate peptide sequencing and narrow mass range quantification events. In computer science, intelligent agents are frequently used to make autonomous decisions in complex environments. Here we develop and describe a framework for intelligent data acquisition and real-time database searching and showcase selected examples. The intelligent agent is implemented in the MaxQuant computational proteomics environment, termed MaxQuant Real-Time. It analyzes data as it is acquired on the mass spectrometer, constructs isotope patterns and SILAC pair information as well as controls MS and tandem MS events based on real-time and prior MS data or external knowledge. Re-implementing a top10 method in the intelligent agent yields similar performance to the data dependent methods running on the mass spectrometer itself. We demonstrate the capabilities of MaxQuant Real-Time by creating a real-time search engine capable of identifying peptides “on-the-fly” within 30 ms, well within the time constraints of a shotgun fragmentation “topN” method. The agent can focus sequencing events onto peptides of specific interest, such as those originating from a specific gene ontology (GO) term, or peptides that are likely modified versions of already identified peptides. Finally, we demonstrate enhanced quantification of SILAC pairs whose ratios were poorly defined in survey spectra. MaxQuant Real-Time is flexible and can be applied to a large number of scenarios that would benefit from intelligent, directed data acquisition. Our framework should be especially useful for new instrument types, such as the quadrupole-Orbitrap, that are currently becoming available. PMID:22171319
NASA Astrophysics Data System (ADS)
Ng, I.-Son; Yu, You-Jin; Yi, Ying-Chen; Tan, Shih-I.; Huang, Bo-Chuan; Han, Yin-Lung
2017-12-01
The proteomics strategy was utilized to analyze and identify the gold adsorption proteins from Tepidimonas fonticaldi AT-A2, due to its outstanding performance in gold-binding and recovery. The results showed that three small proteins, including histidine biosynthesis protein (HisIE), iron donor protein (CyaY) and hypothetical protein_65aa, have a higher ability to adsorb gold ions because of the negatively charged domains or metal binding sites. On the other hand, the Salmonella PmrA/PmrB two-component system first replaces the iron (III)-binding motif using the peptide sequence from hypothetical protein_65aa, and this is then used to reveal the sensing and responsiveness to gold metal ions, which is totally different from the performance of traditional gold binding peptide (GBP) on the crystals on the surface of gold (111). We have successfully demonstrated an integrative proteomics and bacterial two-component system to explore the novel gold binding peptide. Finally, the heterologous over-expression of gold binding peptide by E. coli and the equilibrium of binding capacity for Au(III) have been conducted.
LXtoo: an integrated live Linux distribution for the bioinformatics community
2012-01-01
Background Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Findings Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. Conclusions LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo. PMID:22813356
LXtoo: an integrated live Linux distribution for the bioinformatics community.
Yu, Guangchuang; Wang, Li-Gen; Meng, Xiao-Hua; He, Qing-Yu
2012-07-19
Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo.
He, Yongqun
2011-01-01
Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning. PMID:22919594
He, Yongqun
2012-01-01
Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning.
Pan, Sheng; Rush, John; Peskind, Elaine R; Galasko, Douglas; Chung, Kathryn; Quinn, Joseph; Jankovic, Joseph; Leverenz, James B; Zabetian, Cyrus; Pan, Catherine; Wang, Yan; Oh, Jung Hun; Gao, Jean; Zhang, Jianpeng; Montine, Thomas; Zhang, Jing
2008-02-01
Targeted quantitative proteomics by mass spectrometry aims to selectively detect one or a panel of peptides/proteins in a complex sample and is particularly appealing for novel biomarker verification/validation because it does not require specific antibodies. Here, we demonstrated the application of targeted quantitative proteomics in searching, identifying, and quantifying selected peptides in human cerebrospinal spinal fluid (CSF) using a matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer (MALDI TOF/TOF)-based platform. The approach involved two major components: the use of isotopic-labeled synthetic peptides as references for targeted identification and quantification and a highly selective mass spectrometric analysis based on the unique characteristics of the MALDI instrument. The platform provides high confidence for targeted peptide detection in a complex system and can potentially be developed into a high-throughput system. Using the liquid chromatography (LC) MALDI TOF/TOF platform and the complementary identification strategy, we were able to selectively identify and quantify a panel of targeted peptides in the whole proteome of CSF without prior depletion of abundant proteins. The effectiveness and robustness of the approach associated with different sample complexity, sample preparation strategies, as well as mass spectrometric quantification were evaluated. Other issues related to chromatography separation and the feasibility for high-throughput analysis were also discussed. Finally, we applied targeted quantitative proteomics to analyze a subset of previously identified candidate markers in CSF samples of patients with Parkinson's disease (PD) at different stages and Alzheimer's disease (AD) along with normal controls.
Assessing the impact of transcriptomics, proteomics and metabolomics on fungal phytopathology.
Tan, Kar-Chun; Ipcho, Simon V S; Trengove, Robert D; Oliver, Richard P; Solomon, Peter S
2009-09-01
SUMMARY Peer-reviewed literature is today littered with exciting new tools and techniques that are being used in all areas of biology and medicine. Transcriptomics, proteomics and, more recently, metabolomics are three of these techniques that have impacted on fungal plant pathology. Used individually, each of these techniques can generate a plethora of data that could occupy a laboratory for years. When used in combination, they have the potential to comprehensively dissect a system at the transcriptional and translational level. Transcriptomics, or quantitative gene expression profiling, is arguably the most familiar to researchers in the field of fungal plant pathology. Microarrays have been the primary technique for the last decade, but others are now emerging. Proteomics has also been exploited by the fungal phytopathogen community, but perhaps not to its potential. A lack of genome sequence information has frustrated proteomics researchers and has largely contributed to this technique not fulfilling its potential. The coming of the genome sequencing era has partially alleviated this problem. Metabolomics is the most recent of these techniques to emerge and is concerned with the non-targeted profiling of all metabolites in a given system. Metabolomics studies on fungal plant pathogens are only just beginning to appear, although its potential to dissect many facets of the pathogen and disease will see its popularity increase quickly. This review assesses the impact of transcriptomics, proteomics and metabolomics on fungal plant pathology over the last decade and discusses their futures. Each of the techniques is described briefly with further reading recommended. Key examples highlighting the application of these technologies to fungal plant pathogens are also reviewed.
Integrating cell biology and proteomic approaches in plants.
Takáč, Tomáš; Šamajová, Olga; Šamaj, Jozef
2017-10-03
Significant improvements of protein extraction, separation, mass spectrometry and bioinformatics nurtured advancements of proteomics during the past years. The usefulness of proteomics in the investigation of biological problems can be enhanced by integration with other experimental methods from cell biology, genetics, biochemistry, pharmacology, molecular biology and other omics approaches including transcriptomics and metabolomics. This review aims to summarize current trends integrating cell biology and proteomics in plant science. Cell biology approaches are most frequently used in proteomic studies investigating subcellular and developmental proteomes, however, they were also employed in proteomic studies exploring abiotic and biotic stress responses, vesicular transport, cytoskeleton and protein posttranslational modifications. They are used either for detailed cellular or ultrastructural characterization of the object subjected to proteomic study, validation of proteomic results or to expand proteomic data. In this respect, a broad spectrum of methods is employed to support proteomic studies including ultrastructural electron microscopy studies, histochemical staining, immunochemical localization, in vivo imaging of fluorescently tagged proteins and visualization of protein-protein interactions. Thus, cell biological observations on fixed or living cell compartments, cells, tissues and organs are feasible, and in some cases fundamental for the validation and complementation of proteomic data. Validation of proteomic data by independent experimental methods requires development of new complementary approaches. Benefits of cell biology methods and techniques are not sufficiently highlighted in current proteomic studies. This encouraged us to review most popular cell biology methods used in proteomic studies and to evaluate their relevance and potential for proteomic data validation and enrichment of purely proteomic analyses. We also provide examples of representative studies combining proteomic and cell biology methods for various purposes. Integrating cell biology approaches with proteomic ones allow validation and better interpretation of proteomic data. Moreover, cell biology methods remarkably extend the knowledge provided by proteomic studies and might be fundamental for the functional complementation of proteomic data. This review article summarizes current literature linking proteomics with cell biology. Copyright © 2017 Elsevier B.V. All rights reserved.
Arai, Kazuya; Sakamoto, Ruriko; Kubota, Daisuke; Kondo, Tadashi
2013-08-01
Chemoresistance is one of the most critical prognostic factors in osteosarcoma, and elucidation of the molecular backgrounds of chemoresistance may lead to better clinical outcomes. Spheroid cells resemble in vivo cells and are considered an in vitro model for the drug discovery. We found that spheroid cells displayed more chemoresistance than conventional monolayer cells across 11 osteosarcoma cell lines. To investigate the molecular mechanisms underlying the resistance to chemotherapy, we examined the proteomic differences between the monolayer and spheroid cells by 2D-DIGE. Of the 4762 protein species observed, we further investigated 435 species with annotated mass spectra in the public proteome database, Genome Medicine Database of Japan Proteomics. Among the 435 protein species, we found that 17 species exhibited expression level differences when the cells formed spheroids in more than five cell lines and four species out of these 17 were associated with spheroid-formation associated resistance to doxorubicin. We confirmed the upregulation of cathepsin D in spheroid cells by western blotting. Cathepsin D has been implicated in chemoresistance of various malignancies but has not previously been implemented in osteosarcoma. Our study suggested that the spheroid system may be a useful tool to reveal the molecular backgrounds of chemoresistance in osteosarcoma. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic Analysis of the Human Olfactory Bulb.
Dammalli, Manjunath; Dey, Gourav; Madugundu, Anil K; Kumar, Manish; Rodrigues, Benvil; Gowda, Harsha; Siddaiah, Bychapur Gowrishankar; Mahadevan, Anita; Shankar, Susarla Krishna; Prasad, Thottethodi Subrahmanya Keshava
2017-08-01
The importance of olfaction to human health and disease is often underappreciated. Olfactory dysfunction has been reported in association with a host of common complex diseases, including neurological diseases such as Alzheimer's disease and Parkinson's disease. For health, olfaction or the sense of smell is also important for most mammals, for optimal engagement with their environment. Indeed, animals have developed sophisticated olfactory systems to detect and interpret the rich information presented to them to assist in day-to-day activities such as locating food sources, differentiating food from poisons, identifying mates, promoting reproduction, avoiding predators, and averting death. In this context, the olfactory bulb is a vital component of the olfactory system receiving sensory information from the axons of the olfactory receptor neurons located in the nasal cavity and the first place that processes the olfactory information. We report in this study original observations on the human olfactory bulb proteome in healthy subjects, using a high-resolution mass spectrometry-based proteomic approach. We identified 7750 nonredundant proteins from human olfactory bulbs. Bioinformatics analysis of these proteins showed their involvement in biological processes associated with signal transduction, metabolism, transport, and olfaction. These new observations provide a crucial baseline molecular profile of the human olfactory bulb proteome, and should assist the future discovery of biomarker proteins and novel diagnostics associated with diseases characterized by olfactory dysfunction.
Xia, Kai; Zang, Ning; Zhang, Junmei; Zhang, Hong; Li, Yudong; Liu, Ye; Feng, Wei; Liang, Xinle
2016-12-05
Acetobacter pasteurianus is the main starter in rice vinegar manufacturing due to its remarkable abilities to resist and produce acetic acid. Although several mechanisms of acetic acid resistance have been proposed and only a few effector proteins have been identified, a comprehensive depiction of the biological processes involved in acetic acid resistance is needed. In this study, iTRAQ-based quantitative proteomic analysis was adopted to investigate the whole proteome of different acidic titers (3.6, 7.1 and 9.3%, w/v) of Acetobacter pasteurianus Ab3 during the vinegar fermentation process. Consequently, 1386 proteins, including 318 differentially expressed proteins (p<0.05), were identified. Compared to that in the low titer circumstance, cells conducted distinct biological processes under high acetic acid stress, where >150 proteins were differentially expressed. Specifically, proteins involved in amino acid metabolic processes and fatty acid biosynthesis were differentially expressed, which may contribute to the acetic acid resistance of Acetobacter. Transcription factors, two component systems and toxin-antitoxin systems were implicated in the modulatory network at multiple levels. In addition, the identification of proteins involved in redox homeostasis, protein metabolism, and the cell envelope suggested that the whole cellular system is mobilized in response to acid stress. These findings provide a differential proteomic profile of acetic acid resistance in Acetobacter pasteurianus and have potential application to highly acidic rice vinegar manufacturing. Copyright © 2016 Elsevier B.V. All rights reserved.
Using Big Data to Discover Diagnostics and Therapeutics for Gastrointestinal and Liver Diseases
Wooden, Benjamin; Goossens, Nicolas; Hoshida, Yujin; Friedman, Scott L.
2016-01-01
Technologies such as genome sequencing, gene expression profiling, proteomic and metabolomic analyses, electronic medical records, and patient-reported health information have produced large amounts of data, from various populations, cell types, and disorders (big data). However, these data must be integrated and analyzed if they are to produce models or concepts about physiologic function or mechanisms of pathogenesis. Many of these data are available to the public, allowing researchers anywhere to search for markers of specific biologic processes or therapeutic targets for specific diseases or patient types. We review recent advances in the fields of computational and systems biology, and highlight opportunities for researchers to use big data sets in the fields of gastroenterology and hepatology, to complement traditional means of diagnostic and therapeutic discovery. PMID:27773806
Marlow, Jeffery; Skennerton, Connor T.; Li, Zhou; ...
2016-04-29
Marine methane seep habitats represent an important control on the global flux of methane between the subsurface and water column reservoirs. Meta-omics studies have begun to outline community-wide metabolic potential, but expression patterns of proteins that enact sulfate-mediated anaerobic methane oxidation in seeps are poorly characterized. Proteomic stable isotope probing (proteomic SIP) offers an additional layer of information for characterizing phylogenetically specific, functionally relevant activity in mixed microbial communities. Here we applied proteomic SIP to 15NH4+ and CH4 amended seep sediment microcosms in an attempt to track the protein synthesis of slow-growing, low-energy microbial systems. Across all samples, 3495 proteinsmore » were identified, 21% of which were 15N-labeled. We observed active synthesis (15N enrichment) of all proteins believed to be involved in sulfate reduction and reverse methanogenesis including methylenetetrahydromethanopterin reductase (Mer). The abundance and phylogenetic range of methyl-coenzyme M reductase (Mcr) orthologs produced during incubation experiments suggests that seeps provide sufficient niches for multiple organisms performing analogous metabolisms. Twenty-eight previously unreported post-translational modifications of McrA were measured, indicating dynamic enzymatic machinery and offering a dimension of functional diversity beyond gene-dictated sequence. RNA polymerase associated with putative sulfur-oxidizing Epsilonproteobacteria and aerobic Gammaproteobacteria were more abundant among pre-incubation proteins, suggesting diminished metabolic activity in long-term anoxic, sulfidic experimental incubations. Twenty-six proteins of unknown function were detected in all proteomic experiments and actively expressed in labeled experiments, suggesting that they play important roles in methane seep ecosystems. The addition of stable isotope probing to environmental proteomics experiments provides a mechanism to begin to assess the degree to which diagnostic meatbolic proteins are long-lived or acively synthesized in complex, slow-growing microbial communities. Our work here demonstrates that sediment-hosted microbial assemblages in marine methane seeps are dynamic, heterogeneous systems with broad functional diversity.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marlow, Jeffery; Skennerton, Connor T.; Li, Zhou
Marine methane seep habitats represent an important control on the global flux of methane between the subsurface and water column reservoirs. Meta-omics studies have begun to outline community-wide metabolic potential, but expression patterns of proteins that enact sulfate-mediated anaerobic methane oxidation in seeps are poorly characterized. Proteomic stable isotope probing (proteomic SIP) offers an additional layer of information for characterizing phylogenetically specific, functionally relevant activity in mixed microbial communities. Here we applied proteomic SIP to 15NH4+ and CH4 amended seep sediment microcosms in an attempt to track the protein synthesis of slow-growing, low-energy microbial systems. Across all samples, 3495 proteinsmore » were identified, 21% of which were 15N-labeled. We observed active synthesis (15N enrichment) of all proteins believed to be involved in sulfate reduction and reverse methanogenesis including methylenetetrahydromethanopterin reductase (Mer). The abundance and phylogenetic range of methyl-coenzyme M reductase (Mcr) orthologs produced during incubation experiments suggests that seeps provide sufficient niches for multiple organisms performing analogous metabolisms. Twenty-eight previously unreported post-translational modifications of McrA were measured, indicating dynamic enzymatic machinery and offering a dimension of functional diversity beyond gene-dictated sequence. RNA polymerase associated with putative sulfur-oxidizing Epsilonproteobacteria and aerobic Gammaproteobacteria were more abundant among pre-incubation proteins, suggesting diminished metabolic activity in long-term anoxic, sulfidic experimental incubations. Twenty-six proteins of unknown function were detected in all proteomic experiments and actively expressed in labeled experiments, suggesting that they play important roles in methane seep ecosystems. The addition of stable isotope probing to environmental proteomics experiments provides a mechanism to begin to assess the degree to which diagnostic meatbolic proteins are long-lived or acively synthesized in complex, slow-growing microbial communities. Our work here demonstrates that sediment-hosted microbial assemblages in marine methane seeps are dynamic, heterogeneous systems with broad functional diversity.« less
Ferro, Myriam; Brugière, Sabine; Salvi, Daniel; Seigneurin-Berny, Daphné; Court, Magali; Moyet, Lucas; Ramus, Claire; Miras, Stéphane; Mellal, Mourad; Le Gall, Sophie; Kieffer-Jaquinod, Sylvie; Bruley, Christophe; Garin, Jérôme; Joyard, Jacques; Masselon, Christophe; Rolland, Norbert
2010-06-01
Recent advances in the proteomics field have allowed a series of high throughput experiments to be conducted on chloroplast samples, and the data are available in several public databases. However, the accurate localization of many chloroplast proteins often remains hypothetical. This is especially true for envelope proteins. We went a step further into the knowledge of the chloroplast proteome by focusing, in the same set of experiments, on the localization of proteins in the stroma, the thylakoids, and envelope membranes. LC-MS/MS-based analyses first allowed building the AT_CHLORO database (http://www.grenoble.prabi.fr/protehome/grenoble-plant-proteomics/), a comprehensive repertoire of the 1323 proteins, identified by 10,654 unique peptide sequences, present in highly purified chloroplasts and their subfractions prepared from Arabidopsis thaliana leaves. This database also provides extensive proteomics information (peptide sequences and molecular weight, chromatographic retention times, MS/MS spectra, and spectral count) for a unique chloroplast protein accurate mass and time tag database gathering identified peptides with their respective and precise analytical coordinates, molecular weight, and retention time. We assessed the partitioning of each protein in the three chloroplast compartments by using a semiquantitative proteomics approach (spectral count). These data together with an in-depth investigation of the literature were compiled to provide accurate subplastidial localization of previously known and newly identified proteins. A unique knowledge base containing extensive information on the proteins identified in envelope fractions was thus obtained, allowing new insights into this membrane system to be revealed. Altogether, the data we obtained provide unexpected information about plastidial or subplastidial localization of some proteins that were not suspected to be associated to this membrane system. The spectral counting-based strategy was further validated as the compartmentation of well known pathways (for instance, photosynthesis and amino acid, fatty acid, or glycerolipid biosynthesis) within chloroplasts could be dissected. It also allowed revisiting the compartmentation of the chloroplast metabolism and functions.
State-of-the-art nanoplatform-integrated MALDI-MS impacting resolutions in urinary proteomics.
Gopal, Judy; Muthu, Manikandan; Chun, Se-Chul; Wu, Hui-Fen
2015-06-01
Urine proteomics has become a subject of interest, since it has led to a number of breakthroughs in disease diagnostics. Urine contains information not only from the kidney and the urinary tract but also from other organs, thus urinary proteome analysis allows for identification of biomarkers for both urogenital and systemic diseases. The following review gives a brief overview of the analytical techniques that have been in practice for urinary proteomics. MALDI-MS technique and its current application status in this area of clinical research have been discussed. The review comments on the challenges facing the conventional MALDI-MS technique and the upgradation of this technique with the introduction of nanotechnology. This review projects nano-based techniques such as nano-MALDI-MS, surface-assisted laser desorption/ionization, and nanostructure-initiator MS as the platforms that have the potential in trafficking MALDI-MS from the lab to the bedside. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Global Proteomics Analysis of the Response to Starvation in C. elegans*
Larance, Mark; Pourkarimi, Ehsan; Wang, Bin; Brenes Murillo, Alejandro; Kent, Robert; Lamond, Angus I.; Gartner, Anton
2015-01-01
Periodic starvation of animals induces large shifts in metabolism but may also influence many other cellular systems and can lead to adaption to prolonged starvation conditions. To date, there is limited understanding of how starvation affects gene expression, particularly at the protein level. Here, we have used mass-spectrometry-based quantitative proteomics to identify global changes in the Caenorhabditis elegans proteome due to acute starvation of young adult animals. Measuring changes in the abundance of over 5,000 proteins, we show that acute starvation rapidly alters the levels of hundreds of proteins, many involved in central metabolic pathways, highlighting key regulatory responses. Surprisingly, we also detect changes in the abundance of chromatin-associated proteins, including specific linker histones, histone variants, and histone posttranslational modifications associated with the epigenetic control of gene expression. To maximize community access to these data, they are presented in an online searchable database, the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd/). PMID:25963834
A Systematic Analysis of a Deep Mouse Epididymal Sperm Proteome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chauvin, Theodore; Xie, Fang; Liu, Tao
Spermatozoa are highly specialized cells that, when mature, are capable of navigating the female reproductive tract and fertilizing an oocyte. The sperm cell is thought to be largely quiescent in terms of transcriptional and translational activity. As a result, once it has left the male reproductive tract, the sperm cell is essentially operating with a static population of proteins. It is therefore theoretically possible to understand the protein networks contained in a sperm cell and to deduce its cellular function capabilities. To this end we have performed a proteomic analysis of mouse sperm isolated from the cauda epididymis and havemore » confidently identified 2,850 proteins, which is the most comprehensive sperm proteome for any species reported to date. These proteins comprise many complete cellular pathways, including those for energy production via glycolysis, β-oxidation and oxidative phosphorylation, protein folding and transport, and cell signaling systems. This proteome should prove a useful tool for assembly and testing of protein networks important for sperm function.« less
Nakamura, Tatsuji; Kuromitsu, Junro; Oda, Yoshiya
2008-03-01
Two-dimensional liquid-chromatographic (LC) separation followed by mass spectrometric (MS) analysis was examined for the identification of peptides in complex mixtures as an alternative to widely used two-dimensional gel electrophoresis followed by MS analysis for use in proteomics. The present method involves the off-line coupling of a narrow-bore, polymer-based, reversed-phase column using an acetonitrile gradient in an alkaline mobile phase in the first dimension with octadecylsilanized silica (ODS)-based nano-LC/MS in the second dimension. After the first separation, successive fractions were acidified and dried off-line, then loaded on the second dimension column. Both columns separate peptides according to hydrophobicity under different pH conditions, but more peptides were identified than with the conventional technique for shotgun proteomics, that is, the combination of a strong cation exchange column with an ODS column, and the system was robust because no salts were included in the mobile phases. The suitability of the method for proteomics measurements was evaluated.
Comparative proteomics in alkaptonuria provides insights into inflammation and oxidative stress.
Braconi, Daniela; Bernardini, Giulia; Paffetti, Alessandro; Millucci, Lia; Geminiani, Michela; Laschi, Marcella; Frediani, Bruno; Marzocchi, Barbara; Santucci, Annalisa
2016-12-01
Alkaptonuria (AKU) is an ultra-rare inborn error of metabolism associated with a defective catabolism of phenylalanine and tyrosine leading to increased systemic levels of homogentisic acid (HGA). Excess HGA is partly excreted in the urine, partly accumulated within the body and deposited onto connective tissues under the form of an ochronotic pigment, leading to a range of clinical manifestations. No clear genotype/phenotype correlation was found in AKU, and today there is the urgent need to identify biomarkers able to monitor AKU progression and evaluate response to treatment. With this aim, we provided the first proteomic study on serum and plasma samples from alkaptonuric individuals showing pathological SAA, CRP and Advanced Oxidation Protein Products (AOPP) levels. Interesting similarities with proteomic studies on other rheumatic diseases were highlighted together with proteome alterations supporting the existence of oxidative stress and inflammation in AKU. Potential candidate biomarkers to assess disease severity, monitor disease progression and evaluate response to treatment were identified as well. Copyright © 2016 Elsevier Ltd. All rights reserved.
Proteome-wide survey of the autoimmune target repertoire in autoimmune polyendocrine syndrome type 1
Landegren, Nils; Sharon, Donald; Freyhult, Eva; Hallgren, Åsa; Eriksson, Daniel; Edqvist, Per-Henrik; Bensing, Sophie; Wahlberg, Jeanette; Nelson, Lawrence M.; Gustafsson, Jan; Husebye, Eystein S.; Anderson, Mark S.; Snyder, Michael; Kämpe, Olle
2016-01-01
Autoimmune polyendocrine syndrome type 1 (APS1) is a monogenic disorder that features multiple autoimmune disease manifestations. It is caused by mutations in the Autoimmune regulator (AIRE) gene, which promote thymic display of thousands of peripheral tissue antigens in a process critical for establishing central immune tolerance. We here used proteome arrays to perform a comprehensive study of autoimmune targets in APS1. Interrogation of established autoantigens revealed highly reliable detection of autoantibodies, and by exploring the full panel of more than 9000 proteins we further identified MAGEB2 and PDILT as novel major autoantigens in APS1. Our proteome-wide assessment revealed a marked enrichment for tissue-specific immune targets, mirroring AIRE’s selectiveness for this category of genes. Our findings also suggest that only a very limited portion of the proteome becomes targeted by the immune system in APS1, which contrasts the broad defect of thymic presentation associated with AIRE-deficiency and raises novel questions what other factors are needed for break of tolerance. PMID:26830021
Rea, Giuseppina; Cristofaro, Francesco; Pani, Giuseppe; Pascucci, Barbara; Ghuge, Sandip A; Corsetto, Paola Antonia; Imbriani, Marcello; Visai, Livia; Rizzo, Angela M
2016-03-30
Space is a hostile environment characterized by high vacuum, extreme temperatures, meteoroids, space debris, ionospheric plasma, microgravity and space radiation, which all represent risks for human health. A deep understanding of the biological consequences of exposure to the space environment is required to design efficient countermeasures to minimize their negative impact on human health. Recently, proteomic approaches have received a significant amount of attention in the effort to further study microgravity-induced physiological changes. In this review, we summarize the current knowledge about the effects of microgravity on microorganisms (in particular Cupriavidus metallidurans CH34, Bacillus cereus and Rhodospirillum rubrum S1H), plants (whole plants, organs, and cell cultures), mammalian cells (endothelial cells, bone cells, chondrocytes, muscle cells, thyroid cancer cells, immune system cells) and animals (invertebrates, vertebrates and mammals). Herein, we describe their proteome's response to microgravity, focusing on proteomic discoveries and their future potential applications in space research. Space experiments and operational flight experience have identified detrimental effects on human health and performance because of exposure to weightlessness, even when currently available countermeasures are implemented. Many experimental tools and methods have been developed to study microgravity induced physiological changes. Recently, genomic and proteomic approaches have received a significant amount of attention. This review summarizes the recent research studies of the proteome response to microgravity inmicroorganisms, plants, mammalians cells and animals. Current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of all proteomes. Understanding gene and/or protein expression is the key to unlocking the mechanisms behind microgravity-induced problems and to finding effective countermeasures to spaceflight-induced alterations but also for the study of diseases on earth. Future perspectives are also highlighted. Copyright © 2015 Elsevier B.V. All rights reserved.
Bordbar, Aarash; Jamshidi, Neema; Palsson, Bernhard O
2011-07-12
The development of high-throughput technologies capable of whole cell measurements of genes, proteins, and metabolites has led to the emergence of systems biology. Integrated analysis of the resulting omic data sets has proved to be hard to achieve. Metabolic network reconstructions enable complex relationships amongst molecular components to be represented formally in a biologically relevant manner while respecting physical constraints. In silico models derived from such reconstructions can then be queried or interrogated through mathematical simulations. Proteomic profiling studies of the mature human erythrocyte have shown more proteins present related to metabolic function than previously thought; however the significance and the causal consequences of these findings have not been explored. Erythrocyte proteomic data was used to reconstruct the most expansive description of erythrocyte metabolism to date, following extensive manual curation, assessment of the literature, and functional testing. The reconstruction contains 281 enzymes representing functions from glycolysis to cofactor and amino acid metabolism. Such a comprehensive view of erythrocyte metabolism implicates the erythrocyte as a potential biomarker for different diseases as well as a 'cell-based' drug-screening tool. The analysis shows that 94 erythrocyte enzymes are implicated in morbid single nucleotide polymorphisms, representing 142 pathologies. In addition, over 230 FDA-approved and experimental pharmaceuticals have enzymatic targets in the erythrocyte. The advancement of proteomic technologies and increased generation of high-throughput proteomic data have created the need for a means to analyze these data in a coherent manner. Network reconstructions provide a systematic means to integrate and analyze proteomic data in a biologically meaning manner. Analysis of the red cell proteome has revealed an unexpected level of complexity in the functional capabilities of human erythrocyte metabolism.
Agrawal, Ganesh Kumar; Sarkar, Abhijit; Righetti, Pier Giorgio; Pedreschi, Romina; Carpentier, Sebastien; Wang, Tai; Barkla, Bronwyn J; Kohli, Ajay; Ndimba, Bongani Kaiser; Bykova, Natalia V; Rampitsch, Christof; Zolla, Lello; Rafudeen, Mohamed Suhail; Cramer, Rainer; Bindschedler, Laurence Veronique; Tsakirpaloglou, Nikolaos; Ndimba, Roya Janeen; Farrant, Jill M; Renaut, Jenny; Job, Dominique; Kikuchi, Shoshi; Rakwal, Randeep
2013-01-01
Tremendous progress in plant proteomics driven by mass spectrometry (MS) techniques has been made since 2000 when few proteomics reports were published and plant proteomics was in its infancy. These achievements include the refinement of existing techniques and the search for new techniques to address food security, safety, and health issues. It is projected that in 2050, the world's population will reach 9-12 billion people demanding a food production increase of 34-70% (FAO, 2009) from today's food production. Provision of food in a sustainable and environmentally committed manner for such a demand without threatening natural resources, requires that agricultural production increases significantly and that postharvest handling and food manufacturing systems become more efficient requiring lower energy expenditure, a decrease in postharvest losses, less waste generation and food with longer shelf life. There is also a need to look for alternative protein sources to animal based (i.e., plant based) to be able to fulfill the increase in protein demands by 2050. Thus, plant biology has a critical role to play as a science capable of addressing such challenges. In this review, we discuss proteomics especially MS, as a platform, being utilized in plant biology research for the past 10 years having the potential to expedite the process of understanding plant biology for human benefits. The increasing application of proteomics technologies in food security, analysis, and safety is emphasized in this review. But, we are aware that no unique approach/technology is capable to address the global food issues. Proteomics-generated information/resources must be integrated and correlated with other omics-based approaches, information, and conventional programs to ensure sufficient food and resources for human development now and in the future. © 2013 Wiley Periodicals, Inc.
Marlow, Jeffrey J.; Skennerton, Connor T.; Li, Zhou; Chourey, Karuna; Hettich, Robert L.; Pan, Chongle; Orphan, Victoria J.
2016-01-01
Marine methane seep habitats represent an important control on the global flux of methane. Nucleotide-based meta-omics studies outline community-wide metabolic potential, but expression patterns of environmentally relevant proteins are poorly characterized. Proteomic stable isotope probing (proteomic SIP) provides additional information by characterizing phylogenetically specific, functionally relevant activity in mixed microbial communities, offering enhanced detection through system-wide product integration. Here we applied proteomic SIP to 15NH4+ and CH4 amended seep sediment microcosms in an attempt to track protein synthesis of slow-growing, low-energy microbial systems. Across all samples, 3495 unique proteins were identified, 11% of which were 15N-labeled. Consistent with the dominant anaerobic oxidation of methane (AOM) activity commonly observed in anoxic seep sediments, proteins associated with sulfate reduction and reverse methanogenesis—including the ANME-2 associated methylenetetrahydromethanopterin reductase (Mer)—were all observed to be actively synthesized (15N-enriched). Conversely, proteins affiliated with putative aerobic sulfur-oxidizing epsilon- and gammaproteobacteria showed a marked decrease over time in our anoxic sediment incubations. The abundance and phylogenetic range of 15N-enriched methyl-coenzyme M reductase (Mcr) orthologs, many of which exhibited novel post-translational modifications, suggests that seep sediments provide niches for multiple organisms performing analogous metabolisms. In addition, 26 proteins of unknown function were consistently detected and actively expressed under conditions supporting AOM, suggesting that they play important roles in methane seep ecosystems. Stable isotope probing in environmental proteomics experiments provides a mechanism to determine protein durability and evaluate lineage-specific responses in complex microbial communities placed under environmentally relevant conditions. Our work here demonstrates the active synthesis of a metabolically specific minority of enzymes, revealing the surprising longevity of most proteins over the course of an extended incubation experiment in an established, slow-growing, methane-impacted environmental system. PMID:27199908
Analysis of the pumpkin phloem proteome provides insights into angiosperm sieve tube function.
Lin, Ming-Kuem; Lee, Young-Jin; Lough, Tony J; Phinney, Brett S; Lucas, William J
2009-02-01
Increasing evidence suggests that proteins present in the angiosperm sieve tube system play an important role in the long distance signaling system of plants. To identify the nature of these putatively non-cell-autonomous proteins, we adopted a large scale proteomics approach to analyze pumpkin phloem exudates. Phloem proteins were fractionated by fast protein liquid chromatography using both anion and cation exchange columns and then either in-solution or in-gel digested following further separation by SDS-PAGE. A total of 345 LC-MS/MS data sets were analyzed using a combination of Mascot and X!Tandem against the NCBI non-redundant green plant database and an extensive Cucurbit maxima expressed sequence tag database. In this analysis, 1,209 different consensi were obtained of which 1,121 could be annotated from GenBank and BLAST search analyses against three plant species, Arabidopsis thaliana, rice (Oryza sativa), and poplar (Populus trichocarpa). Gene ontology (GO) enrichment analyses identified sets of phloem proteins that function in RNA binding, mRNA translation, ubiquitin-mediated proteolysis, and macromolecular and vesicle trafficking. Our findings indicate that protein synthesis and turnover, processes that were thought to be absent in enucleate sieve elements, likely occur within the angiosperm phloem translocation stream. In addition, our GO analysis identified a set of phloem proteins that are associated with the GO term "embryonic development ending in seed dormancy"; this finding raises the intriguing question as to whether the phloem may exert some level of control over seed development. The universal significance of the phloem proteome was highlighted by conservation of the phloem proteome in species as diverse as monocots (rice), eudicots (Arabidopsis and pumpkin), and trees (poplar). These results are discussed from the perspective of the role played by the phloem proteome as an integral component of the whole plant communication system.
Swomley, Aaron M; Triplett, Judy C; Keeney, Jeriel T; Warrier, Govind; Pearson, Kevin J; Mattison, Julie A; de Cabo, Rafael; Cai, Jian; Klein, Jon B; Butterfield, D Allan
2017-01-01
A diet consisting of a high intake of saturated fat and refined sugars is characteristic of a Western-diet and has been shown to have a substantial negative effect on human health. Expression proteomics were used to investigate changes to the parietal lobe proteome of rhesus monkeys consuming either a high fat and sugar (HFS) diet, a HFS diet supplemented with resveratrol (HFS+RSV), or a healthy control diet for 2 years. Here we discuss the modifications in the levels of 12 specific proteins involved in various cellular systems including metabolism, neurotransmission, structural integrity, and general cellular signaling following a nutritional intervention. Our results contribute to a better understanding of the mechanisms by which resveratrol functions through the up- or down-regulation of proteins in different cellular sub-systems to affect the overall health of the brain. Copyright © 2016 Elsevier Inc. All rights reserved.
Guarnieri, Michael T.; Nag, Ambarish; Smolinski, Sharon L.; Darzins, Al; Seibert, Michael; Pienkos, Philip T.
2011-01-01
Biofuels derived from algal lipids represent an opportunity to dramatically impact the global energy demand for transportation fuels. Systems biology analyses of oleaginous algae could greatly accelerate the commercialization of algal-derived biofuels by elucidating the key components involved in lipid productivity and leading to the initiation of hypothesis-driven strain-improvement strategies. However, higher-level systems biology analyses, such as transcriptomics and proteomics, are highly dependent upon available genomic sequence data, and the lack of these data has hindered the pursuit of such analyses for many oleaginous microalgae. In order to examine the triacylglycerol biosynthetic pathway in the unsequenced oleaginous microalga, Chlorella vulgaris, we have established a strategy with which to bypass the necessity for genomic sequence information by using the transcriptome as a guide. Our results indicate an upregulation of both fatty acid and triacylglycerol biosynthetic machinery under oil-accumulating conditions, and demonstrate the utility of a de novo assembled transcriptome as a search model for proteomic analysis of an unsequenced microalga. PMID:22043295
A TRPV2 interactome-based signature for prognosis in glioblastoma patients.
Doñate-Macián, Pau; Gómez, Antonio; Dégano, Irene R; Perálvarez-Marín, Alex
2018-04-06
Proteomics aids to the discovery and expansion of protein-protein interaction networks, which are key to understand molecular mechanisms in physiology and physiopathology, but also to infer protein function in a guilt-by-association fashion. In this study we use a systematic protein-protein interaction membrane yeast two-hybrid method to expand the interactome of TRPV2, a cation channel related to nervous system development. After validation of the interactome in silico , we define a TRPV2-interactome signature combining proteomics with the available physio-pathological data in Disgenet to find interactome-disease associations, highlighting nervous system disorders and neoplasms. The TRPV2-interactome signature against available experimental data is capable of discriminating overall risk in glioblastoma multiforme prognosis, progression, recurrence, and chemotherapy resistance. Beyond the impact on glioblastoma physiopathology, this study shows that combining systematic proteomics with in silico methods and available experimental data is key to open new perspectives to define novel biomarkers for diagnosis, prognosis and therapeutics in disease.
A TRPV2 interactome-based signature for prognosis in glioblastoma patients
Dégano, Irene R.; Perálvarez-Marín, Alex
2018-01-01
Proteomics aids to the discovery and expansion of protein-protein interaction networks, which are key to understand molecular mechanisms in physiology and physiopathology, but also to infer protein function in a guilt-by-association fashion. In this study we use a systematic protein-protein interaction membrane yeast two-hybrid method to expand the interactome of TRPV2, a cation channel related to nervous system development. After validation of the interactome in silico, we define a TRPV2-interactome signature combining proteomics with the available physio-pathological data in Disgenet to find interactome-disease associations, highlighting nervous system disorders and neoplasms. The TRPV2-interactome signature against available experimental data is capable of discriminating overall risk in glioblastoma multiforme prognosis, progression, recurrence, and chemotherapy resistance. Beyond the impact on glioblastoma physiopathology, this study shows that combining systematic proteomics with in silico methods and available experimental data is key to open new perspectives to define novel biomarkers for diagnosis, prognosis and therapeutics in disease. PMID:29719613
PNAC: a protein nucleolar association classifier
2011-01-01
Background Although primarily known as the site of ribosome subunit production, the nucleolus is involved in numerous and diverse cellular processes. Recent large-scale proteomics projects have identified thousands of human proteins that associate with the nucleolus. However, in most cases, we know neither the fraction of each protein pool that is nucleolus-associated nor whether their association is permanent or conditional. Results To describe the dynamic localisation of proteins in the nucleolus, we investigated the extent of nucleolar association of proteins by first collating an extensively curated literature-derived dataset. This dataset then served to train a probabilistic predictor which integrates gene and protein characteristics. Unlike most previous experimental and computational studies of the nucleolar proteome that produce large static lists of nucleolar proteins regardless of their extent of nucleolar association, our predictor models the fluidity of the nucleolus by considering different classes of nucleolar-associated proteins. The new method predicts all human proteins as either nucleolar-enriched, nucleolar-nucleoplasmic, nucleolar-cytoplasmic or non-nucleolar. Leave-one-out cross validation tests reveal sensitivity values for these four classes ranging from 0.72 to 0.90 and positive predictive values ranging from 0.63 to 0.94. The overall accuracy of the classifier was measured to be 0.85 on an independent literature-based test set and 0.74 using a large independent quantitative proteomics dataset. While the three nucleolar-association groups display vastly different Gene Ontology biological process signatures and evolutionary characteristics, they collectively represent the most well characterised nucleolar functions. Conclusions Our proteome-wide classification of nucleolar association provides a novel representation of the dynamic content of the nucleolus. This model of nucleolar localisation thus increases the coverage while providing accurate and specific annotations of the nucleolar proteome. It will be instrumental in better understanding the central role of the nucleolus in the cell and its interaction with other subcellular compartments. PMID:21272300
Mapping Proteome-Wide Interactions of Reactive Chemicals Using Chemoproteomic Platforms
Counihan, Jessica L.; Ford, Breanna; Nomura, Daniel K.
2015-01-01
A large number of pharmaceuticals, endogenous metabolites, and environmental chemicals act through covalent mechanisms with protein targets. Yet, their specific interactions with the proteome still remain poorly defined for most of these reactive chemicals. Deciphering direct protein targets of reactive small-molecules is critical in understanding their biological action, off-target effects, potential toxicological liabilities, and development of safer and more selective agents. Chemoproteomic technologies have arisen as a powerful strategy that enable the assessment of proteome-wide interactions of these irreversible agents directly in complex biological systems. We review here several chemoproteomic strategies that have facilitated our understanding of specific protein interactions of irreversibly-acting pharmaceuticals, endogenous metabolites, and environmental electrophiles to reveal novel pharmacological, biological, and toxicological mechanisms. PMID:26647369
Development of a proteomic approach to monitor protein synthesis in mycotoxin producing moulds.
Milles, J; Krämer, J; Prange, A
2007-12-01
In general, proteome studies compare different states of metabolism to investigate external or internal influences on protein expression. In the context of mycotoxin production the method could open another view on this complex and could be helpful to gain knowledge about proteins which are involved in metabolism (enzymes, transporters). In this short technical report, we describe a new protocol suitable for protein preparation for whole proteome analysis ofFusarium graminearum. Cell lysis was performed by grinding the mycelium with liquid nitrogen. Proteins were extracted with TCA/acetone and then cleaned; the isolated proteins were separated in a 2D-gel electrophoresis system (BioRad) using different pH gradients. The protocol established seems also generally applicable for other mycotoxin producing fungi.
Kim, Bong-Woo; Lee, Chang Seok; Yi, Jae-Sung; Lee, Joo-Hyung; Lee, Joong-Won; Choo, Hyo-Jung; Jung, Soon-Young; Kim, Min-Sik; Lee, Sang-Won; Lee, Myung-Shik; Yoon, Gyesoon; Ko, Young-Gyu
2010-12-01
Although accumulating proteomic analyses have supported the fact that mitochondrial oxidative phosphorylation (OXPHOS) complexes are localized in lipid rafts, which mediate cell signaling, immune response and host-pathogen interactions, there has been no in-depth study of the physiological functions of lipid-raft OXPHOS complexes. Here, we show that many subunits of OXPHOS complexes were identified from the lipid rafts of human adipocytes, C2C12 myotubes, Jurkat cells and surface biotin-labeled Jurkat cells via shotgun proteomic analysis. We discuss the findings of OXPHOS complexes in lipid rafts, the role of the surface ATP synthase complex as a receptor for various ligands and extracellular superoxide generation by plasma membrane oxidative phosphorylation complexes.
Completed | Office of Cancer Clinical Proteomics Research
Prior to the current Clinical Proteomic Tumor Analysis Consortium (CPTAC), previously funded initiatives associated with clinical proteomics research included: Clinical Proteomic Tumor Analysis Consortium (CPTAC 2.0) Clinical Proteomic Technologies for Cancer Initiative (CPTC) Mouse Proteomic Technologies Initiative
Harden, Charlotte J; Perez-Carrion, Kristine; Babakordi, Zara; Plummer, Sue F; Hepburn, Natalie; Barker, Margo E; Wright, Phillip C; Evans, Caroline A; Corfe, Bernard M
2012-06-06
Current measurement of appetite depends upon tools that are either subjective (visual analogue scales), or invasive (blood). Saliva is increasingly recognised as a valuable resource for biomarker analysis. Proteomics workflows may provide alternative means for the assessment of appetitive response. The study aimed to assess the potential value of the salivary proteome to detect novel biomarkers of appetite using an iTRAQ-based workflow. Diurnal variation of salivary protein concentrations was assessed. A randomised, controlled, crossover study examined the effects on the salivary proteome of isocaloric doses of various long chain fatty acid (LCFA) oil emulsions compared to no treatment (NT). Fasted males provided saliva samples before and following NT or dosing with LCFA emulsions. The oil component of the DHA emulsion contained predominantly docosahexaenoic acid and the oil component of OA contained predominantly oleic acid. Several proteins were present in significantly (p<0.05) different quantities in saliva samples taken following treatments compared to fasting samples. DHA caused alterations in thioredoxin and serpin B4 relative to OA and NT. A further study evaluated energy intake (EI) in response to LCFA in conjunction with subjective appetite scoring. DHA was associated with significantly lower EI relative to NT and OA (p=0.039). The collective data suggest investigation of salivary proteome may be of value in appetitive response. This article is part of a Special Issue entitled: Proteomics: The clinical link. Copyright © 2011 Elsevier B.V. All rights reserved.
Chemical proteomics approaches for identifying the cellular targets of natural products
Sieber, S. A.
2016-01-01
Covering: 2010 up to 2016 Deconvoluting the mode of action of natural products and drugs remains one of the biggest challenges in chemistry and biology today. Chemical proteomics is a growing area of chemical biology that seeks to design small molecule probes to understand protein function. In the context of natural products, chemical proteomics can be used to identify the protein binding partners or targets of small molecules in live cells. Here, we highlight recent examples of chemical probes based on natural products and their application for target identification. The review focuses on probes that can be covalently linked to their target proteins (either via intrinsic chemical reactivity or via the introduction of photocrosslinkers), and can be applied “in situ” – in living systems rather than cell lysates. We also focus here on strategies that employ a click reaction, the copper-catalysed azide–alkyne cycloaddition reaction (CuAAC), to allow minimal functionalisation of natural product scaffolds with an alkyne or azide tag. We also discuss ‘competitive mode’ approaches that screen for natural products that compete with a well-characterised chemical probe for binding to a particular set of protein targets. Fuelled by advances in mass spectrometry instrumentation and bioinformatics, many modern strategies are now embracing quantitative proteomics to help define the true interacting partners of probes, and we highlight the opportunities this rapidly evolving technology provides in chemical proteomics. Finally, some of the limitations and challenges of chemical proteomics approaches are discussed. PMID:27098809
DOE Office of Scientific and Technical Information (OSTI.GOV)
Omasits, U.; Quebatte, Maxime; Stekhoven, Daniel J.
2013-11-01
Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, wemore » could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ~90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor.« less
Chemical proteomics approaches for identifying the cellular targets of natural products.
Wright, M H; Sieber, S A
2016-05-04
Covering: 2010 up to 2016Deconvoluting the mode of action of natural products and drugs remains one of the biggest challenges in chemistry and biology today. Chemical proteomics is a growing area of chemical biology that seeks to design small molecule probes to understand protein function. In the context of natural products, chemical proteomics can be used to identify the protein binding partners or targets of small molecules in live cells. Here, we highlight recent examples of chemical probes based on natural products and their application for target identification. The review focuses on probes that can be covalently linked to their target proteins (either via intrinsic chemical reactivity or via the introduction of photocrosslinkers), and can be applied "in situ" - in living systems rather than cell lysates. We also focus here on strategies that employ a click reaction, the copper-catalysed azide-alkyne cycloaddition reaction (CuAAC), to allow minimal functionalisation of natural product scaffolds with an alkyne or azide tag. We also discuss 'competitive mode' approaches that screen for natural products that compete with a well-characterised chemical probe for binding to a particular set of protein targets. Fuelled by advances in mass spectrometry instrumentation and bioinformatics, many modern strategies are now embracing quantitative proteomics to help define the true interacting partners of probes, and we highlight the opportunities this rapidly evolving technology provides in chemical proteomics. Finally, some of the limitations and challenges of chemical proteomics approaches are discussed.
Omasits, Ulrich; Quebatte, Maxime; Stekhoven, Daniel J.; Fortes, Claudia; Roschitzki, Bernd; Robinson, Mark D.; Dehio, Christoph; Ahrens, Christian H.
2013-01-01
Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, we could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ∼90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor. PMID:23878158
Identification of new intrinsic proteins in Arabidopsis plasma membrane proteome.
Marmagne, Anne; Rouet, Marie-Aude; Ferro, Myriam; Rolland, Norbert; Alcon, Carine; Joyard, Jacques; Garin, Jérome; Barbier-Brygoo, Hélène; Ephritikhine, Geneviève
2004-07-01
Identification and characterization of anion channel genes in plants represent a goal for a better understanding of their central role in cell signaling, osmoregulation, nutrition, and metabolism. Though channel activities have been well characterized in plasma membrane by electrophysiology, the corresponding molecular entities are little documented. Indeed, the hydrophobic protein equipment of plant plasma membrane still remains largely unknown, though several proteomic approaches have been reported. To identify new putative transport systems, we developed a new proteomic strategy based on mass spectrometry analyses of a plasma membrane fraction enriched in hydrophobic proteins. We produced from Arabidopsis cell suspensions a highly purified plasma membrane fraction and characterized it in detail by immunological and enzymatic tests. Using complementary methods for the extraction of hydrophobic proteins and mass spectrometry analyses on mono-dimensional gels, about 100 proteins have been identified, 95% of which had never been found in previous proteomic studies. The inventory of the plasma membrane proteome generated by this approach contains numerous plasma membrane integral proteins, one-third displaying at least four transmembrane segments. The plasma membrane localization was confirmed for several proteins, therefore validating such proteomic strategy. An in silico analysis shows a correlation between the putative functions of the identified proteins and the expected roles for plasma membrane in transport, signaling, cellular traffic, and metabolism. This analysis also reveals 10 proteins that display structural properties compatible with transport functions and will constitute interesting targets for further functional studies.
Li, Ginny X H; Vogel, Christine; Choi, Hyungwon
2018-06-07
While tandem mass spectrometry can detect post-translational modifications (PTM) at the proteome scale, reported PTM sites are often incomplete and include false positives. Computational approaches can complement these datasets by additional predictions, but most available tools use prediction models pre-trained for single PTM type by the developers and it remains a difficult task to perform large-scale batch prediction for multiple PTMs with flexible user control, including the choice of training data. We developed an R package called PTMscape which predicts PTM sites across the proteome based on a unified and comprehensive set of descriptors of the physico-chemical microenvironment of modified sites, with additional downstream analysis modules to test enrichment of individual or pairs of PTMs in protein domains. PTMscape is flexible in the ability to process any major modifications, such as phosphorylation and ubiquitination, while achieving the sensitivity and specificity comparable to single-PTM methods and outperforming other multi-PTM tools. Applying this framework, we expanded proteome-wide coverage of five major PTMs affecting different residues by prediction, especially for lysine and arginine modifications. Using a combination of experimentally acquired sites (PSP) and newly predicted sites, we discovered that the crosstalk among multiple PTMs occur more frequently than by random chance in key protein domains such as histone, protein kinase, and RNA recognition motifs, spanning various biological processes such as RNA processing, DNA damage response, signal transduction, and regulation of cell cycle. These results provide a proteome-scale analysis of crosstalk among major PTMs and can be easily extended to other types of PTM.
Differentially delayed root proteome responses to salt stress in sugar cane varieties.
Pacheco, Cinthya Mirella; Pestana-Calsa, Maria Clara; Gozzo, Fabio Cesar; Mansur Custodio Nogueira, Rejane Jurema; Menossi, Marcelo; Calsa, Tercilio
2013-12-06
Soil salinity is a limiting factor to sugar cane crop development, although in general plants present variable mechanisms of tolerance to salinity stress. The molecular basis underlying these mechanisms can be inferred by using proteomic analysis. Thus, the objective of this work was to identify differentially expressed proteins in sugar cane plants submitted to salinity stress. For that, a greenhouse experiment was established with four sugar cane varieties and two salt conditions, 0 mM (control) and 200 mM NaCl. Physiological and proteomics analyses were performed after 2 and 72 h of stress induction by salt. Distinct physiological responses to salinity stress were observed in the varieties and linked to tolerance mechanisms. In proteomic analysis, the roots soluble protein fraction was extracted, quantified, and analyzed through bidimensional electrophoresis. Gel images analyses were done computationally, where in each contrast only one variable was considered (salinity condition or variety). Differential spots were excised, digested by trypsin, and identified via mass spectrometry. The tolerant variety RB867515 showed the highest accumulation of proteins involved in growth, development, carbohydrate and energy metabolism, reactive oxygen species metabolization, protein protection, and membrane stabilization after 2 h of stress. On the other hand, the presence of these proteins in the sensitive variety was verified only in stress treatment after 72 h. These data indicate that these stress responses pathways play a role in the tolerance to salinity in sugar cane, and their effectiveness for phenotypical tolerance depends on early stress detection and activation of the coding genes expression.