The November 1, 2017 issue of Cancer Research is dedicated to a collection of computational resource papers in genomics, proteomics, animal models, imaging, and clinical subjects for non-bioinformaticists looking to incorporate computing tools into their work. Scientists at Pacific Northwest National Laboratory have developed P-MartCancer, an open, web-based interactive software tool that enables statistical analyses of peptide or protein data generated from mass-spectrometry (MS)-based global proteomics experiments.
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
2005-01-01
proteomic gel analyses. The research group has explored the use of chemodescriptors calculated using high-level ab initio quantum chemical basis sets...descriptors that characterize the entire proteomics map, local descriptors that characterize a subset of the proteins present in the gel, and spectrum...techniques for analyzing the full set of proteins present in a proteomics map. 14. SUBJECT TERMS 1S. NUMBER OF PAGES Topological indices
Computer aided manual validation of mass spectrometry-based proteomic data.
Curran, Timothy G; Bryson, Bryan D; Reigelhaupt, Michael; Johnson, Hannah; White, Forest M
2013-06-15
Advances in mass spectrometry-based proteomic technologies have increased the speed of analysis and the depth provided by a single analysis. Computational tools to evaluate the accuracy of peptide identifications from these high-throughput analyses have not kept pace with technological advances; currently the most common quality evaluation methods are based on statistical analysis of the likelihood of false positive identifications in large-scale data sets. While helpful, these calculations do not consider the accuracy of each identification, thus creating a precarious situation for biologists relying on the data to inform experimental design. Manual validation is the gold standard approach to confirm accuracy of database identifications, but is extremely time-intensive. To palliate the increasing time required to manually validate large proteomic datasets, we provide computer aided manual validation software (CAMV) to expedite the process. Relevant spectra are collected, catalogued, and pre-labeled, allowing users to efficiently judge the quality of each identification and summarize applicable quantitative information. CAMV significantly reduces the burden associated with manual validation and will hopefully encourage broader adoption of manual validation in mass spectrometry-based proteomics. Copyright © 2013 Elsevier Inc. All rights reserved.
Han, Mee-Jung; Yun, Hongseok; Lee, Jeong Wook; Lee, Yu Hyun; Lee, Sang Yup; Yoo, Jong-Shin; Kim, Jin Young; Kim, Jihyun F; Hur, Cheol-Goo
2011-04-01
Escherichia coli K-12 and B strains have most widely been employed for scientific studies as well as industrial applications. Recently, the complete genome sequences of two representative descendants of E. coli B strains, REL606 and BL21(DE3), have been determined. Here, we report the subproteome reference maps of E. coli B REL606 by analyzing cytoplasmic, periplasmic, inner and outer membrane, and extracellular proteomes based on the genome information using experimental and computational approaches. Among the total of 3487 spots, 651 proteins including 410 non-redundant proteins were identified and characterized by 2-DE and LC-MS/MS; they include 440 cytoplasmic, 45 periplasmic, 50 inner membrane, 61 outer membrane, and 55 extracellular proteins. In addition, subcellular localizations of all 4205 ORFs of E. coli B were predicted by combined computational prediction methods. The subcellular localizations of 1812 (43.09%) proteins of currently unknown function were newly assigned. The results of computational prediction were also compared with the experimental results, showing that overall precision and recall were 92.16 and 92.16%, respectively. This work represents the most comprehensive analyses of the subproteomes of E. coli B, and will be useful as a reference for proteome profiling studies under various conditions. The complete proteome data are available online (http://ecolib.kaist.ac.kr). Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.
2015-01-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L
2015-02-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Computational biology for ageing
Wieser, Daniela; Papatheodorou, Irene; Ziehm, Matthias; Thornton, Janet M.
2011-01-01
High-throughput genomic and proteomic technologies have generated a wealth of publicly available data on ageing. Easy access to these data, and their computational analysis, is of great importance in order to pinpoint the causes and effects of ageing. Here, we provide a description of the existing databases and computational tools on ageing that are available for researchers. We also describe the computational approaches to data interpretation in the field of ageing including gene expression, comparative and pathway analyses, and highlight the challenges for future developments. We review recent biological insights gained from applying bioinformatics methods to analyse and interpret ageing data in different organisms, tissues and conditions. PMID:21115530
Efficient visualization of high-throughput targeted proteomics experiments: TAPIR.
Röst, Hannes L; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars
2015-07-15
Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Lavallée-Adam, Mathieu; Rauniyar, Navin; McClatchy, Daniel B; Yates, John R
2014-12-05
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights.
2015-01-01
The majority of large-scale proteomics quantification methods yield long lists of quantified proteins that are often difficult to interpret and poorly reproduced. Computational approaches are required to analyze such intricate quantitative proteomics data sets. We propose a statistical approach to computationally identify protein sets (e.g., Gene Ontology (GO) terms) that are significantly enriched with abundant proteins with reproducible quantification measurements across a set of replicates. To this end, we developed PSEA-Quant, a protein set enrichment analysis algorithm for label-free and label-based protein quantification data sets. It offers an alternative approach to classic GO analyses, models protein annotation biases, and allows the analysis of samples originating from a single condition, unlike analogous approaches such as GSEA and PSEA. We demonstrate that PSEA-Quant produces results complementary to GO analyses. We also show that PSEA-Quant provides valuable information about the biological processes involved in cystic fibrosis using label-free protein quantification of a cell line expressing a CFTR mutant. Finally, PSEA-Quant highlights the differences in the mechanisms taking place in the human, rat, and mouse brain frontal cortices based on tandem mass tag quantification. Our approach, which is available online, will thus improve the analysis of proteomics quantification data sets by providing meaningful biological insights. PMID:25177766
Halligan, Brian D.; Geiger, Joey F.; Vallejos, Andrew K.; Greene, Andrew S.; Twigger, Simon N.
2009-01-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step by step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center website (http://proteomics.mcw.edu/vipdac). PMID:19358578
Halligan, Brian D; Geiger, Joey F; Vallejos, Andrew K; Greene, Andrew S; Twigger, Simon N
2009-06-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step-by-step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center Web site ( http://proteomics.mcw.edu/vipdac ).
Hoehenwarter, Wolfgang; Larhlimi, Abdelhalim; Hummel, Jan; Egelhofer, Volker; Selbig, Joachim; van Dongen, Joost T; Wienkoop, Stefanie; Weckwerth, Wolfram
2011-07-01
Mass Accuracy Precursor Alignment is a fast and flexible method for comparative proteome analysis that allows the comparison of unprecedented numbers of shotgun proteomics analyses on a personal computer in a matter of hours. We compared 183 LC-MS analyses and more than 2 million MS/MS spectra and could define and separate the proteomic phenotypes of field grown tubers of 12 tetraploid cultivars of the crop plant Solanum tuberosum. Protein isoforms of patatin as well as other major gene families such as lipoxygenase and cysteine protease inhibitor that regulate tuber development were found to be the primary source of variability between the cultivars. This suggests that differentially expressed protein isoforms modulate genotype specific tuber development and the plant phenotype. We properly assigned the measured abundance of tryptic peptides to different protein isoforms that share extensive stretches of primary structure and thus inferred their abundance. Peptides unique to different protein isoforms were used to classify the remaining peptides assigned to the entire subset of isoforms based on a common abundance profile using multivariate statistical procedures. We identified nearly 4000 proteins which we used for quantitative functional annotation making this the most extensive study of the tuber proteome to date.
Proteomic analysis of albumin and globulin fractions of pea (Pisum sativum L.) seeds.
Dziuba, Jerzy; Szerszunowicz, Iwona; Nałęcz, Dorota; Dziuba, Marta
2014-01-01
Proteomic analysis is emerging as a highly useful tool in food research, including studies of food allergies. Two-dimensional gel electrophoresis involving isoelectric focusing and sodium dodecyl sulfate polyacrylamide gel electrophoresis is the most effective method of separating hundreds or even thousands of proteins. In this study, albumin and globulin tractions of pea seeds cv. Ramrod were subjected to proteomic analysis. Selected potentially alergenic proteins were identified based on their molecular weights and isoelectric points. Pea seeds (Pisum sativum L.) cv. Ramrod harvested over a period of two years (Plant Breeding Station in Piaski-Szelejewo) were used in the experiment. The isolated albumins, globulins and legumin and vicilin fractions of globulins were separated by two-dimensional gel electrophoresis. Proteomic images were analysed in the ImageMaster 2D Platinum program with the use of algorithms from the Melanie application. The relative content, isoelectric points and molecular weights were computed for all identified proteins. Electrophoregrams were analysed by matching spot positions from three independent replications. The proteomes of albumins, globulins and legumin and vicilin fractions of globulins produced up to several hundred spots (proteins). Spots most characteristic of a given fraction were identified by computer analysis and spot matching. The albumin proteome accumulated spots of relatively high intensity over a broad range of pi values of ~4.2-8.1 in 3 molecular weight (MW) ranges: I - high molecular-weight albumins with MW of ~50-110 kDa, II - average molecular-weight albumins with MW of ~20-35 kDa, and III - low molecular-weight albumins with MW of ~13-17 kDa. 2D gel electrophoregrams revealed the presence of 81 characteristic spots, including 24 characteristic of legumin and 14 - of vicilin. Two-dimensional gel electrophoresis proved to be a useful tool for identifying pea proteins. Patterns of spots with similar isoelectric points and different molecular weights or spots with different isoelectric points and similar molecular weights play an important role in proteome analysis. The regions characteristic of albumin, globulin and legumin and vicilin fractions of globulin with typical MW and pi values were identified as the results of performed 2D electrophoretic separations of pea proteins. 2D gel electrophoresis of albumins and the vicilin fraction of globulins revealed the presence of 4 and 2 spots, respectively, representing potentially allergenic proteins. They probably corresponded to vicilin fragments synthesized during post-translational modification of the analysed protein.
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Proteomic and Biochemical Analyses of the Cotyledon and Root of Flooding-Stressed Soybean Plants
Komatsu, Setsuko; Makino, Takahiro; Yasue, Hiroshi
2013-01-01
Background Flooding significantly reduces the growth and grain yield of soybean plants. Proteomic and biochemical techniques were used to determine whether the function of cotyledon and root is altered in soybean under flooding stress. Results Two-day-old soybean plants were flooded for 2 days, after which the proteins from root and cotyledon were extracted for proteomic analysis. In response to flooding stress, the abundance of 73 and 28 proteins was significantly altered in the root and cotyledon, respectively. The accumulation of only one protein, 70 kDa heat shock protein (HSP70) (Glyma17g08020.1), increased in both organs following flooding. The ratio of protein abundance of HSP70 and biophoton emission in the cotyledon was higher than those detected in the root under flooding stress. Computed tomography and elemental analyses revealed that flooding stress decreases the number of calcium oxalate crystal the cotyledon, indicating calcium ion was elevated in the cotyledon under flooding stress. Conclusion These results suggest that calcium might play one role through HSP70 in the cotyledon under flooding stress. PMID:23799004
Computational clustering for viral reference proteomes
Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.
2016-01-01
Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712
A community proposal to integrate proteomics activities in ELIXIR.
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J R; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on 'The Future of Proteomics in ELIXIR' that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR's existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper.
A community proposal to integrate proteomics activities in ELIXIR
Vizcaíno, Juan Antonio; Walzer, Mathias; Jiménez, Rafael C.; Bittremieux, Wout; Bouyssié, David; Carapito, Christine; Corrales, Fernando; Ferro, Myriam; Heck, Albert J.R.; Horvatovich, Peter; Hubalek, Martin; Lane, Lydie; Laukens, Kris; Levander, Fredrik; Lisacek, Frederique; Novak, Petr; Palmblad, Magnus; Piovesan, Damiano; Pühler, Alfred; Schwämmle, Veit; Valkenborg, Dirk; van Rijswijk, Merlijn; Vondrasek, Jiri; Eisenacher, Martin; Martens, Lennart; Kohlbacher, Oliver
2017-01-01
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on ‘The Future of Proteomics in ELIXIR’ that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR’s existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper. PMID:28713550
Intra- and Extra-cellular Proteome Analyses of Steroid-Producer Mycobacteria.
Barreiro, Carlos; Morales, Alejandro; Vázquez-Iglesias, Inés; Sola-Landa, Alberto
2017-01-01
The importance of the pathogenic mycobacteria has mainly focused the omic analyses on different aspects of their clinical significance. In contrast, those industrially relevant mycobacteria have received less attention, even though the steroids market sales in 2011, in example, were estimated in $8 billion.The extra-cellular proteome, due to its relevance in the sterols processing and uptake; as well as the intra-cellular proteome, because of its role in steroids bioconversion, are the core of the present chapter. As a proof of concept, the obtaining methods for both sub-proteomes of Mycobacterium neoaurum NRRL B-3805, a relevant industrial strain involved in steroids production, have been developed. Thus, procedures and relevant key points of these proteomes analyses are fully described.
MannDB: A microbial annotation database for protein characterization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, C; Lam, M; Smith, J
2006-05-19
MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less
This week, we are excited to announce the launch of the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) Proteogenomics Computational DREAM Challenge. The aim of this Challenge is to encourage the generation of computational methods for extracting information from the cancer proteome and for linking those data to genomic and transcriptomic information. The specific goals are to predict proteomic and phosphoproteomic data from other multiple data types including transcriptomics and genetics.
Vijay, Sonam
2014-01-01
Salivary gland proteins of Anopheles mosquitoes offer attractive targets to understand interactions with sporozoites, blood feeding behavior, homeostasis, and immunological evaluation of malaria vectors and parasite interactions. To date limited studies have been carried out to elucidate salivary proteins of An. stephensi salivary glands. The aim of the present study was to provide detailed analytical attributives of functional salivary gland proteins of urban malaria vector An. stephensi. A proteomic approach combining one-dimensional electrophoresis (1DE), ion trap liquid chromatography mass spectrometry (LC/MS/MS), and computational bioinformatic analysis was adopted to provide the first direct insight into identification and functional characterization of known salivary proteins and novel salivary proteins of An. stephensi. Computational studies by online servers, namely, MASCOT and OMSSA algorithms, identified a total of 36 known salivary proteins and 123 novel proteins analysed by LC/MS/MS. This first report describes a baseline proteomic catalogue of 159 salivary proteins belonging to various categories of signal transduction, regulation of blood coagulation cascade, and various immune and energy pathways of An. stephensi sialotranscriptome by mass spectrometry. Our results may serve as basis to provide a putative functional role of proteins in concept of blood feeding, biting behavior, and other aspects of vector-parasite host interactions for parasite development in anopheline mosquitoes. PMID:25126571
Vijay, Sonam; Rawat, Manmeet; Sharma, Arun
2014-01-01
Salivary gland proteins of Anopheles mosquitoes offer attractive targets to understand interactions with sporozoites, blood feeding behavior, homeostasis, and immunological evaluation of malaria vectors and parasite interactions. To date limited studies have been carried out to elucidate salivary proteins of An. stephensi salivary glands. The aim of the present study was to provide detailed analytical attributives of functional salivary gland proteins of urban malaria vector An. stephensi. A proteomic approach combining one-dimensional electrophoresis (1DE), ion trap liquid chromatography mass spectrometry (LC/MS/MS), and computational bioinformatic analysis was adopted to provide the first direct insight into identification and functional characterization of known salivary proteins and novel salivary proteins of An. stephensi. Computational studies by online servers, namely, MASCOT and OMSSA algorithms, identified a total of 36 known salivary proteins and 123 novel proteins analysed by LC/MS/MS. This first report describes a baseline proteomic catalogue of 159 salivary proteins belonging to various categories of signal transduction, regulation of blood coagulation cascade, and various immune and energy pathways of An. stephensi sialotranscriptome by mass spectrometry. Our results may serve as basis to provide a putative functional role of proteins in concept of blood feeding, biting behavior, and other aspects of vector-parasite host interactions for parasite development in anopheline mosquitoes.
A novel spectral library workflow to enhance protein identifications.
Li, Haomin; Zong, Nobel C; Liang, Xiangbo; Kim, Allen K; Choi, Jeong Ho; Deng, Ning; Zelaya, Ivette; Lam, Maggie; Duan, Huilong; Ping, Peipei
2013-04-09
The innovations in mass spectrometry-based investigations in proteome biology enable systematic characterization of molecular details in pathophysiological phenotypes. However, the process of delineating large-scale raw proteomic datasets into a biological context requires high-throughput data acquisition and processing. A spectral library search engine makes use of previously annotated experimental spectra as references for subsequent spectral analyses. This workflow delivers many advantages, including elevated analytical efficiency and specificity as well as reduced demands in computational capacity. In this study, we created a spectral matching engine to address challenges commonly associated with a library search workflow. Particularly, an improved sliding dot product algorithm, that is robust to systematic drifts of mass measurement in spectra, is introduced. Furthermore, a noise management protocol distinguishes spectra correlation attributed from noise and peptide fragments. It enables elevated separation between target spectral matches and false matches, thereby suppressing the possibility of propagating inaccurate peptide annotations from library spectra to query spectra. Moreover, preservation of original spectra also accommodates user contributions to further enhance the quality of the library. Collectively, this search engine supports reproducible data analyses using curated references, thereby broadening the accessibility of proteomics resources to biomedical investigators. This article is part of a Special Issue entitled: From protein structures to clinical applications. Copyright © 2013 Elsevier B.V. All rights reserved.
Comparative proteomics lends insight into genotype-specific pathogenicity.
Guarnieri, Michael T
2013-09-01
Comparative proteomic analyses have emerged as a powerful tool for the identification of unique biomarkers and mechanisms of pathogenesis. In this issue of Proteomics, Murugaiyan et al. utilize difference gel electrophoresis (DIGE) to examine differential protein expression between nonpathogenic and pathogenic genotypes of Prototheca zopfii, a causative agent in bovine enteritis and mastitis. Their findings provide insights into molecular mechanisms of infection and evolutionary adaptation of pathogenic genotypes, demonstrating the power of comparative proteomic analyses. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi
2017-06-23
The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from genome sequences, though there are over lapped proteins. Based on the demonstrated application of data stored in the database for functional analyses, it is suggested that these data will be useful for analyses of biological mechanisms in soybean. Furthermore, coupled with recent advances in information and communication technology, the usefulness of this database would increase in the analyses of biological mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.
Computer applications making rapid advances in high throughput microbial proteomics (HTMP).
Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen
2014-02-01
The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.
Proteomics and Systems Biology: Current and Future Applications in the Nutritional Sciences1
Moore, J. Bernadette; Weeks, Mark E.
2011-01-01
In the last decade, advances in genomics, proteomics, and metabolomics have yielded large-scale datasets that have driven an interest in global analyses, with the objective of understanding biological systems as a whole. Systems biology integrates computational modeling and experimental biology to predict and characterize the dynamic properties of biological systems, which are viewed as complex signaling networks. Whereas the systems analysis of disease-perturbed networks holds promise for identification of drug targets for therapy, equally the identified critical network nodes may be targeted through nutritional intervention in either a preventative or therapeutic fashion. As such, in the context of the nutritional sciences, it is envisioned that systems analysis of normal and nutrient-perturbed signaling networks in combination with knowledge of underlying genetic polymorphisms will lead to a future in which the health of individuals will be improved through predictive and preventative nutrition. Although high-throughput transcriptomic microarray data were initially most readily available and amenable to systems analysis, recent technological and methodological advances in MS have contributed to a linear increase in proteomic investigations. It is now commonplace for combined proteomic technologies to generate complex, multi-faceted datasets, and these will be the keystone of future systems biology research. This review will define systems biology, outline current proteomic methodologies, highlight successful applications of proteomics in nutrition research, and discuss the challenges for future applications of systems biology approaches in the nutritional sciences. PMID:22332076
Proteomic analyses of host and pathogen responses during bovine mastitis.
Boehmer, Jamie L
2011-12-01
The pursuit of biomarkers for use as clinical screening tools, measures for early detection, disease monitoring, and as a means for assessing therapeutic responses has steadily evolved in human and veterinary medicine over the past two decades. Concurrently, advances in mass spectrometry have markedly expanded proteomic capabilities for biomarker discovery. While initial mass spectrometric biomarker discovery endeavors focused primarily on the detection of modulated proteins in human tissues and fluids, recent efforts have shifted to include proteomic analyses of biological samples from food animal species. Mastitis continues to garner attention in veterinary research due mainly to affiliated financial losses and food safety concerns over antimicrobial use, but also because there are only a limited number of efficacious mastitis treatment options. Accordingly, comparative proteomic analyses of bovine milk have emerged in recent years. Efforts to prevent agricultural-related food-borne illness have likewise fueled an interest in the proteomic evaluation of several prominent strains of bacteria, including common mastitis pathogens. The interest in establishing biomarkers of the host and pathogen responses during bovine mastitis stems largely from the need to better characterize mechanisms of the disease, to identify reliable biomarkers for use as measures of early detection and drug efficacy, and to uncover potentially novel targets for the development of alternative therapeutics. The following review focuses primarily on comparative proteomic analyses conducted on healthy versus mastitic bovine milk. However, a comparison of the host defense proteome of human and bovine milk and the proteomic analysis of common veterinary pathogens are likewise introduced.
Differentially delayed root proteome responses to salt stress in sugar cane varieties.
Pacheco, Cinthya Mirella; Pestana-Calsa, Maria Clara; Gozzo, Fabio Cesar; Mansur Custodio Nogueira, Rejane Jurema; Menossi, Marcelo; Calsa, Tercilio
2013-12-06
Soil salinity is a limiting factor to sugar cane crop development, although in general plants present variable mechanisms of tolerance to salinity stress. The molecular basis underlying these mechanisms can be inferred by using proteomic analysis. Thus, the objective of this work was to identify differentially expressed proteins in sugar cane plants submitted to salinity stress. For that, a greenhouse experiment was established with four sugar cane varieties and two salt conditions, 0 mM (control) and 200 mM NaCl. Physiological and proteomics analyses were performed after 2 and 72 h of stress induction by salt. Distinct physiological responses to salinity stress were observed in the varieties and linked to tolerance mechanisms. In proteomic analysis, the roots soluble protein fraction was extracted, quantified, and analyzed through bidimensional electrophoresis. Gel images analyses were done computationally, where in each contrast only one variable was considered (salinity condition or variety). Differential spots were excised, digested by trypsin, and identified via mass spectrometry. The tolerant variety RB867515 showed the highest accumulation of proteins involved in growth, development, carbohydrate and energy metabolism, reactive oxygen species metabolization, protein protection, and membrane stabilization after 2 h of stress. On the other hand, the presence of these proteins in the sensitive variety was verified only in stress treatment after 72 h. These data indicate that these stress responses pathways play a role in the tolerance to salinity in sugar cane, and their effectiveness for phenotypical tolerance depends on early stress detection and activation of the coding genes expression.
Genome and proteome annotation: organization, interpretation and integration
Reeves, Gabrielle A.; Talavera, David; Thornton, Janet M.
2008-01-01
Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations. PMID:19019817
An insight into cyanobacterial genomics--a perspective.
Lakshmi, Palaniswamy Thanga Velan
2007-05-20
At the turn of the millennium, cyanobacteria deserve attention to be reviewed to understand the past, present and future. The advent of post genomic research, which encompasses functional genomics, structural genomics, transcriptomics, pharmacogenomics, proteomics and metabolomics that allows a systematic wide approach for biological system studies. Thus by exploiting genomic and associated protein information through computational analyses, the fledging information that are generated by biotechnological analyses, could be well extrapolated to fill in the lacuna of scarce information on cyanobacteria and as an effort this paper attempts to highlights the perspectives available and awakens researcher to concentrate in the field of cyanobacterial informatics.
Targeted proteomics identifies liquid-biopsy signatures for extracapsular prostate cancer
Kim, Yunee; Jeon, Jouhyun; Mejia, Salvador; Yao, Cindy Q; Ignatchenko, Vladimir; Nyalwidhe, Julius O; Gramolini, Anthony O; Lance, Raymond S; Troyer, Dean A; Drake, Richard R; Boutros, Paul C; Semmes, O. John; Kislinger, Thomas
2016-01-01
Biomarkers are rapidly gaining importance in personalized medicine. Although numerous molecular signatures have been developed over the past decade, there is a lack of overlap and many biomarkers fail to validate in independent patient cohorts and hence are not useful for clinical application. For these reasons, identification of novel and robust biomarkers remains a formidable challenge. We combine targeted proteomics with computational biology to discover robust proteomic signatures for prostate cancer. Quantitative proteomics conducted in expressed prostatic secretions from men with extraprostatic and organ-confined prostate cancers identified 133 differentially expressed proteins. Using synthetic peptides, we evaluate them by targeted proteomics in a 74-patient cohort of expressed prostatic secretions in urine. We quantify a panel of 34 candidates in an independent 207-patient cohort. We apply machine-learning approaches to develop clinical predictive models for prostate cancer diagnosis and prognosis. Our results demonstrate that computationally guided proteomics can discover highly accurate non-invasive biomarkers. PMID:27350604
Anticoagulants and the propagation phase of thrombin generation.
Orfeo, Thomas; Gissel, Matthew; Butenas, Saulius; Undas, Anetta; Brummel-Ziedins, Kathleen E; Mann, Kenneth G
2011-01-01
The view that clot time-based assays do not provide a sufficient assessment of an individual's hemostatic competence, especially in the context of anticoagulant therapy, has provoked a search for new metrics, with significant focus directed at techniques that define the propagation phase of thrombin generation. Here we use our deterministic mathematical model of tissue-factor initiated thrombin generation in combination with reconstructions using purified protein components to characterize how the interplay between anticoagulant mechanisms and variable composition of the coagulation proteome result in differential regulation of the propagation phase of thrombin generation. Thrombin parameters were extracted from computationally derived thrombin generation profiles generated using coagulation proteome factor data from warfarin-treated individuals (N = 54) and matching groups of control individuals (N = 37). A computational clot time prolongation value (cINR) was devised that correlated with their actual International Normalized Ratio (INR) values, with differences between individual INR and cINR values shown to derive from the insensitivity of the INR to tissue factor pathway inhibitor (TFPI). The analysis suggests that normal range variation in TFPI levels could be an important contributor to the failure of the INR to adequately reflect the anticoagulated state in some individuals. Warfarin-induced changes in thrombin propagation phase parameters were then compared to those induced by unfractionated heparin, fondaparinux, rivaroxaban, and a reversible thrombin inhibitor. Anticoagulants were assessed at concentrations yielding equivalent cINR values, with each anticoagulant evaluated using 32 unique coagulation proteome compositions. The analyses showed that no anticoagulant recapitulated all features of warfarin propagation phase dynamics; differences in propagation phase effects suggest that anticoagulants that selectively target fXa or thrombin may provoke fewer bleeding episodes. More generally, the study shows that computational modeling of the response of core elements of the coagulation proteome to a physiologically relevant tissue factor stimulus may improve the monitoring of a broad range of anticoagulants.
Computational Omics Pre-Awardees | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the pre-awardees of the Computational Omics solicitation. Working with NVIDIA Foundation's Compute the Cure initiative and Leidos Biomedical Research Inc., the NCI, through this solicitation, seeks to leverage computational efforts to provide tools for the mining and interpretation of large-scale publicly available ‘omics’ datasets.
Pressurized Pepsin Digestion in Proteomics
López-Ferrer, Daniel; Petritis, Konstantinos; Robinson, Errol W.; Hixson, Kim K.; Tian, Zhixin; Lee, Jung Hwa; Lee, Sang-Won; Tolić, Nikola; Weitz, Karl K.; Belov, Mikhail E.; Smith, Richard D.; Paša-Tolić, Ljiljana
2011-01-01
Integrated top-down bottom-up proteomics combined with on-line digestion has great potential to improve the characterization of protein isoforms in biological systems and is amendable to high throughput proteomics experiments. Bottom-up proteomics ultimately provides the peptide sequences derived from the tandem MS analyses of peptides after the proteome has been digested. Top-down proteomics conversely entails the MS analyses of intact proteins for more effective characterization of genetic variations and/or post-translational modifications. Herein, we describe recent efforts toward efficient integration of bottom-up and top-down LC-MS-based proteomics strategies. Since most proteomics separations utilize acidic conditions, we exploited the compatibility of pepsin (where the optimal digestion conditions are at low pH) for integration into bottom-up and top-down proteomics work flows. Pressure-enhanced pepsin digestions were successfully performed and characterized with several standard proteins in either an off-line mode using a Barocycler or an on-line mode using a modified high pressure LC system referred to as a fast on-line digestion system (FOLDS). FOLDS was tested using pepsin and a whole microbial proteome, and the results were compared against traditional trypsin digestions on the same platform. Additionally, FOLDS was integrated with a RePlay configuration to demonstrate an ultrarapid integrated bottom-up top-down proteomics strategy using a standard mixture of proteins and a monkey pox virus proteome. PMID:20627868
ProteoWizard: open source software for rapid proteomics tools development.
Kessner, Darren; Chambers, Matt; Burke, Robert; Agus, David; Mallick, Parag
2008-11-01
The ProteoWizard software project provides a modular and extensible set of open-source, cross-platform tools and libraries. The tools perform proteomics data analyses; the libraries enable rapid tool creation by providing a robust, pluggable development framework that simplifies and unifies data file access, and performs standard proteomics and LCMS dataset computations. The library contains readers and writers of the mzML data format, which has been written using modern C++ techniques and design principles and supports a variety of platforms with native compilers. The software has been specifically released under the Apache v2 license to ensure it can be used in both academic and commercial projects. In addition to the library, we also introduce a rapidly growing set of companion tools whose implementation helps to illustrate the simplicity of developing applications on top of the ProteoWizard library. Cross-platform software that compiles using native compilers (i.e. GCC on Linux, MSVC on Windows and XCode on OSX) is available for download free of charge, at http://proteowizard.sourceforge.net. This website also provides code examples, and documentation. It is our hope the ProteoWizard project will become a standard platform for proteomics development; consequently, code use, contribution and further development are strongly encouraged.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lopez-Ferrer, Daniel; Petritis, Konstantinos; Robinson, Errol W.
2011-02-01
Integrated top-down bottom-up proteomics combined with online digestion has great potential to improve the characterization of protein isoforms in biological systems and is amendable to highthroughput proteomics experiments. Bottom-up proteomics ultimately provides the peptide sequences derived from the tandem MS analyses of peptides after the proteome has been digested. Top-down proteomics conversely entails the MS analyses of intact proteins for more effective characterization of genetic variations and/or post-translational modifications (PTMs). Herein, we describe recent efforts towards efficient integration of bottom-up and top-down LCMS based proteomic strategies. Since most proteomic platforms (i.e. LC systems) operate in acidic environments, we exploited themore » compatibility of the pepsin (i.e. the enzyme’s natural acidic activity) for the integration of bottom-up and top-down proteomics. Pressure enhanced pepsin digestions were successfully performed and characterized with several standard proteins in either an offline mode using a Barocycler or an online mode using a modified high pressure LC system referred to as a fast online digestion system (FOLDS). FOLDS was tested using pepsin and a whole microbial proteome, and the results compared against traditional trypsin digestions on the same platform. Additionally, FOLDS was integrated with a RePlay configuration to demonstrate an ultra-rapid integrated bottom-up top-down proteomic strategy employing a standard mixture of proteins and a monkey pox virus proteome.« less
Joyce, Brendan; Lee, Danny; Rubio, Alex; Ogurtsov, Aleksey; Alves, Gelio; Yu, Yi-Kuo
2018-03-15
RAId is a software package that has been actively developed for the past 10 years for computationally and visually analyzing MS/MS data. Founded on rigorous statistical methods, RAId's core program computes accurate E-values for peptides and proteins identified during database searches. Making this robust tool readily accessible for the proteomics community by developing a graphical user interface (GUI) is our main goal here. We have constructed a graphical user interface to facilitate the use of RAId on users' local machines. Written in Java, RAId_GUI not only makes easy executions of RAId but also provides tools for data/spectra visualization, MS-product analysis, molecular isotopic distribution analysis, and graphing the retrieval versus the proportion of false discoveries. The results viewer displays and allows the users to download the analyses results. Both the knowledge-integrated organismal databases and the code package (containing source code, the graphical user interface, and a user manual) are available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/raid.html .
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.
2016-01-01
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205
Principles of proteome allocation are revealed using proteomic data and genome-scale models
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...
2016-11-18
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Simulation of two dimensional electrophoresis and tandem mass spectrometry for teaching proteomics.
Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul
2012-01-01
In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations-2D electrophoresis and tandem mass spectrometry. The two simulations are integrated together and are designed to teach the concept of proteome analysis of prokaryotic and eukaryotic organisms. 2DE-Tandem MS can be used as a freestanding simulation, or in conjunction with a wet lab, to introduce proteomics in the undergraduate classroom. 2DE Tandem MS is a free program available on Sourceforge at https://sourceforge.net/projects/jbf/. It was developed using Java Swing and functions in Mac OSX, Windows, and Linux, ensuring that every student sees a consistent and informative graphical user interface no matter the computer platform they choose. Java must be installed on the host computer to run 2DE Tandem MS. Example classroom exercises are provided in the Supporting Information. Copyright © 2012 Wiley Periodicals, Inc.
Lo, Andy; Weiner, Joel H; Li, Liang
2013-09-17
Due to limited sample amounts, instrument time considerations, and reagent costs, only a small number of replicate experiments are typically performed for quantitative proteome analyses. Generation of reproducible data that can be readily assessed for consistency within a small number of datasets is critical for accurate quantification. We report our investigation of a strategy using reciprocal isotope labeling of two comparative samples as a tool for determining proteome changes. Reciprocal labeling was evaluated to determine the internal consistency of quantified proteome changes from Escherichia coli grown under aerobic and anaerobic conditions. Qualitatively, the peptide overlap between replicate analyses of the same sample and reverse labeled samples were found to be within 8%. Quantitatively, reciprocal analyses showed only a slight increase in average overall inconsistency when compared with replicate analyses (1.29 vs. 1.24-fold difference). Most importantly, reverse labeling was successfully used to identify spurious values resulting from incorrect peptide identifications and poor peak fitting. After removal of 5% of the peptide data with low reproducibility, a total of 275 differentially expressed proteins (>1.50-fold difference) were consistently identified and were then subjected to bioinformatics analysis. General considerations and guidelines for reciprocal labeling experimental design and biological significance of obtained results are discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Proteomic Analyses of Ethanol Tolerance in Lactobacillus buchneri NRRL B-30929
USDA-ARS?s Scientific Manuscript database
The Lactobacillus buchneri NRRL B-30929 strain, isolated from a fuel ethanol production facility, exhibits high tolerance to environmental ethanol concentrations. In this study, the ethanol tolerance trait was elucidated at the molecular level by using proteomics comparison and analyses. Cellular p...
Application of proteomics to ecology and population biology.
Karr, T L
2008-02-01
Proteomics is a relatively new scientific discipline that merges protein biochemistry, genome biology and bioinformatics to determine the spatial and temporal expression of proteins in cells, tissues and whole organisms. There has been very little application of proteomics to the fields of behavioral genetics, evolution, ecology and population dynamics, and has only recently been effectively applied to the closely allied fields of molecular evolution and genetics. However, there exists considerable potential for proteomics to impact in areas related to functional ecology; this review will introduce the general concepts and methodologies that define the field of proteomics and compare and contrast the advantages and disadvantages with other methods. Examples of how proteomics can aid, complement and indeed extend the study of functional ecology will be discussed including the main tool of ecological studies, population genetics with an emphasis on metapopulation structure analysis. Because proteomic analyses provide a direct measure of gene expression, it obviates some of the limitations associated with other genomic approaches, such as microarray and EST analyses. Likewise, in conjunction with associated bioinformatics and molecular evolutionary tools, proteomics can provide the foundation of a systems-level integration approach that can enhance ecological studies. It can be envisioned that proteomics will provide important new information on issues specific to metapopulation biology and adaptive processes in nature. A specific example of the application of proteomics to sperm ageing is provided to illustrate the potential utility of the approach.
Beyond the proteome: Mass Spectrometry Special Interest Group (MS-SIG) at ISMB/ECCB 2013
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ryu, Soyoung; Payne, Samuel H.; Schaab, Christoph
2014-07-02
Mass spectrometry special interest group (MS-SIG) aims to bring together experts from the global research community to discuss highlights and challenges in the field of mass spectrometry (MS)-based proteomics and computational biology. The rapid echnological developments in MS-based proteomics have enabled the generation of a large amount of meaningful information on hundreds to thousands of proteins simultaneously from a biological sample; however, the complexity of the MS data require sophisticated computational algorithms and software for data analysis and interpretation. This year’s MS-SIG meeting theme was ‘Beyond the Proteome’ with major focuses on improving protein identification/quantification and using proteomics data tomore » solve interesting problems in systems biology and clinical research.« less
Bioinformatics for spermatogenesis: annotation of male reproduction based on proteomics
Zhou, Tao; Zhou, Zuo-Min; Guo, Xue-Jiang
2013-01-01
Proteomics strategies have been widely used in the field of male reproduction, both in basic and clinical research. Bioinformatics methods are indispensable in proteomics-based studies and are used for data presentation, database construction and functional annotation. In the present review, we focus on the functional annotation of gene lists obtained through qualitative or quantitative methods, summarizing the common and male reproduction specialized proteomics databases. We introduce several integrated tools used to find the hidden biological significance from the data obtained. We further describe in detail the information on male reproduction derived from Gene Ontology analyses, pathway analyses and biomedical analyses. We provide an overview of bioinformatics annotations in spermatogenesis, from gene function to biological function and from biological function to clinical application. On the basis of recently published proteomics studies and associated data, we show that bioinformatics methods help us to discover drug targets for sperm motility and to scan for cancer-testis genes. In addition, we summarize the online resources relevant to male reproduction research for the exploration of the regulation of spermatogenesis. PMID:23852026
Purification and fractionation of membranes for proteomic analyses.
Marmagne, Anne; Salvi, Daniel; Rolland, Norbert; Ephritikhine, Geneviève; Joyard, Jacques; Barbier-Brygoo, Hélène
2006-01-01
Proteomics is a very powerful approach to link the information contained in sequenced genomes, such as Arabidopsis, to the functional knowledge provided by studies of plant cell compartments. However, membrane proteomics remains a challenge. One way to bring into view the complex mixture of proteins present in a membrane is to develop proteomic analyses based on (1) the use of highly purified membrane fractions and (2) fractionation of membrane proteins to retrieve as many proteins as possible (from the most to the less hydrophobic ones). To illustrate such strategies, we choose two types of membranes, the plasma membrane and the chloroplast envelope membranes. Both types of membranes can be prepared in a reasonable degree of purity from different types of tissues: the plasma membrane from cultured cells and the chloroplast envelope membrane from whole plants. This article is restricted to the description of methods for the preparation of highly purified and characterized plant membrane fractions and the subsequent fractionation of these membrane proteins according to simple physicochemical criteria (i.e., chloroform/methanol extraction, alkaline or saline treatments) for further analyses using modern proteomic methodologies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.
Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less
Proteomic Analyses of the Unexplored Sea Anemone Bunodactis verrucosa
Campos, Alexandre; Turkina, Maria V.; Ribeiro, Tiago; Osorio, Hugo; Vasconcelos, Vítor; Antunes, Agostinho
2018-01-01
Cnidarian toxic products, particularly peptide toxins, constitute a promising target for biomedicine research. Indeed, cnidarians are considered as the largest phylum of generally toxic animals. However, research on peptides and toxins of sea anemones is still limited. Moreover, most of the toxins from sea anemones have been discovered by classical purification approaches. Recently, high-throughput methodologies have been used for this purpose but in other Phyla. Hence, the present work was focused on the proteomic analyses of whole-body extract from the unexplored sea anemone Bunodactis verrucosa. The proteomic analyses applied were based on two methods: two-dimensional gel electrophoresis combined with MALDI-TOF/TOF and shotgun proteomic approach. In total, 413 proteins were identified, but only eight proteins were identified from gel-based analyses. Such proteins are mainly involved in basal metabolism and biosynthesis of antibiotics as the most relevant pathways. In addition, some putative toxins including metalloproteinases and neurotoxins were also identified. These findings reinforce the significance of the production of antimicrobial compounds and toxins by sea anemones, which play a significant role in defense and feeding. In general, the present study provides the first proteome map of the sea anemone B. verrucosa stablishing a reference for future studies in the discovery of new compounds. PMID:29364843
Proteomic Analyses of the Unexplored Sea Anemone Bunodactis verrucosa.
Domínguez-Pérez, Dany; Campos, Alexandre; Alexei Rodríguez, Armando; Turkina, Maria V; Ribeiro, Tiago; Osorio, Hugo; Vasconcelos, Vítor; Antunes, Agostinho
2018-01-24
Cnidarian toxic products, particularly peptide toxins, constitute a promising target for biomedicine research. Indeed, cnidarians are considered as the largest phylum of generally toxic animals. However, research on peptides and toxins of sea anemones is still limited. Moreover, most of the toxins from sea anemones have been discovered by classical purification approaches. Recently, high-throughput methodologies have been used for this purpose but in other Phyla. Hence, the present work was focused on the proteomic analyses of whole-body extract from the unexplored sea anemone Bunodactis verrucosa . The proteomic analyses applied were based on two methods: two-dimensional gel electrophoresis combined with MALDI-TOF/TOF and shotgun proteomic approach. In total, 413 proteins were identified, but only eight proteins were identified from gel-based analyses. Such proteins are mainly involved in basal metabolism and biosynthesis of antibiotics as the most relevant pathways. In addition, some putative toxins including metalloproteinases and neurotoxins were also identified. These findings reinforce the significance of the production of antimicrobial compounds and toxins by sea anemones, which play a significant role in defense and feeding. In general, the present study provides the first proteome map of the sea anemone B. verrucosa stablishing a reference for future studies in the discovery of new compounds.
The unique peptidome: Taxon-specific tryptic peptides as biomarkers for targeted metaproteomics.
Mesuere, Bart; Van der Jeugt, Felix; Devreese, Bart; Vandamme, Peter; Dawyndt, Peter
2016-09-01
The Unique Peptide Finder (http://unipept.ugent.be/peptidefinder) is an interactive web application to quickly hunt for tryptic peptides that are unique to a particular species, genus, or any other taxon. Biodiversity within the target taxon is represented by a set of proteomes selected from a monthly updated list of complete and nonredundant UniProt proteomes, supplemented with proprietary proteomes loaded into persistent local browser storage. The software computes and visualizes pan and core peptidomes as unions and intersections of tryptic peptides occurring in the selected proteomes. In addition, it also computes and displays unique peptidomes as the set of all tryptic peptides that occur in all selected proteomes but not in any UniProt record not assigned to the target taxon. As a result, the unique peptides can serve as robust biomarkers for the target taxon, for example, in targeted metaproteomics studies. Computations are extremely fast since they are underpinned by the Unipept database, the lowest common ancestor algorithm implemented in Unipept and modern web technologies that facilitate in-browser data storage and parallel processing. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A comprehensive and scalable database search system for metaproteomics.
Chatterjee, Sandip; Stupp, Gregory S; Park, Sung Kyu Robin; Ducom, Jean-Christophe; Yates, John R; Su, Andrew I; Wolan, Dennis W
2016-08-16
Mass spectrometry-based shotgun proteomics experiments rely on accurate matching of experimental spectra against a database of protein sequences. Existing computational analysis methods are limited in the size of their sequence databases, which severely restricts the proteomic sequencing depth and functional analysis of highly complex samples. The growing amount of public high-throughput sequencing data will only exacerbate this problem. We designed a broadly applicable metaproteomic analysis method (ComPIL) that addresses protein database size limitations. Our approach to overcome this significant limitation in metaproteomics was to design a scalable set of sequence databases assembled for optimal library querying speeds. ComPIL was integrated with a modified version of the search engine ProLuCID (termed "Blazmass") to permit rapid matching of experimental spectra. Proof-of-principle analysis of human HEK293 lysate with a ComPIL database derived from high-quality genomic libraries was able to detect nearly all of the same peptides as a search with a human database (~500x fewer peptides in the database), with a small reduction in sensitivity. We were also able to detect proteins from the adenovirus used to immortalize these cells. We applied our method to a set of healthy human gut microbiome proteomic samples and showed a substantial increase in the number of identified peptides and proteins compared to previous metaproteomic analyses, while retaining a high degree of protein identification accuracy and allowing for a more in-depth characterization of the functional landscape of the samples. The combination of ComPIL with Blazmass allows proteomic searches to be performed with database sizes much larger than previously possible. These large database searches can be applied to complex meta-samples with unknown composition or proteomic samples where unexpected proteins may be identified. The protein database, proteomic search engine, and the proteomic data files for the 5 microbiome samples characterized and discussed herein are open source and available for use and additional analysis.
The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the opening of the leaderboard to its Proteogenomics Computational DREAM Challenge. The leadership board remains open for submissions during September 25, 2017 through October 8, 2017, with the Challenge expected to run until November 17, 2017.
Murugaiyan, Jayaseelan; Eravci, Murat; Weise, Christoph; Roesler, Uwe
2017-06-01
Here, we provide the dataset associated with our research article 'label-free quantitative proteomic analysis of harmless and pathogenic strains of infectious microalgae, Prototheca spp.' (Murugaiyan et al., 2017) [1]. This dataset describes liquid chromatography-mass spectrometry (LC-MS)-based protein identification and quantification of a non-infectious strain, Prototheca zopfii genotype 1 and two strains associated with severe and mild infections, respectively, P. zopfii genotype 2 and Prototheca blaschkeae . Protein identification and label-free quantification was carried out by analysing MS raw data using the MaxQuant-Andromeda software suit. The expressional level differences of the identified proteins among the strains were computed using Perseus software and the results were presented in [1]. This DiB provides the MaxQuant output file and raw data deposited in the PRIDE repository with the dataset identifier PXD005305.
Choi, Hyungwon; Kim, Sinae; Fermin, Damian; Tsou, Chih-Chiang; Nesvizhskii, Alexey I
2015-11-03
We introduce QPROT, a statistical framework and computational tool for differential protein expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite, originally developed for spectral count data, adapted for the analysis using continuously measured protein-level intensity data. QPROT offers a new intensity normalization procedure and model-based differential expression analysis, both of which account for missing data. Determination of differential expression of each protein is based on the standardized Z-statistic based on the posterior distribution of the log fold change parameter, guided by the false discovery rate estimated by a well-known Empirical Bayes method. We evaluated the classification performance of QPROT using the quantification calibration data from the clinical proteomic technology assessment for cancer (CPTAC) study and a recently published Escherichia coli benchmark dataset, with evaluation of FDR accuracy in the latter. QPROT is a statistical framework with computational software tool for comparative quantitative proteomics analysis. It features various extensions of QSPEC method originally built for spectral count data analysis, including probabilistic treatment of missing values in protein intensity data. With the increasing popularity of label-free quantitative proteomics data, the proposed method and accompanying software suite will be immediately useful for many proteomics laboratories. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.
Simulation of Two Dimensional Electrophoresis and Tandem Mass Spectrometry for Teaching Proteomics
ERIC Educational Resources Information Center
Fisher, Amanda; Sekera, Emily; Payne, Jill; Craig, Paul
2012-01-01
In proteomics, complex mixtures of proteins are separated (usually by chromatography or electrophoresis) and identified by mass spectrometry. We have created 2DE Tandem MS, a computer program designed for use in the biochemistry, proteomics, or bioinformatics classroom. It contains two simulations--2D electrophoresis and tandem mass spectrometry.…
Castagnola, M.; Scarano, E.; Messana, I.; Cabras, T.; Iavarone, F.; Di Cintio, G.; Fiorita, A.; De Corso, E.; Paludetti, G.
2017-01-01
SUMMARY Saliva testing is a non-invasive and inexpensive test that can serve as a source of information useful for diagnosis of disease. As we enter the era of genomic technologies and –omic research, collection of saliva has increased. Recent proteomic platforms have analysed the human salivary proteome and characterised about 3000 differentially expressed proteins and peptides: in saliva, more than 90% of proteins in weight are derived from the secretion of three couples of "major" glands; all the other components are derived from minor glands, gingival crevicular fluid, mucosal exudates and oral microflora. The most common aim of proteomic analysis is to discriminate between physiological and pathological conditions. A proteomic protocol to analyze the whole saliva proteome is not currently available. It is possible distinguish two type of proteomic platforms: top-down proteomics investigates intact naturally-occurring structure of a protein under examination; bottom-up proteomics analyses peptide fragments after pre-digestion (typically with trypsin). Because of this heterogeneity, many different biomarkers may be proposed for the same pathology. The salivary proteome has been characterised in several diseases: oral squamous cell carcinoma and oral leukoplakia, chronic graft-versus-host disease Sjögren's syndrome and other autoimmune disorders such as SAPHO, schizophrenia and bipolar disorder, and genetic diseases like Down's Syndrome and Wilson disease. The results of research reported herein suggest that in the near future human saliva will be a relevant diagnostic fluid for clinical diagnosis and prognosis. PMID:28516971
Arntzen, Magnus Ø; Thiede, Bernd
2012-02-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no.
Arntzen, Magnus Ø.; Thiede, Bernd
2012-01-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no. PMID:22067098
compomics-utilities: an open-source Java library for computational proteomics.
Barsnes, Harald; Vaudel, Marc; Colaert, Niklaas; Helsens, Kenny; Sickmann, Albert; Berven, Frode S; Martens, Lennart
2011-03-08
The growing interest in the field of proteomics has increased the demand for software tools and applications that process and analyze the resulting data. And even though the purpose of these tools can vary significantly, they usually share a basic set of features, including the handling of protein and peptide sequences, the visualization of (and interaction with) spectra and chromatograms, and the parsing of results from various proteomics search engines. Developers typically spend considerable time and effort implementing these support structures, which detracts from working on the novel aspects of their tool. In order to simplify the development of proteomics tools, we have implemented an open-source support library for computational proteomics, called compomics-utilities. The library contains a broad set of features required for reading, parsing, and analyzing proteomics data. compomics-utilities is already used by a long list of existing software, ensuring library stability and continued support and development. As a user-friendly, well-documented and open-source library, compomics-utilities greatly simplifies the implementation of the basic features needed in most proteomics tools. Implemented in 100% Java, compomics-utilities is fully portable across platforms and architectures. Our library thus allows the developers to focus on the novel aspects of their tools, rather than on the basic functions, which can contribute substantially to faster development, and better tools for proteomics.
Bergerat, Agnes; Decano, Julius; Wu, Chang-Jiun; Choi, Hyungwon; Nesvizhskii, Alexey I; Moran, Ann Marie; Ruiz-Opazo, Nelson; Steffen, Martin; Herrera, Victoria LM
2011-01-01
Stroke is the third leading cause of death in the United States with high rates of morbidity among survivors. The search to fill the unequivocal need for new therapeutic approaches would benefit from unbiased proteomic analyses of animal models of spontaneous stroke in the prestroke stage. Since brain microvessels play key roles in neurovascular coupling, we investigated prestroke microvascular proteome changes. Proteomic analysis of cerebral cortical microvessels (cMVs) was done by tandem mass spectrometry comparing two prestroke time points. Metaprotein-pathway analyses of proteomic spectral count data were done to identify risk factor–induced changes, followed by QSPEC-analyses of individual protein changes associated with increased stroke susceptibility. We report 26 cMV proteome profiles from male and female stroke-prone and non–stroke-prone rats at 2 months and 4.5 months of age prior to overt stroke events. We identified 1,934 proteins by two or more peptides. Metaprotein pathway analysis detected age-associated changes in energy metabolism and cell-to-microenvironment interactions, as well as sex-specific changes in energy metabolism and endothelial leukocyte transmigration pathways. Stroke susceptibility was associated independently with multiple protein changes associated with ischemia, angiogenesis or involved in blood brain barrier (BBB) integrity. Immunohistochemical analysis confirmed aquaporin-4 and laminin-α1 induction in cMVs, representative of proteomic changes with >65 Bayes factor (BF), associated with stroke susceptibility. Altogether, proteomic analysis demonstrates significant molecular changes in ischemic cerebral microvasculature in the prestroke stage, which could contribute to the observed model phenotype of microhemorrhages and postischemic hemorrhagic transformation. These pathways comprise putative targets for translational research of much needed novel diagnostic and therapeutic approaches for stroke. PMID:21519634
Wang, Guanghui; Wu, Wells W; Zeng, Weihua; Chou, Chung-Lin; Shen, Rong-Fong
2006-05-01
A critical step in protein biomarker discovery is the ability to contrast proteomes, a process referred generally as quantitative proteomics. While stable-isotope labeling (e.g., ICAT, 18O- or 15N-labeling, or AQUA) remains the core technology used in mass spectrometry-based proteomic quantification, increasing efforts have been directed to the label-free approach that relies on direct comparison of peptide peak areas between LC-MS runs. This latter approach is attractive to investigators for its simplicity as well as cost effectiveness. In the present study, the reproducibility and linearity of using a label-free approach to highly complex proteomes were evaluated. Various amounts of proteins from different proteomes were subjected to repeated LC-MS analyses using an ion trap or Fourier transform mass spectrometer. Highly reproducible data were obtained between replicated runs, as evidenced by nearly ideal Pearson's correlation coefficients (for ion's peak areas or retention time) and average peak area ratios. In general, more than 50% and nearly 90% of the peptide ion ratios deviated less than 10% and 20%, respectively, from the average in duplicate runs. In addition, the multiplicity ratios of the amounts of proteins used correlated nicely with the observed averaged ratios of peak areas calculated from detected peptides. Furthermore, the removal of abundant proteins from the samples led to an improvement in reproducibility and linearity. A computer program has been written to automate the processing of data sets from experiments with groups of multiple samples for statistical analysis. Algorithms for outlier-resistant mean estimation and for adjusting statistical significance threshold in multiplicity of testing were incorporated to minimize the rate of false positives. The program was applied to quantify changes in proteomes of parental and p53-deficient HCT-116 human cells and found to yield reproducible results. Overall, this study demonstrates an alternative approach that allows global quantification of differentially expressed proteins in complex proteomes. The utility of this method to biomarker discovery is likely to synergize with future improvements in the detecting sensitivity of mass spectrometers.
Trentmann, Oliver; Haferkamp, Ilka
2013-01-01
Vacuoles of plants fulfill various biologically important functions, like turgor generation and maintenance, detoxification, solute sequestration, or protein storage. Different types of plant vacuoles (lytic versus protein storage) are characterized by different functional properties apparently caused by a different composition/abundance and regulation of transport proteins in the surrounding membrane, the tonoplast. Proteome analyses allow the identification of vacuolar proteins and provide an informative basis for assigning observed transport processes to specific carriers or channels. This review summarizes techniques required for vacuolar proteome analyses, like e.g., isolation of the large central vacuole or tonoplast membrane purification. Moreover, an overview about diverse published vacuolar proteome studies is provided. It becomes evident that qualitative proteomes from different plant species represent just the tip of the iceberg. During the past few years, mass spectrometry achieved immense improvement concerning its accuracy, sensitivity, and application. As a consequence, modern tonoplast proteome approaches are suited for detecting alterations in membrane protein abundance in response to changing environmental/physiological conditions and help to clarify the regulation of tonoplast transport processes. PMID:23459586
The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce that teams led by Jaewoo Kang (Korea University), and Yuanfang Guan with Hongyang Li (University of Michigan) as the best performers of the NCI-CPTAC DREAM Proteogenomics Computational Challenge. Over 500 participants from 20 countries registered for the Challenge, which offered $25,000 in cash awards contributed by the NVIDIA Foundation through its Compute the Cure initiative.
Controlled vocabularies and ontologies in proteomics: Overview, principles and practice☆
Mayer, Gerhard; Jones, Andrew R.; Binz, Pierre-Alain; Deutsch, Eric W.; Orchard, Sandra; Montecchi-Palazzi, Luisa; Vizcaíno, Juan Antonio; Hermjakob, Henning; Oveillero, David; Julian, Randall; Stephan, Christian; Meyer, Helmut E.; Eisenacher, Martin
2014-01-01
This paper focuses on the use of controlled vocabularies (CVs) and ontologies especially in the area of proteomics, primarily related to the work of the Proteomics Standards Initiative (PSI). It describes the relevant proteomics standard formats and the ontologies used within them. Software and tools for working with these ontology files are also discussed. The article also examines the “mapping files” used to ensure correct controlled vocabulary terms that are placed within PSI standards and the fulfillment of the MIAPE (Minimum Information about a Proteomics Experiment) requirements. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23429179
Kiraga, Joanna; Mackiewicz, Pawel; Mackiewicz, Dorota; Kowalczuk, Maria; Biecek, Przemysław; Polak, Natalia; Smolarczyk, Kamila; Dudek, Miroslaw R; Cebrat, Stanislaw
2007-01-01
Background The distribution of isoelectric point (pI) of proteins in a proteome is universal for all organisms. It is bimodal dividing the proteome into two sets of acidic and basic proteins. Different species however have different abundance of acidic and basic proteins that may be correlated with taxonomy, subcellular localization, ecological niche of organisms and proteome size. Results We have analysed 1784 proteomes encoded by chromosomes of Archaea, Bacteria, Eukaryota, and also mitochondria, plastids, prokaryotic plasmids, phages and viruses. We have found significant correlation in more than 95% of proteomes between the protein length and pI in proteomes – positive for acidic proteins and negative for the basic ones. Plastids, viruses and plasmids encode more basic proteomes while chromosomes of Archaea, Bacteria, Eukaryota, mitochondria and phages more acidic ones. Mitochondrial proteomes of Viridiplantae, Protista and Fungi are more basic than Metazoa. It results from the presence of basic proteins in the former proteomes and their absence from the latter ones and is related with reduction of metazoan genomes. Significant correlation was found between the pI bias of proteomes encoded by prokaryotic chromosomes and proteomes encoded by plasmids but there is no correlation between eukaryotic nuclear-coded proteomes and proteomes encoded by organelles. Detailed analyses of prokaryotic proteomes showed significant relationships between pI distribution and habitat, relation to the host cell and salinity of the environment, but no significant correlation with oxygen and temperature requirements. The salinity is positively correlated with acidicity of proteomes. Host-associated organisms and especially intracellular species have more basic proteomes than free-living ones. The higher rate of mutations accumulation in the intracellular parasites and endosymbionts is responsible for the basicity of their tiny proteomes that explains the observed positive correlation between the decrease of genome size and the increase of basicity of proteomes. The results indicate that even conserved proteins subjected to strong selectional constraints follow the global trend in the pI distribution. Conclusion The distribution of pI of proteins in proteomes shows clear relationships with length of proteins, subcellular localization, taxonomy and ecology of organisms. The distribution is also strongly affected by mutational pressure especially in intracellular organisms. PMID:17565672
Rudnick, Paul A.; Clauser, Karl R.; Kilpatrick, Lisa E.; Tchekhovskoi, Dmitrii V.; Neta, Pedatsur; Blonder, Nikša; Billheimer, Dean D.; Blackman, Ronald K.; Bunk, David M.; Cardasis, Helene L.; Ham, Amy-Joan L.; Jaffe, Jacob D.; Kinsinger, Christopher R.; Mesri, Mehdi; Neubert, Thomas A.; Schilling, Birgit; Tabb, David L.; Tegeler, Tony J.; Vega-Montoto, Lorenzo; Variyath, Asokan Mulayath; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Carr, Steven A.; Fisher, Susan J.; Gibson, Bradford W.; Paulovich, Amanda G.; Regnier, Fred E.; Rodriguez, Henry; Spiegelman, Cliff; Tempst, Paul; Liebler, Daniel C.; Stein, Stephen E.
2010-01-01
A major unmet need in LC-MS/MS-based proteomics analyses is a set of tools for quantitative assessment of system performance and evaluation of technical variability. Here we describe 46 system performance metrics for monitoring chromatographic performance, electrospray source stability, MS1 and MS2 signals, dynamic sampling of ions for MS/MS, and peptide identification. Applied to data sets from replicate LC-MS/MS analyses, these metrics displayed consistent, reasonable responses to controlled perturbations. The metrics typically displayed variations less than 10% and thus can reveal even subtle differences in performance of system components. Analyses of data from interlaboratory studies conducted under a common standard operating procedure identified outlier data and provided clues to specific causes. Moreover, interlaboratory variation reflected by the metrics indicates which system components vary the most between laboratories. Application of these metrics enables rational, quantitative quality assessment for proteomics and other LC-MS/MS analytical applications. PMID:19837981
Cell death proteomics database: consolidating proteomics data on cell death.
Arntzen, Magnus Ø; Bull, Vibeke H; Thiede, Bernd
2013-05-03
Programmed cell death is a ubiquitous process of utmost importance for the development and maintenance of multicellular organisms. More than 10 different types of programmed cell death forms have been discovered. Several proteomics analyses have been performed to gain insight in proteins involved in the different forms of programmed cell death. To consolidate these studies, we have developed the cell death proteomics (CDP) database, which comprehends data from apoptosis, autophagy, cytotoxic granule-mediated cell death, excitotoxicity, mitotic catastrophe, paraptosis, pyroptosis, and Wallerian degeneration. The CDP database is available as a web-based database to compare protein identifications and quantitative information across different experimental setups. The proteomics data of 73 publications were integrated and unified with protein annotations from UniProt-KB and gene ontology (GO). Currently, more than 6,500 records of more than 3,700 proteins are included in the CDP. Comparing apoptosis and autophagy using overrepresentation analysis of GO terms, the majority of enriched processes were found in both, but also some clear differences were perceived. Furthermore, the analysis revealed differences and similarities of the proteome between autophagosomal and overall autophagy. The CDP database represents a useful tool to consolidate data from proteome analyses of programmed cell death and is available at http://celldeathproteomics.uio.no.
Differential Proteomic Analysis of Noncardia Gastric Cancer from Individuals of Northern Brazil
Leal, Mariana Ferreira; Chung, Janete; Calcagno, Danielle Queiroz; Assumpção, Paulo Pimentel; Demachki, Samia; da Silva, Ismael Dale Cotrim Guerreiro; Chammas, Roger; Burbano, Rommel Rodríguez; de Arruda Cardoso Smith, Marília
2012-01-01
Gastric cancer is the second leading cause of cancer-related death worldwide. The identification of new cancer biomarkers is necessary to reduce the mortality rates through the development of new screening assays and early diagnosis, as well as new target therapies. In this study, we performed a proteomic analysis of noncardia gastric neoplasias of individuals from Northern Brazil. The proteins were analyzed by two-dimensional electrophoresis and mass spectrometry. For the identification of differentially expressed proteins, we used statistical tests with bootstrapping resampling to control the type I error in the multiple comparison analyses. We identified 111 proteins involved in gastric carcinogenesis. The computational analysis revealed several proteins involved in the energy production processes and reinforced the Warburg effect in gastric cancer. ENO1 and HSPB1 expression were further evaluated. ENO1 was selected due to its role in aerobic glycolysis that may contribute to the Warburg effect. Although we observed two up-regulated spots of ENO1 in the proteomic analysis, the mean expression of ENO1 was reduced in gastric tumors by western blot. However, mean ENO1 expression seems to increase in more invasive tumors. This lack of correlation between proteomic and western blot analyses may be due to the presence of other ENO1 spots that present a slightly reduced expression, but with a high impact in the mean protein expression. In neoplasias, HSPB1 is induced by cellular stress to protect cells against apoptosis. In the present study, HSPB1 presented an elevated protein and mRNA expression in a subset of gastric cancer samples. However, no association was observed between HSPB1 expression and clinicopathological characteristics. Here, we identified several possible biomarkers of gastric cancer in individuals from Northern Brazil. These biomarkers may be useful for the assessment of prognosis and stratification for therapy if validated in larger clinical study sets. PMID:22860099
Current algorithmic solutions for peptide-based proteomics data generation and identification.
Hoopmann, Michael R; Moritz, Robert L
2013-02-01
Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics. Copyright © 2012 Elsevier Ltd. All rights reserved.
hEIDI: An Intuitive Application Tool To Organize and Treat Large-Scale Proteomics Data.
Hesse, Anne-Marie; Dupierris, Véronique; Adam, Claire; Court, Magali; Barthe, Damien; Emadali, Anouk; Masselon, Christophe; Ferro, Myriam; Bruley, Christophe
2016-10-07
Advances in high-throughput proteomics have led to a rapid increase in the number, size, and complexity of the associated data sets. Managing and extracting reliable information from such large series of data sets require the use of dedicated software organized in a consistent pipeline to reduce, validate, exploit, and ultimately export data. The compilation of multiple mass-spectrometry-based identification and quantification results obtained in the context of a large-scale project represents a real challenge for developers of bioinformatics solutions. In response to this challenge, we developed a dedicated software suite called hEIDI to manage and combine both identifications and semiquantitative data related to multiple LC-MS/MS analyses. This paper describes how, through a user-friendly interface, hEIDI can be used to compile analyses and retrieve lists of nonredundant protein groups. Moreover, hEIDI allows direct comparison of series of analyses, on the basis of protein groups, while ensuring consistent protein inference and also computing spectral counts. hEIDI ensures that validated results are compliant with MIAPE guidelines as all information related to samples and results is stored in appropriate databases. Thanks to the database structure, validated results generated within hEIDI can be easily exported in the PRIDE XML format for subsequent publication. hEIDI can be downloaded from http://biodev.extra.cea.fr/docs/heidi .
Integration of gel-based proteome data with pProRep.
Laukens, Kris; Matthiesen, Rune; Lemière, Filip; Esmans, Eddy; Onckelen, Harry Van; Jensen, Ole Nørregaard; Witters, Erwin
2006-11-15
pProRep is a web application integrating electrophoretic and mass spectral data from proteome analyses into a relational database. The graphical web-interface allows users to upload, analyse and share experimental proteome data. It offers researchers the possibility to query all previously analysed datasets and can visualize selected features, such as the presence of a certain set of ions in a peptide mass spectrum, on the level of the two-dimensional gel. The pProRep package and instructions for its use can be downloaded from http://www.ptools.ua.ac.be/pProRep. The application requires a web server that runs PHP 5 (http://www.php.net) and MySQL. Some (non-essential) extensions need additional freely available libraries: details are described in the installation instructions.
Cloud CPFP: a shotgun proteomics data analysis pipeline using cloud and high performance computing.
Trudgian, David C; Mirzaei, Hamid
2012-12-07
We have extended the functionality of the Central Proteomics Facilities Pipeline (CPFP) to allow use of remote cloud and high performance computing (HPC) resources for shotgun proteomics data processing. CPFP has been modified to include modular local and remote scheduling for data processing jobs. The pipeline can now be run on a single PC or server, a local cluster, a remote HPC cluster, and/or the Amazon Web Services (AWS) cloud. We provide public images that allow easy deployment of CPFP in its entirety in the AWS cloud. This significantly reduces the effort necessary to use the software, and allows proteomics laboratories to pay for compute time ad hoc, rather than obtaining and maintaining expensive local server clusters. Alternatively the Amazon cloud can be used to increase the throughput of a local installation of CPFP as necessary. We demonstrate that cloud CPFP allows users to process data at higher speed than local installations but with similar cost and lower staff requirements. In addition to the computational improvements, the web interface to CPFP is simplified, and other functionalities are enhanced. The software is under active development at two leading institutions and continues to be released under an open-source license at http://cpfp.sourceforge.net.
Hamzeiy, Hamid; Cox, Jürgen
2017-02-01
Computational workflows for mass spectrometry-based shotgun proteomics and untargeted metabolomics share many steps. Despite the similarities, untargeted metabolomics is lagging behind in terms of reliable fully automated quantitative data analysis. We argue that metabolomics will strongly benefit from the adaptation of successful automated proteomics workflows to metabolomics. MaxQuant is a popular platform for proteomics data analysis and is widely considered to be superior in achieving high precursor mass accuracies through advanced nonlinear recalibration, usually leading to five to ten-fold better accuracy in complex LC-MS/MS runs. This translates to a sharp decrease in the number of peptide candidates per measured feature, thereby strongly improving the coverage of identified peptides. We argue that similar strategies can be applied to untargeted metabolomics, leading to equivalent improvements in metabolite identification. Copyright © 2016 The Author(s). Published by Elsevier Ltd.. All rights reserved.
2013-01-01
Background Guanine-cytosine (GC) composition is an important feature of genomes. Likewise, amino acid composition is a distinct, but less valued, feature of proteomes. A major concern is that it is not clear what valuable information can be acquired from amino acid composition data. To address this concern, in-depth analyses of the amino acid composition of the complete proteomes from 63 archaea, 270 bacteria, and 128 eukaryotes were performed. Results Principal component analysis of the amino acid matrices showed that the main contributors to proteomic architecture were genomic GC variation, phylogeny, and environmental influences. GC pressure drove positive selection on Ala, Arg, Gly, Pro, Trp, and Val, and adverse selection on Asn, Lys, Ile, Phe, and Tyr. The physico-chemical framework of the complete proteomes withstood GC pressure by frequency complementation of GC-dependent amino acid pairs with similar physico-chemical properties. Gln, His, Ser, and Val were responsible for phylogeny and their constituted components could differentiate archaea, bacteria, and eukaryotes. Environmental niche was also a significant factor in determining proteomic architecture, especially for archaea for which the main amino acids were Cys, Leu, and Thr. In archaea, hyperthermophiles, acidophiles, mesophiles, psychrophiles, and halophiles gathered successively along the environment-based principal component. Concordance between proteomic architecture and the genetic code was also related closely to genomic GC content, phylogeny, and lifestyles. Conclusions Large-scale analyses of the complete proteomes of a wide range of organisms suggested that amino acid composition retained the trace of GC variation, phylogeny, and environmental influences during evolution. The findings from this study will help in the development of a global understanding of proteome evolution, and even biological evolution. PMID:24088322
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Jianying; Dann, Geoffrey P.; Shi, Tujin
2012-03-10
Sodium dodecyl sulfate (SDS) is one of the most popular laboratory reagents used for highly efficient biological sample extraction; however, SDS presents a significant challenge to LC-MS-based proteomic analyses due to its severe interference with reversed-phase LC separations and electrospray ionization interfaces. This study reports a simple SDS-assisted proteomic sample preparation method facilitated by a novel peptide-level SDS removal protocol. After SDS-assisted protein extraction and digestion, SDS was effectively (>99.9%) removed from peptides through ion substitution-mediated DS- precipitation with potassium chloride (KCl) followed by {approx}10 min centrifugation. Excellent peptide recovery (>95%) was observed for less than 20 {mu}g of peptides.more » Further experiments demonstrated the compatibility of this protocol with LC-MS/MS analyses. The resulting proteome coverage from this SDS-assisted protocol was comparable to or better than those obtained from other standard proteomic preparation methods in both mammalian tissues and bacterial samples. These results suggest that this SDS-assisted protocol is a practical, simple, and broadly applicable proteomic sample processing method, which can be particularly useful when dealing with samples difficult to solubilize by other methods.« less
Proteomic Analyses of NF1-Interacting Proteins in Keratinocytes
2015-04-01
and knockout mice further confirmed the interactions suggested by the proteomic analyses. In relation to the development of psoriasis -like symptoms...in the NF1 null epidermis, we analyzed NF1 expression in a mouse model of psoriasis (imiquimod-induced psoriasis -like skin inflammation) and...knockout of epidermal NF1 to elucidate the molecular underpinnings of psoriasis . 15. SUBJECT TERMS neurofibromin-1 (NF1), psoriasis , inflammation
Design and Initial Characterization of the SC-200 Proteomics Standard Mixture
Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald
2011-01-01
Abstract High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels. PMID:21250827
Design and initial characterization of the SC-200 proteomics standard mixture.
Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald; Kolker, Eugene
2011-01-01
High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels.
Proteomic analyses of the environmental toxicity of carcinogenic chemicals
Protein expression and posttranslational modifications consistently change in response to the exposure to environmental chemicals. Recent technological advances in proteomics provide new tools for more efficient characterization of protein expression and posttranslational modific...
Lee, Hangyeore; Mun, Dong-Gi; So, Jeong Eun; Bae, Jingi; Kim, Hokeun; Masselon, Christophe; Lee, Sang-Won
2016-12-06
Proteomics aims to achieve complete profiling of the protein content and protein modifications in cells, tissues, and biofluids and to quantitatively determine changes in their abundances. This information serves to elucidate cellular processes and signaling pathways and to identify candidate protein biomarkers and/or therapeutic targets. Analyses must therefore be both comprehensive and efficient. Here, we present a novel online two-dimensional reverse-phase/reverse-phase liquid chromatography separation platform, which is based on a newly developed online noncontiguous fractionating and concatenating device (NCFC fractionator). In bottom-up proteomics analyses of a complex proteome, this system provided significantly improved exploitation of the separation space of the two RPs, considerably increasing the numbers of peptides identified compared to a contiguous 2D-RP/RPLC method. The fully automated online 2D-NCFC-RP/RPLC system bypassed a number of labor-intensive manual processes required with the previously described offline 2D-NCFC RP/RPLC method, and thus, it offers minimal sample loss in a context of highly reproducible 2D-RP/RPLC experiments.
Proteomics in the investigation of HIV-1 interactions with host proteins.
Li, Ming
2015-02-01
Productive HIV-1 infection depends on host machinery, including a broad array of cellular proteins. Proteomics has played a significant role in the discovery of HIV-1 host proteins. In this review, after a brief survey of the HIV-1 host proteins that were discovered by proteomic analyses, I focus on analyzing the interactions between the virion and host proteins, as well as the technologies and strategies used in those proteomic studies. With the help of proteomics, the identification and characterization of HIV-1 host proteins can be translated into novel antiretroviral therapeutics. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Employee Spotlight: Clarence Chang | Argonne National Laboratory
batteries --Electricity transmission --Smart Grid Environment -Biology --Computational biology --Environmental biology ---Metagenomics ---Terrestrial ecology --Molecular biology ---Clinical proteomics and biomarker discovery ---Interventional biology ---Proteomics --Structural biology -Environmental science &
Dynamic Adaptive Binning: An Improved Quantification Technique for NMR Spectroscopic Data
2010-01-01
Reo 2002). Unlike proteomics and genomics that assess inter- mediate products, metabolomics assesses the end product of cellular function, metabolites...other proteomic , genomic , and metabolomic analyses, NMR spectroscopy is Electronic supplementary material The online version of this article (doi...Changes occurring at the level of genes and proteins (assessed by genomics and proteomics ) may or may not influence a variety of cellular functions
Integrated proteomic and genomic analysis of colorectal cancer
Investigators who analyzed 95 human colorectal tumor samples have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, pro
PatternLab for proteomics 4.0: A one-stop shop for analyzing shotgun proteomic data
Carvalho, Paulo C; Lima, Diogo B; Leprevost, Felipe V; Santos, Marlon D M; Fischer, Juliana S G; Aquino, Priscila F; Moresco, James J; Yates, John R; Barbosa, Valmir C
2017-01-01
PatternLab for proteomics is an integrated computational environment that unifies several previously published modules for analyzing shotgun proteomic data. PatternLab contains modules for formatting sequence databases, performing peptide spectrum matching, statistically filtering and organizing shotgun proteomic data, extracting quantitative information from label-free and chemically labeled data, performing statistics for differential proteomics, displaying results in a variety of graphical formats, performing similarity-driven studies with de novo sequencing data, analyzing time-course experiments, and helping with the understanding of the biological significance of data in the light of the Gene Ontology. Here we describe PatternLab for proteomics 4.0, which closely knits together all of these modules in a self-contained environment, covering the principal aspects of proteomic data analysis as a freely available and easily installable software package. All updates to PatternLab, as well as all new features added to it, have been tested over the years on millions of mass spectra. PMID:26658470
P-MartCancer-Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets.
Webb-Robertson, Bobbie-Jo M; Bramer, Lisa M; Jensen, Jeffrey L; Kobold, Markus A; Stratton, Kelly G; White, Amanda M; Rodland, Karin D
2017-11-01
P-MartCancer is an interactive web-based software environment that enables statistical analyses of peptide or protein data, quantitated from mass spectrometry-based global proteomics experiments, without requiring in-depth knowledge of statistical programming. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification, and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access and the capability to analyze multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium at the peptide, gene, and protein levels. P-MartCancer is deployed as a web service (https://pmart.labworks.org/cptac.html), alternatively available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/). Cancer Res; 77(21); e47-50. ©2017 AACR . ©2017 American Association for Cancer Research.
An Overview of Advanced SILAC-Labeling Strategies for Quantitative Proteomics.
Terzi, F; Cambridge, S
2017-01-01
Comparative, quantitative mass spectrometry of proteins provides great insight to protein abundance and function, but some molecular characteristics related to protein dynamics are not so easily obtained. Because the metabolic incorporation of stable amino acid isotopes allows the extraction of distinct temporal and spatial aspects of protein dynamics, the SILAC methodology is uniquely suited to be adapted for advanced labeling strategies. New SILAC strategies have emerged that allow deeper foraging into the complexity of cellular proteomes. Here, we review a few advanced SILAC-labeling strategies that have been published during last the years. Among them, different subsaturating-labeling as well as dual-labeling schemes are most prominent for a range of analyses including those of neuronal proteomes, secretion, or cell-cell-induced stimulations. These recent developments suggest that much more information can be gained from proteomic analyses if the labeling strategies are specifically tailored toward the experimental design. © 2017 Elsevier Inc. All rights reserved.
Cehofski, Lasse Jørgensen; Honoré, Bent; Vorum, Henrik
2017-04-28
Retinal artery occlusion (RAO), retinal vein occlusion (RVO), diabetic retinopathy (DR) and age-related macular degeneration (AMD) are frequent ocular diseases with potentially sight-threatening outcomes. In the present review we discuss major findings of proteomic studies of RAO, RVO, DR and AMD, including an overview of ocular proteome changes associated with anti-vascular endothelial growth factor (VEGF) treatments. Despite the severe outcomes of RAO, the proteome of the disease remains largely unstudied. There is also limited knowledge about the proteome of RVO, but proteomic studies suggest that RVO is associated with remodeling of the extracellular matrix and adhesion processes. Proteomic studies of DR have resulted in the identification of potential therapeutic targets such as carbonic anhydrase-I. Proliferative diabetic retinopathy is the most intensively studied stage of DR. Proteomic studies have established VEGF, pigment epithelium-derived factor (PEDF) and complement components as key factors associated with AMD. The aim of this review is to highlight the major milestones in proteomics in RAO, RVO, DR and AMD. Through large-scale protein analyses, proteomics is bringing new important insights into these complex pathological conditions.
Investigators from the National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) who comprehensively analyzed 95 human colorectal tumor samples, have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, provides a more comprehensive view of the biological features that drive cancer than genomic analysis alone and may help identify the most important targets for cancer detection and intervention.
The speciation of the proteome
Jungblut, Peter R; Holzhütter, Hermann G; Apweiler, Rolf; Schlüter, Hartmut
2008-01-01
Introduction In proteomics a paradox situation developed in the last years. At one side it is basic knowledge that proteins are post-translationally modified and occur in different isoforms. At the other side the protein expression concept disclaims post-translational modifications by connecting protein names directly with function. Discussion Optimal proteome coverage is today reached by bottom-up liquid chromatography/mass spectrometry. But quantification at the peptide level in shotgun or bottom-up approaches by liquid chromatography and mass spectrometry is completely ignoring that a special peptide may exist in an unmodified form and in several-fold modified forms. The acceptance of the protein species concept is a basic prerequisite for meaningful quantitative analyses in functional proteomics. In discovery approaches only top-down analyses, separating the protein species before digestion, identification and quantification by two-dimensional gel electrophoresis or protein liquid chromatography, allow the correlation between changes of a biological situation and function. Conclusion To obtain biological relevant information kinetics and systems biology have to be performed at the protein species level, which is the major challenge in proteomics today. PMID:18638390
77 FR 67381 - Government-Owned Inventions; Availability for Licensing
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-09
.... ``Computational and Experimental RNA Nanoparticle Design,'' in Automation in Genomics and Proteomics: An... and Experimental RNA Nanoparticle Design,'' in Automation in Genomics and Proteomics: An Engineering... Development Stage: Prototype Pre-clinical In vitro data available Inventors: Robert J. Crouch and Yutaka...
Shteynberg, David; Deutsch, Eric W.; Lam, Henry; Eng, Jimmy K.; Sun, Zhi; Tasman, Natalie; Mendoza, Luis; Moritz, Robert L.; Aebersold, Ruedi; Nesvizhskii, Alexey I.
2011-01-01
The combination of tandem mass spectrometry and sequence database searching is the method of choice for the identification of peptides and the mapping of proteomes. Over the last several years, the volume of data generated in proteomic studies has increased dramatically, which challenges the computational approaches previously developed for these data. Furthermore, a multitude of search engines have been developed that identify different, overlapping subsets of the sample peptides from a particular set of tandem mass spectrometry spectra. We present iProphet, the new addition to the widely used open-source suite of proteomic data analysis tools Trans-Proteomics Pipeline. Applied in tandem with PeptideProphet, it provides more accurate representation of the multilevel nature of shotgun proteomic data. iProphet combines the evidence from multiple identifications of the same peptide sequences across different spectra, experiments, precursor ion charge states, and modified states. It also allows accurate and effective integration of the results from multiple database search engines applied to the same data. The use of iProphet in the Trans-Proteomics Pipeline increases the number of correctly identified peptides at a constant false discovery rate as compared with both PeptideProphet and another state-of-the-art tool Percolator. As the main outcome, iProphet permits the calculation of accurate posterior probabilities and false discovery rate estimates at the level of sequence identical peptide identifications, which in turn leads to more accurate probability estimates at the protein level. Fully integrated with the Trans-Proteomics Pipeline, it supports all commonly used MS instruments, search engines, and computer platforms. The performance of iProphet is demonstrated on two publicly available data sets: data from a human whole cell lysate proteome profiling experiment representative of typical proteomic data sets, and from a set of Streptococcus pyogenes experiments more representative of organism-specific composite data sets. PMID:21876204
Guarnieri, Michael T.; Nag, Ambarish; Smolinski, Sharon L.; Darzins, Al; Seibert, Michael; Pienkos, Philip T.
2011-01-01
Biofuels derived from algal lipids represent an opportunity to dramatically impact the global energy demand for transportation fuels. Systems biology analyses of oleaginous algae could greatly accelerate the commercialization of algal-derived biofuels by elucidating the key components involved in lipid productivity and leading to the initiation of hypothesis-driven strain-improvement strategies. However, higher-level systems biology analyses, such as transcriptomics and proteomics, are highly dependent upon available genomic sequence data, and the lack of these data has hindered the pursuit of such analyses for many oleaginous microalgae. In order to examine the triacylglycerol biosynthetic pathway in the unsequenced oleaginous microalga, Chlorella vulgaris, we have established a strategy with which to bypass the necessity for genomic sequence information by using the transcriptome as a guide. Our results indicate an upregulation of both fatty acid and triacylglycerol biosynthetic machinery under oil-accumulating conditions, and demonstrate the utility of a de novo assembled transcriptome as a search model for proteomic analysis of an unsequenced microalga. PMID:22043295
A Computational Tool to Detect and Avoid Redundancy in Selected Reaction Monitoring
Röst, Hannes; Malmström, Lars; Aebersold, Ruedi
2012-01-01
Selected reaction monitoring (SRM), also called multiple reaction monitoring, has become an invaluable tool for targeted quantitative proteomic analyses, but its application can be compromised by nonoptimal selection of transitions. In particular, complex backgrounds may cause ambiguities in SRM measurement results because peptides with interfering transitions similar to those of the target peptide may be present in the sample. Here, we developed a computer program, the SRMCollider, that calculates nonredundant theoretical SRM assays, also known as unique ion signatures (UIS), for a given proteomic background. We show theoretically that UIS of three transitions suffice to conclusively identify 90% of all yeast peptides and 85% of all human peptides. Using predicted retention times, the SRMCollider also simulates time-scheduled SRM acquisition, which reduces the number of interferences to consider and leads to fewer transitions necessary to construct an assay. By integrating experimental fragment ion intensities from large scale proteome synthesis efforts (SRMAtlas) with the information content-based UIS, we combine two orthogonal approaches to create high quality SRM assays ready to be deployed. We provide a user friendly, open source implementation of an algorithm to calculate UIS of any order that can be accessed online at http://www.srmcollider.org to find interfering transitions. Finally, our tool can also simulate the specificity of novel data-independent MS acquisition methods in Q1–Q3 space. This allows us to predict parameters for these methods that deliver a specificity comparable with that of SRM. Using SRM interference information in addition to other sources of information can increase the confidence in an SRM measurement. We expect that the consideration of information content will become a standard step in SRM assay design and analysis, facilitated by the SRMCollider. PMID:22535207
Evolution of complexity in the zebrafish synapse proteome
Bayés, Àlex; Collins, Mark O.; Reig-Viader, Rita; Gou, Gemma; Goulding, David; Izquierdo, Abril; Choudhary, Jyoti S.; Emes, Richard D.; Grant, Seth G. N.
2017-01-01
The proteome of human brain synapses is highly complex and is mutated in over 130 diseases. This complexity arose from two whole-genome duplications early in the vertebrate lineage. Zebrafish are used in modelling human diseases; however, its synapse proteome is uncharacterized, and whether the teleost-specific genome duplication (TSGD) influenced complexity is unknown. We report the characterization of the proteomes and ultrastructure of central synapses in zebrafish and analyse the importance of the TSGD. While the TSGD increases overall synapse proteome complexity, the postsynaptic density (PSD) proteome of zebrafish has lower complexity than mammals. A highly conserved set of ∼1,000 proteins is shared across vertebrates. PSD ultrastructural features are also conserved. Lineage-specific proteome differences indicate that vertebrate species evolved distinct synapse types and functions. The data sets are a resource for a wide range of studies and have important implications for the use of zebrafish in modelling human synaptic diseases. PMID:28252024
Ahmad, Yasmin; Sharma, Narendra K.; Ahmad, Mohammad Faiz; Sharma, Manish; Garg, Iti; Srivastava, Mousami; Bhargava, Kalpana
2015-01-01
Exposure to high altitude induces physiological responses due to hypoxia. Lungs being at the first level to face the alterations in oxygen levels are critical to counter and balance these changes. Studies have been done analysing pulmonary proteome alterations in response to exposure to hypobaric hypoxia. However, such studies have reported the alterations at specific time points and do not reflect the gradual proteomic changes. These studies also identify the various biochemical pathways and responses induced after immediate exposure and the resolution of these effects in challenge to hypobaric hypoxia. In the present study, using 2-DE/MS approach, we attempt to resolve these shortcomings by analysing the proteome alterations in lungs in response to different durations of exposure to hypobaric hypoxia. Our study thus highlights the gradual and dynamic changes in pulmonary proteome following hypobaric hypoxia. For the first time, we also report the possible consideration of SULT1A1, as a biomarker for the diagnosis of high altitude pulmonary edema (HAPE). Higher SULT1A1 levels were observed in rats as well as in humans exposed to high altitude, when compared to sea-level controls. This study can thus form the basis for identifying biomarkers for diagnostic and prognostic purposes in responses to hypobaric hypoxia. PMID:26022216
Proteomic Analyses of the Effects of Drugs of Abuse on Monocyte-Derived Mature Dendritic Cells
Reynolds, Jessica L.; Mahajan, Supriya D.; Aalinkeel, Ravikunar; Nair, B.; Sykes, Donald E.; Schwartz, Stanley A.
2010-01-01
Drug abuse has become a global health concern. Understanding how drug abuse modulates the immune system and how the immune system responds to pathogens associated with drug abuse, such hepatitis C virus (HCV) and human immunodeficiency virus (HIV-1), can be assessed by an integrated approach comparing proteomic analyses and quantitation of gene expression. Two-dimensional (2D) difference gel electrophoresis was used to determine the molecular mechanisms underlying the proteomic changes that alter normal biological processes when monocyte-derived mature dendritic cells were treated with cocaine or methamphetamine. Both drugs differentially regulated the expression of several functional classes of proteins including those that modulate apoptosis, protein folding, protein kinase activity, and metabolism and proteins that function as intracellular signal transduction molecules. Proteomic data were validated using a combination of quantitative, real-time PCR and Western blot analyses. These studies will help to identify the molecular mechanisms, including the expression of several functionally important classes of proteins that have emerged as potential mediators of pathogenesis. These proteins may predispose immunocompetent cells, including dendritic cells, to infection with viruses such as HCV and HIV-1, which are associated with drug abuse. PMID:19811410
Standardized protocols for quality control of MRM-based plasma proteomic workflows.
Percy, Andrew J; Chambers, Andrew G; Smith, Derek S; Borchers, Christoph H
2013-01-04
Mass spectrometry (MS)-based proteomics is rapidly emerging as a viable technology for the identification and quantitation of biological samples, such as human plasma--the most complex yet commonly employed biofluid in clinical analyses. The transition from a qualitative to quantitative science is required if proteomics is going to successfully make the transition to a clinically useful technique. MS, however, has been criticized for a lack of reproducibility and interlaboratory transferability. Currently, the MS and plasma proteomics communities lack standardized protocols and reagents to ensure that high-quality quantitative data can be accurately and precisely reproduced by laboratories across the world using different MS technologies. Toward addressing this issue, we have developed standard protocols for multiple reaction monitoring (MRM)-based assays with customized isotopically labeled internal standards for quality control of the sample preparation workflow and the MS platform in quantitative plasma proteomic analyses. The development of reference standards and their application to a single MS platform is discussed herein, along with the results from intralaboratory tests. The tests highlighted the importance of the reference standards in assessing the efficiency and reproducibility of the entire bottom-up proteomic workflow and revealed errors related to the sample preparation and performance quality and deficits of the MS and LC systems. Such evaluations are necessary if MRM-based quantitative plasma proteomics is to be used in verifying and validating putative disease biomarkers across different research laboratories and eventually in clinical laboratories.
Quantitative trait loci mapping of the mouse plasma proteome (pQTL).
Holdt, Lesca M; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel
2013-02-01
A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F(2) intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins.
Quantitative Trait Loci Mapping of the Mouse Plasma Proteome (pQTL)
Holdt, Lesca M.; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel
2013-01-01
A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F2 intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins. PMID:23172855
This research project combines the use of whole organism endpoints, genomic, proteomic and metabolomic approaches, and computational modeling in a systems biology approach to 1) identify molecular indicators of exposure and biomarkers of effect to EDCs representing several modes/...
Cehofski, Lasse Jørgensen; Honoré, Bent; Vorum, Henrik
2017-01-01
Retinal artery occlusion (RAO), retinal vein occlusion (RVO), diabetic retinopathy (DR) and age-related macular degeneration (AMD) are frequent ocular diseases with potentially sight-threatening outcomes. In the present review we discuss major findings of proteomic studies of RAO, RVO, DR and AMD, including an overview of ocular proteome changes associated with anti-vascular endothelial growth factor (VEGF) treatments. Despite the severe outcomes of RAO, the proteome of the disease remains largely unstudied. There is also limited knowledge about the proteome of RVO, but proteomic studies suggest that RVO is associated with remodeling of the extracellular matrix and adhesion processes. Proteomic studies of DR have resulted in the identification of potential therapeutic targets such as carbonic anhydrase-I. Proliferative diabetic retinopathy is the most intensively studied stage of DR. Proteomic studies have established VEGF, pigment epithelium-derived factor (PEDF) and complement components as key factors associated with AMD. The aim of this review is to highlight the major milestones in proteomics in RAO, RVO, DR and AMD. Through large-scale protein analyses, proteomics is bringing new important insights into these complex pathological conditions. PMID:28452939
TrSDB: a proteome database of transcription factors
Hermoso, Antoni; Aguilar, Daniel; Aviles, Francesc X.; Querol, Enrique
2004-01-01
TrSDB—TranScout Database—(http://ibb.uab.es/trsdb) is a proteome database of eukaryotic transcription factors based upon predicted motifs by TranScout and data sources such as InterPro and Gene Ontology Annotation. Nine eukaryotic proteomes are included in the current version. Extensive and diverse information for each database entry, different analyses considering TranScout classification and similarity relationships are offered for research on transcription factors or gene expression. PMID:14681387
Background | Office of Cancer Clinical Proteomics Research
The term "proteomics" refers to a large-scale comprehensive study of a specific proteome resulting from its genome, including abundances of proteins, their variations and modifications, and interacting partners and networks in order to understand cellular processes involved. Similarly, “Cancer proteomics” refers to comprehensive analyses of proteins and their derivatives translated from a specific cancer genome using a human biospecimen or a preclinical model (e.g., cultured cell or animal model).
Deutsch, Eric W.; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L.
2015-01-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include mass spectrometry to define protein sequence, protein:protein interactions, and protein post-translational modifications. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative mass spectrometry proteomics. It supports all major operating systems and instrument vendors via open data formats. Here we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of tandem mass spectrometry datasets, as well as some major upcoming features. PMID:25631240
Deutsch, Eric W; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L
2015-08-01
Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Spermatogenesis in mammals: proteomic insights.
Chocu, Sophie; Calvel, Pierre; Rolland, Antoine D; Pineau, Charles
2012-08-01
Spermatogenesis is a highly sophisticated process involved in the transmission of genetic heritage. It includes halving ploidy, repackaging of the chromatin for transport, and the equipment of developing spermatids and eventually spermatozoa with the advanced apparatus (e.g., tightly packed mitochondrial sheat in the mid piece, elongating of the tail, reduction of cytoplasmic volume) to elicit motility once they reach the epididymis. Mammalian spermatogenesis is divided into three phases. In the first the primitive germ cells or spermatogonia undergo a series of mitotic divisions. In the second the spermatocytes undergo two consecutive divisions in meiosis to produce haploid spermatids. In the third the spermatids differentiate into spermatozoa in a process called spermiogenesis. Paracrine, autocrine, juxtacrine, and endocrine pathways all contribute to the regulation of the process. The array of structural elements and chemical factors modulating somatic and germ cell activity is such that the network linking the various cellular activities during spermatogenesis is unimaginably complex. Over the past two decades, advances in genomics have greatly improved our knowledge of spermatogenesis, by identifying numerous genes essential for the development of functional male gametes. Large-scale analyses of testicular function have deepened our insight into normal and pathological spermatogenesis. Progress in genome sequencing and microarray technology have been exploited for genome-wide expression studies, leading to the identification of hundreds of genes differentially expressed within the testis. However, although proteomics has now come of age, the proteomics-based investigation of spermatogenesis remains in its infancy. Here, we review the state-of-the-art of large-scale proteomic analyses of spermatogenesis, from germ cell development during sex determination to spermatogenesis in the adult. Indeed, a few laboratories have undertaken differential protein profiling expression studies and/or systematic analyses of testicular proteomes in entire organs or isolated cells from various species. We consider the pros and cons of proteomics for studying the testicular germ cell gene expression program. Finally, we address the use of protein datasets, through integrative genomics (i.e., combining genomics, transcriptomics, and proteomics), bioinformatics, and modelling.
Single-molecule protein sequencing through fingerprinting: computational assessment
NASA Astrophysics Data System (ADS)
Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin
2015-10-01
Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.
Recent developments in structural proteomics for protein structure determination.
Liu, Hsuan-Liang; Hsu, Jyh-Ping
2005-05-01
The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.
Integrating cell biology and proteomic approaches in plants.
Takáč, Tomáš; Šamajová, Olga; Šamaj, Jozef
2017-10-03
Significant improvements of protein extraction, separation, mass spectrometry and bioinformatics nurtured advancements of proteomics during the past years. The usefulness of proteomics in the investigation of biological problems can be enhanced by integration with other experimental methods from cell biology, genetics, biochemistry, pharmacology, molecular biology and other omics approaches including transcriptomics and metabolomics. This review aims to summarize current trends integrating cell biology and proteomics in plant science. Cell biology approaches are most frequently used in proteomic studies investigating subcellular and developmental proteomes, however, they were also employed in proteomic studies exploring abiotic and biotic stress responses, vesicular transport, cytoskeleton and protein posttranslational modifications. They are used either for detailed cellular or ultrastructural characterization of the object subjected to proteomic study, validation of proteomic results or to expand proteomic data. In this respect, a broad spectrum of methods is employed to support proteomic studies including ultrastructural electron microscopy studies, histochemical staining, immunochemical localization, in vivo imaging of fluorescently tagged proteins and visualization of protein-protein interactions. Thus, cell biological observations on fixed or living cell compartments, cells, tissues and organs are feasible, and in some cases fundamental for the validation and complementation of proteomic data. Validation of proteomic data by independent experimental methods requires development of new complementary approaches. Benefits of cell biology methods and techniques are not sufficiently highlighted in current proteomic studies. This encouraged us to review most popular cell biology methods used in proteomic studies and to evaluate their relevance and potential for proteomic data validation and enrichment of purely proteomic analyses. We also provide examples of representative studies combining proteomic and cell biology methods for various purposes. Integrating cell biology approaches with proteomic ones allow validation and better interpretation of proteomic data. Moreover, cell biology methods remarkably extend the knowledge provided by proteomic studies and might be fundamental for the functional complementation of proteomic data. This review article summarizes current literature linking proteomics with cell biology. Copyright © 2017 Elsevier B.V. All rights reserved.
Polati, Rita; Menini, Michele; Robotti, Elisa; Millioni, Renato; Marengo, Emilio; Novelli, Enrico; Balzan, Stefania; Cecconi, Daniela
2012-12-01
To study proteomic changes involved in tenderization of bovine Longissimus dorsi four Charolaise heifers and four Charolaise bull's muscles were sampled at slaughter after early and long ageing (2-4°C for 12 and 26days respectively). Descriptive sensory evaluation of samples were performed and their tenderness evaluated by Warner-Bratzler shear force test. Protein composition of fresh muscle and of meat aged was analysed by cartesian and polar 2-D electrophoresis. Student's t-test and Ranking-PCA analyses were performed to detect proteomic modulation, and the selected protein spots were identified by nano-HPLC-Chip MS/MS. This research has demonstrated that there are no differences between proteomic patterns of male and females Longissimus dorsi muscle, and that the extension of ageing beyond 12days, did not brings any concrete advantage in terms of sensory quality. Furthermore, the data presented here demonstrated that meat maturation caused changes of the abundance of proteins involved in metabolic, structural, and stress related processes. Copyright © 2012 Elsevier Ltd. All rights reserved.
P-MartCancer–Interactive Online Software to Enable Analysis of Shotgun Cancer Proteomic Datasets
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Bramer, Lisa M.; Jensen, Jeffrey L.
P-MartCancer is a new interactive web-based software environment that enables biomedical and biological scientists to perform in-depth analyses of global proteomics data without requiring direct interaction with the data or with statistical software. P-MartCancer offers a series of statistical modules associated with quality assessment, peptide and protein statistics, protein quantification and exploratory data analyses driven by the user via customized workflows and interactive visualization. Currently, P-MartCancer offers access to multiple cancer proteomic datasets generated through the Clinical Proteomics Tumor Analysis Consortium (CPTAC) at the peptide, gene and protein levels. P-MartCancer is deployed using Azure technologies (http://pmart.labworks.org/cptac.html), the web-service is alternativelymore » available via Docker Hub (https://hub.docker.com/r/pnnl/pmart-web/) and many statistical functions can be utilized directly from an R package available on GitHub (https://github.com/pmartR).« less
Kling, Peter; Förlin, Lars
2009-10-01
Proteomic effect screening in zebrafish liver cells was performed to generate hypotheses regarding single and mixed exposure to the BFRs HBCD and TBBPA. Responses at sublethal exposure were analysed by two-dimensional gel electrophoresis followed by MALDI-TOF and FT-ICR protein identification. Mixing of HBCD and TBBPA at sublethal doses of individual substances seemed to increase toxicity. Proteomic analyses revealed distinct exposure-specific and overlapping responses suggesting novel mechanisms with regard to HBCD and TBBPA exposure. While distinct HBCD responses were related to decreased protein metabolism, TBBPA revealed effects related to protein folding and NADPH production. Overlapping responses suggest increased gluconeogenesis (GAPDH and aldolase) while distinct mixture effects suggest a pronounced NADPH production and changes in proteins related to cell cycle control (prohibitin and crk-like oncogene). We conclude that mixtures containing HBCD and TBBPA may result in unexpected effects highlighting proteomics as a sensitive tool for detecting and hypothesis generation of mixture effects.
A Routine 'Top-Down' Approach to Analysis of the Human Serum Proteome.
D'Silva, Arlene M; Hyett, Jon A; Coorssen, Jens R
2017-06-06
Serum provides a rich source of potential biomarker proteoforms. One of the major obstacles in analysing serum proteomes is detecting lower abundance proteins owing to the presence of hyper-abundant species (e.g., serum albumin and immunoglobulins). Although depletion methods have been used to address this, these can lead to the concomitant removal of non-targeted protein species, and thus raise issues of specificity, reproducibility, and the capacity for meaningful quantitative analyses. Altering the native stoichiometry of the proteome components may thus yield a more complex series of issues than dealing directly with the inherent complexity of the sample. Hence, here we targeted method refinements so as to ensure optimum resolution of serum proteomes via a top down two-dimensional gel electrophoresis (2DE) approach that enables the routine assessment of proteoforms and is fully compatible with subsequent mass spectrometric analyses. Testing included various fractionation and non-fractionation approaches. The data show that resolving 500 µg protein on 17 cm 3-10 non-linear immobilised pH gradient strips in the first dimension followed by second dimension resolution on 7-20% gradient gels with a combination of lithium dodecyl sulfate (LDS) and sodium dodecyl sulfate (SDS) detergents markedly improves the resolution and detection of proteoforms in serum. In addition, well established third dimension electrophoretic separations in combination with deep imaging further contributed to the best available resolution, detection, and thus quantitative top-down analysis of serum proteomes.
Combining proteomics and metabolite analyses to unravel cadmium stress-response in poplar leaves.
Kieffer, Pol; Planchon, Sébastien; Oufir, Mouhssin; Ziebel, Johanna; Dommes, Jacques; Hoffmann, Lucien; Hausman, Jean-François; Renaut, Jenny
2009-01-01
A proteomic analysis of poplar leaves exposed to cadmium, combined with biochemical analysis of pigments and carbohydrates revealed changes in primary carbon metabolism. Proteomic results suggested that photosynthesis was slightly affected. Together with a growth inhibition, photoassimilates were less needed for developmental processes and could be stored in the form of hexoses or complex sugars, acting also as osmoprotectants. Simultaneously, mitochondrial respiration was upregulated, providing energy needs of cadmium-exposed plants.
Recent advances and opportunities in proteomic analyses of tumour heterogeneity.
Bateman, Nicholas W; Conrads, Thomas P
2018-04-01
Solid tumour malignancies comprise a highly variable admixture of tumour and non-tumour cellular populations, forming a complex cellular ecosystem and tumour microenvironment. This tumour heterogeneity is not incidental, and is known to correlate with poor patient prognosis for many cancer types. Indeed, non-malignant cell populations, such as vascular endothelial and immune cells, are known to play key roles supporting and, in some cases, driving aggressive tumour biology, and represent targets of emerging therapeutics, such as antiangiogenesis and immune checkpoint inhibitors. The biochemical interplay between these cellular populations and how they contribute to molecular tumour heterogeneity remains enigmatic, particularly from the perspective of the tumour proteome. This review focuses on recent advances in proteomic methods, namely imaging mass spectrometry, single-cell proteomic techniques, and preanalytical sample processing, that are uniquely positioned to enable detailed analysis of discrete cellular populations within tumours to improve our understanding of tumour proteomic heterogeneity. This review further emphasizes the opportunity afforded by the application of these techniques to the analysis of tumour heterogeneity in formalin-fixed paraffin-embedded archival tumour tissues, as these represent an invaluable resource for retrospective analyses that is now routinely accessible, owing to recent technological and methodological advances in tumour tissue proteomics. Copyright © 2018 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2018 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis
Padula, Matthew P.; Berry, Iain J.; O′Rourke, Matthew B.; Raymond, Benjamin B.A.; Santos, Jerran; Djordjevic, Steven P.
2017-01-01
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O′Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single ‘spots’ in a polyacrylamide gel, allowing the quantitation of changes in a proteoform′s abundance to ascertain changes in an organism′s phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the ‘Top-Down’. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O’Farrell’s paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism′s proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer. PMID:28387712
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis.
Padula, Matthew P; Berry, Iain J; O Rourke, Matthew B; Raymond, Benjamin B A; Santos, Jerran; Djordjevic, Steven P
2017-04-07
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O'Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single 'spots' in a polyacrylamide gel, allowing the quantitation of changes in a proteoform's abundance to ascertain changes in an organism's phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the 'Top-Down'. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O'Farrell's paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism's proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer.
Koczorowska, Maria Magdalena; Friedemann, Charlotte; Geiger, Klaus; Follo, Marie; Biniossek, Martin Lothar; Schilling, Oliver
2017-12-01
Solid tumors contain various components that together form the tumor microenvironment. Cancer associated fibroblasts (CAFs) are capable of secreting and responding to signaling molecules and growth factors. Due to their role in tumor development, CAFs are considered as potential therapeutic targets. A prominent tumor-associated signaling molecule is transforming growth factor β (TGFβ), an inducer of epithelial-to-mesenchymal transition (EMT). The differential action of TGFβ on CAFs and ETCs (epithelial tumor cells) has recently gained interest. Here, we aimed to investigate the effects of TGFβ on CAFs and ETCs at the proteomic level. We established a 2D co-culture system of differentially fluorescently labeled CAFs and ETCs and stimulated this co-culture system with TGFβ. The respective cell types were separated using FACS and subjected to quantitative analyses of individual proteomes using mass spectrometry. We found that TGFβ treatment had a strong impact on the proteome composition of CAFs, whereas ETCs responded only marginally to TGFβ. Quantitative proteomic analyses of the different cell types revealed up-regulation of extracellular matrix (ECM) proteins in TGFβ treated CAFs. In addition, we found that the TGFβ treated CAFs exhibited increased N-cadherin levels. From our data we conclude that CAFs respond to TGFβ treatment by changing their proteome composition, while ETCs appear to be rather resilient.
Proteomic approaches in brain research and neuropharmacology.
Vercauteren, Freya G G; Bergeron, John J M; Vandesande, Frans; Arckens, Lut; Quirion, Rémi
2004-10-01
Numerous applications of genomic technologies have enabled the assembly of unprecedented inventories of genes, expressed in cells under specific physiological and pathophysiological conditions. Complementing the valuable information generated through functional genomics with the integrative knowledge of protein expression and function should enable the development of more efficient diagnostic tools and therapeutic agents. Proteomic analyses are particularly suitable to elucidate posttranslational modifications, expression levels and protein-protein interactions of thousands of proteins at a time. In this review, two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) investigations of brain tissues in neurodegenerative diseases such as Alzheimer's disease, Down syndrome and schizophrenia, and the construction of 2D-PAGE proteome maps of the brain are discussed. The role of the Human Proteome Organization (HUPO) as an international coordinating organization for proteomic efforts, as well as challenges for proteomic technologies and data analysis are also addressed. It is expected that the use of proteomic strategies will have significant impact in neuropharmacology over the coming decade.
The HUPO proteomics standards initiative--overcoming the fragmentation of proteomics data.
Hermjakob, Henning
2006-09-01
Proteomics is a key field of modern biomolecular research, with many small and large scale efforts producing a wealth of proteomics data. However, the vast majority of this data is never exploited to its full potential. Even in publicly funded projects, often the raw data generated in a specific context is analysed, conclusions are drawn and published, but little attention is paid to systematic documentation, archiving, and public access to the data supporting the scientific results. It is often difficult to validate the results stated in a particular publication, and even simple global questions like "In which cellular contexts has my protein of interest been observed?" can currently not be answered with realistic effort, due to a lack of standardised reporting and collection of proteomics data. The Proteomics Standards Initiative (PSI), a work group of the Human Proteome Organisation (HUPO), defines community standards for data representation in proteomics to facilitate systematic data capture, comparison, exchange and verification. In this article we provide an overview of PSI organisational structure, activities, and current results, as well as ways to get involved in the broad-based, open PSI process.
Proteomics in medical microbiology.
Cash, P
2000-04-01
The techniques of proteomics (high resolution two-dimensional electrophoresis and protein characterisation) are widely used for microbiological research to analyse global protein synthesis as an indicator of gene expression. The rapid progress in microbial proteomics has been achieved through the wide availability of whole genome sequences for a number of bacterial groups. Beyond providing a basic understanding of microbial gene expression, proteomics has also played a role in medical areas of microbiology. Progress has been made in the use of the techniques for investigating the epidemiology and taxonomy of human microbial pathogens, the identification of novel pathogenic mechanisms and the analysis of drug resistance. In each of these areas, proteomics has provided new insights that complement genomic-based investigations. This review describes the current progress in these research fields and highlights some of the technical challenges existing for the application of proteomics in medical microbiology. The latter concern the analysis of genetically heterogeneous bacterial populations and the integration of the proteomic and genomic data for these bacteria. The characterisation of the proteomes of bacterial pathogens growing in their natural hosts remains a future challenge.
Proteome Studies of Filamentous Fungi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baker, Scott E.; Panisko, Ellen A.
2011-04-20
The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide breadth of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, non-gel basedmore » proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of different variations on the general method and technologies for identifying peptides in a given sample. We present a method that can serve as a “baseline” for proteomic studies of fungi.« less
Proteome studies of filamentous fungi.
Baker, Scott E; Panisko, Ellen A
2011-01-01
The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide variety of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, nongel-based proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of variations on the general methods and technologies for identifying peptides in a given sample. We present a method that can serve as a "baseline" for proteomic studies of fungi.
Quantitative proteomics in biological research.
Wilm, Matthias
2009-10-01
Proteomics has enabled the direct investigation of biological material, at first through the analysis of individual proteins, then of lysates from cell cultures, and finally of extracts from tissues and biopsies from entire organisms. Its latest manifestation - quantitative proteomics - allows deeper insight into biological systems. This article reviews the different methods used to extract quantitative information from mass spectra. It follows the technical developments aimed toward global proteomics, the attempt to characterize every expressed protein in a cell by at least one peptide. When applications of the technology are discussed, the focus is placed on yeast biology. In particular, differential quantitative proteomics, the comparison between an experiment and its control, is very discriminating for proteins involved in the process being studied. When trying to understand biological processes on a molecular level, differential quantitative proteomics tends to give a clearer picture than global transcription analyses. As a result, MS has become an even more indispensable tool for biochemically motivated biological research.
Single-cell-type Proteomics: Toward a Holistic Understanding of Plant Function*
Dai, Shaojun; Chen, Sixue
2012-01-01
Multicellular organisms such as plants contain different types of cells with specialized functions. Analyzing the protein characteristics of each type of cell will not only reveal specific cell functions, but also enhance understanding of how an organism works. Most plant proteomics studies have focused on using tissues and organs containing a mixture of different cells. Recent single-cell-type proteomics efforts on pollen grains, guard cells, mesophyll cells, root hairs, and trichomes have shown utility. We expect that high resolution proteomic analyses will reveal novel functions in single cells. This review provides an overview of recent developments in plant single-cell-type proteomics. We discuss application of the approach for understanding important cell functions, and we consider the technical challenges of extending the approach to all plant cell types. Finally, we consider the integration of single-cell-type proteomics with transcriptomics and metabolomics with the goal of providing a holistic understanding of plant function. PMID:22982375
Exploratory research into pathogen surface interactions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sinclair, Michael B.; Lane, Todd W.; Jones, Howland D. T.
2006-02-01
In this short-duration project the research team was able to achieve growth of both drinking water biofilms and monospecific biofilms of Legionella pneurnophila. Preliminary comparative proteomic analyses were carried out on planktonic and biofilm-associated Legionella. After delay for completion of permitting and review by the director of the National Institutes for Allergy and Infectious Disease, the Utah 112 strain of Francisella novicida was obtained and preliminary culture and comparative proteomic analyses were carried out. Comprehensive literature searches and data mining were carried out on all research topics.
Trends in mass spectrometry instrumentation for proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Richard D.
2002-12-01
Mass spectrometry has become a primary tool for proteomics due to its capabilities for rapid and sensitive protein identification and quantitation. It is now possible to identify thousands of proteins from microgram sample quantities in a single day and to quantify relative protein abundances. However, the needs for increased capabilities for proteome measurements are immense and are now driving both new strategies and instrument advances. These developments include those based on integration with multi-dimensional liquid separations and high accuracy mass measurements, and promise more than order of magnitude improvements in sensitivity, dynamic range, and throughput for proteomic analyses in themore » near future.« less
Advances in Proteomics Data Analysis and Display Using an Accurate Mass and Time Tag Approach
Zimmer, Jennifer S.D.; Monroe, Matthew E.; Qian, Wei-Jun; Smith, Richard D.
2007-01-01
Proteomics has recently demonstrated utility in understanding cellular processes on the molecular level as a component of systems biology approaches and for identifying potential biomarkers of various disease states. The large amount of data generated by utilizing high efficiency (e.g., chromatographic) separations coupled to high mass accuracy mass spectrometry for high-throughput proteomics analyses presents challenges related to data processing, analysis, and display. This review focuses on recent advances in nanoLC-FTICR-MS-based proteomics approaches and the accompanying data processing tools that have been developed to display and interpret the large volumes of data being produced. PMID:16429408
Computational approaches to protein inference in shotgun proteomics
2012-01-01
Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programing and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area. PMID:23176300
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Why proteomics is not the new genomics and the future of mass spectrometry in cell biology.
Sidoli, Simone; Kulej, Katarzyna; Garcia, Benjamin A
2017-01-02
Mass spectrometry (MS) is an essential part of the cell biologist's proteomics toolkit, allowing analyses at molecular and system-wide scales. However, proteomics still lag behind genomics in popularity and ease of use. We discuss key differences between MS-based -omics and other booming -omics technologies and highlight what we view as the future of MS and its role in our increasingly deep understanding of cell biology. © 2017 Sidoli et al.
Helsens, Kenny; Colaert, Niklaas; Barsnes, Harald; Muth, Thilo; Flikka, Kristian; Staes, An; Timmerman, Evy; Wortelkamp, Steffi; Sickmann, Albert; Vandekerckhove, Joël; Gevaert, Kris; Martens, Lennart
2010-03-01
MS-based proteomics produces large amounts of mass spectra that require processing, identification and possibly quantification before interpretation can be undertaken. High-throughput studies require automation of these various steps, and management of the data in association with the results obtained. We here present ms_lims (http://genesis.UGent.be/ms_lims), a freely available, open-source system based on a central database to automate data management and processing in MS-driven proteomics analyses.
USDA-ARS?s Scientific Manuscript database
The study of desiccation tolerance of lichens, and of their photobionts in particular, has frequently focused on the antioxidant system that protects the cell against photo-oxidative stress during dehydration/rehydration cycles. Thus, in this work we carried out proteomic and transcript analyses of ...
Cuadrat, Rafael R C; da Serra Cruz, Sérgio Manuel; Tschoeke, Diogo Antônio; Silva, Edno; Tosta, Frederico; Jucá, Henrique; Jardim, Rodrigo; Campos, Maria Luiza M; Mattoso, Marta; Dávila, Alberto M R
2014-08-01
A key focus in 21(st) century integrative biology and drug discovery for neglected tropical and other diseases has been the use of BLAST-based computational methods for identification of orthologous groups in pathogenic organisms to discern orthologs, with a view to evaluate similarities and differences among species, and thus allow the transfer of annotation from known/curated proteins to new/non-annotated ones. We used here a profile-based sensitive methodology to identify distant homologs, coupled to the NCBI's COG (Unicellular orthologs) and KOG (Eukaryote orthologs), permitting us to perform comparative genomics analyses on five protozoan genomes. OrthoSearch was used in five protozoan proteomes showing that 3901 and 7473 orthologs can be identified by comparison with COG and KOG proteomes, respectively. The core protozoa proteome inferred was 418 Protozoa-COG orthologous groups and 704 Protozoa-KOG orthologous groups: (i) 31.58% (132/418) belongs to the category J (translation, ribosomal structure, and biogenesis), and 9.81% (41/418) to the category O (post-translational modification, protein turnover, chaperones) using COG; (ii) 21.45% (151/704) belongs to the categories J, and 13.92% (98/704) to the O using KOG. The phylogenomic analysis showed four well-supported clades for Eukarya, discriminating Multicellular [(i) human, fly, plant and worm] and Unicellular [(ii) yeast, (iii) fungi, and (iv) protozoa] species. These encouraging results attest to the usefulness of the profile-based methodology for comparative genomics to accelerate semi-automatic re-annotation, especially of the protozoan proteomes. This approach may also lend itself for applications in global health, for example, in the case of novel drug target discovery against pathogenic organisms previously considered difficult to research with traditional drug discovery tools.
Cuadrat, Rafael R. C.; da Serra Cruz, Sérgio Manuel; Tschoeke, Diogo Antônio; Silva, Edno; Tosta, Frederico; Jucá, Henrique; Jardim, Rodrigo; Campos, Maria Luiza M.; Mattoso, Marta
2014-01-01
Abstract A key focus in 21st century integrative biology and drug discovery for neglected tropical and other diseases has been the use of BLAST-based computational methods for identification of orthologous groups in pathogenic organisms to discern orthologs, with a view to evaluate similarities and differences among species, and thus allow the transfer of annotation from known/curated proteins to new/non-annotated ones. We used here a profile-based sensitive methodology to identify distant homologs, coupled to the NCBI's COG (Unicellular orthologs) and KOG (Eukaryote orthologs), permitting us to perform comparative genomics analyses on five protozoan genomes. OrthoSearch was used in five protozoan proteomes showing that 3901 and 7473 orthologs can be identified by comparison with COG and KOG proteomes, respectively. The core protozoa proteome inferred was 418 Protozoa-COG orthologous groups and 704 Protozoa-KOG orthologous groups: (i) 31.58% (132/418) belongs to the category J (translation, ribosomal structure, and biogenesis), and 9.81% (41/418) to the category O (post-translational modification, protein turnover, chaperones) using COG; (ii) 21.45% (151/704) belongs to the categories J, and 13.92% (98/704) to the O using KOG. The phylogenomic analysis showed four well-supported clades for Eukarya, discriminating Multicellular [(i) human, fly, plant and worm] and Unicellular [(ii) yeast, (iii) fungi, and (iv) protozoa] species. These encouraging results attest to the usefulness of the profile-based methodology for comparative genomics to accelerate semi-automatic re-annotation, especially of the protozoan proteomes. This approach may also lend itself for applications in global health, for example, in the case of novel drug target discovery against pathogenic organisms previously considered difficult to research with traditional drug discovery tools. PMID:24960463
Anopheles salivary gland proteomes from major malaria vectors
2012-01-01
Background Antibody responses against Anopheles salivary proteins can indicate individual exposure to bites of malaria vectors. The extent to which these salivary proteins are species-specific is not entirely resolved. Thus, a better knowledge of the diversity among salivary protein repertoires from various malaria vector species is necessary to select relevant genus-, subgenus- and/or species-specific salivary antigens. Such antigens could be used for quantitative (mosquito density) and qualitative (mosquito species) immunological evaluation of malaria vectors/host contact. In this study, salivary gland protein repertoires (sialomes) from several Anopheles species were compared using in silico analysis and proteomics. The antigenic diversity of salivary gland proteins among different Anopheles species was also examined. Results In silico analysis of secreted salivary gland protein sequences retrieved from an NCBInr database of six Anopheles species belonging to the Cellia subgenus (An. gambiae, An. arabiensis, An. stephensi and An. funestus) and Nyssorhynchus subgenus (An. albimanus and An. darlingi) displayed a higher degree of similarity compared to salivary proteins from closely related Anopheles species. Additionally, computational hierarchical clustering allowed identification of genus-, subgenus- and species-specific salivary proteins. Proteomic and immunoblot analyses performed on salivary gland extracts from four Anopheles species (An. gambiae, An. arabiensis, An. stephensi and An. albimanus) indicated that heterogeneity of the salivary proteome and antigenic proteins was lower among closely related anopheline species and increased with phylogenetic distance. Conclusion This is the first report on the diversity of the salivary protein repertoire among species from the Anopheles genus at the protein level. This work demonstrates that a molecular diversity is exhibited among salivary proteins from closely related species despite their common pharmacological activities. The involvement of these proteins as antigenic candidates for genus-, subgenus- or species-specific immunological evaluation of individual exposure to Anopheles bites is discussed. PMID:23148599
Integrated network analysis and effective tools in plant systems biology
Fukushima, Atsushi; Kanaya, Shigehiko; Nishida, Kozo
2014-01-01
One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms. PMID:25408696
Hashimoto, Masakazu; Bogdanovic, Nenad; Nakagawa, Hiroyuki; Volkmann, Inga; Aoki, Mikio; Winblad, Bengt; Sakai, Jun; Tjernberg, Lars O
2012-01-01
Abstract It is evident that the symptoms of Alzheimer's disease (AD) are derived from severe neuronal damage, and especially pyramidal neurons in the hippocampus are affected pathologically. Here, we analysed the proteome of hippocampal neurons, isolated from post-mortem brains by laser capture microdissection. By using 18O labelling and mass spectrometry, the relative expression levels of 150 proteins in AD and controls were estimated. Many of the identified proteins are involved in transcription and nucleotide binding, glycolysis, heat-shock response, microtubule stabilization, axonal transport or inflammation. The proteins showing the most altered expression in AD were selected for immunohistochemical analysis. These analyses confirmed the altered expression levels, and showed in many AD cases a pathological pattern. For comparison, we also analysed hippocampal sections by Western blot. The expression levels found by this method showed poor correlation with the neuron-specific analysis. Hence, we conclude that cell-specific proteome analysis reveals differences in the proteome that cannot be detected by bulk analysis. PMID:21883897
Ceballos-Laita, Laura; Gutierrez-Carbonell, Elain; Takahashi, Daisuke; Abadía, Anunciación; Uemura, Matsuo; Abadía, Javier; López-Millán, Ana Flor
2018-04-01
This article contains consolidated proteomic data obtained from xylem sap collected from tomato plants grown in Fe- and Mn-sufficient control, as well as Fe-deficient and Mn-deficient conditions. Data presented here cover proteins identified and quantified by shotgun proteomics and Progenesis LC-MS analyses: proteins identified with at least two peptides and showing changes statistically significant (ANOVA; p ≤ 0.05) and above a biologically relevant selected threshold (fold ≥ 2) between treatments are listed. The comparison between Fe-deficient, Mn-deficient and control xylem sap samples using a multivariate statistical data analysis (Principal Component Analysis, PCA) is also included. Data included in this article are discussed in depth in the research article entitled "Effects of Fe and Mn deficiencies on the protein profiles of tomato ( Solanum lycopersicum) xylem sap as revealed by shotgun analyses" [1]. This dataset is made available to support the cited study as well to extend analyses at a later stage.
Cohen Freue, Gabriela V.; Meredith, Anna; Smith, Derek; Bergman, Axel; Sasaki, Mayu; Lam, Karen K. Y.; Hollander, Zsuzsanna; Opushneva, Nina; Takhar, Mandeep; Lin, David; Wilson-McManus, Janet; Balshaw, Robert; Keown, Paul A.; Borchers, Christoph H.; McManus, Bruce; Ng, Raymond T.; McMaster, W. Robert
2013-01-01
Recent technical advances in the field of quantitative proteomics have stimulated a large number of biomarker discovery studies of various diseases, providing avenues for new treatments and diagnostics. However, inherent challenges have limited the successful translation of candidate biomarkers into clinical use, thus highlighting the need for a robust analytical methodology to transition from biomarker discovery to clinical implementation. We have developed an end-to-end computational proteomic pipeline for biomarkers studies. At the discovery stage, the pipeline emphasizes different aspects of experimental design, appropriate statistical methodologies, and quality assessment of results. At the validation stage, the pipeline focuses on the migration of the results to a platform appropriate for external validation, and the development of a classifier score based on corroborated protein biomarkers. At the last stage towards clinical implementation, the main aims are to develop and validate an assay suitable for clinical deployment, and to calibrate the biomarker classifier using the developed assay. The proposed pipeline was applied to a biomarker study in cardiac transplantation aimed at developing a minimally invasive clinical test to monitor acute rejection. Starting with an untargeted screening of the human plasma proteome, five candidate biomarker proteins were identified. Rejection-regulated proteins reflect cellular and humoral immune responses, acute phase inflammatory pathways, and lipid metabolism biological processes. A multiplex multiple reaction monitoring mass-spectrometry (MRM-MS) assay was developed for the five candidate biomarkers and validated by enzyme-linked immune-sorbent (ELISA) and immunonephelometric assays (INA). A classifier score based on corroborated proteins demonstrated that the developed MRM-MS assay provides an appropriate methodology for an external validation, which is still in progress. Plasma proteomic biomarkers of acute cardiac rejection may offer a relevant post-transplant monitoring tool to effectively guide clinical care. The proposed computational pipeline is highly applicable to a wide range of biomarker proteomic studies. PMID:23592955
Chabi, Malika; Goulas, Estelle; Leclercq, Celine C; de Waele, Isabelle; Rihouey, Christophe; Cenci, Ugo; Day, Arnaud; Blervacq, Anne-Sophie; Neutelings, Godfrey; Duponchel, Ludovic; Lerouge, Patrice; Hausman, Jean-François; Renaut, Jenny; Hawkins, Simon
2017-09-01
Experimentally-generated (nanoLC-MS/MS) proteomic analyses of four different flax organs/tissues (inner-stem, outer-stem, leaves and roots) enriched in proteins from 3 different sub-compartments (soluble-, membrane-, and cell wall-proteins) was combined with publically available data on flax seed and whole-stem proteins to generate a flax protein database containing 2996 nonredundant total proteins. Subsequent multiple analyses (MapMan, CAZy, WallProtDB and expert curation) of this database were then used to identify a flax cell wall proteome consisting of 456 nonredundant proteins localized in the cell wall and/or associated with cell wall biosynthesis, remodeling and other cell wall related processes. Examination of the proteins present in different flax organs/tissues provided a detailed overview of cell wall metabolism and highlighted the importance of hemicellulose and pectin remodeling in stem tissues. Phylogenetic analyses of proteins in the cell wall proteome revealed an important paralogy in the class IIIA xyloglucan endo-transglycosylase/hydrolase (XTH) family associated with xyloglucan endo-hydrolase activity.Immunolocalisation, FT-IR microspectroscopy, and enzymatic fingerprinting indicated that flax fiber primary/S1 cell walls contained xyloglucans with typical substituted side chains as well as glucuronoxylans in much lower quantities. These results suggest a likely central role of xyloglucans and endotransglucosylase/hydrolase activity in flax fiber formation and cell wall remodeling processes. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
HTAPP: High-Throughput Autonomous Proteomic Pipeline
Yu, Kebing; Salomon, Arthur R.
2011-01-01
Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676
ProCon - PROteomics CONversion tool.
Mayer, Gerhard; Stephan, Christian; Meyer, Helmut E; Kohl, Michael; Marcus, Katrin; Eisenacher, Martin
2015-11-03
With the growing amount of experimental data produced in proteomics experiments and the requirements/recommendations of journals in the proteomics field to publicly make available data described in papers, a need for long-term storage of proteomics data in public repositories arises. For such an upload one needs proteomics data in a standardized format. Therefore, it is desirable, that the proprietary vendor's software will integrate in the future such an export functionality using the standard formats for proteomics results defined by the HUPO-PSI group. Currently not all search engines and analysis tools support these standard formats. In the meantime there is a need to provide user-friendly free-to-use conversion tools that can convert the data into such standard formats in order to support wet-lab scientists in creating proteomics data files ready for upload into the public repositories. ProCon is such a conversion tool written in Java for conversion of proteomics identification data into standard formats mzIdentML and Pride XML. It allows the conversion of Sequest™/Comet .out files, of search results from the popular and often used ProteomeDiscoverer® 1.x (x=versions 1.1 to1.4) software and search results stored in the LIMS systems ProteinScape® 1.3 and 2.1 into mzIdentML and PRIDE XML. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
Wimmer, Helge; Gundacker, Nina C; Griss, Johannes; Haudek, Verena J; Stättner, Stefan; Mohr, Thomas; Zwickl, Hannes; Paulitschke, Verena; Baron, David M; Trittner, Wolfgang; Kubicek, Markus; Bayer, Editha; Slany, Astrid; Gerner, Christopher
2009-06-01
Interpretation of proteome data with a focus on biomarker discovery largely relies on comparative proteome analyses. Here, we introduce a database-assisted interpretation strategy based on proteome profiles of primary cells. Both 2-D-PAGE and shotgun proteomics are applied. We obtain high data concordance with these two different techniques. When applying mass analysis of tryptic spot digests from 2-D gels of cytoplasmic fractions, we typically identify several hundred proteins. Using the same protein fractions, we usually identify more than thousand proteins by shotgun proteomics. The data consistency obtained when comparing these independent data sets exceeds 99% of the proteins identified in the 2-D gels. Many characteristic differences in protein expression of different cells can thus be independently confirmed. Our self-designed SQL database (CPL/MUW - database of the Clinical Proteomics Laboratories at the Medical University of Vienna accessible via www.meduniwien.ac.at/proteomics/database) facilitates (i) quality management of protein identification data, which are based on MS, (ii) the detection of cell type-specific proteins and (iii) of molecular signatures of specific functional cell states. Here, we demonstrate, how the interpretation of proteome profiles obtained from human liver tissue and hepatocellular carcinoma tissue is assisted by the Clinical Proteomics Laboratories at the Medical University of Vienna-database. Therefore, we suggest that the use of reference experiments supported by a tailored database may substantially facilitate data interpretation of proteome profiling experiments.
The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.
Tyanova, Stefka; Temu, Tikira; Cox, Juergen
2016-12-01
MaxQuant is one of the most frequently used platforms for mass-spectrometry (MS)-based proteomics data analysis. Since its first release in 2008, it has grown substantially in functionality and can be used in conjunction with more MS platforms. Here we present an updated protocol covering the most important basic computational workflows, including those designed for quantitative label-free proteomics, MS1-level labeling and isobaric labeling techniques. This protocol presents a complete description of the parameters used in MaxQuant, as well as of the configuration options of its integrated search engine, Andromeda. This protocol update describes an adaptation of an existing protocol that substantially modifies the technique. Important concepts of shotgun proteomics and their implementation in MaxQuant are briefly reviewed, including different quantification strategies and the control of false-discovery rates (FDRs), as well as the analysis of post-translational modifications (PTMs). The MaxQuant output tables, which contain information about quantification of proteins and PTMs, are explained in detail. Furthermore, we provide a short version of the workflow that is applicable to data sets with simple and standard experimental designs. The MaxQuant algorithms are efficiently parallelized on multiple processors and scale well from desktop computers to servers with many cores. The software is written in C# and is freely available at http://www.maxquant.org.
Parsing and Quantification of Raw Orbitrap Mass Spectrometer Data Using RawQuant.
Kovalchik, Kevin A; Moggridge, Sophie; Chen, David D Y; Morin, Gregg B; Hughes, Christopher S
2018-06-01
Effective analysis of protein samples by mass spectrometry (MS) requires careful selection and optimization of a range of experimental parameters. As the output from the primary detection device, the "raw" MS data file can be used to gauge the success of a given sample analysis. However, the closed-source nature of the standard raw MS file can complicate effective parsing of the data contained within. To ease and increase the range of analyses possible, the RawQuant tool was developed to enable parsing of raw MS files derived from Thermo Orbitrap instruments to yield meta and scan data in an openly readable text format. RawQuant can be commanded to export user-friendly files containing MS 1 , MS 2 , and MS 3 metadata as well as matrices of quantification values based on isobaric tagging approaches. In this study, the utility of RawQuant is demonstrated in several scenarios: (1) reanalysis of shotgun proteomics data for the identification of the human proteome, (2) reanalysis of experiments utilizing isobaric tagging for whole-proteome quantification, and (3) analysis of a novel bacterial proteome and synthetic peptide mixture for assessing quantification accuracy when using isobaric tags. Together, these analyses successfully demonstrate RawQuant for the efficient parsing and quantification of data from raw Thermo Orbitrap MS files acquired in a range of common proteomics experiments. In addition, the individual analyses using RawQuant highlights parametric considerations in the different experimental sets and suggests targetable areas to improve depth of coverage in identification-focused studies and quantification accuracy when using isobaric tags.
Henze, Andrea; Aumer, Franziska; Grabner, Arthur; Raila, Jens; Schweigert, Florian J
2011-10-01
Although horses and donkeys belong to the same genus, their genetic characteristics probably result in specific proteomes and post-translational modifications (PTM) of proteins. Since PTM can alter protein properties, specific PTM may contribute to species-specific characteristics. Therefore, the aim of the present study was to analyse differences in serum protein profiles of horses and donkeys as well as mules, which combine the genetic backgrounds of both species. Additionally, changes in PTM of the protein transthyretin (TTR) were analysed. Serum protein profiles of each species (five animals per species) were determined using strong anion exchanger ProteinChips® (Bio-Rad, Munich, Germany) in combination with surface-enhanced laser desorption ionisation-time of flight MS. The PTM of TTR were analysed subsequently by immunoprecipitation in combination with matrix-assisted laser desorption ionisation-time of flight MS. Protein profiling revealed species-specific differences in the proteome, with some protein peaks present in all three species as well as protein peaks that were unique for donkeys and mules, horses and mules or for horses alone. The molecular weight of TTR of horses and donkeys differed by 30 Da, and both species revealed several modified forms of TTR besides the native form. The mass spectra of mules represented a merging of TTR spectra of horses and donkeys. In summary, the present study indicated that there are substantial differences in the proteome of horses and donkeys. Additionally, the results probably indicate that the proteome of mules reveal a higher similarity to donkeys than to horses.
Wesseling, Hendrik; Guest, Paul C; Lago, Santiago G; Bahn, Sabine
2014-08-01
Proteomic studies have increased our understanding of the molecular pathways affected in psychiatric disorders. Mass spectrometry and two-dimensional gel electrophoresis analyses of post-mortem brain samples from psychiatric patients have revealed effects on synaptic, cytoskeletal, antioxidant and mitochondrial protein networks. Multiplex immunoassay profiling studies have found alterations in hormones, growth factors, transport and inflammation-related proteins in serum and plasma from living first-onset patients. Despite these advances, there are still difficulties in translating these findings into platforms for improved treatment of patients and for discovery of new drugs with better efficacy and side effect profiles. This review describes how the next phase of proteomic investigations in psychiatry should include stringent replication studies for validation of biomarker candidates and functional follow-up studies which can be used to test the impact on physiological function. All biomarker candidates should now be tested in series with traditional and emerging cell biological approaches. This should include investigations of the effects of post-translational modifications, protein dynamics and network analyses using targeted proteomic approaches. Most importantly, there is still an urgent need for development of disease-relevant cellular models for improved translation of proteomic findings into a means of developing novel drug treatments for patients with these life-altering disorders.
Global Analysis of Salmonella Alternative Sigma Factor E on Protein Translation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Jie; Nakayasu, Ernesto S.; Overall, Christopher C.
The alternative sigma factor E (σ E) is critical for response to extracytoplasmic stress in Salmonella. Extensive studies have been conducted on σ E-regulated gene expression, particularly at the transcriptional level. Increasing evidence suggests however that σ E may indirectly participate in post-transcriptional regulation. Here in this study, we conducted sample-matched global proteomic and transcriptomic analyses to determine the level of regulation mediated by σ E in Salmonella. We analysed samples from wild type and isogenic rpoE mutant Salmonella cultivated in three different conditions; nutrient-rich and conditions that mimic early and late intracellular infection. We found that 30% of themore » observed proteome was regulated by σ E combining all three conditions. In different growth conditions, σ E affected the expression of a broad spectrum of Salmonella proteins required for miscellaneous functions. Those involved in transport and binding, protein synthesis, and stress response were particularly highlighted. By comparing transcriptomic and proteomic data, we identified genes post-transcriptionally regulated by σ E and found that post-transcriptional regulation was responsible for a majority of changes observed in the σ E-regulated proteome. Further, comparison of transcriptomic and proteomic data from hfq mutant of Salmonella demonstrated that σ E–mediated post-transcriptional regulation was partially dependent on the RNA-binding protein Hfq.« less
Global Analysis of Salmonella Alternative Sigma Factor E on Protein Translation
Li, Jie; Nakayasu, Ernesto S.; Overall, Christopher C.; ...
2015-02-16
The alternative sigma factor E (σ E) is critical for response to extracytoplasmic stress in Salmonella. Extensive studies have been conducted on σ E-regulated gene expression, particularly at the transcriptional level. Increasing evidence suggests however that σ E may indirectly participate in post-transcriptional regulation. Here in this study, we conducted sample-matched global proteomic and transcriptomic analyses to determine the level of regulation mediated by σ E in Salmonella. We analysed samples from wild type and isogenic rpoE mutant Salmonella cultivated in three different conditions; nutrient-rich and conditions that mimic early and late intracellular infection. We found that 30% of themore » observed proteome was regulated by σ E combining all three conditions. In different growth conditions, σ E affected the expression of a broad spectrum of Salmonella proteins required for miscellaneous functions. Those involved in transport and binding, protein synthesis, and stress response were particularly highlighted. By comparing transcriptomic and proteomic data, we identified genes post-transcriptionally regulated by σ E and found that post-transcriptional regulation was responsible for a majority of changes observed in the σ E-regulated proteome. Further, comparison of transcriptomic and proteomic data from hfq mutant of Salmonella demonstrated that σ E–mediated post-transcriptional regulation was partially dependent on the RNA-binding protein Hfq.« less
Albalat, Amaya; Husi, Holger; Siwy, Justyna; Nally, Jarlath E; McLauglin, Mark; Eckersall, Peter D; Mullen, William
2014-02-01
Proteomics is a growing field that has the potential to be applied to many biology-related disciplines. However, the study of the proteome has proven to be very challenging due to its high level of complexity when compared to genome and transcriptome data. In order to analyse this level of complexity, high resolution separation of peptides/proteins are needed together with high resolution analysers. Currently, liquid chromatography and capillary electrophoresis (CE) are the two most widely used separation techniques that can be coupled on-line with a mass spectrometer (MS). In CE, proteins/ peptides are separated according to their size, charge and shape leading to high resolving power. Although further progress in the area of sensitivity, throughput and proteome coverage are expected, MS-based proteomics have developed to a level at which they are habitually applied to study a wide range of biological questions. The aim of this review is to present CE-MS as a proteomic analytical platform for biomarker research that could be used in farm animal and veterinary studies. This is a MS-analytical platform that has been widely used for biomarker research in the biomedical field but its application in animal proteomic studies is relatively novel. The review will focus on introducing the CE-MS platform and the primary considerations for its application to biomarker research. Furthermore, current applications but more importantly potential application in the field of farm animals and veterinary science will be presented and discussed.
Identification of new intrinsic proteins in Arabidopsis plasma membrane proteome.
Marmagne, Anne; Rouet, Marie-Aude; Ferro, Myriam; Rolland, Norbert; Alcon, Carine; Joyard, Jacques; Garin, Jérome; Barbier-Brygoo, Hélène; Ephritikhine, Geneviève
2004-07-01
Identification and characterization of anion channel genes in plants represent a goal for a better understanding of their central role in cell signaling, osmoregulation, nutrition, and metabolism. Though channel activities have been well characterized in plasma membrane by electrophysiology, the corresponding molecular entities are little documented. Indeed, the hydrophobic protein equipment of plant plasma membrane still remains largely unknown, though several proteomic approaches have been reported. To identify new putative transport systems, we developed a new proteomic strategy based on mass spectrometry analyses of a plasma membrane fraction enriched in hydrophobic proteins. We produced from Arabidopsis cell suspensions a highly purified plasma membrane fraction and characterized it in detail by immunological and enzymatic tests. Using complementary methods for the extraction of hydrophobic proteins and mass spectrometry analyses on mono-dimensional gels, about 100 proteins have been identified, 95% of which had never been found in previous proteomic studies. The inventory of the plasma membrane proteome generated by this approach contains numerous plasma membrane integral proteins, one-third displaying at least four transmembrane segments. The plasma membrane localization was confirmed for several proteins, therefore validating such proteomic strategy. An in silico analysis shows a correlation between the putative functions of the identified proteins and the expected roles for plasma membrane in transport, signaling, cellular traffic, and metabolism. This analysis also reveals 10 proteins that display structural properties compatible with transport functions and will constitute interesting targets for further functional studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matzke, Melissa M.; Brown, Joseph N.; Gritsenko, Marina A.
2013-02-01
Liquid chromatography coupled with mass spectrometry (LC-MS) is widely used to identify and quantify peptides in complex biological samples. In particular, label-free shotgun proteomics is highly effective for the identification of peptides and subsequently obtaining a global protein profile of a sample. As a result, this approach is widely used for discovery studies. Typically, the objective of these discovery studies is to identify proteins that are affected by some condition of interest (e.g. disease, exposure). However, for complex biological samples, label-free LC-MS proteomics experiments measure peptides and do not directly yield protein quantities. Thus, protein quantification must be inferred frommore » one or more measured peptides. In recent years, many computational approaches to relative protein quantification of label-free LC-MS data have been published. In this review, we examine the most commonly employed quantification approaches to relative protein abundance from peak intensity values, evaluate their individual merits, and discuss challenges in the use of the various computational approaches.« less
Establishing Substantial Equivalence: Proteomics
NASA Astrophysics Data System (ADS)
Lovegrove, Alison; Salt, Louise; Shewry, Peter R.
Wheat is a major crop in world agriculture and is consumed after processing into a range of food products. It is therefore of great importance to determine the consequences (intended and unintended) of transgenesis in wheat and whether genetically modified lines are substantially equivalent to those produced by conventional plant breeding. Proteomic analysis is one of several approaches which can be used to address these questions. Two-dimensional PAGE (2D PAGE) remains the most widely available method for proteomic analysis, but is notoriously difficult to reproduce between laboratories. We therefore describe methods which have been developed as standard operating procedures in our laboratory to ensure the reproducibility of proteomic analyses of wheat using 2D PAGE analysis of grain proteins.
Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes
NASA Astrophysics Data System (ADS)
Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy
2007-01-01
Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.
Top-down Proteomics in Health and Disease: Challenges and Opportunities
Gregorich, Zachery R.; Ge, Ying
2014-01-01
Proteomics is essential for deciphering how molecules interact as a system and for understanding the functions of cellular systems in human disease; however, the unique characteristics of the human proteome, which include a high dynamic range of protein expression and extreme complexity due to a plethora of post-translational modifications (PTMs) and sequence variations, make such analyses challenging. An emerging “top-down” mass spectrometry (MS)-based proteomics approach, which provides a “bird’s eye” view of all proteoforms, has unique advantages for the assessment of PTMs and sequence variations. Recently, a number of studies have showcased the potential of top-down proteomics for unraveling of disease mechanisms and discovery of new biomarkers. Nevertheless, the top-down approach still faces significant challenges in terms of protein solubility, separation, and the detection of large intact proteins, as well as the under-developed data analysis tools. Consequently, new technological developments are urgently needed to advance the field of top-down proteomics. Herein, we intend to provide an overview of the recent applications of top-down proteomics in biomedical research. Moreover, we will outline the challenges and opportunities facing top-down proteomics strategies aimed at understanding and diagnosing human diseases. PMID:24723472
HUPO BPP Workshop on Mouse Models for Neurodegeneration--Choosing the right models.
Hamacher, Michael; Marcus, Katrin; Stephan, Christian; van Hall, Andre; Meyer, Helmut E
2005-09-01
The HUPO Brain Proteome Project met during the 4th Dutch Endo-Neuro-Psycho Meeting in Doorwerth, The Netherlands, on June 1, 2005, in order to discuss appropriate (mouse) models for neurodegenerative diseases as well as to conceptualise sophisticated proteomics analyses strategies. Here, the topics of the meeting are summarised.
A Targeted MRM Approach for Tempo-Spatial Proteomics Analyses.
Moradian, Annie; Porras-Yakushi, Tanya R; Sweredoski, Michael J; Hess, Sonja
2016-01-01
When deciding to perform a quantitative proteomics analysis, selectivity, sensitivity, and reproducibility are important criteria to consider. The use of multiple reaction monitoring (MRM) has emerged as a powerful proteomics technique in that regard since it avoids many of the problems typically observed in discovery-based analyses. A prerequisite for such a targeted approach is that the protein targets are known, either as a result of previous global proteomics experiments or because a specific hypothesis is to be tested. When guidelines that have been established in the pharmaceutical industry many decades ago are taken into account, setting up an MRM assay is relatively straightforward. Typically, proteotypic peptides with favorable mass spectrometric properties are synthesized with a heavy isotope for each protein that is to be monitored. Retention times and calibration curves are determined using triple-quadrupole mass spectrometers. The use of iRT peptide standards is both recommended and fully integrated into the bioinformatics pipeline. Digested biological samples are mixed with the heavy and iRT standards and quantified. Here we present a generic protocol for the development of an MRM assay.
CMG-biotools, a free workbench for basic comparative microbial genomics.
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training.
Min, Li; Cheng, Jianbo; Zhao, Shengguo; Tian, He; Zhang, Yangdong; Li, Songli; Yang, Hongjian; Zheng, Nan; Wang, Jiaqi
2016-09-02
Heat stress (HS) has an enormous economic impact on the dairy industry. In recent years, many researchers have investigated changes in the gene expression and metabolomics profiles in dairy cows caused by HS. However, the proteomics profiles of heat-stressed dairy cows have not yet been completely elucidated. We compared plasma proteomics from HS-free and heat-stressed dairy cows using an iTRAQ labeling approach. After the depletion of high abundant proteins in the plasma, 1472 proteins were identified. Of these, 85 proteins were differentially abundant in cows exposed to HS relative to HS-free. Database searches combined with GO and KEGG pathway enrichment analyses revealed that many components of the complement and coagulation cascades were altered in heat-stressed cows compared with HS-free cows. Of these, many factors in the complement system (including complement components C1, C3, C5, C6, C7, C8, and C9, complement factor B, and factor H) were down-regulated by HS, while components of the coagulation system (including coagulation factors, vitamin K-dependent proteins, and fibrinogens) were up-regulated by HS. In conclusion, our results indicate that HS decreases plasma levels of complement system proteins, suggesting that immune function is impaired in dairy cows exposed to HS. Though many aspects of heat stress (HS) have been extensively researched, relatively little is known about the proteomics profile changes that occur during heat exposure. In this work, we employed a proteomics approach to investigate differential abundance of plasma proteins in HS-free and heat-stressed dairy cows. Database searches combined with GO and KEGG pathway enrichment analyses revealed that HS resulted in a decrease in complement components, suggesting that heat-stressed dairy cows have impaired immune function. In addition, through integrative analyses of proteomics and previous metabolomics, we showed enhanced glycolysis, lipid metabolic pathway shifts, and nitrogen repartitioning in dairy cows exposed to HS. Our findings expand our current knowledge on the effects of HS on plasma proteomics in dairy cows and offer a new perspective for future research. Copyright © 2016 Elsevier B.V. All rights reserved.
Birth of plant proteomics in India: a new horizon.
Narula, Kanika; Pandey, Aarti; Gayali, Saurabh; Chakraborty, Niranjan; Chakraborty, Subhra
2015-09-08
In the post-genomic era, proteomics is acknowledged as the next frontier for biological research. Although India has a long and distinguished tradition in protein research, the initiation of proteomics studies was a new horizon. Protein research witnessed enormous progress in protein separation, high-resolution refinements, biochemical identification of the proteins, protein-protein interaction, and structure-function analysis. Plant proteomics research, in India, began its journey on investigation of the proteome profiling, complexity analysis, protein trafficking, and biochemical modeling. The research article by Bhushan et al. in 2006 marked the birth of the plant proteomics research in India. Since then plant proteomics studies expanded progressively and are now being carried out in various institutions spread across the country. The compilation presented here seeks to trace the history of development in the area during the past decade based on publications till date. In this review, we emphasize on outcomes of the field providing prospects on proteomic pathway analyses. Finally, we discuss the connotation of strategies and the potential that would provide the framework of plant proteome research. The past decades have seen rapidly growing number of sequenced plant genomes and associated genomic resources. To keep pace with this increasing body of data, India is in the provisional phase of proteomics research to develop a comparative hub for plant proteomes and protein families, but it requires a strong impetus from intellectuals, entrepreneurs, and government agencies. Here, we aim to provide an overview of past, present and future of Indian plant proteomics, which would serve as an evaluation platform for those seeking to incorporate proteomics into their research programs. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.
Proteomics of the Human Placenta: Promises and Realities
Robinson, J.M.; Ackerman, W.E.; Kniss, D.A.; Takizawa, T.; Vandré, D.D.
2015-01-01
Proteomics is an area of study that sets as its ultimate goal the global analysis of all of the proteins expressed in a biological system of interest. However, technical limitations currently hamper proteome-wide analyses of complex systems. In a more practical sense, a desired outcome of proteomics research is the translation of large protein data sets into formats that provide meaningful information regarding clinical conditions (e.g., biomarkers to serve as diagnostic and/or prognostic indicators of disease). Herein, we discuss placental proteomics by describing existing studies, pointing out their strengths and weaknesses. In so doing, we strive to inform investigators interested in this area of research about the current gap between hyperbolic promises and realities. Additionally, we discuss the utility of proteomics in discovery-based research, particularly as regards the capacity to unearth novel insights into placental biology. Importantly, when considering under studied systems such as the human placenta and diseases associated with abnormalities in placental function, proteomics can serve as a robust ‘shortcut’ to obtaining information unlikely to be garnered using traditional approaches. PMID:18222537
[Techniques for rapid production of monoclonal antibodies for use with antibody technology].
Kamada, Haruhiko
2012-01-01
A monoclonal antibody (Mab), due to its specific binding ability to a target protein, can potentially be one of the most useful tools for the functional analysis of proteins in recent proteomics-based research. However, the production of Mab is a very time-consuming and laborious process (i.e., preparation of recombinant antigens, immunization of animals, preparation of hybridomas), making it the rate-limiting step in using Mabs in high-throughput proteomics research, which heavily relies on comprehensive and rapid methods. Therefore, there is a great demand for new methods to efficiently generate Mabs against a group of proteins identified by proteome analysis. Here, we describe a useful method called "Antibody proteomic technique" for the rapid generations of Mabs to pharmaceutical target, which were identified by proteomic analyses of disease samples (ex. tumor tissue, etc.). We also introduce another method to find profitable targets on vasculature, which is called "Vascular proteomic technique". Our results suggest that this method for the rapid generation of Mabs to proteins may be very useful in proteomics-based research as well as in clinical applications.
Coorssen, Jens R; Yergey, Alfred L
2015-12-03
Molecular mechanisms underlying health and disease function at least in part based on the flexibility and fine-tuning afforded by protein isoforms and post-translational modifications. The ability to effectively and consistently resolve these protein species or proteoforms, as well as assess quantitative changes is therefore central to proteomic analyses. Here we discuss the pros and cons of currently available and developing analytical techniques from the perspective of the full spectrum of available tools and their current applications, emphasizing the concept of fitness-for-purpose in experimental design based on consideration of sample size and complexity; this necessarily also addresses analytical reproducibility and its variance. Data quality is considered the primary criterion, and we thus emphasize that the standards of Analytical Chemistry must apply throughout any proteomic analysis.
Unexpected features of the dark proteome.
Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S; Buckley, Michael J; Tabor, Bruce; Signal, Beth; Gloss, Brian S; Hammang, Christopher J; Rost, Burkhard; Schafferhans, Andrea; O'Donoghue, Seán I
2015-12-29
We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.
Accurate proteome-wide protein quantification from high-resolution 15N mass spectra
2011-01-01
In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of 15N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing 15N spectra in which quantification is integrated with identification. Application of this method to an Escherichia coli growth transition reveals significant improvement in quantification accuracy over previous methods. PMID:22182234
Application of proteomics in research on traditional Chinese medicine.
Suo, Tongchuan; Wang, Haixia; Li, Zheng
2016-09-01
Traditional Chinese medicine (TCM) is a widely used complementary alternative medicine approach. Although many aspects of its effectiveness have been approved clinically, rigorous scientific techniques are highly required to translate the promises from TCM into powerful modern therapies. In this respect, proteomics is useful because of its ability to unveil the underlying target proteins and/or protein biomarkers. In this review, we summarize the recent interplay between proteomics and research on TCM, ranging from exploration of the medicinal materials to the biological basis of TCM concepts, and from pathological studies to pharmacological investigations. We show that proteomic analyses provide preliminary biological evidence of the promises in TCM, and the integration of proteomics with other omics and bioinformatics offers a comprehensive methodology to address the complications of TCM. Expert commentary: Currently, only limited information can be obtained regarding TCM issues and thus more work is required to resolve the ambiguity. As such, more collaborations between proteomics and other techniques (other omics, network pharmacology, etc.) are essential for deciphering the underlying biological basis in TCM topics.
Senachak, Jittisak; Cheevadhanarak, Supapon; Hongsthong, Apiradee
2015-07-29
Spirulina (Arthrospira) platensis is the only cyanobacterium that in addition to being studied at the molecular level and subjected to gene manipulation, can also be mass cultivated in outdoor ponds for commercial use as a food supplement. Thus, encountering environmental changes, including temperature stresses, is common during the mass production of Spirulina. The use of cyanobacteria as an experimental platform, especially for photosynthetic gene manipulation in plants and bacteria, is becoming increasingly important. Understanding the mechanisms and protein-protein interaction networks that underlie low- and high-temperature responses is relevant to Spirulina mass production. To accomplish this goal, high-throughput techniques such as OMICs analyses are used. Thus, large datasets must be collected, managed and subjected to information extraction. Therefore, databases including (i) proteomic analysis and protein-protein interaction (PPI) data and (ii) domain/motif visualization tools are required for potential use in temperature response models for plant chloroplasts and photosynthetic bacteria. A web-based repository was developed including an embedded database, SpirPro, and tools for network visualization. Proteome data were analyzed integrated with protein-protein interactions and/or metabolic pathways from KEGG. The repository provides various information, ranging from raw data (2D-gel images) to associated results, such as data from interaction and/or pathway analyses. This integration allows in silico analyses of protein-protein interactions affected at the metabolic level and, particularly, analyses of interactions between and within the affected metabolic pathways under temperature stresses for comparative proteomic analysis. The developed tool, which is coded in HTML with CSS/JavaScript and depicted in Scalable Vector Graphics (SVG), is designed for interactive analysis and exploration of the constructed network. SpirPro is publicly available on the web at http://spirpro.sbi.kmutt.ac.th . SpirPro is an analysis platform containing an integrated proteome and PPI database that provides the most comprehensive data on this cyanobacterium at the systematic level. As an integrated database, SpirPro can be applied in various analyses, such as temperature stress response networking analysis in cyanobacterial models and interacting domain-domain analysis between proteins of interest.
Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O
2015-08-25
Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
Computational Omics Funding Opportunity | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) and the NVIDIA Foundation are pleased to announce funding opportunities in the fight against cancer. Each organization has launched a request for proposals (RFP) that will collectively fund up to $2 million to help to develop a new generation of data-intensive scientific tools to find new ways to treat cancer.
Systematic Proteomic Approach to Characterize the Impacts of ...
Chemical interactions have posed a big challenge in toxicity characterization and human health risk assessment of environmental mixtures. To characterize the impacts of chemical interactions on protein and cytotoxicity responses to environmental mixtures, we established a systems biology approach integrating proteomics, bioinformatics, statistics, and computational toxicology to measure expression or phosphorylation levels of 21 critical toxicity pathway regulators and 445 downstream proteins in human BEAS-28 cells treated with 4 concentrations of nickel, 2 concentrations each of cadmium and chromium, as well as 12 defined binary and 8 defined ternary mixtures of these metals in vitro. Multivariate statistical analysis and mathematical modeling of the metal-mediated proteomic response patterns showed a high correlation between changes in protein expression or phosphorylation and cellular toxic responses to both individual metals and metal mixtures. Of the identified correlated proteins, only a small set of proteins including HIF-1a is likely to be responsible for selective cytotoxic responses to different metals and metals mixtures. Furthermore, support vector machine learning was utilized to computationally predict protein responses to uncharacterized metal mixtures using experimentally generated protein response profiles corresponding to known metal mixtures. This study provides a novel proteomic approach for characterization and prediction of toxicities of
Mapping the Small Molecule Interactome by Mass Spectrometry.
Flaxman, Hope A; Woo, Christina M
2018-01-16
Mapping small molecule interactions throughout the proteome provides the critical structural basis for functional analysis of their impact on biochemistry. However, translation of mass spectrometry-based proteomics methods to directly profile the interaction between a small molecule and the whole proteome is challenging because of the substoichiometric nature of many interactions, the diversity of covalent and noncovalent interactions involved, and the subsequent computational complexity associated with their spectral assignment. Recent advances in chemical proteomics have begun fill this gap to provide a structural basis for the breadth of small molecule-protein interactions in the whole proteome. Innovations enabling direct characterization of the small molecule interactome include faster, more sensitive instrumentation coupled to chemical conjugation, enrichment, and labeling methods that facilitate detection and assignment. These methods have started to measure molecular interaction hotspots due to inherent differences in local amino acid reactivity and binding affinity throughout the proteome. Measurement of the small molecule interactome is producing structural insights and methods for probing and engineering protein biochemistry. Direct structural characterization of the small molecule interactome is a rapidly emerging area pushing new frontiers in biochemistry at the interface of small molecules and the proteome.
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
USDA-ARS?s Scientific Manuscript database
Proteomic analyses were done on 2 chemosensory appendages of the lone star tick, Amblyomma americanum. Proteins in the fore tarsi, which contain the olfactory Haller's organ, and in the palps, that include gustatory sensilla, were compared with proteins in the third tarsi. Also, male and female tick...
Van Oudenhove, Laurence; Devreese, Bart
2013-06-01
Proteomics has evolved substantially since its early days, some 20 years ago. In this mini-review, we aim to provide an overview of general methodologies and more recent developments in mass spectrometric approaches used for relative and absolute quantitation of proteins. Enhancement of sensitivity of the mass spectrometers as well as improved sample preparation and protein fractionation methods are resulting in a more comprehensive analysis of proteomes. We also document some upcoming trends for quantitative proteomics such as the use of label-free quantification methods. Hopefully, microbiologists will continue to explore proteomics as a tool in their research to understand the adaptation of microorganisms to their ever changing environment. We encourage them to incorporate some of the described new developments in mass spectrometry to facilitate their analyses and improve the general knowledge of the fascinating world of microorganisms.
Holmes, Christina; Carlson, Siobhan M.; McDonald, Fiona; Jones, Mavis; Graham, Janice
2016-01-01
Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics. PMID:27134568
Design and analysis issues in quantitative proteomics studies.
Karp, Natasha A; Lilley, Kathryn S
2007-09-01
Quantitative proteomics is the comparison of distinct proteomes which enables the identification of protein species which exhibit changes in expression or post-translational state in response to a given stimulus. Many different quantitative techniques are being utilized and generate large datasets. Independent of the technique used, these large datasets need robust data analysis to ensure valid conclusions are drawn from such studies. Approaches to address the problems that arise with large datasets are discussed to give insight into the types of statistical analyses of data appropriate for the various experimental strategies that can be employed by quantitative proteomic studies. This review also highlights the importance of employing a robust experimental design and highlights various issues surrounding the design of experiments. The concepts and examples discussed within will show how robust design and analysis will lead to confident results that will ensure quantitative proteomics delivers.
Drought-Responsive Mechanisms in Plant Leaves Revealed by Proteomics.
Wang, Xiaoli; Cai, Xiaofeng; Xu, Chenxi; Wang, Quanhua; Dai, Shaojun
2016-10-18
Plant drought tolerance is a complex trait that requires a global view to understand its underlying mechanism. The proteomic aspects of plant drought response have been extensively investigated in model plants, crops and wood plants. In this review, we summarize recent proteomic studies on drought response in leaves to reveal the common and specialized drought-responsive mechanisms in different plants. Although drought-responsive proteins exhibit various patterns depending on plant species, genotypes and stress intensity, proteomic analyses show that dominant changes occurred in sensing and signal transduction, reactive oxygen species scavenging, osmotic regulation, gene expression, protein synthesis/turnover, cell structure modulation, as well as carbohydrate and energy metabolism. In combination with physiological and molecular results, proteomic studies in leaves have helped to discover some potential proteins and/or metabolic pathways for drought tolerance. These findings provide new clues for understanding the molecular basis of plant drought tolerance.
Drought-Responsive Mechanisms in Plant Leaves Revealed by Proteomics
Wang, Xiaoli; Cai, Xiaofeng; Xu, Chenxi; Wang, Quanhua; Dai, Shaojun
2016-01-01
Plant drought tolerance is a complex trait that requires a global view to understand its underlying mechanism. The proteomic aspects of plant drought response have been extensively investigated in model plants, crops and wood plants. In this review, we summarize recent proteomic studies on drought response in leaves to reveal the common and specialized drought-responsive mechanisms in different plants. Although drought-responsive proteins exhibit various patterns depending on plant species, genotypes and stress intensity, proteomic analyses show that dominant changes occurred in sensing and signal transduction, reactive oxygen species scavenging, osmotic regulation, gene expression, protein synthesis/turnover, cell structure modulation, as well as carbohydrate and energy metabolism. In combination with physiological and molecular results, proteomic studies in leaves have helped to discover some potential proteins and/or metabolic pathways for drought tolerance. These findings provide new clues for understanding the molecular basis of plant drought tolerance. PMID:27763546
Holmes, Christina; Carlson, Siobhan M; McDonald, Fiona; Jones, Mavis; Graham, Janice
2016-01-02
Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics.
Zhu, Ying; Clair, Geremy; Chrisler, William; Shen, Yufeng; Zhao, Rui; Shukla, Anil; Moore, Ronald; Misra, Ravi; Pryhuber, Gloria; Smith, Richard; Ansong, Charles; Kelly, Ryan T
2018-05-24
We report on the quantitative proteomic analysis of single mammalian cells. Fluorescence-activated cell sorting was employed to deposit cells into a newly developed nanodroplet sample processing chip, after which samples were analysed by ultrasensitive nanoLC-MS. An average of ~670 protein groups were confidently identified from single HeLa cells, which is a far greater level of proteome coverage for single cells than has been previously reported. We demonstrate that the single cell proteomics platform can be used to differentiate cell types from enzyme-dissociated human lung primary cells and identify specific protein markers for epithelial and mesenchymal cells. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Yu, Yanbao; Sikorski, Patricia; Smith, Madeline; Bowman-Gholston, Cynthia; Cacciabeve, Nicolas; Nelson, Karen E.; Pieper, Rembert
2017-01-01
Inflammation in the urinary tract results in a urinary proteome characterized by a high dynamic range of protein concentrations and high variability in protein content. This proteome encompasses plasma proteins not resorbed by renal tubular uptake, renal secretion products, proteins of immune cells and erythrocytes derived from trans-urothelial migration and vascular leakage, respectively, and exfoliating urothelial and squamous epithelial cells. We examined how such proteins partition into soluble urine (SU) and urinary pellet (UP) fractions by analyzing 33 urine specimens 12 of which were associated with a urinary tract infection (UTI). Using mass spectrometry-based metaproteomic approaches, we identified 5,327 non-redundant human proteins, 2,638 and 4,379 of which were associated with SU and UP fractions, respectively, and 1,206 non-redundant protein orthology groups derived from pathogenic and commensal organisms of the urogenital tract. Differences between the SU and UP proteomes were influenced by local inflammation, supported by respective comparisons with 12 healthy control urine proteomes. Clustering analyses showed that SU and UP fractions had proteomic signatures discerning UTIs, vascular injury, and epithelial cell exfoliation from the control group to varying degrees. Cases of UTI revealed clusters of proteins produced by activated neutrophils. Network analysis supported the central role of neutrophil effector proteins in the defense against invading pathogens associated with subsequent coagulation and wound repair processes. Our study expands the existing knowledge of the urinary proteome under perturbed conditions, and should be useful as reference dataset in the search of biomarkers. PMID:28042331
Proteogenomic characterization of human colon and rectal cancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Bing; Wang, Jing; Wang, Xiaojing
2014-09-18
We analyzed proteomes of colon and rectal tumors previously characterized by the Cancer Genome Atlas (TCGA) and performed integrated proteogenomic analyses. Protein sequence variants encoded by somatic genomic variations displayed reduced expression compared to protein variants encoded by germline variations. mRNA transcript abundance did not reliably predict protein expression differences between tumors. Proteomics identified five protein expression subtypes, two of which were associated with the TCGA "MSI/CIMP" transcriptional subtype, but had distinct mutation and methylation patterns and associated with different clinical outcomes. Although CNAs showed strong cis- and trans-effects on mRNA expression, relatively few of these extend to the proteinmore » level. Thus, proteomics data enabled prioritization of candidate driver genes. Our analyses identified HNF4A, a novel candidate driver gene in tumors with chromosome 20q amplifications. Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords novel insights into cancer biology.« less
Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpievitch, Yuliya V.; Polpitiya, Ashoka D.; Anderson, Gordon A.
2010-12-01
Mass spectrometry-based proteomics has become the tool of choice for identifying and quantifying the proteome of an organism. Though recent years have seen a tremendous improvement in instrument performance and the computational tools used, significant challenges remain, and there are many opportunities for statisticians to make important contributions. In the most widely used "bottom-up" approach to proteomics, complex mixtures of proteins are first subjected to enzymatic cleavage, the resulting peptide products are separated based on chemical or physical properties and analyzed using a mass spectrometer. The two fundamental challenges in the analysis of bottom-up MS-based proteomics are: (1) Identifying themore » proteins that are present in a sample, and (2) Quantifying the abundance levels of the identified proteins. Both of these challenges require knowledge of the biological and technological context that gives rise to observed data, as well as the application of sound statistical principles for estimation and inference. We present an overview of bottom-up proteomics and outline the key statistical issues that arise in protein identification and quantification.« less
A tutorial for software development in quantitative proteomics using PSI standard formats☆
Gonzalez-Galarza, Faviel F.; Qi, Da; Fan, Jun; Bessant, Conrad; Jones, Andrew R.
2014-01-01
The Human Proteome Organisation — Proteomics Standards Initiative (HUPO-PSI) has been working for ten years on the development of standardised formats that facilitate data sharing and public database deposition. In this article, we review three HUPO-PSI data standards — mzML, mzIdentML and mzQuantML, which can be used to design a complete quantitative analysis pipeline in mass spectrometry (MS)-based proteomics. In this tutorial, we briefly describe the content of each data model, sufficient for bioinformaticians to devise proteomics software. We also provide guidance on the use of recently released application programming interfaces (APIs) developed in Java for each of these standards, which makes it straightforward to read and write files of any size. We have produced a set of example Java classes and a basic graphical user interface to demonstrate how to use the most important parts of the PSI standards, available from http://code.google.com/p/psi-standard-formats-tutorial. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23584085
Seliger, Barbara; Dressler, Sven P.; Wang, Ena; Kellner, Roland; Recktenwald, Christian V.; Lottspeich, Friedrich; Marincola, Francesco M.; Baumgärtner, Maja; Atkins, Derek; Lichtenfels, Rudolf
2012-01-01
Results obtained from expression profilings of renal cell carcinoma using different “ome”-based approaches and comprehensive data analysis demonstrated that proteome-based technologies and cDNA microarray analyses complement each other during the discovery phase for disease-related candidate biomarkers. The integration of the respective data revealed the uniqueness and complementarities of the different technologies. While comparative cDNA microarray analyses though restricted to upregulated targets largely revealed genes involved in controlling gene/protein expression (19%) and signal transduction processes (13%), proteomics/PROTEOMEX-defined candidate biomarkers include enzymes of the cellular metabolism (36%), transport proteins (12%) and cell motility/structural molecules (10%). Candidate biomarkers defined by proteomics and PROTEOMEX are frequently shared, whereas the sharing rate between cDNA microarray and proteome-based profilings is limited. Putative candidate biomarkers provide insights into their cellular (dys)function and their diagnostic/prognostic value but still warrant further validation in larger patient numbers. Based on the fact that merely 3 candidate biomarkers were shared by all applied technologies, namely annexin A4, tubulin alpha-1A chain and ubiquitin carboxyl-terminal hydrolase L1 the analysis at a single hierarchical level of biological regulation seems to provide only limited results thus emphasizing the importance and benefit of performing rather combinatorial screenings which can complement the standard clinical predictors. PMID:19235166
The Hemolymph Proteome of Fed and Starved Drosophila Larvae
Goetze, Sandra; Ahrens, Christian H.; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F.
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics. PMID:23840627
The hemolymph proteome of fed and starved Drosophila larvae.
Handke, Björn; Poernbacher, Ingrid; Goetze, Sandra; Ahrens, Christian H; Omasits, Ulrich; Marty, Florian; Simigdala, Nikiana; Meyer, Imke; Wollscheid, Bernd; Brunner, Erich; Hafen, Ernst; Lehner, Christian F
2013-01-01
The co-operation of specialized organ systems in complex multicellular organisms depends on effective chemical communication. Thus, body fluids (like blood, lymph or intraspinal fluid) contain myriads of signaling mediators apart from metabolites. Moreover, these fluids are also of crucial importance for immune and wound responses. Compositional analyses of human body fluids are therefore of paramount diagnostic importance. Further improving their comprehensiveness should increase our understanding of inter-organ communication. In arthropods, which have trachea for gas exchange and an open circulatory system, the single dominating interstitial fluid is the hemolymph. Accordingly, a detailed analysis of hemolymph composition should provide an especially comprehensive picture of chemical communication and defense in animals. Therefore we used an extensive protein fractionation workflow in combination with a discovery-driven proteomic approach to map out the detectable protein composition of hemolymph isolated from Drosophila larvae. Combined mass spectrometric analysis revealed more than 700 proteins extending far beyond the previously known Drosophila hemolymph proteome. Moreover, by comparing hemolymph isolated from either fed or starved larvae, we provide initial provisional insights concerning compositional changes in response to nutritional state. Storage proteins in particular were observed to be strongly reduced by starvation. Our hemolymph proteome catalog provides a rich basis for data mining, as exemplified by our identification of potential novel cytokines, as well as for future quantitative analyses by targeted proteomics.
Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.
Hughes, Christopher S; Morin, Gregg B
2018-03-01
Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Characterization, design, and function of the mitochondrial proteome: from organs to organisms.
Lotz, Christopher; Lin, Amanda J; Black, Caitlin M; Zhang, Jun; Lau, Edward; Deng, Ning; Wang, Yueju; Zong, Nobel C; Choi, Jeong H; Xu, Tao; Liem, David A; Korge, Paavo; Weiss, James N; Hermjakob, Henning; Yates, John R; Apweiler, Rolf; Ping, Peipei
2014-02-07
Mitochondria are a common energy source for organs and organisms; their diverse functions are specialized according to the unique phenotypes of their hosting environment. Perturbation of mitochondrial homeostasis accompanies significant pathological phenotypes. However, the connections between mitochondrial proteome properties and function remain to be experimentally established on a systematic level. This uncertainty impedes the contextualization and translation of proteomic data to the molecular derivations of mitochondrial diseases. We present a collection of mitochondrial features and functions from four model systems, including two cardiac mitochondrial proteomes from distinct genomes (human and mouse), two unique organ mitochondrial proteomes from identical genetic codons (mouse heart and mouse liver), as well as a relevant metazoan out-group (drosophila). The data, composed of mitochondrial protein abundance and their biochemical activities, capture the core functionalities of these mitochondria. This investigation allowed us to redefine the core mitochondrial proteome from organs and organisms, as well as the relevant contributions from genetic information and hosting milieu. Our study has identified significant enrichment of disease-associated genes and their products. Furthermore, correlational analyses suggest that mitochondrial proteome design is primarily driven by cellular environment. Taken together, these results connect proteome feature with mitochondrial function, providing a prospective resource for mitochondrial pathophysiology and developing novel therapeutic targets in medicine.
Benevenuto, Rafael Fonseca; Agapito-Tenfen, Sarah Zanon; Vilperte, Vinicius; Wikmark, Odd-Gunnar; van Rensburg, Peet Jansen; Nodari, Rubens Onofre
2017-01-01
Some genetically modified (GM) plants have transgenes that confer tolerance to abiotic stressors. Meanwhile, other transgenes may interact with abiotic stressors, causing pleiotropic effects that will affect the plant physiology. Thus, physiological alteration might have an impact on the product safety. However, routine risk assessment (RA) analyses do not evaluate the response of GM plants exposed to different environmental conditions. Therefore, we here present a proteome profile of herbicide-tolerant maize, including the levels of phytohormones and related compounds, compared to its near-isogenic non-GM variety under drought and herbicide stresses. Twenty differentially abundant proteins were detected between GM and non-GM hybrids under different water deficiency conditions and herbicide sprays. Pathway enrichment analysis showed that most of these proteins are assigned to energetic/carbohydrate metabolic processes. Among phytohormones and related compounds, different levels of ABA, CA, JA, MeJA and SA were detected in the maize varieties and stress conditions analysed. In pathway and proteome analyses, environment was found to be the major source of variation followed by the genetic transformation factor. Nonetheless, differences were detected in the levels of JA, MeJA and CA and in the abundance of 11 proteins when comparing the GM plant and its non-GM near-isogenic variety under the same environmental conditions. Thus, these findings do support molecular studies in GM plants Risk Assessment analyses. PMID:28245233
Data Portal | Office of Cancer Clinical Proteomics Research
The CPTAC Data Portal is a centralized repository for the public dissemination of proteomic sequence datasets collected by CPTAC, along with corresponding genomic sequence datasets. In addition, available are analyses of CPTAC's raw mass spectrometry-based data files (mapping of spectra to peptide sequences and protein identification) by individual investigators from CPTAC and by a Common Data Analysis Pipeline.
USDA-ARS?s Scientific Manuscript database
The aim of this work was to study the effects of Fe and Mn deficiencies on the xylem sap proteome of tomato using a shotgun proteomic approach, with the final goal of elucidating plant response mechanisms to these stresses. This approach yielded 643 proteins reliably identified and quantified with 7...
Combining Capillary Electrophoresis with Mass Spectrometry for Applications in Proteomics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Simpson, David C.; Smith, Richard D.
2005-04-01
Throughout the field of global proteomics, ranging from simple organism studies to human medical applications, the high sample complexity creates demands for improved separations and analysis techniques. Furthermore, with increased organism complexity, the correlation between proteome and genome becomes less certain due to extensive mRNA processing prior to translation. In this way, the same DNA sequence can potentially code for regions in a number of distinct proteins; quantitative differences in expression (or abundance) between these often-related species are of significant interest. Well-established proteomics techniques, which use genomic information to identify peptides that originate from protease digestion, often cannot easily distinguishmore » between such gene products; intact protein-level analyses are required to complete the picture, particularly for identifying post-translational modifications. While chromatographic techniques are currently better suited to peptide analysis, capillary electrophoresis (CE) in combination with mass spectrometry (MS) may become important for intact protein analysis. This review focuses on CE/MS instrumentation and techniques showing promise for such applications, highlighting those with greatest potential. Reference will also be made to developments relevant to peptide-level analyses for use in time- or sample-limited situations.« less
NASA Astrophysics Data System (ADS)
Santra, Tapesh; Delatola, Eleni Ioanna
2016-07-01
Presence of considerable noise and missing data points make analysis of mass-spectrometry (MS) based proteomic data a challenging task. The missing values in MS data are caused by the inability of MS machines to reliably detect proteins whose abundances fall below the detection limit. We developed a Bayesian algorithm that exploits this knowledge and uses missing data points as a complementary source of information to the observed protein intensities in order to find differentially expressed proteins by analysing MS based proteomic data. We compared its accuracy with many other methods using several simulated datasets. It consistently outperformed other methods. We then used it to analyse proteomic screens of a breast cancer (BC) patient cohort. It revealed large differences between the proteomic landscapes of triple negative and Luminal A, which are the most and least aggressive types of BC. Unexpectedly, majority of these differences could be attributed to the direct transcriptional activity of only seven transcription factors some of which are known to be inactive in triple negative BC. We also identified two new proteins which significantly correlated with the survival of BC patients, and therefore may have potential diagnostic/prognostic values.
Salivary biomarker development using genomic, proteomic and metabolomic approaches
2012-01-01
The use of saliva as a diagnostic sample provides a non-invasive, cost-efficient method of sample collection for disease screening without the need for highly trained professionals. Saliva collection is far more practical and safe compared with invasive methods of sample collection, because of the infection risk from contaminated needles during, for example, blood sampling. Furthermore, the use of saliva could increase the availability of accurate diagnostics for remote and impoverished regions. However, the development of salivary diagnostics has required technical innovation to allow stabilization and detection of analytes in the complex molecular mixture that is saliva. The recent development of cost-effective room temperature analyte stabilization methods, nucleic acid pre-amplification techniques and direct saliva transcriptomic analysis have allowed accurate detection and quantification of transcripts found in saliva. Novel protein stabilization methods have also facilitated improved proteomic analyses. Although candidate biomarkers have been discovered using epigenetic, transcriptomic, proteomic and metabolomic approaches, transcriptomic analyses have so far achieved the most progress in terms of sensitivity and specificity, and progress towards clinical implementation. Here, we review recent developments in salivary diagnostics that have been accomplished using genomic, transcriptomic, proteomic and metabolomic approaches. PMID:23114182
Burnum-Johnson, Kristin E.; Nie, Song; Casey, Cameron P.; Monroe, Matthew E.; Orton, Daniel J.; Ibrahim, Yehia M.; Gritsenko, Marina A.; Clauss, Therese R. W.; Shukla, Anil K.; Moore, Ronald J.; Purvine, Samuel O.; Shi, Tujin; Qian, Weijun; Liu, Tao; Baker, Erin S.; Smith, Richard D.
2016-01-01
Current proteomic approaches include both broad discovery measurements and quantitative targeted analyses. In many cases, discovery measurements are initially used to identify potentially important proteins (e.g. candidate biomarkers) and then targeted studies are employed to quantify a limited number of selected proteins. Both approaches, however, suffer from limitations. Discovery measurements aim to sample the whole proteome but have lower sensitivity, accuracy, and quantitation precision than targeted approaches, whereas targeted measurements are significantly more sensitive but only sample a limited portion of the proteome. Herein, we describe a new approach that performs both discovery and targeted monitoring (DTM) in a single analysis by combining liquid chromatography, ion mobility spectrometry and mass spectrometry (LC-IMS-MS). In DTM, heavy labeled target peptides are spiked into tryptic digests and both the labeled and unlabeled peptides are detected using LC-IMS-MS instrumentation. Compared with the broad LC-MS discovery measurements, DTM yields greater peptide/protein coverage and detects lower abundance species. DTM also achieved detection limits similar to selected reaction monitoring (SRM) indicating its potential for combined high quality discovery and targeted analyses, which is a significant step toward the convergence of discovery and targeted approaches. PMID:27670688
Welle, Kevin A.; Zhang, Tian; Hryhorenko, Jennifer R.; Shen, Shichen; Qu, Jun; Ghaemmaghami, Sina
2016-01-01
Recent advances in mass spectrometry have enabled system-wide analyses of protein turnover. By globally quantifying the kinetics of protein clearance and synthesis, these methodologies can provide important insights into the regulation of the proteome under varying cellular and environmental conditions. To facilitate such analyses, we have employed a methodology that combines metabolic isotopic labeling (Stable Isotope Labeling in Cell Culture - SILAC) with isobaric tagging (Tandem Mass Tags - TMT) for analysis of multiplexed samples. The fractional labeling of multiple time-points can be measured in a single mass spectrometry run, providing temporally resolved measurements of protein turnover kinetics. To demonstrate the feasibility of the approach, we simultaneously measured the kinetics of protein clearance and accumulation for more than 3000 proteins in dividing and quiescent human fibroblasts and verified the accuracy of the measurements by comparison to established non-multiplexed approaches. The results indicate that upon reaching quiescence, fibroblasts compensate for lack of cellular growth by globally downregulating protein synthesis and upregulating protein degradation. The described methodology significantly reduces the cost and complexity of temporally-resolved dynamic proteomic experiments and improves the precision of proteome-wide turnover data. PMID:27765818
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis.
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V Lila; Karagouni, Amalia D; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V. Lila; Karagouni, Amalia D.; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality. PMID:22267904
Cloud Computing for Protein-Ligand Binding Site Comparison
2013-01-01
The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery. PMID:23762824
Cloud computing for protein-ligand binding site comparison.
Hung, Che-Lun; Hua, Guan-Jie
2013-01-01
The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.
Maryáš, Josef; Faktor, Jakub; Dvořáková, Monika; Struhárová, Iva; Grell, Peter; Bouchal, Pavel
2014-03-01
Metastases are responsible for most of the cases of death in patients with solid tumors. There is thus an urgent clinical need of better understanding the exact molecular mechanisms and finding novel therapeutics targets and biomarkers of metastatic disease of various tumors. Metastases are formed in a complicated biological process called metastatic cascade. Up to now, proteomics has enabled the identification of number of metastasis-associated proteins and potential biomarkers in cancer tissues, microdissected cells, model systems, and secretomes. Expression profiles and biological role of key proteins were confirmed in verification and functional experiments. This communication reviews these observations and analyses the methodological aspects of the proteomics approaches used. Moreover, it reviews contribution of current proteomics in the field of functional characterization and interactome analysis of proteins involved in various events in metastatic cascade. It is evident that ongoing technical progress will further increase proteome coverage and sample capacity of proteomics technologies, giving complex answers to clinical and functional questions asked. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Nitric oxide-based protein modification: formation and site-specificity of protein S-nitrosylation
Kovacs, Izabella; Lindermayr, Christian
2013-01-01
Nitric oxide (NO) is a reactive free radical with pleiotropic functions that participates in diverse biological processes in plants, such as germination, root development, stomatal closing, abiotic stress, and defense responses. It acts mainly through redox-based modification of cysteine residue(s) of target proteins, called protein S-nitrosylation.In this way NO regulates numerous cellular functions and signaling events in plants. Identification of S-nitrosylated substrates and their exact target cysteine residue(s) is very important to reveal the molecular mechanisms and regulatory roles of S-nitrosylation. In addition to the necessity of protein–protein interaction for trans-nitrosylation and denitrosylation reactions, the cellular redox environment and cysteine thiol micro-environment have been proposed important factors for the specificity of protein S-nitrosylation. Several methods have recently been developed for the proteomic identification of target proteins. However, the specificity of NO-based cysteine modification is still less defined. In this review, we discuss formation and specificity of S-nitrosylation. Special focus will be on potential S-nitrosylation motifs, site-specific proteomic analyses, computational predictions using different algorithms, and on structural analysis of cysteine S-nitrosylation. PMID:23717319
Enhancing Bottom-up and Top-down Proteomic Measurements with Ion Mobility Separations
Baker, Erin Shammel; Burnum-Johnson, Kristin E.; Ibrahim, Yehia M.; ...
2015-07-03
Proteomic measurements with greater throughput, sensitivity and additional structural information enhance the in-depth characterization of complex mixtures and targeted studies with additional information and higher confidence. While liquid chromatography separation coupled with mass spectrometry (LC-MS) measurements have provided information on thousands of proteins in different sample types, the additional of another rapid separation stage providing structural information has many benefits for analyses. Technical advances in ion funnels and multiplexing have enabled ion mobility separations to be easily and effectively coupled with LC-MS proteomics to enhance the information content of measurements. Finally, herein, we report on applications illustrating increased sensitivity, throughput,more » and structural information by utilizing IMS-MS and LC-IMS-MS measurements for both bottom-up and top-down proteomics measurements.« less
Gray, Michael W
2015-08-18
Comparative studies of the mitochondrial proteome have identified a conserved core of proteins descended from the α-proteobacterial endosymbiont that gave rise to the mitochondrion and was the source of the mitochondrial genome in contemporary eukaryotes. A surprising result of phylogenetic analyses is the relatively small proportion (10-20%) of the mitochondrial proteome displaying a clear α-proteobacterial ancestry. A large fraction of mitochondrial proteins typically has detectable homologs only in other eukaryotes and is presumed to represent proteins that emerged specifically within eukaryotes. A further significant fraction of the mitochondrial proteome consists of proteins with homologs in prokaryotes, but without a robust phylogenetic signal affiliating them with specific prokaryotic lineages. The presumptive evolutionary source of these proteins is quite different in contending models of mitochondrial origin.
Marx, Harald; Lemeer, Simone; Schliep, Jan Erik; Matheron, Lucrece; Mohammed, Shabaz; Cox, Jürgen; Mann, Matthias; Heck, Albert J R; Kuster, Bernhard
2013-06-01
We present a peptide library and data resource of >100,000 synthetic, unmodified peptides and their phosphorylated counterparts with known sequences and phosphorylation sites. Analysis of the library by mass spectrometry yielded a data set that we used to evaluate the merits of different search engines (Mascot and Andromeda) and fragmentation methods (beam-type collision-induced dissociation (HCD) and electron transfer dissociation (ETD)) for peptide identification. We also compared the sensitivities and accuracies of phosphorylation-site localization tools (Mascot Delta Score, PTM score and phosphoRS), and we characterized the chromatographic behavior of peptides in the library. We found that HCD identified more peptides and phosphopeptides than did ETD, that phosphopeptides generally eluted later from reversed-phase columns and were easier to identify than unmodified peptides and that current computational tools for proteomics can still be substantially improved. These peptides and spectra will facilitate the development, evaluation and improvement of experimental and computational proteomic strategies, such as separation techniques and the prediction of retention times and fragmentation patterns.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.
2016-05-03
ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. Themetabolite,protein, andlipidextraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of thismore » protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental,in vitro, and clinical). IMPORTANCEIn systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample.« less
Proteomics of Skeletal Muscle: Focus on Insulin Resistance and Exercise Biology
Deshmukh, Atul S.
2016-01-01
Skeletal muscle is the largest tissue in the human body and plays an important role in locomotion and whole body metabolism. It accounts for ~80% of insulin stimulated glucose disposal. Skeletal muscle insulin resistance, a primary feature of Type 2 diabetes, is caused by a decreased ability of muscle to respond to circulating insulin. Physical exercise improves insulin sensitivity and whole body metabolism and remains one of the most promising interventions for the prevention of Type 2 diabetes. Insulin resistance and exercise adaptations in skeletal muscle might be a cause, or consequence, of altered protein expressions profiles and/or their posttranslational modifications (PTMs). Mass spectrometry (MS)-based proteomics offer enormous promise for investigating the molecular mechanisms underlying skeletal muscle insulin resistance and exercise-induced adaptation; however, skeletal muscle proteomics are challenging. This review describes the technical limitations of skeletal muscle proteomics as well as emerging developments in proteomics workflow with respect to samples preparation, liquid chromatography (LC), MS and computational analysis. These technologies have not yet been fully exploited in the field of skeletal muscle proteomics. Future studies that involve state-of-the-art proteomics technology will broaden our understanding of exercise-induced adaptations as well as molecular pathogenesis of insulin resistance. This could lead to the identification of new therapeutic targets. PMID:28248217
A Comprehensive, Open-source Platform for Mass Spectrometry-based Glycoproteomics Data Analysis.
Liu, Gang; Cheng, Kai; Lo, Chi Y; Li, Jun; Qu, Jun; Neelamegham, Sriram
2017-11-01
Glycosylation is among the most abundant and diverse protein post-translational modifications (PTMs) identified to date. The structural analysis of this PTM is challenging because of the diverse monosaccharides which are not conserved among organisms, the branched nature of glycans, their isomeric structures, and heterogeneity in the glycan distribution at a given site. Glycoproteomics experiments have adopted the traditional high-throughput LC-MS n proteomics workflow to analyze site-specific glycosylation. However, comprehensive computational platforms for data analyses are scarce. To address this limitation, we present a comprehensive, open-source, modular software for glycoproteomics data analysis called GlycoPAT (GlycoProteomics Analysis Toolbox; freely available from www.VirtualGlycome.org/glycopat). The program includes three major advances: (1) "SmallGlyPep," a minimal linear representation of glycopeptides for MS n data analysis. This format allows facile serial fragmentation of both the peptide backbone and PTM at one or more locations. (2) A novel scoring scheme based on calculation of the "Ensemble Score (ES)," a measure that scores and rank-orders MS/MS spectrum for N- and O-linked glycopeptides using cross-correlation and probability based analyses. (3) A false discovery rate (FDR) calculation scheme where decoy glycopeptides are created by simultaneously scrambling the amino acid sequence and by introducing artificial monosaccharides by perturbing the original sugar mass. Parallel computing facilities and user-friendly GUIs (Graphical User Interfaces) are also provided. GlycoPAT is used to catalogue site-specific glycosylation on simple glycoproteins, standard protein mixtures and human plasma cryoprecipitate samples in three common MS/MS fragmentation modes: CID, HCD and ETD. It is also used to identify 960 unique glycopeptides in cell lysates from prostate cancer cells. The results show that the simultaneous consideration of peptide and glycan fragmentation is necessary for high quality MS n spectrum annotation in CID and HCD fragmentation modes. Additionally, they confirm the suitability of GlycoPAT to analyze shotgun glycoproteomics data. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.; ...
2015-08-18
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
The Pacific Northwest National Laboratory library of bacterial and archaeal proteomic biodiversity
DOE Office of Scientific and Technical Information (OSTI.GOV)
Payne, Samuel H.; Monroe, Matthew E.; Overall, Christopher C.
This dataset deposition announces the submission to public repositories of the PNNL Biodiversity Library, a large collection of global proteomics data for 112 bacterial and archaeal organisms. The data comprises 35,162 tandem mass spectrometry (MS/MS) datasets from ~10 years of research. All data has been searched, annotated and organized in a consistent manner to promote reuse by the community. Protein identifications were cross-referenced with KEGG functional annotations which allows for pathway oriented investigation. We present the data as a freely available community resource. A variety of data re-use options are described for computational modeling, proteomics assay design and bioengineering. Instrumentmore » data and analysis files are available at ProteomeXchange via the MassIVE partner repository under the identifiers PXD001860 and MSV000079053.« less
Stadlmann, Johannes; Hoi, David M; Taubenschmid, Jasmin; Mechtler, Karl; Penninger, Josef M
2018-05-18
SugarQb (www.imba.oeaw.ac.at/sugarqb) is a freely available collection of computational tools for the automated identification of intact glycopeptides from high-resolution HCD MS/MS data-sets in the Proteome Discoverer environment. We report the migration of SugarQb to the latest and free version of Proteome Discoverer 2.1, and apply it to the analysis of PNGase F-resistant N-glycopeptides from mouse embryonic stem cells. The analysis of intact glycopeptides highlights unexpected technical limitations to PNGase F-dependent glycoproteomic workflows at the proteome level, and warrants a critical re-interpretation of seminal data-sets in the context of N-glycosylation-site prediction. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Lichti, Cheryl F.; Fan, Xiuzhen; English, Robert D.; Zhang, Yafang; Li, Dingge; Kong, Fanping; Sinha, Mala; Andersen, Clark R.; Spratt, Heidi; Luxon, Bruce A.; Green, Thomas A.
2014-01-01
Prior research demonstrated that environmental enrichment creates individual differences in behavior leading to a protective addiction phenotype in rats. Understanding the mechanisms underlying this phenotype will guide selection of targets for much-needed novel pharmacotherapeutics. The current study investigates differences in proteome expression in the nucleus accumbens of enriched and isolated rats and the proteomic response to cocaine self-administration using a liquid chromatography mass spectrometry (LCMS) technique to quantify 1917 proteins. Results of complementary Ingenuity Pathways Analyses (IPA) and gene set enrichment analyses (GSEA), both performed using protein quantitative data, demonstrate that cocaine increases vesicular transporters for dopamine and glutamate as well as increasing proteins in the RhoA pathway. Further, cocaine regulates proteins related to ERK, CREB and AKT signaling. Environmental enrichment altered expression of a large number of proteins implicated in a diverse number of neuronal functions (e.g., energy production, mRNA splicing, and ubiquitination), molecular cascades (e.g., protein kinases), psychiatric disorders (e.g., mood disorders), and neurodegenerative diseases (e.g., Huntington's and Alzheimer's diseases). Upregulation of energy metabolism components in EC rats was verified using RNA sequencing. Most of the biological functions and pathways listed above were also identified in the Cocaine X Enrichment interaction analysis, providing clear evidence that enriched and isolated rats respond quite differently to cocaine exposure. The overall impression of the current results is that enriched saline-administering rats have a unique proteomic complement compared to enriched cocaine-administering rats as well as saline and cocaine-taking isolated rats. These results identify possible mechanisms of the protective phenotype and provide fertile soil for developing novel pharmacotherapeutics. Proteomics data are available via ProteomeXchange with identifier PXD000990. PMID:25100957
Forensic proteomics for the evaluation of the post-mortem decay in bones.
Procopio, Noemi; Williams, Anna; Chamberlain, Andrew T; Buckley, Michael
2018-04-15
Current methods for evaluation the of post-mortem interval (PMI) of skeletal remains suffer from poor accuracy due to the great number of variables that affect the diagenetic process and to the lack of specific guidelines to address this issue. During decomposition, proteins can undergo cumulative decay over the time, resulting in a decrease in the range and abundance of proteins present (i.e., the proteome) in different tissues as well as in an increase of post-translational modifications occurring in these proteins. In this study, we investigate the applicability of bone proteomic analyses to simulated forensic contexts, looking for specific biomarkers that may help the estimation of PMI, as well as evaluate a previously discovered marker for the estimation of biological age. We noticed a reduction of particular plasma and muscle proteins with increasing PMIs, as well as an increased deamidation of biglycan, a protein with a role in modulating bone growth and mineralization. We also corroborated our previous results regarding the use of fetuin-A as a potential biomarker for the estimation of age-at-death, demonstrating the applicability and the great potential that proteomics may have towards forensic sciences. The estimation of the post-mortem interval has a key role in forensic investigations, however nowadays it still suffers from poor reliability, especially when body tissues are heavily decomposed. Here we propose for the first time the application of bone proteomics to the estimation of the time elapsed since death and found several new potential biomarkers to address this, demonstrating the applicability of proteomic analyses to forensic sciences. Copyright © 2018. Published by Elsevier B.V.
Albar, Juan Pablo; Binz, Pierre-Alain; Eisenacher, Martin; Jones, Andrew R; Mayer, Gerhard; Omenn, Gilbert S; Orchard, Sandra; Vizcaíno, Juan Antonio; Hermjakob, Henning
2015-01-01
Objective To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI’s evolution, and future directions and synergies for the group. Materials and Methods The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. Results We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community. Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and biological phenotypes. The PSI website can be accessed at http://www.psidev.info. PMID:25726569
Maringer, Kevin; Yousuf, Amjad; Heesom, Kate J; Fan, Jun; Lee, David; Fernandez-Sesma, Ana; Bessant, Conrad; Matthews, David A; Davidson, Andrew D
2017-01-19
Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the efficiency of annotation projects in non-model organisms. PIT therefore represents a valuable new tool to study the biology of the important vector species Ae. aegypti, including its role in transmitting emerging viruses of global public health concern.
Molina, Laurence; Salvetat, Nicolas; Ameur, Randa Ben; Peres, Sabine; Sommerer, Nicolas; Jarraya, Fayçal; Ayadi, Hammadi; Molina, Franck; Granier, Claude
2011-12-10
The characterization of the normal urinary proteome is steadily progressing and represents a major interest in the assessment of clinical urinary biomarkers. To estimate quantitatively the variability of the normal urinary proteome, urines of 20 healthy people were collected. We first evaluated the impact of the sample conservation temperature on urine proteome integrity. Keeping the urine sample at RT or at +4°C until storage at -80°C seems the best way for long-term storage of samples for 2D-GE analysis. The quantitative variability of the normal urinary proteome was estimated on the 20 urines mapped by 2D-GE. The occurrence of the 910 identified spots was analysed throughout the gels and represented in a virtual 2D gel. Sixteen percent of the spots were found to occur in all samples and 23% occurred in at least 90% of urines. About 13% of the protein spots were present only in 10% or less of the samples, thus representing the most variable part of the normal urinary proteome. Twenty proteins corresponding to a fraction of the fully conserved spots were identified by mass spectrometry. In conclusion, a "public" urinary proteome, common to healthy individuals, seems to coexist with a "private" urinary proteome, which is more specific to each individual. Copyright © 2011 Elsevier B.V. All rights reserved.
A unique proteomic profile on surface IgM ligation in unmutated chronic lymphocytic leukemia
Perrot, Aurore; Pionneau, Cédric; Nadaud, Sophie; Davi, Frédéric; Leblond, Véronique; Jacob, Frédéric; Merle-Béral, Hélène; Herbrecht, Raoul; Béné, Marie-Christine; Gribben, John G.; Vallat, Laurent
2011-01-01
Chronic lymphocytic leukemia (CLL) is characterized by a highly variable clinical course with 2 extreme subsets: indolent, ZAP70− and mutated immunoglobulin heavy chain gene (M-CLL); and aggressive, ZAP70+ and unmutated immunoglobulin heavy chain (UM-CLL). Given the long-term suspicion of antigenic stimulation as a primum movens in the disease, the role of the B-cell receptor has been extensively studied in various experimental settings; albeit scarcely in a comparative dynamic proteomic approach. Here we use a quantitative 2-dimensional fluorescence difference gel electrophoresis technology to compare 48 proteomic profiles of the 2 CLL subsets before and after anti-IgM ligation. Differentially expressed proteins were subsequently identified by mass spectrometry. We show that unstimulated M- and UM-CLL cells display distinct proteomic profiles. Furthermore, anti-IgM stimulation induces a specific proteomic response, more pronounced in the more aggressive CLL. Statistical analyses demonstrate several significant protein variations according to stimulation conditions. Finally, we identify an intermediate form of M-CLL cells, with an indolent profile (ZAP70−) but sharing aggressive proteomic profiles alike UM-CLL cells. Collectively, this first quantitative and dynamic proteome analysis of CLL further dissects the complex molecular pathway after B-cell receptor stimulation and depicts distinct proteomic profiles, which could lead to novel molecular stratification of the disease. PMID:21602524
Lindsey, Merry L; Mayr, Manuel; Gomes, Aldrin V; Delles, Christian; Arrell, D Kent; Murphy, Anne M; Lange, Richard A; Costello, Catherine E; Jin, Yu-Fang; Laskowitz, Daniel T; Sam, Flora; Terzic, Andre; Van Eyk, Jennifer; Srinivas, Pothur R
2015-09-01
The year 2014 marked the 20th anniversary of the coining of the term proteomics. The purpose of this scientific statement is to summarize advances over this period that have catalyzed our capacity to address the experimental, translational, and clinical implications of proteomics as applied to cardiovascular health and disease and to evaluate the current status of the field. Key successes that have energized the field are delineated; opportunities for proteomics to drive basic science research, facilitate clinical translation, and establish diagnostic and therapeutic healthcare algorithms are discussed; and challenges that remain to be solved before proteomic technologies can be readily translated from scientific discoveries to meaningful advances in cardiovascular care are addressed. Proteomics is the result of disruptive technologies, namely, mass spectrometry and database searching, which drove protein analysis from 1 protein at a time to protein mixture analyses that enable large-scale analysis of proteins and facilitate paradigm shifts in biological concepts that address important clinical questions. Over the past 20 years, the field of proteomics has matured, yet it is still developing rapidly. The scope of this statement will extend beyond the reaches of a typical review article and offer guidance on the use of next-generation proteomics for future scientific discovery in the basic research laboratory and clinical settings. © 2015 American Heart Association, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pfiffner, Susan M.; Löffler, Frank; Ritalahti, Kirsti
The overall goal for this funded project was to develop and exploit environmental metaproteomics tools to identify biomarkers for monitoring microbial activity affecting U speciation at U-contaminated sites, correlate metaproteomics profiles with geochemical parameters and U(VI) reduction activity (or lack thereof), elucidate mechanisms contributing to U(VI) reduction, and provide remediation project managers with additional information to make science-based site management decisions for achieving cleanup goals more efficiently. Although significant progress has been made in elucidating the microbiology contribution to metal and radionuclide reduction, the cellular components, pathway(s), and mechanisms involved in U trans-formation remain poorly understood. Recent advances in (meta)proteomicsmore » technology enable detailed studies of complex samples, including environmental samples, which differ between sites and even show considerable variability within the same site (e.g., the Oak Ridge IFRC site). Additionally, site-specific geochemical conditions affect microbial activity and function, suggesting generalized assessment and interpretations may not suffice. This research effort integrated current understanding of the microbiology and biochemistry of U(VI) reduction and capitalize on advances in proteomics technology made over the past few years. Field-related analyses used Oak Ridge IFRC field ground water samples from locations where slow-release substrate biostimulation has been implemented to accelerate in situ U(VI) reduction rates. Our overarching hypothesis was that the metabolic signature in environmental samples, as deciphered by the metaproteome measurements, would show a relationship with U(VI) reduction activity. Since metaproteomic and metagenomic characterizations were computationally challenging and time-consuming, we used a tiered approach that combines database mining, controlled laboratory studies, U(VI) reduction activity measurements, phylogenetic analyses, and gene expression studies to support the metaproteomics characterizations. Growth experiments of target microorganisms (Anaeromyxobacter, Shewanella, Geobacter) revealed tremendous respiratory versatility, as evidenced by the ability to utilize a range of electron donors (e.g. acetate, hydrogen, pyruvate, lactate, succinate, formate) and electron acceptors (e.g. nitrate, fumarate, halogenated phenols, ferric iron, nitrous oxide, etc.). In particular, the dissimilatory metabolic reduction of metals, including radionuclides, by target microorganisms spurred interest for in situ bioremediation of contaminated soils and sediments. Distinct c-type cytochrome expression patterns were observed in target microorganisms grown with the different electron acceptors. For each target microorganism, the core proteome covered almost all metabolic pathways represented by their corresponding pan-proteomes. Unique proteins were detected for each target microorganism, and their expression and possible functionalities were linked to specific growth conditions through proteomics measurements. Optimization of the proteomic tools included in-depth comprehensive metagenomic and metaproteomic analyses on a limited number of samples. The optimized metaproteomic analyses were then applied to Oak Ridge IFRC field samples from the slow-release substrate biostimulation. Metaproteomic analysis and pathway mapping results demonstrated the distinct effects of metal and non-metal growth conditions on the proteome expression. With these metaproteomic tools, we identified which previously hypothetical metabolic pathways were active during the analyzed time points of the slow release substrate biostimulation. Thus, we demonstrated the utility of these tools for site assessment, efficient implementation of bioremediation and long-term monitoring. This research of detailed protein analysis linked with metal reduction activity did (1) show that c-type cytochrome isoforms, previously associated with radionuclide reduction activity, are suitable biomarkers, (2) identify new biomarker targets for site assessment and bioremediation monitoring, and (3) provide new information about specific proteins and mechanisms involved in U(VI) reduction and immobilization. This expanded metagenomic and metaproteomic toolbox contributed to implementing science-driven site management with broad benefits to the DOE mission.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Mingqun ..; Kikuchi, Takane; Brewer, Heather M.
2011-02-17
Anaplasma phagocytophilum and Ehrlichia chaffeensis are obligatory intracellular {alpha}-proteobacteria that infect human leukocytes and cause potentially fatal emerging zoonoses. In the present study, we determined global protein expression profiles of these bacteria cultured in the human promyelocytic leukemia cell line, HL-60. Mass spectrometric (MS) analyses identified a total of 1,212 A. phagocytophilum and 1,021 E. chaffeensis proteins, representing 89.3 and 92.3% of the predicted bacterial proteomes, respectively. Nearly all bacterial proteins ({approx}99%) with known functions were expressed, whereas only approximately 80% of hypothetical proteins were detected in infected human cells. Quantitative MS/MS analyses indicated that highly expressed proteins in bothmore » bacteria included chaperones, enzymes involved in biosynthesis and metabolism, and outer membrane proteins, such as A. phagocytophilum P44 and E. chaffeensis P28/OMP-1. Among 113 A. phagocytophilum p44 paralogous genes, 110 of them were expressed and 88 of them were encoded by pseudogenes. In addition, bacterial infection of HL-60 cells up-regulated the expression of human proteins involved mostly in cytoskeleton components, vesicular trafficking, cell signaling, and energy metabolism, but down regulated some pattern recognition receptors involved in innate immunity. Our proteomics data represent a comprehensive analysis of A. phagocytophilum and E. chaffeensis proteomes, and provide a quantitative view of human host protein expression profiles regulated by bacterial infection. The availability of these proteomic data will provide new insights into biology and pathogenesis of these obligatory intracellular pathogens.« less
Proteogenomic characterization of human colon and rectal cancer
Zhang, Bing; Wang, Jing; Wang, Xiaojing; Zhu, Jing; Liu, Qi; Shi, Zhiao; Chambers, Matthew C.; Zimmerman, Lisa J.; Shaddox, Kent F.; Kim, Sangtae; Davies, Sherri R.; Wang, Sean; Wang, Pei; Kinsinger, Christopher R.; Rivers, Robert C.; Rodriguez, Henry; Townsend, R. Reid; Ellis, Matthew J.C.; Carr, Steven A.; Tabb, David L.; Coffey, Robert J.; Slebos, Robbert J.C.; Liebler, Daniel C.
2014-01-01
Summary We analyzed proteomes of colon and rectal tumors previously characterized by the Cancer Genome Atlas (TCGA) and performed integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. mRNA transcript abundance did not reliably predict protein abundance differences between tumors. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA “MSI/CIMP” transcriptomic subtype, but had distinct mutation, methylation, and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates including HNF4A, TOMM34 and SRC. Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology. PMID:25043054
Comparing Simplification Strategies for the Skeletal Muscle Proteome
Geary, Bethany; Young, Iain S.; Cash, Phillip; Whitfield, Phillip D.; Doherty, Mary K.
2016-01-01
Skeletal muscle is a complex tissue that is dominated by the presence of a few abundant proteins. This wide dynamic range can mask the presence of lower abundance proteins, which can be a confounding factor in large-scale proteomic experiments. In this study, we have investigated a number of pre-fractionation methods, at both the protein and peptide level, for the characterization of the skeletal muscle proteome. The analyses revealed that the use of OFFGEL isoelectric focusing yielded the largest number of protein identifications (>750) compared to alternative gel-based and protein equalization strategies. Further, OFFGEL led to a substantial enrichment of a different sub-population of the proteome. Filter-aided sample preparation (FASP), coupled to peptide-level OFFGEL provided more confidence in the results due to a substantial increase in the number of peptides assigned to each protein. The findings presented here support the use of a multiplexed approach to proteome characterization of skeletal muscle, which has a recognized imbalance in the dynamic range of its protein complement. PMID:28248220
Noninvasive diagnosis of intraamniotic infection: proteomic biomarkers in vaginal fluid.
Hitti, Jane; Lapidus, Jodi A; Lu, Xinfang; Reddy, Ashok P; Jacob, Thomas; Dasari, Surendra; Eschenbach, David A; Gravett, Michael G; Nagalla, Srinivasa R
2010-07-01
We analyzed the vaginal fluid proteome to identify biomarkers of intraamniotic infection among women in preterm labor. Proteome analysis was performed on vaginal fluid specimens from women with preterm labor, using multidimensional liquid chromatography, tandem mass spectrometry, and label-free quantification. Enzyme immunoassays were used to quantify candidate proteins. Classification accuracy for intraamniotic infection (positive amniotic fluid bacterial culture and/or interleukin-6 >2 ng/mL) was evaluated using receiver-operator characteristic curves obtained by logistic regression. Of 170 subjects, 30 (18%) had intraamniotic infection. Vaginal fluid proteome analysis revealed 338 unique proteins. Label-free quantification identified 15 proteins differentially expressed in intraamniotic infection, including acute-phase reactants, immune modulators, high-abundance amniotic fluid proteins and extracellular matrix-signaling factors; these findings were confirmed by enzyme immunoassay. A multi-analyte algorithm showed accurate classification of intraamniotic infection. Vaginal fluid proteome analyses identified proteins capable of discriminating between patients with and without intraamniotic infection. Copyright (c) 2010 Mosby, Inc. All rights reserved.
de Bernonville, Thomas Dugé; Albenne, Cécile; Arlat, Matthieu; Hoffmann, Laurent; Lauber, Emmanuelle; Jamet, Elisabeth
2014-01-01
Proteomic analysis of xylem sap has recently become a major field of interest to understand several biological questions related to plant development and responses to environmental clues. The xylem sap appears as a dynamic fluid undergoing changes in its proteome upon abiotic and biotic stresses. Unlike cell compartments which are amenable to purification in sufficient amount prior to proteomic analysis, the xylem sap has to be collected in particular conditions to avoid contamination by intracellular proteins and to obtain enough material. A model plant like Arabidopsis thaliana is not suitable for such an analysis because efficient harvesting of xylem sap is difficult. The analysis of the xylem sap proteome also requires specific procedures to concentrate proteins and to focus on proteins predicted to be secreted. Indeed, xylem sap proteins appear to be synthesized and secreted in the root stele or to originate from dying differentiated xylem cells. This chapter describes protocols to collect xylem sap from Brassica species and to prepare total and N-glycoprotein extracts for identification of proteins by mass spectrometry analyses and bioinformatics.
Combining Search Engines for Comparative Proteomics
Tabb, David
2012-01-01
Many proteomics laboratories have found spectral counting to be an ideal way to recognize biomarkers that differentiate cohorts of samples. This approach assumes that proteins that differ in quantity between samples will generate different numbers of identifiable tandem mass spectra. Increasingly, researchers are employing multiple search engines to maximize the identifications generated from data collections. This talk evaluates four strategies to combine information from multiple search engines in comparative proteomics. The “Count Sum” model pools the spectra across search engines. The “Vote Counting” model combines the judgments from each search engine by protein. Two other models employ parametric and non-parametric analyses of protein-specific p-values from different search engines. We evaluated the four strategies in two different data sets. The ABRF iPRG 2009 study generated five LC-MS/MS analyses of “red” E. coli and five analyses of “yellow” E. coli. NCI CPTAC Study 6 generated five concentrations of Sigma UPS1 spiked into a yeast background. All data were identified with X!Tandem, Sequest, MyriMatch, and TagRecon. For both sample types, “Vote Counting” appeared to manage the diverse identification sets most effectively, yielding heightened discrimination as more search engines were added.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chin, Mark H.; Qian, Weijun; Wang, Haixing
2008-02-10
The molecular mechanisms underlying the changes in the nigrostriatal pathway in Parkinson disease (PD) are not completely understood. Here we use mass spectrometry and microarrays to study the proteomic and transcriptomic changes in the striatum of two mouse models of PD, induced by the distinct neurotoxins 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) and methamphetamine (METH). Proteomic analyses resulted in the identification and relative quantification of 912 proteins with two or more unique peptides and 85 proteins with significant abundance changes following neurotoxin treatment. Similarly, microarray analyses revealed 181 genes with significant changes in mRNA following neurotoxin treatment. The combined protein and gene list providesmore » a clearer picture of the potential mechanisms underlying neurodegeneration observed in PD. Functional analysis of this combined list revealed a number of significant categories, including mitochondrial dysfunction, oxidative stress response and apoptosis. Additionally, codon usage and miRNAs may play an important role in translational control in the striatum. These results constitute one of the largest datasets integrating protein and transcript changes for these neurotoxin models with many similar endpoint phenotypes but distinct mechanisms.« less
Khan, Gulafshana Hafeez; Galazis, Nicolas; Docheva, Nikolina; Layfield, Robert; Atiomo, William
2015-01-01
STUDY QUESTION Do any proteomic biomarkers previously identified for pre-eclampsia (PE) overlap with those identified in women with polycystic ovary syndrome (PCOS). SUMMARY ANSWER Five previously identified proteomic biomarkers were found to be common in women with PE and PCOS when compared with controls. WHAT IS KNOWN ALREADY Various studies have indicated an association between PCOS and PE; however, the pathophysiological mechanisms supporting this association are not known. STUDY DESIGN, SIZE, DURATION A systematic review and update of our PCOS proteomic biomarker database was performed, along with a parallel review of PE biomarkers. The study included papers from 1980 to December 2013. PARTICIPANTS/MATERIALS, SETTING, METHODS In all the studies analysed, there were a total of 1423 patients and controls. The number of proteomic biomarkers that were catalogued for PE was 192. MAIN RESULTS AND THE ROLE OF CHANCE Five proteomic biomarkers were shown to be differentially expressed in women with PE and PCOS when compared with controls: transferrin, fibrinogen α, β and γ chain variants, kininogen-1, annexin 2 and peroxiredoxin 2. In PE, the biomarkers were identified in serum, plasma and placenta and in PCOS, the biomarkers were identified in serum, follicular fluid, and ovarian and omental biopsies. LIMITATIONS, REASONS FOR CAUTION The techniques employed to detect proteomics have limited ability in identifying proteins that are of low abundance, some of which may have a diagnostic potential. The sample sizes and number of biomarkers identified from these studies do not exclude the risk of false positives, a limitation of all biomarker studies. The biomarkers common to PE and PCOS were identified from proteomic analyses of different tissues. WIDER IMPLICATIONS OF THE FINDINGS This data amalgamation of the proteomic studies in PE and in PCOS, for the first time, discovered a panel of five biomarkers for PE which are common to women with PCOS, including transferrin, fibrinogen α, β and γ chain variants, kininogen-1, annexin 2 and peroxiredoxin 2. If validated, these biomarkers could provide a useful framework for the knowledge infrastructure in this area. To accomplish this goal, a well co-ordinated multidisciplinary collaboration of clinicians, basic scientists and mathematicians is vital. STUDY FUNDING/COMPETING INTEREST(S) No financial support was obtained for this project. There are no conflicts of interest. PMID:25351721
Pasculescu, Adrian; Schoof, Erwin M; Creixell, Pau; Zheng, Yong; Olhovsky, Marina; Tian, Ruijun; So, Jonathan; Vanderlaan, Rachel D; Pawson, Tony; Linding, Rune; Colwill, Karen
2014-04-04
A major challenge in mass spectrometry and other large-scale applications is how to handle, integrate, and model the data that is produced. Given the speed at which technology advances and the need to keep pace with biological experiments, we designed a computational platform, CoreFlow, which provides programmers with a framework to manage data in real-time. It allows users to upload data into a relational database (MySQL), and to create custom scripts in high-level languages such as R, Python, or Perl for processing, correcting and modeling this data. CoreFlow organizes these scripts into project-specific pipelines, tracks interdependencies between related tasks, and enables the generation of summary reports as well as publication-quality images. As a result, the gap between experimental and computational components of a typical large-scale biology project is reduced, decreasing the time between data generation, analysis and manuscript writing. CoreFlow is being released to the scientific community as an open-sourced software package complete with proteomics-specific examples, which include corrections for incomplete isotopic labeling of peptides (SILAC) or arginine-to-proline conversion, and modeling of multiple/selected reaction monitoring (MRM/SRM) results. CoreFlow was purposely designed as an environment for programmers to rapidly perform data analysis. These analyses are assembled into project-specific workflows that are readily shared with biologists to guide the next stages of experimentation. Its simple yet powerful interface provides a structure where scripts can be written and tested virtually simultaneously to shorten the life cycle of code development for a particular task. The scripts are exposed at every step so that a user can quickly see the relationships between the data, the assumptions that have been made, and the manipulations that have been performed. Since the scripts use commonly available programming languages, they can easily be transferred to and from other computational environments for debugging or faster processing. This focus on 'on the fly' analysis sets CoreFlow apart from other workflow applications that require wrapping of scripts into particular formats and development of specific user interfaces. Importantly, current and future releases of data analysis scripts in CoreFlow format will be of widespread benefit to the proteomics community, not only for uptake and use in individual labs, but to enable full scrutiny of all analysis steps, thus increasing experimental reproducibility and decreasing errors. This article is part of a Special Issue entitled: Can Proteomics Fill the Gap Between Genomics and Phenotypes? Copyright © 2014 Elsevier B.V. All rights reserved.
Bacterial membrane proteomics.
Poetsch, Ansgar; Wolters, Dirk
2008-10-01
About one quarter to one third of all bacterial genes encode proteins of the inner or outer bacterial membrane. These proteins perform essential physiological functions, such as the import or export of metabolites, the homeostasis of metal ions, the extrusion of toxic substances or antibiotics, and the generation or conversion of energy. The last years have witnessed completion of a plethora of whole-genome sequences of bacteria important for biotechnology or medicine, which is the foundation for proteome and other functional genome analyses. In this review, we discuss the challenges in membrane proteome analysis, starting from sample preparation and leading to MS-data analysis and quantification. The current state of available proteomics technologies as well as their advantages and disadvantages will be described with a focus on shotgun proteomics. Then, we will briefly introduce the most abundant proteins and protein families present in bacterial membranes before bacterial membrane proteomics studies of the last years will be presented. It will be shown how these works enlarged our knowledge about the physiological adaptations that take place in bacteria during fine chemical production, bioremediation, protein overexpression, and during infections. Furthermore, several examples from literature demonstrate the suitability of membrane proteomics for the identification of antigens and different pathogenic strains, as well as the elucidation of membrane protein structure and function.
Schubert, Peter; Devine, Dana V
2010-01-03
Proteomics has brought new perspectives to the fields of hematology and transfusion medicine in the last decade. The steady improvement of proteomic technology is propelling novel discoveries of molecular mechanisms by studying protein expression, post-translational modifications and protein interactions. This review article focuses on the application of proteomics to the identification of molecular mechanisms leading to the deterioration of blood platelets during storage - a critical aspect in the provision of platelet transfusion products. Several proteomic approaches have been employed to analyse changes in the platelet protein profile during storage and the obtained data now need to be translated into platelet biochemistry in order to connect the results to platelet function. Targeted biochemical applications then allow the identification of points for intervention in signal transduction pathways. Once validated and placed in a transfusion context, these data will provide further understanding of the underlying molecular mechanisms leading to platelet storage lesion. Future aspects of proteomics in blood banking will aim to make use of protein markers identified for platelet storage lesion development to monitor proteome changes when alterations such as the use of additive solutions or pathogen reduction strategies are put in place in order to improve platelet quality for patients. (c) 2009 Elsevier B.V. All rights reserved.
Analyzing large-scale proteomics projects with latent semantic indexing.
Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning
2008-01-01
Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.
Couto, Narciso; Schooling, Sarah R; Dutcher, John R; Barber, Jill
2015-10-02
In the present work, two different proteomic platforms, gel-based and gel-free, were used to map the matrix and outer membrane vesicle exoproteomes of Pseudomonas aeruginosa PAO1 biofilms. These two proteomic strategies allowed us a confident identification of 207 and 327 proteins from enriched outer membrane vesicles and whole matrix isolated from biofilms. Because of the physicochemical characteristics of these subproteomes, the two strategies showed complementarity, and thus, the most comprehensive analysis of P. aeruginosa exoproteome to date was achieved. Under our conditions, outer membrane vesicles contribute approximately 20% of the whole matrix proteome, demonstrating that membrane vesicles are an important component of the matrix. The proteomic profiles were analyzed in terms of their biological context, namely, a biofilm. Accordingly relevant metabolic processes involved in cellular adaptation to the biofilm lifestyle as well as those related to P. aeruginosa virulence capabilities were a key feature of the analyses. The diversity of the matrix proteome corroborates the idea of high heterogeneity within the biofilm; cells can display different levels of metabolism and can adapt to local microenvironments making this proteomic analysis challenging. In addition to analyzing our own primary data, we extend the analysis to published data by other groups in order to deepen our understanding of the complexity inherent within biofilm populations.
MitoMiner: a data warehouse for mitochondrial proteomics data
Smith, Anthony C.; Blackshaw, James A.; Robinson, Alan J.
2012-01-01
MitoMiner (http://mitominer.mrc-mbu.cam.ac.uk/) is a data warehouse for the storage and analysis of mitochondrial proteomics data gathered from publications of mass spectrometry and green fluorescent protein tagging studies. In MitoMiner, these data are integrated with data from UniProt, Gene Ontology, Online Mendelian Inheritance in Man, HomoloGene, Kyoto Encyclopaedia of Genes and Genomes and PubMed. The latest release of MitoMiner stores proteomics data sets from 46 studies covering 11 different species from eumetazoa, viridiplantae, fungi and protista. MitoMiner is implemented by using the open source InterMine data warehouse system, which provides a user interface allowing users to upload data for analysis, personal accounts to store queries and results and enables queries of any data in the data model. MitoMiner also provides lists of proteins for use in analyses, including the new MitoMiner mitochondrial proteome reference sets that specify proteins with substantial experimental evidence for mitochondrial localization. As further mitochondrial proteomics data sets from normal and diseased tissue are published, MitoMiner can be used to characterize the variability of the mitochondrial proteome between tissues and investigate how changes in the proteome may contribute to mitochondrial dysfunction and mitochondrial-associated diseases such as cancer, neurodegenerative diseases, obesity, diabetes, heart failure and the ageing process. PMID:22121219
Proteogenomics Dashboard for the Human Proteome Project.
Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto
2015-09-04
dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.
SELDI PROTEINCHIP-BASED LIVER BIOMARKERS IN FUNGICIDE EXPOSED ZEBRAFISH
The research presented here is part of a three-phased small fish computational toxicology project using a combination of 1) whole organism endpoints, 2) genomic, proteomic, and metabolomic approaches, and 3) computational modeling to (a) identify new molecular biomarkers of expos...
Shaping biological knowledge: applications in proteomics.
Lisacek, F; Chichester, C; Gonnet, P; Jaillet, O; Kappus, S; Nikitin, F; Roland, P; Rossier, G; Truong, L; Appel, R
2004-01-01
The central dogma of molecular biology has provided a meaningful principle for data integration in the field of genomics. In this context, integration reflects the known transitions from a chromosome to a protein sequence: transcription, intron splicing, exon assembly and translation. There is no such clear principle for integrating proteomics data, since the laws governing protein folding and interactivity are not quite understood. In our effort to bring together independent pieces of information relative to proteins in a biologically meaningful way, we assess the bias of bioinformatics resources and consequent approximations in the framework of small-scale studies. We analyse proteomics data while following both a data-driven (focus on proteins smaller than 10 kDa) and a hypothesis-driven (focus on whole bacterial proteomes) approach. These applications are potentially the source of specialized complements to classical biological ontologies.
The Use of Proteomics in Assisted Reproduction.
Kosteria, Ioanna; Anagnostopoulos, Athanasios K; Kanaka-Gantenbein, Christina; Chrousos, George P; Tsangaris, George T
2017-01-01
Despite the explosive increase in the use of Assisted Reproductive Technologies (ART) over the last 30 years, their success rates remain suboptimal. Proteomics is a rapidly-evolving technology-driven science that has already been widely applied in the exploration of human reproduction and fertility, providing useful insights into its physiology and leading to the identification of numerous proteins that may be potential biomarkers and/or treatment targets of a successful ART pregnancy. Here we present a brief overview of the techniques used in proteomic analyses and attempt a comprehensive presentation of recent data from mass spectrometry-based proteomic studies in humans, regarding all components of ARTs, including the male and female gamete, the derived zygote and embryo, the endometrium and, finally, the ART offspring both pre- and postnatally. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Seto, Jason; Walsh, Michael P.; Mahadevan, Padmanabhan; Zhang, Qiwei; Seto, Donald
2010-01-01
Technological advances and increasingly cost-effect methodologies in DNA sequencing and computational analysis are providing genome and proteome data for human adenovirus research. Applying these tools, data and derived knowledge to the development of vaccines against these pathogens will provide effective prophylactics. The same data and approaches can be applied to vector development for gene delivery in gene therapy and vaccine delivery protocols. Examination of several field strain genomes and their analyses provide examples of data that are available using these approaches. An example of the development of HAdV-B3 both as a vaccine and also as a vector is presented. PMID:21994597
The Proteome Folding Project: Proteome-scale prediction of structure and function
Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard
2011-01-01
The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995
Unexpected features of the dark proteome
Perdigão, Nelson; Heinrich, Julian; Stolte, Christian; Sabir, Kenneth S.; Buckley, Michael J.; Tabor, Bruce; Signal, Beth; Gloss, Brian S.; Hammang, Christopher J.; Rost, Burkhard; Schafferhans, Andrea
2015-01-01
We surveyed the “dark” proteome–that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44–54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology. PMID:26578815
Winkler, Robert
2015-01-01
In biological mass spectrometry, crude instrumental data need to be converted into meaningful theoretical models. Several data processing and data evaluation steps are required to come to the final results. These operations are often difficult to reproduce, because of too specific computing platforms. This effect, known as 'workflow decay', can be diminished by using a standardized informatic infrastructure. Thus, we compiled an integrated platform, which contains ready-to-use tools and workflows for mass spectrometry data analysis. Apart from general unit operations, such as peak picking and identification of proteins and metabolites, we put a strong emphasis on the statistical validation of results and Data Mining. MASSyPup64 includes e.g., the OpenMS/TOPPAS framework, the Trans-Proteomic-Pipeline programs, the ProteoWizard tools, X!Tandem, Comet and SpiderMass. The statistical computing language R is installed with packages for MS data analyses, such as XCMS/metaXCMS and MetabR. The R package Rattle provides a user-friendly access to multiple Data Mining methods. Further, we added the non-conventional spreadsheet program teapot for editing large data sets and a command line tool for transposing large matrices. Individual programs, console commands and modules can be integrated using the Workflow Management System (WMS) taverna. We explain the useful combination of the tools by practical examples: (1) A workflow for protein identification and validation, with subsequent Association Analysis of peptides, (2) Cluster analysis and Data Mining in targeted Metabolomics, and (3) Raw data processing, Data Mining and identification of metabolites in untargeted Metabolomics. Association Analyses reveal relationships between variables across different sample sets. We present its application for finding co-occurring peptides, which can be used for target proteomics, the discovery of alternative biomarkers and protein-protein interactions. Data Mining derived models displayed a higher robustness and accuracy for classifying sample groups in targeted Metabolomics than cluster analyses. Random Forest models do not only provide predictive models, which can be deployed for new data sets, but also the variable importance. We demonstrate that the later is especially useful for tracking down significant signals and affected pathways in untargeted Metabolomics. Thus, Random Forest modeling supports the unbiased search for relevant biological features in Metabolomics. Our results clearly manifest the importance of Data Mining methods to disclose non-obvious information in biological mass spectrometry . The application of a Workflow Management System and the integration of all required programs and data in a consistent platform makes the presented data analyses strategies reproducible for non-expert users. The simple remastering process and the Open Source licenses of MASSyPup64 (http://www.bioprocess.org/massypup/) enable the continuous improvement of the system.
17th Chromosome-Centric Human Proteome Project Symposium in Tehran.
Meyfour, Anna; Pahlavan, Sara; Sobhanian, Hamid; Salekdeh, Ghasem Hosseini
2018-04-01
This report describes the 17th Chromosome-Centric Human Proteome Project which was held in Tehran, Iran, April 27 and 28, 2017. A brief summary of the symposium's talks including new technical and computational approaches for the identification of novel proteins from non-coding genomic regions, physicochemical and biological causes of missing proteins, and the close interactions between Chromosome- and Biology/Disease-driven Human Proteome Project are presented. A synopsis of decisions made on the prospective programs to maintain collaborative works, share resources and information, and establishment of a newly organized working group, the task force for missing protein analysis are discussed. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Managing expectations when publishing tools and methods for computational proteomics.
Martens, Lennart; Kohlbacher, Oliver; Weintraub, Susan T
2015-05-01
Computational tools are pivotal in proteomics because they are crucial for identification, quantification, and statistical assessment of data. The gateway to finding the best choice of a tool or approach for a particular problem is frequently journal articles, yet there is often an overwhelming variety of options that makes it hard to decide on the best solution. This is particularly difficult for nonexperts in bioinformatics. The maturity, reliability, and performance of tools can vary widely because publications may appear at different stages of development. A novel idea might merit early publication despite only offering proof-of-principle, while it may take years before a tool can be considered mature, and by that time it might be difficult for a new publication to be accepted because of a perceived lack of novelty. After discussions with members of the computational mass spectrometry community, we describe here proposed recommendations for organization of informatics manuscripts as a way to set the expectations of readers (and reviewers) through three different manuscript types that are based on existing journal designations. Brief Communications are short reports describing novel computational approaches where the implementation is not necessarily production-ready. Research Articles present both a novel idea and mature implementation that has been suitably benchmarked. Application Notes focus on a mature and tested tool or concept and need not be novel but should offer advancement from improved quality, ease of use, and/or implementation. Organizing computational proteomics contributions into these three manuscript types will facilitate the review process and will also enable readers to identify the maturity and applicability of the tool for their own workflows.
Neural Stem Cells (NSCs) and Proteomics*
Shoemaker, Lorelei D.; Kornblum, Harley I.
2016-01-01
Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. PMID:26494823
Proteomic profiling of developing cotton fibers from wild and domesticated Gossypium barbadense.
Hu, Guanjing; Koh, Jin; Yoo, Mi-Jeong; Grupp, Kara; Chen, Sixue; Wendel, Jonathan F
2013-10-01
Pima cotton (Gossypium barbadense) is widely cultivated because of its long, strong seed trichomes ('fibers') used for premium textiles. These agronomically advanced fibers were derived following domestication and thousands of years of human-mediated crop improvement. To gain an insight into fiber development and evolution, we conducted comparative proteomic and transcriptomic profiling of developing fiber from an elite cultivar and a wild accession. Analyses using isobaric tag for relative and absolute quantification (iTRAQ) LC-MS/MS technology identified 1317 proteins in fiber. Of these, 205 were differentially expressed across developmental stages, and 190 showed differential expression between wild and cultivated forms, 14.4% of the proteome sampled. Human selection may have shifted the timing of developmental modules, such that some occur earlier in domesticated than in wild cotton. A novel approach was used to detect possible biased expression of homoeologous copies of proteins. Results indicate a significant partitioning of duplicate gene expression at the protein level, but an approximately equal degree of bias for each of the two constituent genomes of allopolyploid cotton. Our results demonstrate the power of complementary transcriptomic and proteomic approaches for the study of the domestication process. They also provide a rich database for mining for functional analyses of cotton improvement or evolution. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Burnum-Johnson, Kristin E; Nie, Song; Casey, Cameron P; Monroe, Matthew E; Orton, Daniel J; Ibrahim, Yehia M; Gritsenko, Marina A; Clauss, Therese R W; Shukla, Anil K; Moore, Ronald J; Purvine, Samuel O; Shi, Tujin; Qian, Weijun; Liu, Tao; Baker, Erin S; Smith, Richard D
2016-12-01
Current proteomic approaches include both broad discovery measurements and quantitative targeted analyses. In many cases, discovery measurements are initially used to identify potentially important proteins (e.g. candidate biomarkers) and then targeted studies are employed to quantify a limited number of selected proteins. Both approaches, however, suffer from limitations. Discovery measurements aim to sample the whole proteome but have lower sensitivity, accuracy, and quantitation precision than targeted approaches, whereas targeted measurements are significantly more sensitive but only sample a limited portion of the proteome. Herein, we describe a new approach that performs both discovery and targeted monitoring (DTM) in a single analysis by combining liquid chromatography, ion mobility spectrometry and mass spectrometry (LC-IMS-MS). In DTM, heavy labeled target peptides are spiked into tryptic digests and both the labeled and unlabeled peptides are detected using LC-IMS-MS instrumentation. Compared with the broad LC-MS discovery measurements, DTM yields greater peptide/protein coverage and detects lower abundance species. DTM also achieved detection limits similar to selected reaction monitoring (SRM) indicating its potential for combined high quality discovery and targeted analyses, which is a significant step toward the convergence of discovery and targeted approaches. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Sma3s: A universal tool for easy functional annotation of proteomes and transcriptomes.
Casimiro-Soriguer, Carlos S; Muñoz-Mérida, Antonio; Pérez-Pulido, Antonio J
2017-06-01
The current cheapening of next-generation sequencing has led to an enormous growth in the number of sequenced genomes and transcriptomes, allowing wet labs to get the sequences from their organisms of study. To make the most of these data, one of the first things that should be done is the functional annotation of the protein-coding genes. But it used to be a slow and tedious step that can involve the characterization of thousands of sequences. Sma3s is an accurate computational tool for annotating proteins in an unattended way. Now, we have developed a completely new version, which includes functionalities that will be of utility for fundamental and applied science. Currently, the results provide functional categories such as biological processes, which become useful for both characterizing particular sequence datasets and comparing results from different projects. But one of the most important implemented innovations is that it has now low computational requirements, and the complete annotation of a simple proteome or transcriptome usually takes around 24 hours in a personal computer. Sma3s has been tested with a large amount of complete proteomes and transcriptomes, and it has demonstrated its potential in health science and other specific projects. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Yu, Yanbao; Leng, Taohua; Yun, Dong; Liu, Na; Yao, Jun; Dai, Ying; Yang, Pengyuan; Chen, Xian
2013-01-01
Emerging evidences indicate that blood platelets function in multiple biological processes including immune response, bone metastasis and liver regeneration in addition to their known roles in hemostasis and thrombosis. Global elucidation of platelet proteome will provide the molecular base of these platelet functions. Here, we set up a high throughput platform for maximum exploration of the rat/human platelet proteome using integrated proteomics technologies, and then applied to identify the largest number of the proteins expressed in both rat and human platelets. After stringent statistical filtration, a total of 837 unique proteins matched with at least two unique peptides were precisely identified, making it the first comprehensive protein database so far for rat platelets. Meanwhile, quantitative analyses of the thrombin-stimulated platelets offered great insights into the biological functions of platelet proteins and therefore confirmed our global profiling data. A comparative proteomic analysis between rat and human platelets was also conducted, which revealed not only a significant similarity, but also an across-species evolutionary link that the orthologous proteins representing ‘core proteome’, and the ‘evolutionary proteome’ is actually a relatively static proteome. PMID:20443191
Hildebrandt, Petra; Surmann, Kristin; Salazar, Manuela Gesell; Normann, Nicole; Völker, Uwe; Schmidt, Frank
2016-10-01
Staphylococcus aureus is a Gram-positive opportunistic pathogen that is able to cause a broad range of infectious diseases in humans. Furthermore, S. aureus is able to survive inside nonprofessional phagocytic host cell which serve as a niche for the pathogen to hide from the immune system and antibiotics therapies. Modern OMICs technologies provide valuable tools to investigate host-pathogen interactions upon internalization. However, these experiments are often hampered by limited capabilities to retrieve bacteria from such an experimental setting. Thus, the aim of this study was to develop a labeling strategy allowing fast detection and quantitation of S. aureus in cell lysates or infected cell lines by flow cytometry for subsequent proteome analyses. Therefore, S. aureus cells were labeled with the DNA stain SYTO ® 9, or Vancomycin BODIPY ® FL (VMB), a glycopeptide antibiotic binding to most Gram-positive bacteria which was conjugated to a fluorescent dye. Staining of S. aureus HG001 with SYTO 9 allowed counting of bacteria from pure cultures but not in cell lysates from infection experiments. In contrast, with VMB it was feasible to stain bacteria from pure cultures as well as from samples of infection experiments. VMB can also be applied for histocytochemistry analysis of formaldehyde fixed cell layers grown on coverslips. Proteome analyses of S. aureus labeled with VMB revealed that the labeling procedure provoked only minor changes on proteome level and allowed cell sorting and analysis of S. aureus from infection settings with sensitivity similar to continuous gfp expression. Furthermore, VMB labeling allowed precise counting of internalized bacteria and can be employed for downstream analyses, e.g., proteomics, of strains not easily amendable to genetic manipulation such as clinical isolates. © 2016 International Society for Advancement of Cytometry. © 2016 International Society for Advancement of Cytometry.
Wan, Huafang; Cui, Yixin; Ding, Yijuan; Mei, Jiaqin; Dong, Hongli; Zhang, Wenxin; Wu, Shiqi; Liang, Ying; Zhang, Chunyu; Li, Jiana; Xiong, Qing; Qian, Wei
2016-01-01
Understanding the regulation of lipid metabolism is vital for genetic engineering of canola ( Brassica napus L.) to increase oil yield or modify oil composition. We conducted time-series analyses of transcriptomes and proteomes to uncover the molecular networks associated with oil accumulation and dynamic changes in these networks in canola. The expression levels of genes and proteins were measured at 2, 4, 6, and 8 weeks after pollination (WAP). Our results show that the biosynthesis of fatty acids is a dominant cellular process from 2 to 6 WAP, while the degradation mainly happens after 6 WAP. We found that genes in almost every node of fatty acid synthesis pathway were significantly up-regulated during oil accumulation. Moreover, significant expression changes of two genes, acetyl-CoA carboxylase and acyl-ACP desaturase, were detected on both transcriptomic and proteomic levels. We confirmed the temporal expression patterns revealed by the transcriptomic analyses using quantitative real-time PCR experiments. The gene set association analysis show that the biosynthesis of fatty acids and unsaturated fatty acids are the most significant biological processes from 2-4 WAP and 4-6 WAP, respectively, which is consistent with the results of time-series analyses. These results not only provide insight into the mechanisms underlying lipid metabolism, but also reveal novel candidate genes that are worth further investigation for their values in the genetic engineering of canola.
Ye, Dongping; Liang, Weiguo; Dai, Libing; Zhou, Longqiang; Yao, Yicun; Zhong, Xin; Chen, Honghui; Xu, Jiake
2015-05-01
Degeneration of the intervertebral disc (IVD) is a major chronic medical condition associated with back pain. To better understand the pathogenesis of IVD degeneration, we performed comparative and quantitative proteomic analyses of normal and degenerated human annulus fibrosus (AF) cells and identified proteins that are differentially expressed between them. Annulus fibrosus cells were isolated and cultured from patients with lumbar disc herniation (the experimental group, degenerated AF cells) and scoliosis patients who underwent orthopaedic surgery (the control group, normal AF cells). Comparative proteomic analyses of normal and degenerated cultured AF cells were carried out using 2-D electrophoresis, mass spectrometric analyses, and database searching. Quantitative analyses of silver-stained 2-D electrophoresis gels of normal and degenerated cultured AF cells identified 10 protein spots that showed the most altered differential expression levels between the two groups. Among these, three proteins were decreased, including heat shock cognate 71-kDa protein, glucose-6-phosphate 1-dehydrogenase, and protocadherin-23, whereas seven proteins were increased, including guanine nucleotide-binding protein G(i) subunit α-2, superoxide dismutase, transmembrane protein 51, adenosine receptor A3, 26S protease regulatory subunit 8, lipid phosphate phosphatase-related protein, and fatty acyl-crotonic acid reductase 1. These differentially expressed proteins might be involved in the pathophysiological process of IVD degeneration and have potential values as biomarkers of the degeneration of IVD. © 2015 Wiley Publishing Asia Pty Ltd.
Medina-Aunon, J. Alberto; Martínez-Bartolomé, Salvador; López-García, Miguel A.; Salazar, Emilio; Navajas, Rosana; Jones, Andrew R.; Paradela, Alberto; Albar, Juan P.
2011-01-01
The development of the HUPO-PSI's (Proteomics Standards Initiative) standard data formats and MIAPE (Minimum Information About a Proteomics Experiment) guidelines should improve proteomics data sharing within the scientific community. Proteomics journals have encouraged the use of these standards and guidelines to improve the quality of experimental reporting and ease the evaluation and publication of manuscripts. However, there is an evident lack of bioinformatics tools specifically designed to create and edit standard file formats and reports, or embed them within proteomics workflows. In this article, we describe a new web-based software suite (The ProteoRed MIAPE web toolkit) that performs several complementary roles related to proteomic data standards. First, it can verify that the reports fulfill the minimum information requirements of the corresponding MIAPE modules, highlighting inconsistencies or missing information. Second, the toolkit can convert several XML-based data standards directly into human readable MIAPE reports stored within the ProteoRed MIAPE repository. Finally, it can also perform the reverse operation, allowing users to export from MIAPE reports into XML files for computational processing, data sharing, or public database submission. The toolkit is thus the first application capable of automatically linking the PSI's MIAPE modules with the corresponding XML data exchange standards, enabling bidirectional conversions. This toolkit is freely available at http://www.proteored.org/MIAPE/. PMID:21983993
Quantitative proteomic analysis of bacterial enzymes released in cheese during ripening.
Jardin, Julien; Mollé, Daniel; Piot, Michel; Lortal, Sylvie; Gagnaire, Valérie
2012-04-02
Due to increasingly available bacterial genomes in databases, proteomic tools have recently been used to screen proteins expressed by micro-organisms in food in order to better understand their metabolism in situ. While the main objective is the systematic identification of proteins, the next step will be to bridge the gap between identification and quantification of these proteins. For that purpose, a new mass spectrometry-based approach was applied, using isobaric tagging reagent for quantitative proteomic analysis (iTRAQ), which are amine specific and yield labelled peptides identical in mass. Experimental Swiss-type cheeses were manufactured from microfiltered milk using Streptococcus thermophilus ITG ST20 and Lactobacillus helveticus ITG LH1 as lactic acid starters. At three ripening times (7, 20 and 69 days), cheese aqueous phases were extracted and enriched in bacterial proteins by fractionation. Each sample, standardised in protein amount prior to proteomic analyses, was: i) analysed by 2D-electrophoresis for qualitative analysis and ii) submitted to trypsinolysis, and labelled with specific iTRAQ tag, one per ripening time. The three labelled samples were mixed together and analysed by nano-LC coupled on-line with ESI-QTOF mass spectrometer. Thirty proteins, both from bacterial or bovine origin, were identified and efficiently quantified. The free bacterial proteins detected were enzymes from the central carbon metabolism as well as stress proteins. Depending on the protein considered, the quantity of these proteins in the cheese aqueous extract increased from 2.5 to 20 fold in concentration from day 7 to day 69 of ripening. Copyright © 2012 Elsevier B.V. All rights reserved.
Kundu, Subrata; Chakraborty, Dipjyoti; Kundu, Anirban; Pal, Amita
2013-01-01
Vigna mungo, a tropical leguminous plant, highly susceptible to yellow mosaic disease caused by Mungbean Yellow Mosaic India Virus (MYMIV) resulting in high yield penalty. The molecular events occurring during compatible and incompatible interactions between V. mungo and MYMIV pathosystem are yet to be explored. In this study biochemical analyses in conjunction with proteomics of MYMIV-susceptible and -resistant V. mungo genotypes were executed to get an insight in the molecular events during compatible and incompatible plant-virus interactions. Biochemical analysis revealed an increase in phenolics, hydrogen peroxide and carbohydrate contents in both compatible and incompatible interactions; but the magnitudes were higher during incompatible interaction. In the resistant genotype the activities of superoxide dismutase and ascorbate peroxidase increased significantly, while catalase activity decreased. Comparative proteome analyses using two-dimensional gel electrophoresis coupled with mass spectrometry identified 109 differentially abundant proteins at 3, 7 and 14 days post MYMIV-inoculation. Proteins of several functional categories were differentially changed in abundance during both compatible and incompatible interactions. Among these, photosynthesis related proteins were mostly affected in the susceptible genotype resulting in reduced photosynthesis rate under MYMIV-stress. Differential intensities of chlorophyll fluorescence and chlorophyll contents are in congruence with proteomics data. It was revealed that Photosystem II electron transports are the primary targets of MYMIV during pathogenesis. Quantitative real time PCR analyses of selected genes corroborates with respective protein abundance during incompatible interaction. The network of various cellular pathways that are involved in inducing defense response contains several conglomerated cores of nodal proteins, of which ascorbate peroxidase, rubisco activase and serine/glycine hydroxymethyl transferase are the three major hubs with high connectivity. These nodal proteins play the crucial role of key regulators in bringing about a coordinated defense response in highly orchestrated manner. Biochemical and proteomic analyses revealed early accumulation of the defense/stress related proteins involved in ROS metabolism during incompatible interaction. The robustness in induction of defense/stress and signal transduction related proteins is the key factor in inducing resistance. The mechanism of MYMIV-resistance in V. mungo involves redirection of carbohydrate flux towards pentose phosphate pathway. Some of these identified, differentially regulated proteins are also conferring abiotic stress responses illustrating harmony amongst different stress responses. To the best of our knowledge, this is the lone study deciphering differential regulations of V. mungo leaf proteome upon MYMIV infection elucidating the mode of resistance response at the biochemical level.
CMG-Biotools, a Free Workbench for Basic Comparative Microbial Genomics
Vesth, Tammi; Lagesen, Karin; Acar, Öncel; Ussery, David
2013-01-01
Background Today, there are more than a hundred times as many sequenced prokaryotic genomes than were present in the year 2000. The economical sequencing of genomic DNA has facilitated a whole new approach to microbial genomics. The real power of genomics is manifested through comparative genomics that can reveal strain specific characteristics, diversity within species and many other aspects. However, comparative genomics is a field not easily entered into by scientists with few computational skills. The CMG-biotools package is designed for microbiologists with limited knowledge of computational analysis and can be used to perform a number of analyses and comparisons of genomic data. Results The CMG-biotools system presents a stand-alone interface for comparative microbial genomics. The package is a customized operating system, based on Xubuntu 10.10, available through the open source Ubuntu project. The system can be installed on a virtual computer, allowing the user to run the system alongside any other operating system. Source codes for all programs are provided under GNU license, which makes it possible to transfer the programs to other systems if so desired. We here demonstrate the package by comparing and analyzing the diversity within the class Negativicutes, represented by 31 genomes including 10 genera. The analyses include 16S rRNA phylogeny, basic DNA and codon statistics, proteome comparisons using BLAST and graphical analyses of DNA structures. Conclusion This paper shows the strength and diverse use of the CMG-biotools system. The system can be installed on a vide range of host operating systems and utilizes as much of the host computer as desired. It allows the user to compare multiple genomes, from various sources using standardized data formats and intuitive visualizations of results. The examples presented here clearly shows that users with limited computational experience can perform complicated analysis without much training. PMID:23577086
Screening the Molecular Framework Underlying Local Dendritic mRNA Translation
Namjoshi, Sanjeev V.; Raab-Graham, Kimberly F.
2017-01-01
In the last decade, bioinformatic analyses of high-throughput proteomics and transcriptomics data have enabled researchers to gain insight into the molecular networks that may underlie lasting changes in synaptic efficacy. Development and utilization of these techniques have advanced the field of learning and memory significantly. It is now possible to move from the study of activity-dependent changes of a single protein to modeling entire network changes that require local protein synthesis. This data revolution has necessitated the development of alternative computational and statistical techniques to analyze and understand the patterns contained within. Thus, the focus of this review is to provide a synopsis of the journey and evolution toward big data techniques to address still unanswered questions regarding how synapses are modified to strengthen neuronal circuits. We first review the seminal studies that demonstrated the pivotal role played by local mRNA translation as the mechanism underlying the enhancement of enduring synaptic activity. In the interest of those who are new to the field, we provide a brief overview of molecular biology and biochemical techniques utilized for sample preparation to identify locally translated proteins using RNA sequencing and proteomics, as well as the computational approaches used to analyze these data. While many mRNAs have been identified, few have been shown to be locally synthesized. To this end, we review techniques currently being utilized to visualize new protein synthesis, a task that has proven to be the most difficult aspect of the field. Finally, we provide examples of future applications to test the physiological relevance of locally synthesized proteins identified by big data approaches. PMID:28286470
He, Yongqun
2011-01-01
Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning. PMID:22919594
Evolving Relevance of Neuroproteomics in Alzheimer's Disease.
Lista, Simone; Zetterberg, Henrik; O'Bryant, Sid E; Blennow, Kaj; Hampel, Harald
2017-01-01
Substantial progress in the understanding of the biology of Alzheimer's disease (AD) has been achieved over the past decades. The early detection and diagnosis of AD and other age-related neurodegenerative diseases, however, remain a challenging scientific frontier. Therefore, the comprehensive discovery (relating to all individual, converging or diverging biochemical disease mechanisms), development, validation, and qualification of standardized biological markers with diagnostic and prognostic functions with a precise performance profile regarding specificity, sensitivity, and positive and negative predictive value are warranted.Methodological innovations in the area of exploratory high-throughput technologies, such as sequencing, microarrays, and mass spectrometry-based analyses of proteins/peptides, have led to the generation of large global molecular datasets from a multiplicity of biological systems, such as biological fluids, cells, tissues, and organs. Such methodological progress has shifted the attention to the execution of hypothesis-independent comprehensive exploratory analyses (opposed to the classical hypothesis-driven candidate approach), with the aim of fully understanding the biological systems in physiology and disease as a whole. The systems biology paradigm integrates experimental biology with accurate and rigorous computational modelling to describe and foresee the dynamic features of biological systems. The use of dynamically evolving technological platforms, including mass spectrometry, in the area of proteomics has enabled to rush the process of biomarker discovery and validation for refining significantly the diagnosis of AD. Currently, proteomics-which is part of the systems biology paradigm-is designated as one of the dominant matured sciences needed for the effective exploratory discovery of prospective biomarker candidates expected to play an effective role in aiding the early detection, diagnosis, prognosis, and therapy development in AD.
He, Yongqun
2012-01-01
Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning.
Scrambled eggs: Proteomic portraits and novel biomarkers of egg quality in zebrafish (Danio rerio)
Yilmaz, Ozlem; Patinote, Amélie; Nguyen, Thao Vi; Com, Emmanuelle; Lavigne, Regis; Pineau, Charles; Sullivan, Craig V.; Bobe, Julien
2017-01-01
Egg quality is a complex biological trait and a major determinant of reproductive fitness in all animals. This study delivered the first proteomic portraits of egg quality in zebrafish, a leading biomedical model for early development. Egg batches of good and poor quality, evidenced by embryo survival for 24 h, were sampled immediately after spawning and used to create pooled or replicated sample sets whose protein extracts were subjected to different levels of fractionation before liquid chromatography and tandem mass spectrometry. Obtained spectra were searched against a zebrafish proteome database and detected proteins were annotated, categorized and quantified based on normalized spectral counts. Manually curated and automated enrichment analyses revealed poor quality eggs to be deficient of proteins involved in protein synthesis and energy and lipid metabolism, and of some vitellogenin products and lectins, and to have a surfeit of proteins involved in endo-lysosomal activities, autophagy, and apoptosis, and of some oncogene products, lectins and egg envelope proteins. Results of pathway and network analyses suggest that this aberrant proteomic profile results from failure of oocytes giving rise to poor quality eggs to properly transit through final maturation, and implicated Wnt signaling in the etiology of this defect. Quantitative comparisons of abundant proteins in good versus poor quality eggs revealed 17 candidate egg quality markers. Thus, the zebrafish egg proteome is clearly linked to embryo developmental potential, a phenomenon that begs further investigation to elucidate the root causes of poor egg quality, presently a serious and intractable problem in livestock and human reproductive medicine. PMID:29145436
Scrambled eggs: Proteomic portraits and novel biomarkers of egg quality in zebrafish (Danio rerio).
Yilmaz, Ozlem; Patinote, Amélie; Nguyen, Thao Vi; Com, Emmanuelle; Lavigne, Regis; Pineau, Charles; Sullivan, Craig V; Bobe, Julien
2017-01-01
Egg quality is a complex biological trait and a major determinant of reproductive fitness in all animals. This study delivered the first proteomic portraits of egg quality in zebrafish, a leading biomedical model for early development. Egg batches of good and poor quality, evidenced by embryo survival for 24 h, were sampled immediately after spawning and used to create pooled or replicated sample sets whose protein extracts were subjected to different levels of fractionation before liquid chromatography and tandem mass spectrometry. Obtained spectra were searched against a zebrafish proteome database and detected proteins were annotated, categorized and quantified based on normalized spectral counts. Manually curated and automated enrichment analyses revealed poor quality eggs to be deficient of proteins involved in protein synthesis and energy and lipid metabolism, and of some vitellogenin products and lectins, and to have a surfeit of proteins involved in endo-lysosomal activities, autophagy, and apoptosis, and of some oncogene products, lectins and egg envelope proteins. Results of pathway and network analyses suggest that this aberrant proteomic profile results from failure of oocytes giving rise to poor quality eggs to properly transit through final maturation, and implicated Wnt signaling in the etiology of this defect. Quantitative comparisons of abundant proteins in good versus poor quality eggs revealed 17 candidate egg quality markers. Thus, the zebrafish egg proteome is clearly linked to embryo developmental potential, a phenomenon that begs further investigation to elucidate the root causes of poor egg quality, presently a serious and intractable problem in livestock and human reproductive medicine.
Inserra, Ilaria; Martelli, Claudia; Cipollina, Mara; Cicione, Claudia; Iavarone, Federica; Taranto, Giuseppe Di; Barba, Marta; Castagnola, Massimo; Desiderio, Claudia; Lattanzi, Wanda
2016-04-01
The lipoaspirate fluid (LAF) is emerging as a potentially valuable source in regenerative medicine. In particular, our group recently demonstrated that it is able to exert osteoinductive properties in vitro. This original observation stimulated the investigation of the proteomic component of LAF, by means of LC-ESI-LTQ-Orbitrap-MS top-down/bottom-up integrated approach, which represents the object of the present study. Top-down analyses required the optimization of sample pretreatment procedures to enable the correct investigation of the intact proteome. Bottom-up analyses have been directly applied to untreated samples after monodimensional SDS-PAGE separation. The analysis of the acid-soluble fraction of LAF by top-down approach allowed demonstrating the presence of albumin and hemoglobin fragments (i.e. VV- and LVV-hemorphin-7), thymosins β4 and β10 peptides, ubiquitin and acyl-CoA binding protein; adipogenesis regulatory factor, perilipin-1 fragments, and S100A6, along with their PTMs. Part of the bottom-up proteomic profile was reproducibly found in both tested samples. The bottom-up approach allowed demonstrating the presence of proteins, listed among the components of adipose tissue and/or comprised within the ASCs intracellular content and secreted proteome. Our data provide a first glance on the LAF molecular profile, which is consistent with its tissue environment. LAF appeared to contain bioactive proteins, peptides and paracrine factors, suggesting its potential translational exploitation. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Assessment of Sample Preparation Bias in Mass Spectrometry-Based Proteomics.
Klont, Frank; Bras, Linda; Wolters, Justina C; Ongay, Sara; Bischoff, Rainer; Halmos, Gyorgy B; Horvatovich, Péter
2018-04-17
For mass spectrometry-based proteomics, the selected sample preparation strategy is a key determinant for information that will be obtained. However, the corresponding selection is often not based on a fit-for-purpose evaluation. Here we report a comparison of in-gel (IGD), in-solution (ISD), on-filter (OFD), and on-pellet digestion (OPD) workflows on the basis of targeted (QconCAT-multiple reaction monitoring (MRM) method for mitochondrial proteins) and discovery proteomics (data-dependent acquisition, DDA) analyses using three different human head and neck tissues (i.e., nasal polyps, parotid gland, and palatine tonsils). Our study reveals differences between the sample preparation methods, for example, with respect to protein and peptide losses, quantification variability, protocol-induced methionine oxidation, and asparagine/glutamine deamidation as well as identification of cysteine-containing peptides. However, none of the methods performed best for all types of tissues, which argues against the existence of a universal sample preparation method for proteome analysis.
Journet, Agnès; Klein, Gérard; Brugière, Sabine; Vandenbrouck, Yves; Chapel, Agnès; Kieffer, Sylvie; Bruley, Christophe; Masselon, Christophe; Aubry, Laurence
2012-01-01
The cellular slime mold Dictyostelium discoideum is a soil-living eukaryote, which feeds on microorganisms engulfed by phagocytosis. Axenic laboratory strains have been produced that are able to use liquid growth medium internalized by macropinocytosis as the source of food. To better define the macropinocytosis process, we established the inventory of proteins associated with this pathway using mass spectrometry-based proteomics. Using a magnetic purification procedure and high-performance LC-MS/MS proteome analysis, a list of 2108 non-redundant proteins was established, of which 24% featured membrane-spanning domains. Bioinformatics analyses indicated that the most abundant proteins were linked to signaling, vesicular trafficking and the cytoskeleton. The present repertoire validates our purification method and paves the way for a future proteomics approach to study the dynamics of macropinocytosis. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomics Analysis of Alfalfa Response to Heat Stress
Li, Weimin; Wei, Zhenwu; Qiao, Zhihong; Wu, Zinian; Cheng, Lixiang; Wang, Yuyang
2013-01-01
The proteome responses to heat stress have not been well understood. In this study, alfalfa (Medicago sativa L. cv. Huaiyin) seedlings were exposed to 25°C (control) and 40°C (heat stress) in growth chambers, and leaves were collected at 24, 48 and 72 h after treatment, respectively. The morphological, physiological and proteomic processes were negatively affected under heat stress. Proteins were extracted and separated by two-dimensional polyacrylamide gel electrophoresis (2-DE), and differentially expressed protein spots were identified by mass spectrometry (MS). Totally, 81 differentially expressed proteins were identified successfully by MALDI-TOF/TOF. These proteins were categorized into nine classes: including metabolism, energy, protein synthesis, protein destination/storage, transporters, intracellular traffic, cell structure, signal transduction and disease/defence. Five proteins were further analyzed for mRNA levels. The results of the proteomics analyses provide a better understanding of the molecular basis of heat-stress responses in alfalfa. PMID:24324825
Maize-Pathogen Interactions: An Ongoing Combat from a Proteomics Perspective.
Pechanova, Olga; Pechan, Tibor
2015-11-30
Maize (Zea mays L.) is a host to numerous pathogenic species that impose serious diseases to its ear and foliage, negatively affecting the yield and the quality of the maize crop. A considerable amount of research has been carried out to elucidate mechanisms of maize-pathogen interactions with a major goal to identify defense-associated proteins. In this review, we summarize interactions of maize with its agriculturally important pathogens that were assessed at the proteome level. Employing differential analyses, such as the comparison of pathogen-resistant and susceptible maize varieties, as well as changes in maize proteomes after pathogen challenge, numerous proteins were identified as possible candidates in maize resistance. We describe findings of various research groups that used mainly mass spectrometry-based, high through-put proteomic tools to investigate maize interactions with fungal pathogens Aspergillus flavus, Fusarium spp., and Curvularia lunata, and viral agents Rice Black-streaked Dwarf Virus and Sugarcane Mosaic Virus.
Maize-Pathogen Interactions: An Ongoing Combat from a Proteomics Perspective
Pechanova, Olga; Pechan, Tibor
2015-01-01
Maize (Zea mays L.) is a host to numerous pathogenic species that impose serious diseases to its ear and foliage, negatively affecting the yield and the quality of the maize crop. A considerable amount of research has been carried out to elucidate mechanisms of maize-pathogen interactions with a major goal to identify defense-associated proteins. In this review, we summarize interactions of maize with its agriculturally important pathogens that were assessed at the proteome level. Employing differential analyses, such as the comparison of pathogen-resistant and susceptible maize varieties, as well as changes in maize proteomes after pathogen challenge, numerous proteins were identified as possible candidates in maize resistance. We describe findings of various research groups that used mainly mass spectrometry-based, high through-put proteomic tools to investigate maize interactions with fungal pathogens Aspergillus flavus, Fusarium spp., and Curvularia lunata, and viral agents Rice Black-streaked Dwarf Virus and Sugarcane Mosaic Virus. PMID:26633370
A Computational Algorithm for Functional Clustering of Proteome Dynamics During Development
Wang, Yaqun; Wang, Ningtao; Hao, Han; Guo, Yunqian; Zhen, Yan; Shi, Jisen; Wu, Rongling
2014-01-01
Phenotypic traits, such as seed development, are a consequence of complex biochemical interactions among genes, proteins and metabolites, but the underlying mechanisms that operate in a coordinated and sequential manner remain elusive. Here, we address this issue by developing a computational algorithm to monitor proteome changes during the course of trait development. The algorithm is built within the mixture-model framework in which each mixture component is modeled by a specific group of proteins that display a similar temporal pattern of expression in trait development. A nonparametric approach based on Legendre orthogonal polynomials was used to fit dynamic changes of protein expression, increasing the power and flexibility of protein clustering. By analyzing a dataset of proteomic dynamics during early embryogenesis of the Chinese fir, the algorithm has successfully identified several distinct types of proteins that coordinate with each other to determine seed development in this forest tree commercially and environmentally important to China. The algorithm will find its immediate applications for the characterization of mechanistic underpinnings for any other biological processes in which protein abundance plays a key role. PMID:24955031
Andromeda: a peptide search engine integrated into the MaxQuant environment.
Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias
2011-04-01
A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
Micro computed tomography (CT) scanned anatomical gateway to insect pest bioinformatics
USDA-ARS?s Scientific Manuscript database
An international collaboration to establish an interactive Digital Video Library for a Systems Biology Approach to study the Asian citrus Psyllid and psyllid genomics/proteomics interactions is demonstrated. Advances in micro-CT, digital computed tomography (CT) scan uses X-rays to make detailed pic...
Integration of cardiac proteome biology and medicine by a specialized knowledgebase.
Zong, Nobel C; Li, Haomin; Li, Hua; Lam, Maggie P Y; Jimenez, Rafael C; Kim, Christina S; Deng, Ning; Kim, Allen K; Choi, Jeong Ho; Zelaya, Ivette; Liem, David; Meyer, David; Odeberg, Jacob; Fang, Caiyun; Lu, Hao-Jie; Xu, Tao; Weiss, James; Duan, Huilong; Uhlen, Mathias; Yates, John R; Apweiler, Rolf; Ge, Junbo; Hermjakob, Henning; Ping, Peipei
2013-10-12
Omics sciences enable a systems-level perspective in characterizing cardiovascular biology. Integration of diverse proteomics data via a computational strategy will catalyze the assembly of contextualized knowledge, foster discoveries through multidisciplinary investigations, and minimize unnecessary redundancy in research efforts. The goal of this project is to develop a consolidated cardiac proteome knowledgebase with novel bioinformatics pipeline and Web portals, thereby serving as a new resource to advance cardiovascular biology and medicine. We created Cardiac Organellar Protein Atlas Knowledgebase (COPaKB; www.HeartProteome.org), a centralized platform of high-quality cardiac proteomic data, bioinformatics tools, and relevant cardiovascular phenotypes. Currently, COPaKB features 8 organellar modules, comprising 4203 LC-MS/MS experiments from human, mouse, drosophila, and Caenorhabditis elegans, as well as expression images of 10,924 proteins in human myocardium. In addition, the Java-coded bioinformatics tools provided by COPaKB enable cardiovascular investigators in all disciplines to retrieve and analyze pertinent organellar protein properties of interest. COPaKB provides an innovative and interactive resource that connects research interests with the new biological discoveries in protein sciences. With an array of intuitive tools in this unified Web server, nonproteomics investigators can conveniently collaborate with proteomics specialists to dissect the molecular signatures of cardiovascular phenotypes.
Chimeric plastid proteome in the Florida "red tide" dinoflagellate Karenia brevis.
Nosenko, Tetyana; Lidie, Kristy L; Van Dolah, Frances M; Lindquist, Erika; Cheng, Jan-Fang; Bhattacharya, Debashish
2006-11-01
Current understanding of the plastid proteome comes almost exclusively from studies of plants and red algae. The proteome in these taxa has a relatively simple origin via integration of proteins from a single cyanobacterial primary endosymbiont and the host. However, the most successful algae in marine environments are the chlorophyll c-containing chromalveolates such as diatoms and dinoflagellates that contain a plastid of red algal origin derived via secondary or tertiary endosymbiosis. Virtually nothing is known about the plastid proteome in these taxa. We analyzed expressed sequence tag data from the toxic "Florida red tide" dinoflagellate Karenia brevis that has undergone a tertiary plastid endosymbiosis. Comparative analyses identified 30 nuclear-encoded plastid-targeted proteins in this chromalveolate that originated via endosymbiotic or horizontal gene transfer (HGT) from multiple different sources. We identify a fundamental divide between plant/red algal and chromalveolate plastid proteomes that reflects a history of mixotrophy in the latter group resulting in a highly chimeric proteome. Loss of phagocytosis in the "red" and "green" clades effectively froze their proteomes, whereas chromalveolate lineages retain the ability to engulf prey allowing them to continually recruit new, potentially adaptive genes through subsequent endosymbioses and HGT. One of these genes is an electron transfer protein (plastocyanin) of green algal origin in K. brevis that likely allows this species to thrive under conditions of iron depletion.
Allmer, Jens; Kuhlgert, Sebastian; Hippler, Michael
2008-07-07
The amount of information stemming from proteomics experiments involving (multi dimensional) separation techniques, mass spectrometric analysis, and computational analysis is ever-increasing. Data from such an experimental workflow needs to be captured, related and analyzed. Biological experiments within this scope produce heterogenic data ranging from pictures of one or two-dimensional protein maps and spectra recorded by tandem mass spectrometry to text-based identifications made by algorithms which analyze these spectra. Additionally, peptide and corresponding protein information needs to be displayed. In order to handle the large amount of data from computational processing of mass spectrometric experiments, automatic import scripts are available and the necessity for manual input to the database has been minimized. Information is in a generic format which abstracts from specific software tools typically used in such an experimental workflow. The software is therefore capable of storing and cross analysing results from many algorithms. A novel feature and a focus of this database is to facilitate protein identification by using peptides identified from mass spectrometry and link this information directly to respective protein maps. Additionally, our application employs spectral counting for quantitative presentation of the data. All information can be linked to hot spots on images to place the results into an experimental context. A summary of identified proteins, containing all relevant information per hot spot, is automatically generated, usually upon either a change in the underlying protein models or due to newly imported identifications. The supporting information for this report can be accessed in multiple ways using the user interface provided by the application. We present a proteomics database which aims to greatly reduce evaluation time of results from mass spectrometric experiments and enhance result quality by allowing consistent data handling. Import functionality, automatic protein detection, and summary creation act together to facilitate data analysis. In addition, supporting information for these findings is readily accessible via the graphical user interface provided. The database schema and the implementation, which can easily be installed on virtually any server, can be downloaded in the form of a compressed file from our project webpage.
Topic model-based mass spectrometric data analysis in cancer biomarker discovery studies.
Wang, Minkun; Tsai, Tsung-Heng; Di Poto, Cristina; Ferrarini, Alessia; Yu, Guoqiang; Ressom, Habtom W
2016-08-18
A fundamental challenge in quantitation of biomolecules for cancer biomarker discovery is owing to the heterogeneous nature of human biospecimens. Although this issue has been a subject of discussion in cancer genomic studies, it has not yet been rigorously investigated in mass spectrometry based proteomic and metabolomic studies. Purification of mass spectometric data is highly desired prior to subsequent analysis, e.g., quantitative comparison of the abundance of biomolecules in biological samples. We investigated topic models to computationally analyze mass spectrometric data considering both integrated peak intensities and scan-level features, i.e., extracted ion chromatograms (EICs). Probabilistic generative models enable flexible representation in data structure and infer sample-specific pure resources. Scan-level modeling helps alleviate information loss during data preprocessing. We evaluated the capability of the proposed models in capturing mixture proportions of contaminants and cancer profiles on LC-MS based serum proteomic and GC-MS based tissue metabolomic datasets acquired from patients with hepatocellular carcinoma (HCC) and liver cirrhosis as well as synthetic data we generated based on the serum proteomic data. The results we obtained by analysis of the synthetic data demonstrated that both intensity-level and scan-level purification models can accurately infer the mixture proportions and the underlying true cancerous sources with small average error ratios (<7 %) between estimation and ground truth. By applying the topic model-based purification to mass spectrometric data, we found more proteins and metabolites with significant changes between HCC cases and cirrhotic controls. Candidate biomarkers selected after purification yielded biologically meaningful pathway analysis results and improved disease discrimination power in terms of the area under ROC curve compared to the results found prior to purification. We investigated topic model-based inference methods to computationally address the heterogeneity issue in samples analyzed by LC/GC-MS. We observed that incorporation of scan-level features have the potential to lead to more accurate purification results by alleviating the loss in information as a result of integrating peaks. We believe cancer biomarker discovery studies that use mass spectrometric analysis of human biospecimens can greatly benefit from topic model-based purification of the data prior to statistical and pathway analyses.
Coughlan, Christina; Walker, Douglas I.; Lohr, Kelly M.; Richardson, Jason R.; Saba, Laura M.; Caudle, W. Michael; Fritz, Kristofer S.; Roede, James R.
2015-01-01
Epidemiological studies indicate exposures to the herbicide paraquat (PQ) and fungicide maneb (MB) are associated with increased risk of Parkinson's disease (PD). Oxidative stress appears to be a premier mechanism that underlies damage to the nigrostriatal dopamine system in PD and pesticide exposure. Enhanced oxidative stress leads to lipid peroxidation and production of reactive aldehydes; therefore, we conducted proteomic analyses to identify carbonylated proteins in the striatum and cortex of pesticide-treated mice in order to elucidate possible mechanisms of toxicity. Male C57BL/6J mice were treated biweekly for 6 weeks with saline, PQ (10 mg/kg), MB (30 mg/kg), or the combination of PQ and MB (PQMB). Treatments resulted in significant behavioral alterations in all treated mice and depleted striatal dopamine in PQMB mice. Distinct differences in 4-hydroxynonenal-modified proteins were observed in the striatum and cortex. Proteomic analyses identified carbonylated proteins and peptides from the cortex and striatum, and pathway analyses revealed significant enrichment in a variety of KEGG pathways. Further analysis showed enrichment in proteins of the actin cytoskeleton in treated samples, but not in saline controls. These data indicate that treatment-related effects on cytoskeletal proteins could alter proper synaptic function, thereby resulting in impaired neuronal function and even neurodegeneration. PMID:26345149
Substrate-Mediated Laser Ablation under Ambient Conditions for Spatially-Resolved Tissue Proteomics
Fatou, Benoit; Wisztorski, Maxence; Focsa, Cristian; Salzet, Michel; Ziskind, Michael; Fournier, Isabelle
2015-01-01
Numerous applications of ambient Mass Spectrometry (MS) have been demonstrated over the past decade. They promoted the emergence of various micro-sampling techniques such as Laser Ablation/Droplet Capture (LADC). LADC consists in the ablation of analytes from a surface and their subsequent capture in a solvent droplet which can then be analyzed by MS. LADC is thus generally performed in the UV or IR range, using a wavelength at which analytes or the matrix absorb. In this work, we explore the potential of visible range LADC (532 nm) as a micro-sampling technology for large-scale proteomics analyses. We demonstrate that biomolecule analyses using 532 nm LADC are possible, despite the low absorbance of biomolecules at this wavelength. This is due to the preponderance of an indirect substrate-mediated ablation mechanism at low laser energy which contrasts with the conventional direct ablation driven by sample absorption. Using our custom LADC system and taking advantage of this substrate-mediated ablation mechanism, we were able to perform large-scale proteomic analyses of micro-sampled tissue sections and demonstrated the possible identification of proteins with relevant biological functions. Consequently, the 532 nm LADC technique offers a new tool for biological and clinical applications. PMID:26674367
Proteomic and transcriptomic analyses to explain the pleiotropic effects of Ankaferd blood stopper
Simsek, Cem; Selek, Sebnem; Koca, Meltem; Haznedaroglu, Ibrahim Celal
2017-01-01
Ankaferd blood stopper is a standardized mixture of the plants Thymus vulgaris, Glycyrrhiza glabra, Vitis vinifera, Alpinia officinarum, and Urtica dioica and has been used as a topical hemostatic agent and with its clinical application established in randomized controlled trials and case reports. Ankaferd has been successfully used in gastrointestinal endobronchial mucosal and cutaneous bleedings and also in abdominal, thoracic, dental and oropharyngeal, and pelvic surgeries. Ankaferd’s hemostatic action is thought to form a protein complex with coagulation factors that facilitate adhesion of blood components. Besides its hemostatic action, Ankaferd has demonstrated pleiotropic effects, including anti-neoplastic and anti-microbial activities and tissue-healing properties; the underlying mechanisms for these have not been well studied. Ankaferd’s individual components were determined by proteomic and chemical analyses. Ankaferd also augments transcription of some transcription factors which is shown with transcriptomic analysis. The independent effects of these ingredients and augmented transcription factors are not known precisely. Here, we review what is known of Ankaferd blood stopper components from chemical, proteomic, and transcriptomic analyses and propose that individual components can explain some pleiotropic effects of Ankaferd. Certainly more research is needed focusing on individual ingredients of Ankaferd to elucidate their precise and effects. PMID:28839937
Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D
2016-12-09
Venom gland transcriptomic and proteomic analyses have improved our knowledge on the diversity of the heterogeneous components present in scorpion venoms. However, most of these studies have focused on species from the family Buthidae. To gain insights into the molecular diversity of the venom components of scorpions belonging to the family Superstitioniidae, one of the neglected scorpion families, we performed a transcriptomic and proteomic analyses for the species Superstitionia donensis . The total mRNA extracted from the venom glands of two specimens was subjected to massive sequencing by the Illumina protocol, and a total of 219,073 transcripts were generated. We annotated 135 transcripts putatively coding for peptides with identity to known venom components available from different protein databases. Fresh venom collected by electrostimulation was analyzed by LC-MS/MS allowing the identification of 26 distinct components with sequences matching counterparts from the transcriptomic analysis. In addition, the phylogenetic affinities of the found putative calcins, scorpines, La1-like peptides and potassium channel κ toxins were analyzed. The first three components are often reported as ubiquitous in the venom of different families of scorpions. Our results suggest that, at least calcins and scorpines, could be used as molecular markers in phylogenetic studies of scorpion venoms.
Santibáñez-López, Carlos E.; Cid-Uribe, Jimena I.; Batista, Cesar V. F.; Ortiz, Ernesto; Possani, Lourival D.
2016-01-01
Venom gland transcriptomic and proteomic analyses have improved our knowledge on the diversity of the heterogeneous components present in scorpion venoms. However, most of these studies have focused on species from the family Buthidae. To gain insights into the molecular diversity of the venom components of scorpions belonging to the family Superstitioniidae, one of the neglected scorpion families, we performed a transcriptomic and proteomic analyses for the species Superstitionia donensis. The total mRNA extracted from the venom glands of two specimens was subjected to massive sequencing by the Illumina protocol, and a total of 219,073 transcripts were generated. We annotated 135 transcripts putatively coding for peptides with identity to known venom components available from different protein databases. Fresh venom collected by electrostimulation was analyzed by LC-MS/MS allowing the identification of 26 distinct components with sequences matching counterparts from the transcriptomic analysis. In addition, the phylogenetic affinities of the found putative calcins, scorpines, La1-like peptides and potassium channel κ toxins were analyzed. The first three components are often reported as ubiquitous in the venom of different families of scorpions. Our results suggest that, at least calcins and scorpines, could be used as molecular markers in phylogenetic studies of scorpion venoms. PMID:27941686
Ma, Chengying; Cao, Junxi; Li, Jianke; Zhou, Bo; Tang, Jinchi; Miao, Aiqing
2016-01-01
Leaf colour variation is observed in several plants. We obtained two types of branches with yellow and variegated leaves from Camellia sinensis. To reveal the mechanisms that underlie the leaf colour variations, combined morphological, histological, ionomic and proteomic analyses were performed using leaves from abnormal branches (variants) and normal branches (CKs). The measurement of the CIE-Lab coordinates showed that the brightness and yellowness of the variants were more intense than the CKs. When chloroplast profiles were analysed, HY1 (branch with yellow leaves) and HY2 (branch with variegated leaves) displayed abnormal chloroplast structures and a reduced number and size compared with the CKs, indicating that the abnormal chloroplast development might be tightly linked to the leaf colour variations. Moreover, the concentration of elemental minerals was different between the variants and the CKs. Furthermore, DEPs (differentially expressed proteins) were identified in the variants and the CKs by a quantitative proteomics analysis using the label-free approach. The DEPs were significantly involved in photosynthesis and included PSI, PSII, cytochrome b6/f complex, photosynthetic electron transport, LHC and F-type ATPase. Our results suggested that a decrease in the abundance of photosynthetic proteins might be associated with the changes of leaf colours in tea plants. PMID:27633059
Goeminne, Ludger J E; Gevaert, Kris; Clement, Lieven
2018-01-16
Label-free shotgun proteomics is routinely used to assess proteomes. However, extracting relevant information from the massive amounts of generated data remains difficult. This tutorial provides a strong foundation on analysis of quantitative proteomics data. We provide key statistical concepts that help researchers to design proteomics experiments and we showcase how to analyze quantitative proteomics data using our recent free and open-source R package MSqRob, which was developed to implement the peptide-level robust ridge regression method for relative protein quantification described by Goeminne et al. MSqRob can handle virtually any experimental proteomics design and outputs proteins ordered by statistical significance. Moreover, its graphical user interface and interactive diagnostic plots provide easy inspection and also detection of anomalies in the data and flaws in the data analysis, allowing deeper assessment of the validity of results and a critical review of the experimental design. Our tutorial discusses interactive preprocessing, data analysis and visualization of label-free MS-based quantitative proteomics experiments with simple and more complex designs. We provide well-documented scripts to run analyses in bash mode on GitHub, enabling the integration of MSqRob in automated pipelines on cluster environments (https://github.com/statOmics/MSqRob). The concepts outlined in this tutorial aid in designing better experiments and analyzing the resulting data more appropriately. The two case studies using the MSqRob graphical user interface will contribute to a wider adaptation of advanced peptide-based models, resulting in higher quality data analysis workflows and more reproducible results in the proteomics community. We also provide well-documented scripts for experienced users that aim at automating MSqRob on cluster environments. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clair, Geremy; Piehowski, Paul D.; Nicola, Teodora
Global proteomics approaches allow characterization of whole tissue lysates to an impressive depth. However, it is now increasingly recognized that to better understand the complexity of multicellular organisms, global protein profiling of specific spatially defined regions/substructures of tissues (i.e. spatially-resolved proteomics) is essential. Laser capture microdissection (LCM) enables microscopic isolation of defined regions of tissues preserving crucial spatial information. However, current proteomics workflows entail several manual sample preparation steps and are challenged by the microscopic mass-limited samples generated by LCM, and that impact measurement robustness, quantification, and throughput. Here, we coupled LCM with a fully automated sample preparation workflow thatmore » with a single manual step allows: protein extraction, tryptic digestion, peptide cleanup and LC-MS/MS analysis of proteomes from microdissected tissues. Benchmarking against the current state of the art in ultrasensitive global proteomic analysis, our approach demonstrated significant improvements in quantification and throughput. Using our LCM-SNaPP proteomics approach, we characterized to a depth of more than 3,400 proteins, the ontogeny of protein changes during normal lung development in laser capture microdissected alveolar tissue containing ~4,000 cells per sample. Importantly, the data revealed quantitative changes for 350 low abundance transcription factors and signaling molecules, confirming earlier transcript-level observations and defining seven modules of coordinated transcription factor/signaling molecule expression patterns, suggesting that a complex network of temporal regulatory control directs normal lung development with epigenetic regulation fine-tuning pre-natal developmental processes. Our LCM-proteomics approach facilitates efficient, spatially-resolved, ultrasensitive global proteomics analyses in high-throughput that will be enabling for several clinical and biological applications.« less
Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.; Burnum-Johnson, Kristin E.; Kim, Young-Mo; Kyle, Jennifer E.; Matzke, Melissa M.; Shukla, Anil K.; Chu, Rosalie K.; Schepmoes, Athena A.; Jacobs, Jon M.; Baric, Ralph S.; Webb-Robertson, Bobbie-Jo; Smith, Richard D.
2016-01-01
ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. The metabolite, protein, and lipid extraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of this protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental, in vitro, and clinical). IMPORTANCE In systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample. Author Video: An author video summary of this article is available. PMID:27822525
Morris, Jeffrey S
2012-01-01
In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry ( Cromwell ) and 2D gel electrophoresis ( Pinnacle ) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods presented are applied to two specific proteomic technologies, MALDI-TOF and 2D gel electrophoresis, these methods and the other principles discussed in the paper apply much more broadly to other expression proteomics technologies.
The Proteome of Native Adult Müller Glial Cells From Murine Retina*
Hauser, Alexandra; Lepper, Marlen Franziska; Mayo, Rebecca
2016-01-01
To date, the proteomic profiling of Müller cells, the dominant macroglia of the retina, has been hampered because of the absence of suitable enrichment methods. We established a novel protocol to isolate native, intact Müller cells from adult murine retinae at excellent purity which retain in situ morphology and are well suited for proteomic analyses. Two different strategies of sample preparation - an in StageTips (iST) and a subcellular fractionation approach including cell surface protein profiling were used for quantitative liquid chromatography-mass spectrometry (LC-MSMS) comparing Müller cell-enriched to depleted neuronal fractions. Pathway enrichment analyses on both data sets enabled us to identify Müller cell-specific functions which included focal adhesion kinase signaling, signal transduction mediated by calcium as second messenger, transmembrane neurotransmitter transport and antioxidant activity. Pathways associated with RNA processing, cellular respiration and phototransduction were enriched in the neuronal subpopulation. Proteomic results were validated for selected Müller cell genes by quantitative real time PCR, confirming the high expression levels of numerous members of the angiogenic and anti-inflammatory annexins and antioxidant enzymes (e.g. paraoxonase 2, peroxiredoxin 1, 4 and 6). Finally, the significant enrichment of antioxidant proteins in Müller cells was confirmed by measurements on vital retinal cells using the oxidative stress indicator CM-H2DCFDA. In contrast to photoreceptors or bipolar cells, Müller cells were most efficiently protected against H2O2-induced reactive oxygen species formation, which is in line with the protein repertoire identified in the proteomic profiling. Our novel approach to isolate intact glial cells from adult retina in combination with proteomic profiling enabled the identification of novel Müller glia specific proteins, which were validated as markers and for their functional impact in glial physiology. This provides the basis to allow the discovery of novel glial specializations and will enable us to elucidate the role of Müller cells in retinal pathologies — a topic still controversially discussed. PMID:26324419
QC-ART: A tool for real-time quality control assessment of mass spectrometry-based proteomics data.
Stanfill, Bryan A; Nakayasu, Ernesto S; Bramer, Lisa M; Thompson, Allison M; Ansong, Charles K; Clauss, Therese; Gritsenko, Marina A; Monroe, Matthew E; Moore, Ronald J; Orton, Daniel J; Piehowski, Paul D; Schepmoes, Athena A; Smith, Richard D; Webb-Robertson, Bobbie-Jo; Metz, Thomas O
2018-04-17
Liquid chromatography-mass spectrometry (LC-MS)-based proteomics studies of large sample cohorts can easily require from months to years to complete. Acquiring consistent, high-quality data in such large-scale studies is challenging because of normal variations in instrumentation performance over time, as well as artifacts introduced by the samples themselves, such as those due to collection, storage and processing. Existing quality control methods for proteomics data primarily focus on post-hoc analysis to remove low-quality data that would degrade downstream statistics; they are not designed to evaluate the data in near real-time, which would allow for interventions as soon as deviations in data quality are detected. In addition to flagging analyses that demonstrate outlier behavior, evaluating how the data structure changes over time can aide in understanding typical instrument performance or identify issues such as a degradation in data quality due to the need for instrument cleaning and/or re-calibration. To address this gap for proteomics, we developed Quality Control Analysis in Real-Time (QC-ART), a tool for evaluating data as they are acquired in order to dynamically flag potential issues with instrument performance or sample quality. QC-ART has similar accuracy as standard post-hoc analysis methods with the additional benefit of real-time analysis. We demonstrate the utility and performance of QC-ART in identifying deviations in data quality due to both instrument and sample issues in near real-time for LC-MS-based plasma proteomics analyses of a sample subset of The Environmental Determinants of Diabetes in the Young cohort. We also present a case where QC-ART facilitated the identification of oxidative modifications, which are often underappreciated in proteomic experiments. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.
Proteome and phosphoproteome analysis of commensally induced dendritic cell maturation states.
Korkmaz, Ali Giray; Popov, Todor; Peisl, Loulou; Codrea, Marius Cosmin; Nahnsen, Sven; Steimle, Alexander; Velic, Ana; Macek, Boris; von Bergen, Martin; Bernhardt, Joerg; Frick, Julia-Stefanie
2018-05-30
Dendritic cells (DCs) can shape the immune system towards an inflammatory or tolerant state depending on the bacterial antigens and the environment they encounter. In this study we provide a proteomic catalogue of differentially expressed proteins between distinct DC maturation states, brought about by bacteria that differ in their endotoxicity. To achieve this, we have performed proteomics and phosphoproteomics on murine DC cultures. Symbiont and pathobiont bacteria were used to direct dendritic cells into a semi-mature and fully-mature state, respectively. The comparison of semi-mature and fully-mature DCs revealed differential expression in 103 proteins and differential phosphorylation in 118 phosphosites, including major regulatory factors of central immune processes. Our analyses predict that these differences are mediated by upstream elements such as SOCS1, IRF3, ABCA1, TLR4, and PTGER4. Our analyses indicate that the symbiont bacterial strain affects DC proteome in a distinct way, by downregulating inflammatory proteins and activating anti-inflammatory upstream regulators. Biological significance In this study we have investigated the responses of immune cells to distinct bacterial stimuli. We have used the symbiont bacterial strain B. vulgatus and the pathobiont E. coli strain to stimulate cultured primary dendritic cells and performed a shotgun proteome analysis to investigate the protein expression and phosphorylation level differences on a genome level. We have observed expression and phosphorylation level differences in key immune regulators, transcription factors and signal transducers. Moreover, our subsequent bioinformatics analysis indicated regulation at several signaling pathways such as PPAR signaling, LXR/RXR activation and glucocorticoid signaling pathways, which are not studied in detail in an inflammation and DC maturation context. Our phosphoproteome analysis showed differential phosphorylation in 118 phosphosites including those belonging to epigenetic regulators, transcription factors and major cell cycle regulators. We anticipate that our study will facilitate further investigation of immune cell proteomes under different inflammatory and non-inflammatory conditions. Copyright © 2017. Published by Elsevier B.V.
Thermosensitivity of growth is determined by chaperone-mediated proteome reallocation
Chen, Ke; Gao, Ye; Mih, Nathan; O’Brien, Edward J.; Yang, Laurence; Palsson, Bernhard O.
2017-01-01
Maintenance of a properly folded proteome is critical for bacterial survival at notably different growth temperatures. Understanding the molecular basis of thermoadaptation has progressed in two main directions, the sequence and structural basis of protein thermostability and the mechanistic principles of protein quality control assisted by chaperones. Yet we do not fully understand how structural integrity of the entire proteome is maintained under stress and how it affects cellular fitness. To address this challenge, we reconstruct a genome-scale protein-folding network for Escherichia coli and formulate a computational model, FoldME, that provides statistical descriptions of multiscale cellular response consistent with many datasets. FoldME simulations show (i) that the chaperones act as a system when they respond to unfolding stress rather than achieving efficient folding of any single component of the proteome, (ii) how the proteome is globally balanced between chaperones for folding and the complex machinery synthesizing the proteins in response to perturbation, (iii) how this balancing determines growth rate dependence on temperature and is achieved through nonspecific regulation, and (iv) how thermal instability of the individual protein affects the overall functional state of the proteome. Overall, these results expand our view of cellular regulation, from targeted specific control mechanisms to global regulation through a web of nonspecific competing interactions that modulate the optimal reallocation of cellular resources. The methodology developed in this study enables genome-scale integration of environment-dependent protein properties and a proteome-wide study of cellular stress responses. PMID:29073085
2012-01-01
Background Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? Results The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Conclusion Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway. PMID:23216969
Günther, Oliver P; Chen, Virginia; Freue, Gabriela Cohen; Balshaw, Robert F; Tebbutt, Scott J; Hollander, Zsuzsanna; Takhar, Mandeep; McMaster, W Robert; McManus, Bruce M; Keown, Paul A; Ng, Raymond T
2012-12-08
Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway.
Aasebø, Elise; Forthun, Rakel B.; Berven, Frode; Selheim, Frode; Hernandez-Valladares, Maria
2016-01-01
The identification of protein biomarkers for acute myeloid leukemia (AML) that could find applications in AML diagnosis and prognosis, treatment and the selection for bone marrow transplant requires substantial comparative analyses of the proteomes from AML patients. In the past years, several studies have suggested some biomarkers for AML diagnosis or AML classification using methods for sample preparation with low proteome coverage and low resolution mass spectrometers. However, most of the studies did not follow up, confirm or validate their candidates with more patient samples. Current proteomics methods, new high resolution and fast mass spectrometers allow the identification and quantification of several thousands of proteins obtained from few tens of μg of AML cell lysate. Enrichment methods for posttranslational modifications (PTM), such as phosphorylation, can isolate several thousands of site-specific phosphorylated peptides from AML patient samples, which subsequently can be quantified with high confidence in new mass spectrometers. While recent reports aiming to propose proteomic or phosphoproteomic biomarkers on the studied AML patient samples have taken advantage of the technological progress, the access to large cohorts of AML patients to sample from and the availability of appropriate control samples still remain challenging. PMID:26306748
Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L
2010-07-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Ma, Danjun; Wang, Jiarui; Zhao, Yingchun; Lee, Wai-Nang Paul; Xiao, Jing; Go, Vay Liang W.; Wang, Qi; Recker, Robert; Xiao, Gary Guishan
2011-01-01
Objectives Novel quantitative proteomic approaches were used to study the effects of inhibition of glycogen phosphorylase on proteome and signaling pathways in MIA PaCa-2 pancreatic cancer cells. Methods We performed quantitative proteomic analysis in MIA PaCa-2 cancer cells treated with a stratified dose of CP-320626 (25 μM, 50 μM and 100 μM). The effect of metabolic inhibition on cellular protein turnover dynamics was also studied using the modified SILAC method (mSILAC). Results A total of twenty-two protein spots and four phosphoprotein spots were quantitatively analyzed. We found that dynamic expression of total proteins and phosphoproteins was significantly changed in MIA PaCa-2 cells treated with an incremental dose of CP-320626. Functional analyses suggested that most of the proteins differentially expressed were in the pathways of MAPK/ERK and TNF-α/NF-κB. Conclusions Signaling pathways and metabolic pathways share many common cofactors and substrates forming an extended metabolic network. The restriction of substrate through one pathway such as inhibition of glycogen phosphorylation induces pervasive metabolomic and proteomic changes manifested in protein synthesis, breakdown and post-translational modification of signaling molecules. Our results suggest that quantitative proteomic is an important approach to understand the interaction between metabolism and signaling pathways. PMID:22158071
Clinical proteomic analysis of scrub typhus infection.
Park, Edmond Changkyun; Lee, Sang-Yeop; Yun, Sung Ho; Choi, Chi-Won; Lee, Hayoung; Song, Hyun Seok; Jun, Sangmi; Kim, Gun-Hwa; Lee, Chang-Seop; Kim, Seung Il
2018-01-01
Scrub typhus is an acute and febrile infectious disease caused by the Gram-negative α-proteobacterium Orientia tsutsugamushi from the family Rickettsiaceae that is widely distributed in Northern, Southern and Eastern Asia. In the present study, we analysed the serum proteome of scrub typhus patients to investigate specific clinical protein patterns in an attempt to explain pathophysiology and discover potential biomarkers of infection. Serum samples were collected from three patients (before and after treatment with antibiotics) and three healthy subjects. One-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis followed by liquid chromatography-tandem mass spectrometry was performed to identify differentially abundant proteins using quantitative proteomic approaches. Bioinformatic analysis was then performed using Ingenuity Pathway Analysis. Proteomic analysis identified 236 serum proteins, of which 32 were differentially expressed in normal subjects, naive scrub typhus patients and patients treated with antibiotics. Comparative bioinformatic analysis of the identified proteins revealed up-regulation of proteins involved in immune responses, especially complement system, following infection with O. tsutsugamushi , and normal expression was largely rescued by antibiotic treatment. This is the first proteomic study of clinical serum samples from scrub typhus patients. Proteomic analysis identified changes in protein expression upon infection with O. tsutsugamushi and following antibiotic treatment. Our results provide valuable information for further investigation of scrub typhus therapy and diagnosis.
Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.
2010-01-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087
Quantitative body fluid proteomics in medicine - A focus on minimal invasiveness.
Csősz, Éva; Kalló, Gergő; Márkus, Bernadett; Deák, Eszter; Csutak, Adrienne; Tőzsér, József
2017-02-05
Identification of new biomarkers specific for various pathological conditions is an important field in medical sciences. Body fluids have emerging potential in biomarker studies especially those which are continuously available and can be collected by non-invasive means. Changes in the protein composition of body fluids such as tears, saliva, sweat, etc. may provide information on both local and systemic conditions of medical relevance. In this review, our aim is to discuss the quantitative proteomics techniques used in biomarker studies, and to present advances in quantitative body fluid proteomics of non-invasively collectable body fluids with relevance to biomarker identification. The advantages and limitations of the widely used quantitative proteomics techniques are also presented. Based on the reviewed literature, we suggest an ideal pipeline for body fluid analyses aiming at biomarkers discoveries: starting from identification of biomarker candidates by shotgun quantitative proteomics or protein arrays, through verification of potential biomarkers by targeted mass spectrometry, to the antibody-based validation of biomarkers. The importance of body fluids as a rich source of biomarkers is discussed. Quantitative proteomics is a challenging part of proteomics applications. The body fluids collected by non-invasive means have high relevance in medicine; they are good sources for biomarkers used in establishing the diagnosis, follow up of disease progression and predicting high risk groups. The review presents the most widely used quantitative proteomics techniques in body fluid analysis and lists the potential biomarkers identified in tears, saliva, sweat, nasal mucus and urine for local and systemic diseases. Copyright © 2016 Elsevier B.V. All rights reserved.
Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I
2018-05-08
Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
Highly Reproducible Label Free Quantitative Proteomic Analysis of RNA Polymerase Complexes*
Mosley, Amber L.; Sardiu, Mihaela E.; Pattenden, Samantha G.; Workman, Jerry L.; Florens, Laurence; Washburn, Michael P.
2011-01-01
The use of quantitative proteomics methods to study protein complexes has the potential to provide in-depth information on the abundance of different protein components as well as their modification state in various cellular conditions. To interrogate protein complex quantitation using shotgun proteomic methods, we have focused on the analysis of protein complexes using label-free multidimensional protein identification technology and studied the reproducibility of biological replicates. For these studies, we focused on three highly related and essential multi-protein enzymes, RNA polymerase I, II, and III from Saccharomyces cerevisiae. We found that label-free quantitation using spectral counting is highly reproducible at the protein and peptide level when analyzing RNA polymerase I, II, and III. In addition, we show that peptide sampling does not follow a random sampling model, and we show the need for advanced computational models to predict peptide detection probabilities. In order to address these issues, we used the APEX protocol to model the expected peptide detectability based on whole cell lysate acquired using the same multidimensional protein identification technology analysis used for the protein complexes. Neither method was able to predict the peptide sampling levels that we observed using replicate multidimensional protein identification technology analyses. In addition to the analysis of the RNA polymerase complexes, our analysis provides quantitative information about several RNAP associated proteins including the RNAPII elongation factor complexes DSIF and TFIIF. Our data shows that DSIF and TFIIF are the most highly enriched RNAP accessory factors in Rpb3-TAP purifications and demonstrate our ability to measure low level associated protein abundance across biological replicates. In addition, our quantitative data supports a model in which DSIF and TFIIF interact with RNAPII in a dynamic fashion in agreement with previously published reports. PMID:21048197
Krystkowiak, Izabella; Manguy, Jean; Davey, Norman E
2018-06-05
There is a pressing need for in silico tools that can aid in the identification of the complete repertoire of protein binding (SLiMs, MoRFs, miniMotifs) and modification (moiety attachment/removal, isomerization, cleavage) motifs. We have created PSSMSearch, an interactive web-based tool for rapid statistical modeling, visualization, discovery and annotation of protein motif specificity determinants to discover novel motifs in a proteome-wide manner. PSSMSearch analyses proteomes for regions with significant similarity to a motif specificity determinant model built from a set of aligned motif-containing peptides. Multiple scoring methods are available to build a position-specific scoring matrix (PSSM) describing the motif specificity determinant model. This model can then be modified by a user to add prior knowledge of specificity determinants through an interactive PSSM heatmap. PSSMSearch includes a statistical framework to calculate the significance of specificity determinant model matches against a proteome of interest. PSSMSearch also includes the SLiMSearch framework's annotation, motif functional analysis and filtering tools to highlight relevant discriminatory information. Additional tools to annotate statistically significant shared keywords and GO terms, or experimental evidence of interaction with a motif-recognizing protein have been added. Finally, PSSM-based conservation metrics have been created for taxonomic range analyses. The PSSMSearch web server is available at http://slim.ucd.ie/pssmsearch/.
Raiszadeh, Michelle M.; Ross, Mark M.; Russo, Paul S.; Schaepper, Mary Ann H.; Zhou, Weidong; Deng, Jianghong; Ng, Daniel; Dickson, April; Dickson, Cindy; Strom, Monica; Osorio, Carolina; Soeprono, Thomas; Wulfkuhle, Julia D.; Kabbani, Nadine; Petricoin, Emanuel F.; Liotta, Lance A.; Kirsch, Wolff M.
2012-01-01
Liquid chromatography tandem mass spectrometry (LC-MS/MS) and multiple reaction monitoring mass spectrometry (MRM-MS) proteomics analyses were performed on eccrine sweat of healthy controls, and the results were compared with those from individuals diagnosed with schizophrenia (SZ). This is the first large scale study of the sweat proteome. First, we performed LC-MS/MS on pooled SZ samples and pooled control samples for global proteomics analysis. Results revealed a high abundance of diverse proteins and peptides in eccrine sweat. Most of the proteins identified from sweat samples were found to be different than the most abundant proteins from serum, which indicates that eccrine sweat is not simply a plasma transudate, and may thereby be a source of unique disease-associated biomolecules. A second independent set of patient and control sweat samples were analyzed by LC-MS/MS and spectral counting to determine qualitative protein differential abundances between the control and disease groups. Differential abundances of selected proteins, initially determined by spectral counting, were verified by MRM-MS analyses. Seventeen proteins showed a differential abundance of approximately two-fold or greater between the SZ pooled sample and the control pooled sample. This study demonstrates the utility of LC-MS/MS and MRM-MS as a viable strategy for the discovery and verification of potential sweat protein disease biomarkers. PMID:22256890
Zheng, Wen; Li, Ximin; Zhang, Lin; Zhang, Yanzhen; Lu, Xiaoping; Tian, Jingkui
2015-06-01
Ginkgo biloba is an attractive and traditional medicinal plant, and has been widely used as a phytomedicine in the prevention and treatment of cardiovascular and cerebrovascular diseases. Flavonoids and terpene lactones are the major bioactive components of Ginkgo, whereas the ginkgolic acids (GAs) with strong allergenic properties are strictly controlled. In this study, we tested the content of flavonoids and GAs under ultraviolet-B (UV-B) treatment and performed comparative proteomic analyses to determine the differential proteins that occur upon UV-B radiation. That might play a crucial role in producing flavonoids and GAs. Our phytochemical analyses demonstrated that UV-B irradiation significantly increased the content of active flavonoids, and decreased the content of toxic GAs. We conducted comparative proteomic analysis of both whole leaf and chloroplasts proteins. In total, 27 differential proteins in the whole leaf and 43 differential proteins in the chloroplast were positively identified and functionally annotated. The proteomic data suggested that enhanced UV-B radiation exposure activated antioxidants and stress-responsive proteins as well as reduced the rate of photosynthesis. We demonstrate that UV-B irradiation pharmaceutically improved the metabolic ingredients of Ginkgo, particularly in terms of reducing GAs. With high UV absorption properties, and antioxidant activities, the flavonoids were likely highly induced as protective molecules following UV-B irradiation. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Paulo, Joao A.; Lee, Linda S.; Banks, Peter A.; Steen, Hanno; Conwell, Darwin L.
2012-01-01
Alterations in the pancreatic fluid proteome of individuals with chronic pancreatitis may offer insights into the development and progression of the disease. The endoscopic pancreas function test (ePFT) can safely collect large volumes of pancreatic fluid that are potentially amenable to proteomic analyses using difference gel electrophoresis (DiGE) coupled with liquid chromatography-tandem mass spectrometry (LC-MS/MS). Pancreatic fluid was collected endoscopically using the ePFT method following secretin stimulation from three individuals with severe chronic pancreatitis and three chronic abdominal pain controls. The fluid was processed to minimize protein degradation and the protein profiles of each cohort, as determined by DiGE and LC-MS/MS, were compared. This DiGE-LC-MS/MS analysis reveals proteins that are differentially expressed in chronic pancreatitis compared to chronic abdominal pain controls. Proteins with higher abundance in pancreatic fluid from chronic pancreatitis individuals include: actin, desmoplankin, alpha-1-antitrypsin, SNC73, and serotransferrin. Those of relatively lower abundance include carboxypeptidase B, lipase, alpha-1-antichymotrypsin, alpha-2-macroglobulin, Arp2/3 subunit 4, glyceraldehyde-3-phosphate dehydrogenase, and protein disulfide isomerase. Endoscopic collection (ePFT) in tandem with DiGE-LC-MS/MS is a suitable approach for pancreatic fluid proteome analysis, however, further optimization of our protocol, as outlined herein, may improve proteome coverage in future analyses. PMID:21792986
Using Big Data to Discover Diagnostics and Therapeutics for Gastrointestinal and Liver Diseases
Wooden, Benjamin; Goossens, Nicolas; Hoshida, Yujin; Friedman, Scott L.
2016-01-01
Technologies such as genome sequencing, gene expression profiling, proteomic and metabolomic analyses, electronic medical records, and patient-reported health information have produced large amounts of data, from various populations, cell types, and disorders (big data). However, these data must be integrated and analyzed if they are to produce models or concepts about physiologic function or mechanisms of pathogenesis. Many of these data are available to the public, allowing researchers anywhere to search for markers of specific biologic processes or therapeutic targets for specific diseases or patient types. We review recent advances in the fields of computational and systems biology, and highlight opportunities for researchers to use big data sets in the fields of gastroenterology and hepatology, to complement traditional means of diagnostic and therapeutic discovery. PMID:27773806
Mass spectrometry-based proteomics: basic principles and emerging technologies and directions.
Van Riper, Susan K; de Jong, Ebbing P; Carlis, John V; Griffin, Timothy J
2013-01-01
As the main catalytic and structural molecules within living systems, proteins are the most likely biomolecules to be affected by radiation exposure. Proteomics, the comprehensive characterization of proteins within complex biological samples, is therefore a research approach ideally suited to assess the effects of radiation exposure on cells and tissues. For comprehensive characterization of proteomes, an analytical platform capable of quantifying protein abundance, identifying post-translation modifications and revealing members of protein complexes on a system-wide level is necessary. Mass spectrometry (MS), coupled with technologies for sample fractionation and automated data analysis, provides such a versatile and powerful platform. In this chapter we offer a view on the current state of MS-proteomics, and focus on emerging technologies within three areas: (1) New instrumental methods; (2) New computational methods for peptide identification; and (3) Label-free quantification. These emerging technologies should be valuable for researchers seeking to better understand biological effects of radiation on living systems.
Hiller, Karsten; Grote, Andreas; Maneck, Matthias; Münch, Richard; Jahn, Dieter
2006-10-01
After the publication of JVirGel 1.0 in 2003 we got many requests and suggestions from the proteomics community to further improve the performance of the software and to add additional useful new features. The integration of the PrediSi algorithm for the prediction of signal peptides for the Sec-dependent protein export into JVirGel 2.0 allows the exclusion of most exported preproteins from calculated proteomic maps and provides the basis for the calculation of Sec-based secretomes. A tool for the identification of transmembrane helices carrying proteins (JCaMelix) and the prediction of the corresponding membrane proteome was added. Finally, in order to directly compare experimental and calculated proteome data, a function to overlay and evaluate predicted and experimental two-dimensional gels was included. JVirGel 2.0 is freely available as precompiled package for the installation on Windows or Linux operating systems. Furthermore, there is a completely platform-independent Java version available for download. Additionally, we provide a Java Server Pages based version of JVirGel 2.0 which can be operated in nearly all web browsers. All versions are accessible at http://www.jvirgel.de
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006
Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio
2014-01-01
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.
Webb-Robertson, Bobbie-Jo M.; Wiberg, Holli K.; Matzke, Melissa M.; ...
2015-04-09
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yieldedmore » the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. In summary, on the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.« less
Neural Stem Cells (NSCs) and Proteomics.
Shoemaker, Lorelei D; Kornblum, Harley I
2016-02-01
Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Biomechanical and proteomic analysis of INF- β-treated astrocytes
NASA Astrophysics Data System (ADS)
Vergara, Daniele; Martignago, Roberta; Leporatti, Stefano; Bonsegna, Stefania; Maruccio, Giuseppe; De Nuccio, Franco; Santino, Angelo; Cingolani, Roberto; Nicolardi, Giuseppe; Maffia, Michele; Rinaldi, Ross
2009-11-01
Astrocytes have a key role in the pathogenesis of several diseases including multiple sclerosis and were proposed as the designed target for immunotherapy. In this study we used atomic force microscopy (AFM) and proteomics methods to analyse and correlate the modifications induced in the viscoleastic properties of astrocytes to the changes induced in protein expression after interferon- β (IFN-β) treatment. Our results indicated that IFN-β treatment resulted in a significant decrease in the Young's modulus, a measure of cell elasticity, in comparison with control cells. The molecular mechanisms that trigger these changes were investigated by 2DE (two-dimensional electrophoresis) and confocal analyses and confirmed by western blotting. Altered proteins were found to be involved in cytoskeleton organization and other important physiological processes.
Isolation of Chromoplasts and Suborganellar Compartments from Tomato and Bell Pepper Fruit.
Barsan, Cristina; Kuntz, Marcel; Pech, Jean-Claude
2017-01-01
Tomato is a model for fruit development and ripening. The isolation of intact plastids from this organism is therefore important for metabolic and proteomic analyses. Pepper, a species from the same family, is also of interest since it allows isolation of intact chromoplasts in large amounts. Here, we provide a detailed protocol for the isolation of tomato plastids at three fruit developmental stages, namely, nascent chromoplasts from the mature green stage, chromoplasts from an intermediate stage, and fully differentiated red chromoplasts. The method relies on sucrose density gradient centrifugations. It yields high purity organelles suitable for proteome analyses. Enzymatic and microscopy assays are summarized to assess purity and intactness. A method is also described for subfractionation of pepper chromoplast lipoprotein structures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bao, Guanhui; University of Chinese Academy of Sciences, Beijing; Dong, Hongjun
Highlights: • Genomes of a butanol tolerant strain and its parent strain were deciphered. • Comparative genomic and proteomic was applied to understand butanol tolerance. • None differentially expressed proteins have mutations in its corresponding genes. • Mutations in ribosome might be responsible for the global difference of proteomics. - Abstract: Clostridium acetobutylicum strain Rh8 is a butanol-tolerant mutant which can tolerate up to 19 g/L butanol, 46% higher than that of its parent strain DSM 1731. We previously performed comparative cytoplasm- and membrane-proteomic analyses to understand the mechanism underlying the improved butanol tolerance of strain Rh8. In this work,more » we further extended this comparison to the genomic level. Compared with the genome of the parent strain DSM 1731, two insertion sites, four deletion sites, and 67 single nucleotide variations (SNVs) are distributed throughout the genome of strain Rh8. Among the 67 SNVs, 16 SNVs are located in the predicted promoters and intergenic regions; while 29 SNVs are located in the coding sequence, affecting a total of 21 proteins involved in transport, cell structure, DNA replication, and protein translation. The remaining 22 SNVs are located in the ribosomal genes, affecting a total of 12 rRNA genes in different operons. Analysis of previous comparative proteomic data indicated that none of the differentially expressed proteins have mutations in its corresponding genes. Rchange Algorithms analysis indicated that the mutations occurred in the ribosomal genes might change the ribosome RNA thermodynamic characteristics, thus affect the translation strength of these proteins. Take together, the improved butanol tolerance of C. acetobutylicum strain Rh8 might be acquired through regulating the translational process to achieve different expression strength of genes involved in butanol tolerance.« less
Bernal, Dolores; Trelis, Maria; Montaner, Sergio; Cantalapiedra, Fernando; Galiano, Alicia; Hackenberg, Michael; Marcilla, Antonio
2014-06-13
With the aim of characterizing the molecules involved in the interaction of Dicrocoelium dendriticum adults and the host, we have performed proteomic analyses of the external surface of the parasite using the currently available datasets including the transcriptome of the related species Echinostoma caproni. We have identified 182 parasite proteins on the outermost surface of D. dendriticum. The presence of exosome-like vesicles in the ESP of D. dendriticum and their components has also been characterized. Using proteomic approaches, we have characterized 84 proteins in these vesicles. Interestingly, we have detected miRNA in D. dendriticum exosomes, thus representing the first report of miRNA in helminth exosomes. In order to identify potential targets for intervention against parasitic helminths, we have analyzed the surface of the parasitic helminth Dicrocoelium dendriticum. Along with the proteomic analyses of the outermost layer of the parasite, our work describes the molecular characterization of the exosomes of D. dendriticum. Our proteomic data confirm the improvement of protein identification from "non-model organisms" like helminths, when using different search engines against a combination of available databases. In addition, this work represents the first report of miRNAs in parasitic helminth exosomes. These vesicles can pack specific proteins and RNAs providing stability and resistance to RNAse digestion in body fluids, and provide a way to regulate host-parasite interplay. The present data should provide a solid foundation for the development of novel methods to control this non-model organism and related parasites. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Proteomic analysis of Bombyx mori molting fluid: Insights into the molting process.
Liu, Hua-Wei; Wang, Luo-Ling; Tang, Xin; Dong, Zhao-Ming; Guo, Peng-Chao; Zhao, Dong-Chao; Xia, Qing-You; Zhao, Ping
2018-02-20
Molting is an essential biological process occurring multiple times throughout the life cycle of most Ecdysozoa. Molting fluids accumulate and function in the exuvial space during the molting process. In this study, we used liquid chromatography-tandem mass spectrometry to investigate the molting fluids to analyze the molecular mechanisms of molting in the silkworm, Bombyx mori. In total, 375 proteins were identified in molting fluids from the silkworm at 14-16h before pupation and eclosion, including 12 chitin metabolism-related enzymes, 35 serine proteases, 15 peptidases, and 38 protease inhibitors. Gene ontology analysis indicated that "catalytic" constitutes the most enriched function in the molting fluid. Gene expression patterns and bioinformatic analyses suggested that numerous enzymes are involved in the degradation of cuticle proteins and chitin. Protein-protein interaction network and activity analyses showed that protease inhibitors are involved in the regulation of multiple pathways in molting fluid. Additionally, many immune-related proteins may be involved in the immune defense during molting. These results provide a comprehensive proteomic insight into proteolytic enzymes and protease inhibitors in molting fluid, and will likely improve the current understanding of physiological processes in insect molting. Insect molting constitutes a dynamic physiological process. To better understand this process, we used LC-MS/MS to investigate the proteome of silkworm molting fluids and identified key proteins involved in silkworm molting. The biological processes of the old cuticle degradation pathway and immune defense response were analyzed in the proteome of silkworm molting fluid. We report that protease inhibitors serve as key factors in the regulation of the molting process. The proteomic results provide new insight into biological molting processes in insects. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacobs, Jon M.; Diamond, Deborah L.; Chan, Eric Y.
2005-06-01
The development of a reproducible model system for the study of Hepatitis C virus (HCV) infection has the potential to significantly enhance the study of virus-host interactions and provide future direction for modeling the pathogenesis of HCV. While there are studies describing global gene expression changes associated with HCV infection, changes in the proteome have not been characterized. We report the first large scale proteome analysis of the highly permissive Huh-7.5 cell line containing a full length HCV replicon. We detected > 4,400 proteins in this cell line, including HCV replicon proteins, using multidimensional liquid chromatographic (LC) separations coupled tomore » mass spectrometry (MS). The set of Huh-7.5 proteins confidently identified is, to our knowledge, the most comprehensive yet reported for a human cell line. Consistent with the literature, a comparison of Huh-7.5 cells (+) and (-) the HCV replicon identified expression changes of proteins involved in lipid metabolism. We extended these analyses to liver biopsy material from HCV-infected patients where > 1,500 proteins were detected from 2 {micro}g protein lysate using the Huh-7.5 protein database and the accurate mass and time (AMT) tag strategy. These findings demonstrate the utility of multidimensional proteome analysis of the HCV replicon model system for assisting the determination of proteins/pathways affected by HCV infection. Our ability to extend these analyses to the highly complex proteome of small liver biopsies with limiting protein yields offers the unique opportunity to begin evaluating the clinical significance of protein expression changes associated with HCV infection.« less
Kim, Bong-Woo; Lee, Chang Seok; Yi, Jae-Sung; Lee, Joo-Hyung; Lee, Joong-Won; Choo, Hyo-Jung; Jung, Soon-Young; Kim, Min-Sik; Lee, Sang-Won; Lee, Myung-Shik; Yoon, Gyesoon; Ko, Young-Gyu
2010-12-01
Although accumulating proteomic analyses have supported the fact that mitochondrial oxidative phosphorylation (OXPHOS) complexes are localized in lipid rafts, which mediate cell signaling, immune response and host-pathogen interactions, there has been no in-depth study of the physiological functions of lipid-raft OXPHOS complexes. Here, we show that many subunits of OXPHOS complexes were identified from the lipid rafts of human adipocytes, C2C12 myotubes, Jurkat cells and surface biotin-labeled Jurkat cells via shotgun proteomic analysis. We discuss the findings of OXPHOS complexes in lipid rafts, the role of the surface ATP synthase complex as a receptor for various ligands and extracellular superoxide generation by plasma membrane oxidative phosphorylation complexes.
Salt stress induces changes in the proteomic profile of micropropagated sugarcane shoots
Reis, Ricardo S.; Heringer, Angelo S.; Rangel, Patricia L.; Santa-Catarina, Claudete; Grativol, Clícia; Veiga, Carlos F. M.; Souza-Filho, Gonçalo A.
2017-01-01
Salt stress is one of the most common stresses in agricultural regions worldwide. In particular, sugarcane is affected by salt stress conditions, and no sugarcane cultivar presently show high productivity accompanied by a tolerance to salt stress. Proteomic analysis allows elucidation of the important pathways involved in responses to various abiotic stresses at the biochemical and molecular levels. Thus, this study aimed to analyse the proteomic effects of salt stress in micropropagated shoots of two sugarcane cultivars (CB38-22 and RB855536) using a label-free proteomic approach. The mass spectrometry proteomics data are available via ProteomeXchange with identifier PXD006075. The RB855536 cultivar is more tolerant to salt stress than CB38-22. A quantitative label-free shotgun proteomic analysis identified 1172 non-redundant proteins, and 1160 of these were observed in both cultivars in the presence or absence of NaCl. Compared with CB38-22, the RB855536 cultivar showed a greater abundance of proteins involved in non-enzymatic antioxidant mechanisms, ion transport, and photosynthesis. Some proteins, such as calcium-dependent protein kinase, photosystem I, phospholipase D, and glyceraldehyde-3-phosphate dehydrogenase, were more abundant in the RB855536 cultivar under salt stress. Our results provide new insights into the response of sugarcane to salt stress, and the changes in the abundance of these proteins might be important for the acquisition of ionic and osmotic homeostasis during exposure to salt stress. PMID:28419154
Trauma-associated Human Neutrophil Alterations Revealed by Comparative Proteomics Profiling
Zhou, Jian-Ying; Krovvidi, Ravi K.; Gao, Yuqian; Gao, Hong; Petritis, Brianne O.; De, Asit; Miller-Graziano, Carol; Bankey, Paul E.; Petyuk, Vladislav A.; Nicora, Carrie D.; Clauss, Therese R; Moore, Ronald J.; Shi, Tujin; Brown, Joseph N.; Kaushal, Amit; Xiao, Wenzhong; Davis, Ronald W.; Maier, Ronald V.; Tompkins, Ronald G.; Qian, Wei-Jun; Camp, David G.; Smith, Richard D.
2013-01-01
PURPOSE Polymorphonuclear neutrophils (PMNs) play an important role in mediating the innate immune response after severe traumatic injury; however, the cellular proteome response to traumatic condition is still largely unknown. EXPERIMENTAL DESIGN We applied 2D-LC-MS/MS based shotgun proteomics to perform comparative proteome profiling of human PMNs from severe trauma patients and healthy controls. RESULTS A total of 197 out of ~2500 proteins (being identified with at least two peptides) were observed with significant abundance changes following the injury. The proteomics data were further compared with transcriptomics data for the same genes obtained from an independent patient cohort. The comparison showed that the protein abundance changes for the majority of proteins were consistent with the mRNA abundance changes in terms of directions of changes. Moreover, increased protein secretion was suggested as one of the mechanisms contributing to the observed discrepancy between protein and mRNA abundance changes. Functional analyses of the altered proteins showed that many of these proteins were involved in immune response, protein biosynthesis, protein transport, NRF2-mediated oxidative stress response, the ubiquitin-proteasome system, and apoptosis pathways. CONCLUSIONS AND CLINICAL RELEVANCE Our data suggest increased neutrophil activation and inhibited neutrophil apoptosis in response to trauma. The study not only reveals an overall picture of functional neutrophil response to trauma at the proteome level, but also provides a rich proteomics data resource of trauma-associated changes in the neutrophil that will be valuable for further studies of the functions of individual proteins in PMNs. PMID:23589343
Plant fluid proteomics: Delving into the xylem sap, phloem sap and apoplastic fluid proteomes.
Rodríguez-Celma, Jorge; Ceballos-Laita, Laura; Grusak, Michael A; Abadía, Javier; López-Millán, Ana-Flor
2016-08-01
The phloem sap, xylem sap and apoplastic fluid play key roles in long and short distance transport of signals and nutrients, and act as a barrier against local and systemic pathogen infection. Among other components, these plant fluids contain proteins which are likely to be important players in their functionalities. However, detailed information about their proteomes is only starting to arise due to the difficulties inherent to the collection methods. This review compiles the proteomic information available to date in these three plant fluids, and compares the proteomes obtained in different plant species in order to shed light into conserved functions in each plant fluid. Inter-species comparisons indicate that all these fluids contain the protein machinery for self-maintenance and defense, including proteins related to cell wall metabolism, pathogen defense, proteolysis, and redox response. These analyses also revealed that proteins may play more relevant roles in signaling in the phloem sap and apoplastic fluid than in the xylem sap. A comparison of the proteomes of the three fluids indicates that although functional categories are somewhat similar, proteins involved are likely to be fluid-specific, except for a small group of proteins present in the three fluids, which may have a universal role, especially in cell wall maintenance and defense. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock. Copyright © 2016 Elsevier B.V. All rights reserved.
An automated method for detecting alternatively spliced protein domains.
Coelho, Vitor; Sammeth, Michael
2018-06-01
Alternative splicing (AS) has been demonstrated to play a role in shaping eukaryotic gene diversity at the transcriptional level. However, the impact of AS on the proteome is still controversial. Studies that seek to explore the effect of AS at the proteomic level are hampered by technical difficulties in the cumbersome process of casting forth and back between genome, transcriptome and proteome space coordinates, and the naïve prediction of protein domains in the presence of AS suffers many redundant sequence scans that emerge from constitutively spliced regions that are shared between alternative products of a gene. We developed the AstaFunk pipeline that computes for every generic transcriptome all domains that are altered by AS events in a systematic and efficient manner. In a nutshell, our method employs Viterbi dynamic programming, which guarantees to find all score-optimal hits of the domains under consideration, while complementary optimisations at different levels avoid redundant and other irrelevant computations. We evaluate AstaFunk qualitatively and quantitatively using RNAseq in well-studied genes with AS, and on large-scale employing entire transcriptomes. Our study confirms complementary reports that the effect of most AS events on the proteome seems to be rather limited, but our results also pinpoint several cases where AS could have a major impact on the function of a protein domain. The JAVA implementation of AstaFunk is available as an open source project on http://astafunk.sammeth.net. micha@sammeth.net. Supplementary data are available at Bioinformatics online.
Yu, Qianqian; Wu, Wei; Tian, Xiaojing; Hou, Man; Dai, Ruitong; Li, Xingmin
2017-02-10
Label-free proteomics was applied to characterize the effect of post-mortem storage time (0, 4, and 9days at 4°C±1°C) on the proteome changes of M. semitendinosus (SM) in Holstein cattle, and correlations between differentially abundant proteins and meat color traits were investigated. The redness (a*) value decreased significantly (P<0.05) during post-mortem storage, meanwhile, the relative proportion of metmyoglobin increased significantly (P<0.05) from 16.99% at day 0 to 40.26% at day 9. A total of 118 proteins with significant changes (fold change>1.5, P<0.05) was identified by comparisons of day 4 vs. day 0, day 9 vs. day 0, and day 9 vs. day 4. Principal component and hierarchical cluster analyses of these proteins were performed, and results exhibited clear distinctions among samples from different storage times. Eighteen differentially abundant proteins were correlated closely with the a* value of meat. Bioinformatics analyses revealed that most of these proteins were involved in glycolysis and energy metabolism, electron-transfer processes, and the antioxidation function, which implied an underlying connection between meat discoloration and these biological processes. It is always a challenge for scientists to improve the stability of meat color during post-mortem storage and retail display. However, the mechanism involved in meat discoloration has not been unraveled completely, and the application of label-free proteomics in studying meat discoloration has not been reported. Our work discovers some key proteins in SM muscle of Holstein cattle that were correlated with a* value of meat via label-free proteomics. Bioinformatics analyses revealed that some of these differentially abundant proteins were involved in glycolysis and energy metabolism, electron-transfer processes, and the antioxidation function, which implied an underlying connection between meat discoloration and these biological processes. These results provide the theoretic basis on understanding of complicated biochemical changes and underlying molecular mechanisms responsible for meat discoloration. Copyright © 2016 Elsevier B.V. All rights reserved.
Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes
Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise
2009-01-01
Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885
2011-01-01
Background Since its inception, proteomics has essentially operated in a discovery mode with the goal of identifying and quantifying the maximal number of proteins in a sample. Increasingly, proteomic measurements are also supporting hypothesis-driven studies, in which a predetermined set of proteins is consistently detected and quantified in multiple samples. Selected reaction monitoring (SRM) is a targeted mass spectrometric technique that supports the detection and quantification of specific proteins in complex samples at high sensitivity and reproducibility. Here, we describe ATAQS, an integrated software platform that supports all stages of targeted, SRM-based proteomics experiments including target selection, transition optimization and post acquisition data analysis. This software will significantly facilitate the use of targeted proteomic techniques and contribute to the generation of highly sensitive, reproducible and complete datasets that are particularly critical for the discovery and validation of targets in hypothesis-driven studies in systems biology. Result We introduce a new open source software pipeline, ATAQS (Automated and Targeted Analysis with Quantitative SRM), which consists of a number of modules that collectively support the SRM assay development workflow for targeted proteomic experiments (project management and generation of protein, peptide and transitions and the validation of peptide detection by SRM). ATAQS provides a flexible pipeline for end-users by allowing the workflow to start or end at any point of the pipeline, and for computational biologists, by enabling the easy extension of java algorithm classes for their own algorithm plug-in or connection via an external web site. This integrated system supports all steps in a SRM-based experiment and provides a user-friendly GUI that can be run by any operating system that allows the installation of the Mozilla Firefox web browser. Conclusions Targeted proteomics via SRM is a powerful new technique that enables the reproducible and accurate identification and quantification of sets of proteins of interest. ATAQS is the first open-source software that supports all steps of the targeted proteomics workflow. ATAQS also provides software API (Application Program Interface) documentation that enables the addition of new algorithms to each of the workflow steps. The software, installation guide and sample dataset can be found in http://tools.proteomecenter.org/ATAQS/ATAQS.html PMID:21414234
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Matzke, Melissa M.; Datta, Susmita
As the capability of mass spectrometry-based proteomics has matured, tens of thousands of peptides can be measured simultaneously, which has the benefit of offering a systems view of protein expression. However, a major challenge is that with an increase in throughput, protein quantification estimation from the native measured peptides has become a computational task. A limitation to existing computationally-driven protein quantification methods is that most ignore protein variation, such as alternate splicing of the RNA transcript and post-translational modifications or other possible proteoforms, which will affect a significant fraction of the proteome. The consequence of this assumption is that statisticalmore » inference at the protein level, and consequently downstream analyses, such as network and pathway modeling, have only limited power for biomarker discovery. Here, we describe a Bayesian model (BP-Quant) that uses statistically derived peptides signatures to identify peptides that are outside the dominant pattern, or the existence of multiple over-expressed patterns to improve relative protein abundance estimates. It is a research-driven approach that utilizes the objectives of the experiment, defined in the context of a standard statistical hypothesis, to identify a set of peptides exhibiting similar statistical behavior relating to a protein. This approach infers that changes in relative protein abundance can be used as a surrogate for changes in function, without necessarily taking into account the effect of differential post-translational modifications, processing, or splicing in altering protein function. We verify the approach using a dilution study from mouse plasma samples and demonstrate that BP-Quant achieves similar accuracy as the current state-of-the-art methods at proteoform identification with significantly better specificity. BP-Quant is available as a MatLab ® and R packages at https://github.com/PNNL-Comp-Mass-Spec/BP-Quant.« less
Cloud parallel processing of tandem mass spectrometry based proteomics data.
Mohammed, Yassene; Mostovenko, Ekaterina; Henneman, Alex A; Marissen, Rob J; Deelder, André M; Palmblad, Magnus
2012-10-05
Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.
Kniemeyer, Olaf
2011-08-01
Fungal species of the genus Aspergillus play significant roles as model organisms in basic research, as "cell factories" for the production of organic acids, pharmaceuticals or industrially important enzymes and as pathogens causing superficial and invasive infections in animals and humans. The release of the genome sequences of several Aspergillus sp. has paved the way for global analyses of protein expression in Aspergilli including the characterisation of proteins, which have not designated any function. With the application of proteomic methods, particularly 2-D gel and LC-MS/MS-based methods, first insights into the composition of the proteome of Aspergilli under different growth and stress conditions could be gained. Putative targets of global regulators led to the improvement of industrially relevant Aspergillus strains and so far not described Aspergillus antigens have already been discovered. Here, I review the recent proteome data generated for the species Aspergillus nidulans, Aspergillus fumigatus, Aspergillus niger, Aspergillus terreus, Aspergillus flavus and Aspergillus oryzae. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Whigham, Arlene; Clarke, Rosemary; Brenes-Murillo, Alejandro J; Estes, Brett; Madhessian, Diana; Lundberg, Emma; Wadsworth, Patricia
2017-01-01
The temporal regulation of protein abundance and post-translational modifications is a key feature of cell division. Recently, we analysed gene expression and protein abundance changes during interphase under minimally perturbed conditions (Ly et al., 2014, 2015). Here, we show that by using specific intracellular immunolabelling protocols, FACS separation of interphase and mitotic cells, including mitotic subphases, can be combined with proteomic analysis by mass spectrometry. Using this PRIMMUS (PRoteomic analysis of Intracellular iMMUnolabelled cell Subsets) approach, we now compare protein abundance and phosphorylation changes in interphase and mitotic fractions from asynchronously growing human cells. We identify a set of 115 phosphorylation sites increased during G2, termed ‘early risers’. This set includes phosphorylation of S738 on TPX2, which we show is important for TPX2 function and mitotic progression. Further, we use PRIMMUS to provide the first a proteome-wide analysis of protein abundance remodeling between prophase, prometaphase and anaphase. PMID:29052541
NASA Astrophysics Data System (ADS)
Li, Huilin; Nguyen, Hong Hanh; Ogorzalek Loo, Rachel R.; Campuzano, Iain D. G.; Loo, Joseph A.
2018-02-01
Mass spectrometry (MS) has become a crucial technique for the analysis of protein complexes. Native MS has traditionally examined protein subunit arrangements, while proteomics MS has focused on sequence identification. These two techniques are usually performed separately without taking advantage of the synergies between them. Here we describe the development of an integrated native MS and top-down proteomics method using Fourier-transform ion cyclotron resonance (FTICR) to analyse macromolecular protein complexes in a single experiment. We address previous concerns of employing FTICR MS to measure large macromolecular complexes by demonstrating the detection of complexes up to 1.8 MDa, and we demonstrate the efficacy of this technique for direct acquirement of sequence to higher-order structural information with several large complexes. We then summarize the unique functionalities of different activation/dissociation techniques. The platform expands the ability of MS to integrate proteomics and structural biology to provide insights into protein structure, function and regulation.
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L.; Dianes, José A.; del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W.; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-01-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra. PMID:27493588
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L; Dianes, José A; Del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-08-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra.
Rabek, Jeffrey P.; Hafer-Macko, Charlene E.; Amaning, James K.; DeFord, James H.; Dimayuga, Vincent L.; Madsen, Mark A.; Macko, Richard F.
2009-01-01
Stroke disability is attributed to upper motor neuron deficits resulting from ischemic brain injury. We have developed proteome maps of the Vastus lateralis to examine the effects of ischemic brain injury on paretic skeletal muscle myofilament proteins. Proteomics analyses from seven hemiparetic stroke patients have detected a decrease of three troponin T isoforms in the paretic muscle suggesting that myosin–actin interactions may be attenuated. We propose that ischemic brain injury may prevent troponin T participation in complex formation thereby affecting the protein interactions associated with excitation–contraction coupling. We have also detected a novel skeletal troponin T isoform that has a C-terminal variation. Our data suggest that the decreased slow troponin T isoform pools in the paretic limb may contribute to the gait deficit after stroke. The complexity of the neurological deficit on Vastus lateralis is suggested by the multiple changes in proteins detected by our proteomics mapping. PMID:19447848
Transcriptional and Proteomic Profiling of Aspergillus flavipes in Response to Sulfur Starvation.
El-Sayed, Ashraf S A; Yassin, Marwa A; Ali, Gul Shad
2015-01-01
Aspergillus flavipes has received considerable interest due to its potential to produce therapeutic enzymes involved in sulfur amino acid metabolism. In natural habitats, A. flavipes survives under sulfur limitations by mobilizing endogenous and exogenous sulfur to operate diverse cellular processes. Sulfur limitation affects virulence and pathogenicity, and modulates proteome of sulfur assimilating enzymes of several fungi. However, there are no previous reports aimed at exploring effects of sulfur limitation on the regulation of A. flavipes sulfur metabolism enzymes at the transcriptional, post-transcriptional and proteomic levels. In this report, we show that sulfur limitation affects morphological and physiological responses of A. flavipes. Transcription and enzymatic activities of several key sulfur metabolism genes, ATP-sulfurylase, sulfite reductase, methionine permease, cysteine synthase, cystathionine β- and γ-lyase, glutathione reductase and glutathione peroxidase were increased under sulfur starvation conditions. A 50 kDa protein band was strongly induced by sulfur starvation, and the proteomic analyses of this protein band using LC-MS/MS revealed similarity to many proteins involved in the sulfur metabolism pathway.
A DIGE proteomic analysis for high-intensity exercise-trained rat skeletal muscle.
Yamaguchi, Wataru; Fujimoto, Eri; Higuchi, Mitsuru; Tabata, Izumi
2010-09-01
Exercise training induces various adaptations in skeletal muscles. However, the mechanisms remain unclear. In this study, we conducted 2D-DIGE proteomic analysis, which has not yet been used for elucidating adaptations of skeletal muscle after high-intensity exercise training (HIT). For 5 days, rats performed HIT, which consisted of 14 20-s swimming exercise bouts carrying a weight (14% of the body weight), and 10-s pause between bouts. The 2D-DIGE analysis was conducted on epitrochlearis muscles excised 18 h after the final training exercise. Proteomic profiling revealed that out of 800 detected and matched spots, 13 proteins exhibited changed expression by HIT compared with sedentary rats. All proteins were identified by MALDI-TOF/MS. Furthermore, using western immunoblot analyses, significantly changed expressions of NDUFS1 and parvalbumin (PV) were validated in relation to HIT. In conclusion, the proteomic 2D-DIGE analysis following HIT-identified expressions of NDUFS1 and PV, previously unknown to have functions related to exercise-training adaptations.
Li, Chunjian; Li, Xuexian
2012-01-01
Optimal nitrogen (N) supply is critical for achieving high grain yield of maize. It is well established that N deficiency significantly reduces grain yield and N oversupply reduces N use efficiency without significant yield increase. However, the underlying proteomic mechanism remains poorly understood. The present field study showed that N deficiency significantly reduced ear size and dry matter accumulation in the cob and grain, directly resulting in a significant decrease in grain yield. The N content, biomass accumulation, and proteomic variations were further analysed in young ears at the silking stage under different N regimes. N deficiency significantly reduced N content and biomass accumulation in young ears of maize plants. Proteomic analysis identified 47 proteins with significant differential accumulation in young ears under different N treatments. Eighteen proteins also responded to other abiotic and biotic stresses, suggesting that N nutritional imbalance triggered a general stress response. Importantly, 24 proteins are involved in regulation of hormonal metabolism and functions, ear development, and C/N metabolism in young ears, indicating profound impacts of N nutrition on ear growth and grain yield at the proteomic level. PMID:22936831
Proteomics technique opens new frontiers in mobilome research.
Davidson, Andrew D; Matthews, David A; Maringer, Kevin
2017-01-01
A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti ). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.
Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework
2012-01-01
Background For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. Results We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. Conclusion The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources. PMID:23216909
Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework.
Lewis, Steven; Csordas, Attila; Killcoyne, Sarah; Hermjakob, Henning; Hoopmann, Michael R; Moritz, Robert L; Deutsch, Eric W; Boyle, John
2012-12-05
For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources.
Nicolaou, Orthodoxia; Kousios, Andreas; Hadjisavvas, Andreas; Lauwerys, Bernard; Sokratous, Kleitos; Kyriacou, Kyriacos
2017-05-01
Advances in mass spectrometry technologies have created new opportunities for discovering novel protein biomarkers in systemic lupus erythematosus (SLE). We performed a systematic review of published reports on proteomic biomarkers identified in SLE patients using mass spectrometry-based proteomics and highlight their potential disease association and clinical utility. Two electronic databases, MEDLINE and EMBASE, were systematically searched up to July 2015. The methodological quality of studies included in the review was performed according to Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines. Twenty-five studies were included in the review, identifying 241 SLE candidate proteomic biomarkers related to various aspects of the disease including disease diagnosis and activity or pinpointing specific organ involvement. Furthermore, 13 of the 25 studies validated their results for a selected number of biomarkers in an independent cohort, resulting in the validation of 28 candidate biomarkers. It is noteworthy that 11 candidate biomarkers were identified in more than one study. A significant number of potential proteomic biomarkers that are related to a number of aspects of SLE have been identified using mass spectrometry proteomic approaches. However, further studies are required to assess the utility of these biomarkers in routine clinical practice. © 2016 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
Bracht, Thilo; Hagemann, Sascha; Loscha, Marius; Megger, Dominik A; Padden, Juliet; Eisenacher, Martin; Kuhlmann, Katja; Meyer, Helmut E; Baba, Hideo A; Sitek, Barbara
2014-06-06
The Baculoviral IAP repeat-containing protein 5 (BIRC5), also known as inhibitor of apoptosis protein survivin, is a member of the chromosomal passenger complex and a key player in mitosis. To investigate the function of BIRC5 in liver regeneration, we analyzed a hepatocyte-specific BIRC5-knockout mouse model using a quantitative label-free proteomics approach. Here, we present the analyses of the proteome changes in hepatocyte-specific BIRC5-knockout mice compared to wildtype mice, as well as proteome changes during liver regeneration induced by partial hepatectomy in wildtype mice and mice lacking hepatic BIRC5, respectively. The BIRC5-knockout mice showed an extensive overexpression of proteins related to cellular maintenance, organization and protein synthesis. Key regulators of cell growth, transcription and translation MTOR and STAT1/STAT2 were found to be overexpressed. During liver regeneration proteome changes representing a response to the mitotic stimulus were detected in wildtype mice. Mainly proteins corresponding to proliferation, cell cycle and cytokinesis were up-regulated. The hepatocyte-specific BIRC5-knockout mice showed impaired liver regeneration, which had severe consequences on the proteome level. However, several proteins with function in mitosis were found to be up-regulated upon the proliferative stimulus. Our results show that the E3 ubiquitin-protein ligase UHRF1 is strongly up-regulated during liver regeneration independently of BIRC5.
Shin, Kwang-Soo; Park, Hee-Soo; Kim, Young-Hwan; Yu, Jae-Hyuk
2013-07-11
FlbA is a regulator of G-protein signaling protein that plays a central role in attenuating heterotrimeric G-protein mediated vegetative growth signaling in Aspergillus. The deletion of flbA (∆flbA) in the opportunistic human pathogen Aspergillus fumigatus results in accelerated cell death and autolysis in submerged culture. To further investigate the effects of ∆flbA on intracellular protein levels we carried out 2-D proteome analyses of 2-day old submerged cultures of ∆flbA and wild type (WT) strains and observed 160 differentially expressed proteins. Via nano-LC-ESI-MS/MS analyses, we revealed the identity of 10 and 2 proteins exhibiting high and low level accumulation, respectively, in ∆flbA strain. Notably, the GliT protein is accumulated at about 1800-fold higher levels in ∆flbA than WT. Moreover, GliT is secreted at high levels from ∆flbA strain, whereas Sod1 (superoxide dismutase) is secreted at a higher level in WT. Northern blot analyses reveal that ∆flbA results in elevated accumulation of gliT mRNA. Consequently, ∆flbA strain exhibits enhanced tolerance to gliotoxin toxicity. Finally, ∆flbA strain displayed enhanced SOD activity and elevated resistance to menadione and paraquat. In summary, FlbA-mediated signaling control negatively affects cellular responses associated with detoxification of reactive oxygen species and of exogenous gliotoxin in A. fumigatus. Regulator of G protein Signaling (RGS) proteins play crucial roles in fundamental biological processes in filamentous fungi. FlbA is the first studied filamentous fungal RGS protein, yet much remains to be understood about its roles in the opportunistic human pathogen Aspergillus fumigatus. In the present study, we examined the effects of the deletion of flbA using comprehensive analyses of the intra- and extracellular proteomes of A. fumigatus wild type and the flbA deletion mutant. Via MS analyses, we identified 10 proteins exhibiting high level accumulation in the flbA deletion mutant and 8 proteins differentially secreted in wild type and the flbA mutant. Based on proteomic analyses, we further examined the role of FlbA and found that FlbA down-regulates gliT expression and SOD activity. Our results proposed that FlbA-mediated signaling control negatively affects cellular responses associated with detoxification of reactive oxygen species and exogenous gliotoxin in A. fumigatus. Copyright © 2013 Elsevier B.V. All rights reserved.
Du, Zhe; Chen, Yinguang; Li, Xu
2017-10-15
Microbial degradation of estrogenic compounds can be affected by the nitrogen source and background carbon in the environment. However, the underlying mechanisms are not well understood. The objective of this study was to elucidate the molecular mechanisms of estrone (E1) biodegradation at the protein level under various background nitrogen (nitrate or ammonium) and carbon conditions (no background carbon, acetic acid, or humic acid as background carbon) by a newly isolated bacterial strain. The E1 degrading bacterial strain, Hydrogenophaga atypica ZD1, was isolated from river sediments and its proteome was characterized under various experimental conditions using quantitative proteomics. Results show that the E1 degradation rate was faster when ammonium was used as the nitrogen source than with nitrate. The degradation rate was also faster when either acetic acid or humic acid was present in the background. Proteomics analyses suggested that the E1 biodegradation products enter the tyrosine metabolism pathway. Compared to nitrate, ammonium likely promoted E1 degradation by increasing the activities of the branched-chain-amino-acid aminotransferase (IlvE) and enzymes involved in the glutamine synthetase-glutamine oxoglutarate aminotransferase (GS-GOGAT) pathway. The increased E1 degradation rate with acetic acid or humic acid in the background can also be attributed to the up-regulation of IlvE. Results from this study can help predict and explain E1 biodegradation kinetics under various environmental conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Proteomic Adaptations to Starvation Prepare Escherichia coli for Disinfection Tolerance
Du, Zhe; Nandakumar, Renu; Nickerson, Kenneth; Li, Xu
2015-01-01
Despite the low nutrient level and constant presence of secondary disinfectants, bacterial re-growth still occurs in drinking water distribution systems. The molecular mechanisms that starved bacteria use to survive low-level chlorine-based disinfectants are not well understood. The objective of this study is to investigate these molecular mechanisms at the protein level that prepare starved cells for disinfection tolerance. Two commonly used secondary disinfectants chlorine and monochloramine, both at 1 mg/L, were used in this study. The proteomes of normal and starved Escherichia coli (K12 MG1655) cells were studied using quantitative proteomics. Over 60-min disinfection, starved cells showed significantly higher disinfection tolerance than normal cells based on the inactivation curves for both chlorine and monochloramine. Proteomic analyses suggest that starvation may prepare cells for the oxidative stress that chlorine-based disinfection will cause by affecting glutathione metabolism. In addition, proteins involved in stress regulation and stress responses were among the ones up-regulated under both starvation and chlorine/monochloramine disinfection. By comparing the fold changes under different conditions, it is suggested that starvation prepares E. coli for disinfection tolerance by increasing the expression of enzymes that can help cells survive chlorine/monochloramine disinfection. Protein co-expression analyses show that proteins in glycolysis and pentose phosphate pathway that were up-regulated under starvation are also involved in disinfection tolerance. Finally, the production and detoxification of methylglyoxal may be involved in the chlorine-based disinfection and cell defense mechanisms. PMID:25463932
Duan, Xunbao; Zhong, Zhisheng; Potireddy, Santhi; Moncada, Camilo; Merali, Salim; Latham, Keith E.
2015-01-01
Embryos produced by somatic cell nuclear transfer (SCNT) display low term developmental potential. This is associated with deficiencies in spindle composition prior to activation and at early mitotic divisions, including failure to assemble certain proteins on the spindle. The protein-deficient spindles are accompanied by chromosome congression defects prior to activation and during the first mitotic divisions of the embryo. The molecular basis for these deficiencies and how they might be avoided are unknown. Proteomic analyses of spindles isolated from normal metaphase II (MII) stage oocytes and SCNT constructs, along with a systematic immunofluorescent survey of known spindle-associated proteins were undertaken. This was the first proteomics study of mammalian oocyte spindles. The study revealed four proteins as being deficient in spindles of SCNT embryos in addition to those previously identified; these were clathrin heavy chain (CLTC), aurora B kinase, dynactin 4, and casein kinase 1 alpha. Due to substantial reduction in CLTC abundance after spindle removal, we undertook functional studies to explore the importance of CLTC in oocyte spindle function and in chromosome congression defects of cloned embryos. Using siRNA knockdown we demonstrated an essential role for CLTC in chromosome congression during oocyte maturation. We also demonstrated rescue of chromosome congression defects in SCNT embryos at the first mitosis using CLTC mRNA injection. These studies are the first to employ proteomics analyses coupled to functional interventions to rescue a specific molecular defect in cloned embryos. PMID:20883044
Proteomic adaptations to starvation prepare Escherichia coli for disinfection tolerance.
Du, Zhe; Nandakumar, Renu; Nickerson, Kenneth W; Li, Xu
2015-02-01
Despite the low nutrient level and constant presence of secondary disinfectants, bacterial re-growth still occurs in drinking water distribution systems. The molecular mechanisms that starved bacteria use to survive low-level chlorine-based disinfectants are not well understood. The objective of this study is to investigate these molecular mechanisms at the protein level that prepare starved cells for disinfection tolerance. Two commonly used secondary disinfectants chlorine and monochloramine, both at 1 mg/L, were used in this study. The proteomes of normal and starved Escherichia coli (K12 MG1655) cells were studied using quantitative proteomics. Over 60-min disinfection, starved cells showed significantly higher disinfection tolerance than normal cells based on the inactivation curves for both chlorine and monochloramine. Proteomic analyses suggest that starvation may prepare cells for the oxidative stress that chlorine-based disinfection will cause by affecting glutathione metabolism. In addition, proteins involved in stress regulation and stress responses were among the ones up-regulated under both starvation and chlorine/monochloramine disinfection. By comparing the fold changes under different conditions, it is suggested that starvation prepares E. coli for disinfection tolerance by increasing the expression of enzymes that can help cells survive chlorine/monochloramine disinfection. Protein co-expression analyses show that proteins in glycolysis and pentose phosphate pathway that were up-regulated under starvation are also involved in disinfection tolerance. Finally, the production and detoxification of methylglyoxal may be involved in the chlorine-based disinfection and cell defense mechanisms. Copyright © 2014 Elsevier Ltd. All rights reserved.
Proteome Exploration to Provide a Resource for the Investigation of Ganoderma lucidum
Yu, Guo-Jun; Yin, Ya-Lin; Yu, Wen-Hui; Liu, Wei; Jin, Yan-Xia; Shrestha, Alok; Yang, Qing; Ye, Xiang-Dong; Sun, Hui
2015-01-01
Ganoderma lucidum is a basidiomycete white rot fungus that has been used for medicinal purposes worldwide. Although information concerning its genome and transcriptome has recently been reported, relatively little information is available for G. lucidum at the proteomic level. In this study, protein fractions from G. lucidum at three developmental stages (16-day mycelia, and fruiting bodies at 60 and 90 days) were prepared and subjected to LC-MS/MS analysis. A search against the G. lucidum genome database identified 803 proteins. Among these proteins, 61 lignocellulose degrading proteins were detected, most of which (49 proteins) were found in the 90-day fruiting bodies. Fourteen TCA-cycle related proteins, 17 peptidases, two argonaute-like proteins, and two immunomodulatory proteins were also detected. A majority (470) of the 803 proteins had GO annotations and were classified into 36 GO terms, with “binding”, “catalytic activity”, and “hydrolase activity” having high percentages. Additionally, 357 out of the 803 proteins were assigned to at least one COG functional category and grouped into 22 COG classifications. Based on the results from the proteomic and sequence alignment analyses, a potentially new immunomodulatory protein (GL18769) was expressed and shown to have high immunomodulatory activity. In this study, proteomic and biochemical analyses of G. lucidum were performed for the first time, revealing that proteins from this fungus can play significant bioactive roles and providing a new foundation for the further functional investigations that this fungus merits. PMID:25756518
Identification of lactoferricin B intracellular targets using an Escherichia coli proteome chip.
Tu, Yu-Hsuan; Ho, Yu-Hsuan; Chuang, Ying-Chih; Chen, Po-Chung; Chen, Chien-Sheng
2011-01-01
Lactoferricin B (LfcinB) is a well-known antimicrobial peptide. Several studies have indicated that it can inhibit bacteria by affecting intracellular activities, but the intracellular targets of this antimicrobial peptide have not been identified. Therefore, we used E. coli proteome chips to identify the intracellular target proteins of LfcinB in a high-throughput manner. We probed LfcinB with E. coli proteome chips and further conducted normalization and Gene Ontology (GO) analyses. The results of the GO analyses showed that the identified proteins were associated with metabolic processes. Moreover, we validated the interactions between LfcinB and chip assay-identified proteins with fluorescence polarization (FP) assays. Sixteen proteins were identified, and an E. coli interaction database (EcID) analysis revealed that the majority of the proteins that interact with these 16 proteins affected the tricarboxylic acid (TCA) cycle. Knockout assays were conducted to further validate the FP assay results. These results showed that phosphoenolpyruvate carboxylase was a target of LfcinB, indicating that one of its mechanisms of action may be associated with pyruvate metabolism. Thus, we used pyruvate assays to conduct an in vivo validation of the relationship between LfcinB and pyruvate level in E. coli. These results showed that E. coli exposed to LfcinB had abnormal pyruvate amounts, indicating that LfcinB caused an accumulation of pyruvate. In conclusion, this study successfully revealed the intracellular targets of LfcinB using an E. coli proteome chip approach.
Identification of Lactoferricin B Intracellular Targets Using an Escherichia coli Proteome Chip
Chen, Po-Chung; Chen, Chien-Sheng
2011-01-01
Lactoferricin B (LfcinB) is a well-known antimicrobial peptide. Several studies have indicated that it can inhibit bacteria by affecting intracellular activities, but the intracellular targets of this antimicrobial peptide have not been identified. Therefore, we used E. coli proteome chips to identify the intracellular target proteins of LfcinB in a high-throughput manner. We probed LfcinB with E. coli proteome chips and further conducted normalization and Gene Ontology (GO) analyses. The results of the GO analyses showed that the identified proteins were associated with metabolic processes. Moreover, we validated the interactions between LfcinB and chip assay-identified proteins with fluorescence polarization (FP) assays. Sixteen proteins were identified, and an E. coli interaction database (EcID) analysis revealed that the majority of the proteins that interact with these 16 proteins affected the tricarboxylic acid (TCA) cycle. Knockout assays were conducted to further validate the FP assay results. These results showed that phosphoenolpyruvate carboxylase was a target of LfcinB, indicating that one of its mechanisms of action may be associated with pyruvate metabolism. Thus, we used pyruvate assays to conduct an in vivo validation of the relationship between LfcinB and pyruvate level in E. coli. These results showed that E. coli exposed to LfcinB had abnormal pyruvate amounts, indicating that LfcinB caused an accumulation of pyruvate. In conclusion, this study successfully revealed the intracellular targets of LfcinB using an E. coli proteome chip approach. PMID:22164243
Alternative Splicing May Not Be the Key to Proteome Complexity.
Tress, Michael L; Abascal, Federico; Valencia, Alfonso
2017-02-01
Alternative splicing is commonly believed to be a major source of cellular protein diversity. However, although many thousands of alternatively spliced transcripts are routinely detected in RNA-seq studies, reliable large-scale mass spectrometry-based proteomics analyses identify only a small fraction of annotated alternative isoforms. The clearest finding from proteomics experiments is that most human genes have a single main protein isoform, while those alternative isoforms that are identified tend to be the most biologically plausible: those with the most cross-species conservation and those that do not compromise functional domains. Indeed, most alternative exons do not seem to be under selective pressure, suggesting that a large majority of predicted alternative transcripts may not even be translated into proteins. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Ultraweak photon emission and proteomics analyses in soybean under abiotic stress.
Komatsu, Setsuko; Kamal, Abu Hena Mostafa; Makino, Takahiro; Hossain, Zahed
2014-07-01
Biophotons are ultraweak photon emissions that are closely related to various biological activities and processes. In mammals, biophoton emissions originate from oxidative bursts in immunocytes during immunological responses. Biophotons emitted from plant organs provide novel information about the physiological state of plant under in vivo condition. In this review, the principles and recent advances in the measurement of biophoton emissions in plants are described. Furthermore, examples of biophoton emission and proteomics in soybean under abiotic stress are reviewed and discussed. Finally, this review suggests that the application of proteomics should provide a better interpretation of plant response to biophoton emission and allow the identification of genes that will allow the screening of crops able to produce maximal yields, even in stressful environments. Copyright © 2014 Elsevier B.V. All rights reserved.
Plant subcellular proteomics: Application for exploring optimal cell function in soybean.
Wang, Xin; Komatsu, Setsuko
2016-06-30
Plants have evolved complicated responses to developmental changes and stressful environmental conditions. Subcellular proteomics has the potential to elucidate localized cellular responses and investigate communications among subcellular compartments during plant development and in response to biotic and abiotic stresses. Soybean, which is a valuable legume crop rich in protein and vegetable oil, can grow in several climatic zones; however, the growth and yield of soybean are markedly decreased under stresses. To date, numerous proteomic studies have been performed in soybean to examine the specific protein profiles of cell wall, plasma membrane, nucleus, mitochondrion, chloroplast, and endoplasmic reticulum. In this review, methods for the purification and purity assessment of subcellular organelles from soybean are summarized. In addition, the findings from subcellular proteomic analyses of soybean during development and under stresses, particularly flooding stress, are presented and the proteins regulated among subcellular compartments are discussed. Continued advances in subcellular proteomics are expected to greatly contribute to the understanding of the responses and interactions that occur within and among subcellular compartments during development and under stressful environmental conditions. Subcellular proteomics has the potential to investigate the cellular events and interactions among subcellular compartments in response to development and stresses in plants. Soybean could grow in several climatic zones; however, the growth and yield of soybean are markedly decreased under stresses. Numerous proteomics of cell wall, plasma membrane, nucleus, mitochondrion, chloroplast, and endoplasmic reticulum was carried out to investigate the respecting proteins and their functions in soybean during development or under stresses. In this review, methods of subcellular-organelle enrichment and purity assessment are summarized. In addition, previous findings of subcellular proteomics are presented, and functional proteins regulated among different subcellular are discussed. Subcellular proteomics contributes greatly to uncovering responses and interactions among subcellular compartments during development and under stressful environmental conditions in soybean. Copyright © 2016 Elsevier B.V. All rights reserved.
Woo, Jongmin; Han, Dohyun; Wang, Joseph Injae; Park, Joonho; Kim, Hyunsoo; Kim, Youngsoo
2017-09-01
The development of systematic proteomic quantification techniques in systems biology research has enabled one to perform an in-depth analysis of cellular systems. We have developed a systematic proteomic approach that encompasses the spectrum from global to targeted analysis on a single platform. We have applied this technique to an activated microglia cell system to examine changes in the intracellular and extracellular proteomes. Microglia become activated when their homeostatic microenvironment is disrupted. There are varying degrees of microglial activation, and we chose to focus on the proinflammatory reactive state that is induced by exposure to such stimuli as lipopolysaccharide (LPS) and interferon-gamma (IFN-γ). Using an improved shotgun proteomics approach, we identified 5497 proteins in the whole-cell proteome and 4938 proteins in the secretome that were associated with the activation of BV2 mouse microglia by LPS or IFN-γ. Of the differentially expressed proteins in stimulated microglia, we classified pathways that were related to immune-inflammatory responses and metabolism. Our label-free parallel reaction monitoring (PRM) approach made it possible to comprehensively measure the hyper-multiplex quantitative value of each protein by high-resolution mass spectrometry. Over 450 peptides that corresponded to pathway proteins and direct or indirect interactors via the STRING database were quantified by label-free PRM in a single run. Moreover, we performed a longitudinal quantification of secreted proteins during microglial activation, in which neurotoxic molecules that mediate neuronal cell loss in the brain are released. These data suggest that latent pathways that are associated with neurodegenerative diseases can be discovered by constructing and analyzing a pathway network model of proteins. Furthermore, this systematic quantification platform has tremendous potential for applications in large-scale targeted analyses. The proteomics data for discovery and label-free PRM analysis have been deposited to the ProteomeXchange Consortium with identifiers
Wiśniewski, Jacek R; Mann, Matthias
2016-07-01
Proteomics and other protein-based analysis methods such as Western blotting all face the challenge of discriminating changes in the levels of proteins of interest from inadvertent changes in the amount loaded for analysis. Mass-spectrometry-based proteomics can now estimate the relative and absolute amounts of thousands of proteins across diverse biological systems. We reasoned that this new technology could prove useful for selection of very stably expressed proteins that could serve as better loading controls than those traditionally employed. Large-scale proteomic analyses of SDS lysates of cultured cells and tissues revealed deglycase DJ-1 as the protein with the lowest variability in abundance among different cell types in human, mouse, and amphibian cells. The protein constitutes 0.069 ± 0.017% of total cellular protein and occurs at a specific concentration of 34.6 ± 8.7 pmol/mg of total protein. Since DJ-1 is ubiquitous and therefore easily detectable with several peptides, it can be helpful in normalization of proteomic data sets. In addition, DJ-1 appears to be an advantageous loading control for Western blot that is superior to those used commonly used, allowing comparisons between tissues and cells originating from evolutionarily distant vertebrate species. Notably, this is not possible by the detection and quantitation of housekeeping proteins, which are often used in the Western blot technique. The approach introduced here can be applied to select the most appropriate loading controls for MS-based proteomics or Western blotting in any biological system.
Lim, Sanghyun; Borza, Tudor; Peters, Rick D; Coffin, Robert H; Al-Mughrabi, Khalil I; Pinto, Devanand M; Wang-Pruski, Gefu
2013-11-20
Phosphite (salts of phosphorous acid; Phi)-based fungicides are increasingly used in controlling oomycete pathogens, such as the late blight agent Phytophthora infestans. In plants, low amounts of Phi induce pathogen resistance through an indirect mode of action. We used iTRAQ-based quantitative proteomics to investigate the effects of phosphite on potato plants before and after infection with P. infestans. Ninety-three (62 up-regulated and 31 down-regulated) differentially regulated proteins, from a total of 1172 reproducibly identified proteins, were identified in the leaf proteome of Phi-treated potato plants. Four days post-inoculation with P. infestans, 16 of the 31 down-regulated proteins remained down-regulated and 42 of the 62 up-regulated proteins remained up-regulated, including 90% of the defense proteins. This group includes pathogenesis-related, stress-responsive, and detoxification-related proteins. Callose deposition and ultrastructural analyses of leaf tissues after infection were used to complement the proteomics approach. This study represents the first comprehensive proteomics analysis of the indirect mode of action of Phi, demonstrating broad effects on plant defense and plant metabolism. The proteomics data and the microscopy study suggest that Phi triggers a hypersensitive response that is responsible for induced resistance of potato leaves against P. infestans. Phosphie triggers complex functional changes in potato leaves that are responsible for the induced resistance against Phytophthora infestans. This article is part of a Special Issue entitled: Translational Plant Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.
The Human Skeletal Muscle Proteome Project: a reappraisal of the current literature
Gonzalez‐Freire, Marta; Semba, Richard D.; Ubaida‐Mohien, Ceereena; Fabbri, Elisa; Scalzo, Paul; Højlund, Kurt; Dufresne, Craig; Lyashkov, Alexey
2016-01-01
Abstract Skeletal muscle is a large organ that accounts for up to half the total mass of the human body. A progressive decline in muscle mass and strength occurs with ageing and in some individuals configures the syndrome of ‘sarcopenia’, a condition that impairs mobility, challenges autonomy, and is a risk factor for mortality. The mechanisms leading to sarcopenia as well as myopathies are still little understood. The Human Skeletal Muscle Proteome Project was initiated with the aim to characterize muscle proteins and how they change with ageing and disease. We conducted an extensive review of the literature and analysed publically available protein databases. A systematic search of peer‐reviewed studies was performed using PubMed. Search terms included ‘human’, ‘skeletal muscle’, ‘proteome’, ‘proteomic(s)’, and ‘mass spectrometry’, ‘liquid chromatography‐mass spectrometry (LC‐MS/MS)’. A catalogue of 5431 non‐redundant muscle proteins identified by mass spectrometry‐based proteomics from 38 peer‐reviewed scientific publications from 2002 to November 2015 was created. We also developed a nosology system for the classification of muscle proteins based on localization and function. Such inventory of proteins should serve as a useful background reference for future research on changes in muscle proteome assessed by quantitative mass spectrometry‐based proteomic approaches that occur with ageing and diseases. This classification and compilation of the human skeletal muscle proteome can be used for the identification and quantification of proteins in skeletal muscle to discover new mechanisms for sarcopenia and specific muscle diseases that can be targeted for the prevention and treatment. PMID:27897395
PrePhyloPro: phylogenetic profile-based prediction of whole proteome linkages
Niu, Yulong; Liu, Chengcheng; Moghimyfiroozabad, Shayan; Yang, Yi
2017-01-01
Direct and indirect functional links between proteins as well as their interactions as part of larger protein complexes or common signaling pathways may be predicted by analyzing the correlation of their evolutionary patterns. Based on phylogenetic profiling, here we present a highly scalable and time-efficient computational framework for predicting linkages within the whole human proteome. We have validated this method through analysis of 3,697 human pathways and molecular complexes and a comparison of our results with the prediction outcomes of previously published co-occurrency model-based and normalization methods. Here we also introduce PrePhyloPro, a web-based software that uses our method for accurately predicting proteome-wide linkages. We present data on interactions of human mitochondrial proteins, verifying the performance of this software. PrePhyloPro is freely available at http://prephylopro.org/phyloprofile/. PMID:28875072
Cambiaghi, Alice; Díaz, Ramón; Martinez, Julia Bauzá; Odena, Antonia; Brunelli, Laura; Caironi, Pietro; Masson, Serge; Baselli, Giuseppe; Ristagno, Giuseppe; Gattinoni, Luciano; de Oliveira, Eliandre; Pastorelli, Roberta; Ferrario, Manuela
2018-04-27
In this work, we examined plasma metabolome, proteome and clinical features in patients with severe septic shock enrolled in the multicenter ALBIOS study. The objective was to identify changes in the levels of metabolites involved in septic shock progression and to integrate this information with the variation occurring in proteins and clinical data. Mass spectrometry-based targeted metabolomics and untargeted proteomics allowed us to quantify absolute metabolites concentration and relative proteins abundance. We computed the ratio D7/D1 to take into account their variation from day 1 (D1) to day 7 (D7) after shock diagnosis. Patients were divided into two groups according to 28-day mortality. Three different elastic net logistic regression models were built: one on metabolites only, one on metabolites and proteins and one to integrate metabolomics and proteomics data with clinical parameters. Linear discriminant analysis and Partial least squares Discriminant Analysis were also implemented. All the obtained models correctly classified the observations in the testing set. By looking at the variable importance (VIP) and the selected features, the integration of metabolomics with proteomics data showed the importance of circulating lipids and coagulation cascade in septic shock progression, thus capturing a further layer of biological information complementary to metabolomics information.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stekhoven, Daniel J.; Omasits, Ulrich; Quebatte, Maxime
2014-03-01
Proteomics data provide unique insights into biological systems, including the predominant subcellular localization (SCL) of proteins, which can reveal important clues about their functions. Here we analyzed data of a complete prokaryotic proteome expressed under two conditions mimicking interaction of the emerging pathogen Bartonella henselae with its mammalian host. Normalized spectral count data from cytoplasmic, total membrane, inner and outer membrane fractions allowed us to identify the predominant SCL for 82% of the identified proteins. The spectral count proportion of total membrane versus cytoplasmic fractions indicated the propensity of cytoplasmic proteins to co-fractionate with the inner membrane, and enabled usmore » to distinguish cytoplasmic, peripheral innermembrane and bona fide inner membrane proteins. Principal component analysis and k-nearest neighbor classification training on selected marker proteins or predominantly localized proteins, allowed us to determine an extensive catalog of at least 74 expressed outer membrane proteins, and to extend the SCL assignment to 94% of the identified proteins, including 18% where in silico methods gave no prediction. Suitable experimental proteomics data combined with straightforward computational approaches can thus identify the predominant SCL on a proteome-wide scale. Finally, we present a conceptual approach to identify proteins potentially changing their SCL in a condition-dependent fashion.« less
Using Big Data to Discover Diagnostics and Therapeutics for Gastrointestinal and Liver Diseases.
Wooden, Benjamin; Goossens, Nicolas; Hoshida, Yujin; Friedman, Scott L
2017-01-01
Technologies such as genome sequencing, gene expression profiling, proteomic and metabolomic analyses, electronic medical records, and patient-reported health information have produced large amounts of data from various populations, cell types, and disorders (big data). However, these data must be integrated and analyzed if they are to produce models or concepts about physiological function or mechanisms of pathogenesis. Many of these data are available to the public, allowing researchers anywhere to search for markers of specific biological processes or therapeutic targets for specific diseases or patient types. We review recent advances in the fields of computational and systems biology and highlight opportunities for researchers to use big data sets in the fields of gastroenterology and hepatology to complement traditional means of diagnostic and therapeutic discovery. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.
Truong, D-H; Bauwens, J; Delaplace, P; Mazzucchelli, G; Lognay, G; Francis, F
2015-11-01
Herbivorous insects can cause severe cellular changes to plant foliage following infestations, depending on feeding behaviour. Here, a proteomic study was conducted to investigate the influence of green peach aphid (Myzus persicae Sulzer) as a polyphagous pest on the defence response of Arabidopsis thaliana (L.) Heynh after aphid colony establishment on the host plant (3 days). Analysis of about 574 protein spots on 2-DE gels revealed 31 differentially expressed protein spots. Twenty out of these 31 differential proteins were selected for analysis by mass spectrometry. In 12 of the 20 analysed spots, we identified seven and nine proteins using MALDI-TOF-MS and LC-ESI-MS/MS, respectively. Of the analysed spots, 25% contain two proteins. Different metabolic pathways were modulated in Arabidopsis leaves according to aphid feeding: most corresponded to carbohydrate, amino acid and energy metabolism, photosynthesis, defence response and translation. This paper has established a survey of early alterations induced in the proteome of Arabidopsis by M. persicae aphids. It provides valuable insights into the complex responses of plants to biological stress, particularly for herbivorous insects with sucking feeding behaviour. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.
2-DE analysis indicates that Acinetobacter baumannii displays a robust and versatile metabolism
Soares, Nelson C; Cabral, Maria P; Parreira, José R; Gayoso, Carmen; Barba, Maria J; Bou, Germán
2009-01-01
Background Acinetobacter baumannii is a nosocomial pathogen that has been associated with outbreak infections in hospitals. Despite increasing awareness about this bacterium, its proteome remains poorly characterised, however recently the complete genome of A. baumannii reference strain ATCC 17978 has been sequenced. Here, we have used 2-DE and MALDI-TOF/TOF approach to characterise the proteome of this strain. Results The membrane and cytoplasmatic protein extracts were analysed separately, these analyses revealed the reproducible presence of 239 and 511 membrane and cytoplamatic protein spots, respectively. MALDI-TOF/TOF characterisation identified a total of 192 protein spots (37 membrane and 155 cytoplasmatic) and revealed that the identified membrane proteins were mainly transport-related proteins, whereas the cytoplasmatic proteins were of diverse nature, although mainly related to metabolic processes. Conclusion This work indicates that A. baumannii has a versatile and robust metabolism and also reveal a number of proteins that may play a key role in the mechanism of drug resistance and virulence. The data obtained complements earlier reports of A. baumannii proteome and provides new tools to increase our knowledge on the protein expression profile of this pathogen. PMID:19785748
Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
Polanski, Andrzej; Marczyk, Michal; Pietrowska, Monika; Widlak, Piotr; Polanska, Joanna
2015-01-01
Mixture - modeling of mass spectra is an approach with many potential applications including peak detection and quantification, smoothing, de-noising, feature extraction and spectral signal compression. However, existing algorithms do not allow for automated analyses of whole spectra. Therefore, despite highlighting potential advantages of mixture modeling of mass spectra of peptide/protein mixtures and some preliminary results presented in several papers, the mixture modeling approach was so far not developed to the stage enabling systematic comparisons with existing software packages for proteomic mass spectra analyses. In this paper we present an efficient algorithm for Gaussian mixture modeling of proteomic mass spectra of different types (e.g., MALDI-ToF profiling, MALDI-IMS). The main idea is automated partitioning of protein mass spectral signal into fragments. The obtained fragments are separately decomposed into Gaussian mixture models. The parameters of the mixture models of fragments are then aggregated to form the mixture model of the whole spectrum. We compare the elaborated algorithm to existing algorithms for peak detection and we demonstrate improvements of peak detection efficiency obtained by using Gaussian mixture modeling. We also show applications of the elaborated algorithm to real proteomic datasets of low and high resolution. PMID:26230717
Proteomic analysis of broccoli (Brassica oleracea) under high temperature and waterlogging stresses.
Lin, Hsin-Hung; Lin, Kuan-Hung; Chen, Su-Ching; Shen, Yu-Hsing; Lo, Hsiao-Feng
2015-12-01
The production of broccoli (Brassica oleracea) is largely reduced by waterlogging and high temperature stresses. Heat-tolerant and heat-susceptible broccoli cultivars TSS-AVRDC-2 and B-75, respectively, were used for physiological and proteomic analyses. The objective of this study was to identify TSS-AVRDC-2 and B-75 proteins differentially regulated at different time periods in response to waterlogging at 40 °C for three days. TSS-AVRDC-2 exhibited significantly higher chlorophyll content, lower stomatal conductance, and better H 2 O 2 scavenging under stress in comparison to B-75. Two-dimensional liquid phase fractionation analyses revealed that Rubisco proteins in both varieties were regulated under stressing treatments, and that TSS-AVRDC-2 had higher levels of both Rubisco large and small subunit transcripts than B-75 when subjected to high temperature and/or waterlogging. This report utilizes physiological and proteomic approaches to discover changes in the protein expression profiles of broccoli in response to heat and waterlogging stresses. Higher levels of Rubisco proteins in TSS-AVRDC-2 could lead to increased carbon fixation efficiency to provide sufficient energy to enable stress tolerance under waterlogging at 40 °C.
GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data.
Carvalho, Paulo C; Fischer, Juliana Sg; Chen, Emily I; Domont, Gilberto B; Carvalho, Maria Gc; Degrave, Wim M; Yates, John R; Barbosa, Valmir C
2009-02-24
Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Here we present a new algorithm, termed GO Explorer (GOEx), that leverages the gene ontology (GO) to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172). We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.
Zhong, Qing; Guo, Tiannan; Rechsteiner, Markus; Rüschoff, Jan H.; Rupp, Niels; Fankhauser, Christian; Saba, Karim; Mortezavi, Ashkan; Poyet, Cédric; Hermanns, Thomas; Zhu, Yi; Moch, Holger; Aebersold, Ruedi; Wild, Peter J.
2017-01-01
Microscopy image data of human cancers provide detailed phenotypes of spatially and morphologically intact tissues at single-cell resolution, thus complementing large-scale molecular analyses, e.g., next generation sequencing or proteomic profiling. Here we describe a high-resolution tissue microarray (TMA) image dataset from a cohort of 71 prostate tissue samples, which was hybridized with bright-field dual colour chromogenic and silver in situ hybridization probes for the tumour suppressor gene PTEN. These tissue samples were digitized and supplemented with expert annotations, clinical information, statistical models of PTEN genetic status, and computer source codes. For validation, we constructed an additional TMA dataset for 424 prostate tissues, hybridized with FISH probes for PTEN, and performed survival analysis on a subset of 339 radical prostatectomy specimens with overall, disease-specific and recurrence-free survival (maximum 167 months). For application, we further produced 6,036 image patches derived from two whole slides. Our curated collection of prostate cancer data sets provides reuse potential for both biomedical and computational studies. PMID:28291248
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Wiberg, Holli K.; Matzke, Melissa M.
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yieldedmore » the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. In summary, on the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.« less
The amino acid's backup bone - storage solutions for proteomics facilities.
Meckel, Hagen; Stephan, Christian; Bunse, Christian; Krafzik, Michael; Reher, Christopher; Kohl, Michael; Meyer, Helmut Erich; Eisenacher, Martin
2014-01-01
Proteomics methods, especially high-throughput mass spectrometry analysis have been continually developed and improved over the years. The analysis of complex biological samples produces large volumes of raw data. Data storage and recovery management pose substantial challenges to biomedical or proteomic facilities regarding backup and archiving concepts as well as hardware requirements. In this article we describe differences between the terms backup and archive with regard to manual and automatic approaches. We also introduce different storage concepts and technologies from transportable media to professional solutions such as redundant array of independent disks (RAID) systems, network attached storages (NAS) and storage area network (SAN). Moreover, we present a software solution, which we developed for the purpose of long-term preservation of large mass spectrometry raw data files on an object storage device (OSD) archiving system. Finally, advantages, disadvantages, and experiences from routine operations of the presented concepts and technologies are evaluated and discussed. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013. Published by Elsevier B.V.
Bromilow, Sophie; Gethings, Lee A; Buckley, Mike; Bromley, Mike; Shewry, Peter R; Langridge, James I; Clare Mills, E N
2017-06-23
The unique physiochemical properties of wheat gluten enable a diverse range of food products to be manufactured. However, gluten triggers coeliac disease, a condition which is treated using a gluten-free diet. Analytical methods are required to confirm if foods are gluten-free, but current immunoassay-based methods can unreliable and proteomic methods offer an alternative but require comprehensive and well annotated sequence databases which are lacking for gluten. A manually a curated database (GluPro V1.0) of gluten proteins, comprising 630 discrete unique full length protein sequences has been compiled. It is representative of the different types of gliadin and glutenin components found in gluten. An in silico comparison of their coeliac toxicity was undertaken by analysing the distribution of coeliac toxic motifs. This demonstrated that whilst the α-gliadin proteins contained more toxic motifs, these were distributed across all gluten protein sub-types. Comparison of annotations observed using a discovery proteomics dataset acquired using ion mobility MS/MS showed that more reliable identifications were obtained using the GluPro V1.0 database compared to the complete reviewed Viridiplantae database. This highlights the value of a curated sequence database specifically designed to support the proteomic workflows and the development of methods to detect and quantify gluten. We have constructed the first manually curated open-source wheat gluten protein sequence database (GluPro V1.0) in a FASTA format to support the application of proteomic methods for gluten protein detection and quantification. We have also analysed the manually verified sequences to give the first comprehensive overview of the distribution of sequences able to elicit a reaction in coeliac disease, the prevalent form of gluten intolerance. Provision of this database will improve the reliability of gluten protein identification by proteomic analysis, and aid the development of targeted mass spectrometry methods in line with Codex Alimentarius Commission requirements for foods designed to meet the needs of gluten intolerant individuals. Copyright © 2017. Published by Elsevier B.V.
Whittington, Emma; Zhao, Qian; Borziak, Kirill; Walters, James R; Dorus, Steve
2015-07-01
The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole-cell proteomic characterisation will facilitate future evolutionary genetic and developmental studies of heteromorphic sperm production and parasperm function. Furthermore, the analyses presented here provide useful annotation information regarding sex-biased gene expression, novel Lepidopteran genes and gene function in the male gamete to complement the newly sequenced and annotated Manduca genome. Copyright © 2015 Elsevier Ltd. All rights reserved.
Proteomic profiling of mitochondria: what does it tell us about the ageing brain?
Ingram, Thomas; Chakrabarti, Lisa
2016-12-13
Mitochondrial dysfunction is evident in numerous neurodegenerative and age-related disorders. It has also been linked to cellular ageing, however our current understanding of the mitochondrial changes that occur are unclear. Functional studies have made some progress reporting reduced respiration, dynamic structural modifications and loss of membrane potential, though there are conflicts within these findings. Proteomic analyses, together with functional studies, are required in order to profile the mitochondrial changes that occur with age and can contribute to unravelling the complexity of the ageing phenotype. The emergence of improved protein separation techniques, combined with mass spectrometry analyses has allowed the identification of age and cell-type specific mitochondrial changes in energy metabolism, antioxidants, fusion and fission machinery, chaperones, membrane proteins and biosynthesis pathways. Here, we identify and review recent data from the analyses of mitochondria from rodent brains. It is expected that knowledge gained from understanding age-related mitochondrial changes of the brain should lead to improved biomarkers of normal ageing and also age-related disease progression.
Atanassov, Ilian; Kuznetsova, Irina; Hinze, Yvonne; Mourier, Arnaud; Filipovska, Aleksandra
2017-01-01
Dysfunction of the oxidative phosphorylation (OXPHOS) system is a major cause of human disease and the cellular consequences are highly complex. Here, we present comparative analyses of mitochondrial proteomes, cellular transcriptomes and targeted metabolomics of five knockout mouse strains deficient in essential factors required for mitochondrial DNA gene expression, leading to OXPHOS dysfunction. Moreover, we describe sequential protein changes during post-natal development and progressive OXPHOS dysfunction in time course analyses in control mice and a middle lifespan knockout, respectively. Very unexpectedly, we identify a new response pathway to OXPHOS dysfunction in which the intra-mitochondrial synthesis of coenzyme Q (ubiquinone, Q) and Q levels are profoundly decreased, pointing towards novel possibilities for therapy. Our extensive omics analyses provide a high-quality resource of altered gene expression patterns under severe OXPHOS deficiency comparing several mouse models, that will deepen our understanding, open avenues for research and provide an important reference for diagnosis and treatment. PMID:29132502
NASA Astrophysics Data System (ADS)
Khatri, Kshitij; Pu, Yi; Klein, Joshua A.; Wei, Juan; Costello, Catherine E.; Lin, Cheng; Zaia, Joseph
2018-04-01
Analysis of singly glycosylated peptides has evolved to a point where large-scale LC-MS analyses can be performed at almost the same scale as proteomics experiments. While collisionally activated dissociation (CAD) remains the mainstay of bottom-up analyses, it performs poorly for the middle-down analysis of multiply glycosylated peptides. With improvements in instrumentation, electron-activated dissociation (ExD) modes are becoming increasingly prevalent for proteomics experiments and for the analysis of fragile modifications such as glycosylation. While these methods have been applied for glycopeptide analysis in isolated studies, an organized effort to compare their efficiencies, particularly for analysis of multiply glycosylated peptides (termed here middle-down glycoproteomics), has not been made. We therefore compared the performance of different ExD modes for middle-down glycopeptide analyses. We identified key features among the different dissociation modes and show that increased electron energy and supplemental activation provide the most useful data for middle-down glycopeptide analysis. [Figure not available: see fulltext.
Proteomic Profiling of Skin Fibroblasts as a Model of Schizophrenia.
Wang, Lan; Rahmoune, Hassan; Guest, Paul C
2017-01-01
Since many aspects of schizophrenia are also manifested at the peripheral level in proliferating cell types, this chapter describes the analysis of skin fibroblasts biopsied from living patients. The method focuses on cell culture and sample preparation for characterization of the model. The resulting cell extracts can be analysed by any number of proteomic techniques for identification of biomarker candidates. This approach could help to elucidate the molecular mechanisms associated with the pathophysiology of schizophrenia and provide a useful model for a new target and drug discovery.
LaCava, John; Molloy, Kelly R.; Taylor, Martin S.; Domanski, Michal; Chait, Brian T.; Rout, Michael P.
2015-01-01
Dissecting and studying cellular systems requires the ability to specifically isolate distinct proteins along with the co-assembled constituents of their associated complexes. Affinity capture techniques leverage high affinity, high specificity reagents to target and capture proteins of interest along with specifically associated proteins from cell extracts. Affinity capture coupled to mass spectrometry (MS)-based proteomic analyses has enabled the isolation and characterization of a wide range of endogenous protein complexes. Here, we outline effective procedures for the affinity capture of protein complexes, highlighting best practices and common pitfalls. PMID:25757543
DOE Office of Scientific and Technical Information (OSTI.GOV)
Merkley, Eric D.; Sego, Landon H.; Lin, Andy
Adaptive processes in bacterial species can occur rapidly in laboratory culture, leading to genetic divergence between naturally occurring and laboratory-adapted strains. Differentiating wild and closely-related laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to adapted phenotypes. Protein abundance profiles from mass spectrometry-based proteomics analyses are a molecular measure of phenotype. Proteomics data contains sufficient information that powerful statistical methods can uncover signatures that distinguish wild strains of Yersinia pestis from laboratory-adapted strains.
Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong
2008-04-01
The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.
MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics*
Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.; Chen, Albert J.; Ayaz-Guner, Serife; Peng, Ying; Valeja, Santosh G.; Liu, Xiaowen; Ge, Ying
2016-01-01
Top-down mass spectrometry (MS)-based proteomics is arguably a disruptive technology for the comprehensive analysis of all proteoforms arising from genetic variation, alternative splicing, and posttranslational modifications (PTMs). However, the complexity of top-down high-resolution mass spectra presents a significant challenge for data analysis. In contrast to the well-developed software packages available for data analysis in bottom-up proteomics, the data analysis tools in top-down proteomics remain underdeveloped. Moreover, despite recent efforts to develop algorithms and tools for the deconvolution of top-down high-resolution mass spectra and the identification of proteins from complex mixtures, a multifunctional software platform, which allows for the identification, quantitation, and characterization of proteoforms with visual validation, is still lacking. Herein, we have developed MASH Suite Pro, a comprehensive software tool for top-down proteomics with multifaceted functionality. MASH Suite Pro is capable of processing high-resolution MS and tandem MS (MS/MS) data using two deconvolution algorithms to optimize protein identification results. In addition, MASH Suite Pro allows for the characterization of PTMs and sequence variations, as well as the relative quantitation of multiple proteoforms in different experimental conditions. The program also provides visualization components for validation and correction of the computational outputs. Furthermore, MASH Suite Pro facilitates data reporting and presentation via direct output of the graphics. Thus, MASH Suite Pro significantly simplifies and speeds up the interpretation of high-resolution top-down proteomics data by integrating tools for protein identification, quantitation, characterization, and visual validation into a customizable and user-friendly interface. We envision that MASH Suite Pro will play an integral role in advancing the burgeoning field of top-down proteomics. PMID:26598644
Uddin, Reaz; Jamil, Faiza
2018-06-01
Pseudomonas aeruginosa is an opportunistic gram-negative bacterium that has the capability to acquire resistance under hostile conditions and become a threat worldwide. It is involved in nosocomial infections. In the current study, potential novel drug targets against P. aeruginosa have been identified using core proteomic analysis and Protein-Protein Interactions (PPIs) studies. The non-redundant reference proteome of 68 strains having complete genome and latest assembly version of P. aeruginosa were downloaded from ftp NCBI RefSeq server in October 2016. The standalone CD-HIT tool was used to cluster ortholog proteins (having >=80% amino acid identity) present in all strains. The pan-proteome was clustered in 12,380 Clusters of Orthologous Proteins (COPs). By using in-house shell scripts, 3252 common COPs were extracted out and designated as clusters of core proteome. The core proteome of PAO1 strain was selected by fetching PAO1's proteome from common COPs. As a result, 1212 proteins were shortlisted that are non-homologous to the human but essential for the survival of the pathogen. Among these 1212 proteins, 321 proteins are conserved hypothetical proteins. Considering their potential as drug target, those 321 hypothetical proteins were selected and their probable functions were characterized. Based on the druggability criteria, 18 proteins were shortlisted. The interacting partners were identified by investigating the PPIs network using STRING v10 database. Subsequently, 8 proteins were shortlisted as 'hub proteins' and proposed as potential novel drug targets against P. aeruginosa. The study is interesting for the scientific community working to identify novel drug targets against MDR pathogens particularly P. aeruginosa. Copyright © 2018 Elsevier Ltd. All rights reserved.
Macdonald, Lucy J.; Adams, Paul D.; Petzold, Christopher J.; Strabala, Timothy J.; Wagner, Armin; Heazlewood, Joshua L.
2013-01-01
Our understanding of the contribution of Golgi proteins to cell wall and wood formation in any woody plant species is limited. Currently, little Golgi proteomics data exists for wood-forming tissues. In this study, we attempted to address this issue by generating and analyzing Golgi-enriched membrane preparations from developing xylem of compression wood from the conifer Pinus radiata. Developing xylem samples from 3-year-old pine trees were harvested for this purpose at a time of active growth and subjected to a combination of density centrifugation followed by free flow electrophoresis, a surface charge separation technique used in the enrichment of Golgi membranes. This combination of techniques was successful in achieving an approximately 200-fold increase in the activity of the Golgi marker galactan synthase and represents a significant improvement for proteomic analyses of the Golgi from conifers. A total of thirty known Golgi proteins were identified by mass spectrometry including glycosyltransferases from gene families involved in glucomannan and glucuronoxylan biosynthesis. The free flow electrophoresis fractions of enriched Golgi were highly abundant in structural proteins (actin and tubulin) indicating a role for the cytoskeleton during compression wood formation. The mass spectrometry proteomics data associated with this study have been deposited to the ProteomeXchange with identifier PXD000557. PMID:24416096
Puente-Marin, Sara; Nombela, Iván; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio; Ortega-Villaizan, María Del Mar
2018-04-09
Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.
Li, Caixia; Tan, Xing Fei; Lim, Teck Kwang; Lin, Qingsong; Gong, Zhiyuan
2016-04-13
Omic approaches have been increasingly used in the zebrafish model for holistic understanding of molecular events and mechanisms of tissue functions. However, plasma is rarely used for omic profiling because of the technical challenges in collecting sufficient blood. In this study, we employed two mass spectrometric (MS) approaches for a comprehensive characterization of zebrafish plasma proteome, i.e. conventional shotgun liquid chromatography-tandem mass spectrometry (LC-MS/MS) for an overview study and quantitative SWATH (Sequential Window Acquisition of all THeoretical fragment-ion spectra) for comparison between genders. 959 proteins were identified in the shotgun profiling with estimated concentrations spanning almost five orders of magnitudes. Other than the presence of a few highly abundant female egg yolk precursor proteins (vitellogenins), the proteomic profiles of male and female plasmas were very similar in both number and abundance and there were basically no other highly gender-biased proteins. The types of plasma proteins based on IPA (Ingenuity Pathway Analysis) classification and tissue sources of production were also very similar. Furthermore, the zebrafish plasma proteome shares significant similarities with human plasma proteome, in particular in top abundant proteins including apolipoproteins and complements. Thus, the current study provided a valuable dataset for future evaluation of plasma proteins in zebrafish.
Li, Caixia; Tan, Xing Fei; Lim, Teck Kwang; Lin, Qingsong; Gong, Zhiyuan
2016-01-01
Omic approaches have been increasingly used in the zebrafish model for holistic understanding of molecular events and mechanisms of tissue functions. However, plasma is rarely used for omic profiling because of the technical challenges in collecting sufficient blood. In this study, we employed two mass spectrometric (MS) approaches for a comprehensive characterization of zebrafish plasma proteome, i.e. conventional shotgun liquid chromatography-tandem mass spectrometry (LC-MS/MS) for an overview study and quantitative SWATH (Sequential Window Acquisition of all THeoretical fragment-ion spectra) for comparison between genders. 959 proteins were identified in the shotgun profiling with estimated concentrations spanning almost five orders of magnitudes. Other than the presence of a few highly abundant female egg yolk precursor proteins (vitellogenins), the proteomic profiles of male and female plasmas were very similar in both number and abundance and there were basically no other highly gender-biased proteins. The types of plasma proteins based on IPA (Ingenuity Pathway Analysis) classification and tissue sources of production were also very similar. Furthermore, the zebrafish plasma proteome shares significant similarities with human plasma proteome, in particular in top abundant proteins including apolipoproteins and complements. Thus, the current study provided a valuable dataset for future evaluation of plasma proteins in zebrafish. PMID:27071722
Extracellular proteome analysis of Leptospira interrogans serovar Lai.
Zeng, Lingbing; Zhang, Yunyi; Zhu, Yongzhang; Yin, Haidi; Zhuang, Xuran; Zhu, Weinan; Guo, Xiaokui; Qin, Jinhong
2013-10-01
Abstract Leptospirosis is one of the most important zoonoses. Leptospira interrogans serovar Lai is a pathogenic spirochete that is responsible for leptospirosis. Extracellular proteins play an important role in the pathogenicity of this bacterium. In this study, L. interrogans serovar Lai was grown in protein-free medium; the supernatant was collected and subsequently analyzed as the extracellular proteome. A total of 66 proteins with more than two unique peptides were detected by MS/MS, and 33 of these were predicted to be extracellular proteins by a combination of bioinformatics analyses, including Psortb, cello, SoSuiGramN and SignalP. Comparisons of the transcriptional levels of these 33 genes between in vivo and in vitro conditions revealed that 15 genes were upregulated and two genes were downregulated in vivo compared to in vitro. A BLAST search for the components of secretion system at the genomic and proteomic levels revealed the presence of the complete type I secretion system and type II secretion system in this strain. Moreover, this strain also exhibits complete Sec translocase and Tat translocase systems. The extracellular proteome analysis of L. interrogans will supplement the previously generated whole proteome data and provide more information for studying the functions of specific proteins in the infection process and for selecting candidate molecules for vaccines or diagnostic tools for leptospirosis.
Puente-Marin, Sara; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio
2018-01-01
Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation. PMID:29642539
Unraveling endometriosis-associated ovarian carcinomas using integrative proteomics
Leung, Felix; Bernardini, Marcus Q.; Liang, Kun; Batruch, Ihor; Rouzbahman, Marjan; Diamandis, Eleftherios P.; Kulasingam, Vathany
2018-01-01
Background: To elucidate potential markers of endometriosis and endometriosis-associated endometrioid and clear cell ovarian carcinomas using mass spectrometry-based proteomics. Methods: A total of 21 fresh, frozen tissues from patients diagnosed with clear cell carcinoma, endometrioid carcinoma, endometriosis and benign endometrium were subjected to an in-depth liquid chromatography-tandem mass spectrometry analysis on the Q-Exactive Plus. Protein identification and quantification were performed using MaxQuant, while downstream analyses were performed using Perseus and various bioinformatics databases. Results: Approximately 9000 proteins were identified in total, representing the first in-depth proteomic investigation of endometriosis and its associated cancers. This proteomic data was shown to be biologically sound, with minimal variation within patient cohorts and recapitulation of known markers. While moderate concordance with genomic data was observed, it was shown that such data are limited in their abilities to represent tumours on the protein level and to distinguish tumours from their benign precursors. Conclusions: The proteomic data suggests that distinct markers may differentiate endometrioid and clear cell carcinoma from endometriosis. These markers may be indicators of pathobiology but will need to be further investigated. Ultimately, this dataset may serve as a basis to unravel the underlying biology of the endometrioid and clear cell cancers with respect to their endometriotic origins. PMID:29721309
Wegrzyn, Jill L.; Bark, Steven J.; Funkelstein, Lydiane; Mosier, Charles; Yap, Angel; Kazemi-Esfarjani, Parasa; La Spada, Albert; Sigurdson, Christina; O’Connor, Daniel T.; Hook, Vivian
2010-01-01
Regulated secretion of neurotransmitters and neurohumoural factors from dense core secretory vesicles provides essential neuroeffectors for cell-cell communication in the nervous and endocrine systems. This study provides comprehensive proteomic characterization of the categories of proteins in chromaffin dense core secretory vesicles that participate in cell-cell communication from the adrenal medulla. Proteomic studies were conducted by nano-HPLC Chip MS/MS tandem mass spectrometry. Results demonstrate that these secretory vesicles contain proteins of distinct functional categories consisting of neuropeptides and neurohumoural factors, protease systems, neurotransmitter enzymes and transporters, receptors, enzymes for biochemical processes, reduction/oxidation regulation, ATPases, protein folding, lipid biochemistry, signal transduction, exocytosis, calcium regulation, as well as structural and cell adhesion proteins. The secretory vesicle proteomic data identified 371 distinct proteins in the soluble fraction and 384 distinct membrane proteins, for a total of 686 distinct secretory vesicle proteins. Notably, these proteomic analyses illustrate the presence of several neurological disease-related proteins in these secretory vesicles, including huntingtin interacting protein, cystatin C, ataxin 7, and prion protein. Overall, these findings demonstrate that multiple protein categories participate in dense core secretory vesicles for production, storage, and secretion of bioactive neuroeffectors for cell-cell communication in health and disease. PMID:20695487
Catalán, Úrsula; Rubió, Laura; López de Las Hazas, Maria-Carmen; Herrero, Pol; Nadal, Pedro; Canela, Núria; Pedret, Anna; Motilva, Maria-José; Solà, Rosa
2016-10-01
Hydroxytyrosol (HT) is the major phenolic compound in virgin olive oil (VOO) in both free and complex forms (secoiridoids; SEC). Proteomics of cardiovascular tissues such as aorta or heart represents a promising tool to uncover the mechanisms of action of phenolic compounds in healthy animals. Twelve female Wistar rats were separated into three groups: a standard diet and two diets supplemented in phenolic compounds (HT and SEC) adjusted to 5 mg/kg/day during 21 days. Proteomic analyses of aorta and heart tissues were performed by nano-LC and MS. Ingenuity Pathway Analysis was used to generate interaction networks. HT or SEC modulated aorta and heart proteome compared to the standard diet. The top-scored networks were related to Cardiovascular System. HT and SEC downregulated proteins related to proliferation and migration of endothelial cells and occlusion of blood vessels in aorta and proteins related to heart failure in heart tissue. SEC showed higher fold change values compared to HT, attributed to higher concentration of HT detected in heart tissue. Changes at proteomic level in cardiovascular tissues may partially account for the underlying mechanisms of VOO phenols cardiovascular protection being the SEC effects higher than free HT. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zeng, Yunliu; Pan, Zhiyong; Ding, Yuduan; Zhu, Andan; Cao, Hongbo; Xu, Qiang; Deng, Xiuxin
2011-11-01
Here, a comprehensive proteomic analysis of the chromoplasts purified from sweet orange using Nycodenz density gradient centrifugation is reported. A GeLC-MS/MS shotgun approach was used to identify the proteins of pooled chromoplast samples. A total of 493 proteins were identified from purified chromoplasts, of which 418 are putative plastid proteins based on in silico sequence homology and functional analyses. Based on the predicted functions of these identified plastid proteins, a large proportion (∼60%) of the chromoplast proteome of sweet orange is constituted by proteins involved in carbohydrate metabolism, amino acid/protein synthesis, and secondary metabolism. Of note, HDS (hydroxymethylbutenyl 4-diphosphate synthase), PAP (plastid-lipid-associated protein), and psHSPs (plastid small heat shock proteins) involved in the synthesis or storage of carotenoid and stress response are among the most abundant proteins identified. A comparison of chromoplast proteomes between sweet orange and tomato suggested a high level of conservation in a broad range of metabolic pathways. However, the citrus chromoplast was characterized by more extensive carotenoid synthesis, extensive amino acid synthesis without nitrogen assimilation, and evidence for lipid metabolism concerning jasmonic acid synthesis. In conclusion, this study provides an insight into the major metabolic pathways as well as some unique characteristics of the sweet orange chromoplasts at the whole proteome level.
Proteomics technique opens new frontiers in mobilome research
Davidson, Andrew D.; Matthews, David A.
2017-01-01
ABSTRACT A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the “mobilome,” which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the “domestication” of transposon proteins for cellular functions. Although ‘omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called “proteomics informed by transcriptomics” (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease. PMID:28932623
A peptide resource for the analysis of Staphylococcus aureus in host pathogen interaction studies
Depke, Maren; Michalik, Stephan; Rabe, Alexander; Surmann, Kristin; Brinkmann, Lars; Jehmlich, Nico; Bernhardt, Jörg; Hecker, Michael; Wollscheid, Bernd; Sun, Zhi; Moritz, Robert L.; Völker, Uwe; Schmidt, Frank
2016-01-01
Staphylococcus aureus is an opportunistic human pathogen, which can cause life-threatening disease. Proteome analyses of the bacterium can provide new insights into its pathophysiology and important facets of metabolic adaptation and, thus, aid the recognition of targets for intervention. However, the value of such proteome studies increases with their comprehensiveness. We present an MS–driven, proteome-wide characterization of the strain S. aureus HG001. Combining 144 high precision proteomic data sets, we identified 19 109 peptides from 2088 distinct S. aureus HG001 proteins, which account for 72% of the predicted ORFs. Peptides were further characterized concerning pI, GRAVY, and detectability scores in order to understand the low peptide coverage of 8.7% (19 109 out of 220 245 theoretical peptides). The high quality peptide-centric spectra have been organized into a comprehensive peptide fragmentation library (SpectraST) and used for identification of S. aureus-typic peptides in highly complex host–pathogen interaction experiments, which significantly improved the number of identified S. aureus proteins compared to a MASCOT search. This effort now allows the elucidation of crucial pathophysiological questions in S. aureus-specific host–pathogen interaction studies through comprehensive proteome analysis. The S. aureus-specific spectra resource developed here also represents an important spectral repository for SRM or for data-independent acquisition MS approaches. All MS data have been deposited in the ProteomeXchange with identifier PXD000702 (http://proteomecentral.proteomexchange.org/dataset/PXD000702). PMID:26224020
A peptide resource for the analysis of Staphylococcus aureus in host-pathogen interaction studies.
Depke, Maren; Michalik, Stephan; Rabe, Alexander; Surmann, Kristin; Brinkmann, Lars; Jehmlich, Nico; Bernhardt, Jörg; Hecker, Michael; Wollscheid, Bernd; Sun, Zhi; Moritz, Robert L; Völker, Uwe; Schmidt, Frank
2015-11-01
Staphylococcus aureus is an opportunistic human pathogen, which can cause life-threatening disease. Proteome analyses of the bacterium can provide new insights into its pathophysiology and important facets of metabolic adaptation and, thus, aid the recognition of targets for intervention. However, the value of such proteome studies increases with their comprehensiveness. We present an MS-driven, proteome-wide characterization of the strain S. aureus HG001. Combining 144 high precision proteomic data sets, we identified 19 109 peptides from 2088 distinct S. aureus HG001 proteins, which account for 72% of the predicted ORFs. Peptides were further characterized concerning pI, GRAVY, and detectability scores in order to understand the low peptide coverage of 8.7% (19 109 out of 220 245 theoretical peptides). The high quality peptide-centric spectra have been organized into a comprehensive peptide fragmentation library (SpectraST) and used for identification of S. aureus-typic peptides in highly complex host-pathogen interaction experiments, which significantly improved the number of identified S. aureus proteins compared to a MASCOT search. This effort now allows the elucidation of crucial pathophysiological questions in S. aureus-specific host-pathogen interaction studies through comprehensive proteome analysis. The S. aureus-specific spectra resource developed here also represents an important spectral repository for SRM or for data-independent acquisition MS approaches. All MS data have been deposited in the ProteomeXchange with identifier PXD000702 (http://proteomecentral.proteomexchange.org/dataset/PXD000702). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Astrophysics Data System (ADS)
Jabbour, Rabih E.; Wade, Mary; Deshpande, Samir V.; McCubbin, Patrick; Snyder, A. Peter; Bevilacqua, Vicky
2012-06-01
Mass spectrometry based proteomic approaches are showing promising capabilities in addressing various biological and biochemical issues. Outer membrane proteins (OMPs) are often associated with virulence in gram-negative pathogens and could prove to be excellent model biomarkers for strain level differentiation among bacteria. Whole cells and OMP extracts were isolated from pathogenic and non-pathogenic strains of Francisella tularensis, Burkholderia thailandensis, and Burkholderia mallei. OMP extracts were compared for their ability to differentiate and delineate the correct database organism to an experimental sample and for the degree of dissimilarity to the nearest-neighbor database strains. This study addresses the comparative experimental proteome analyses of OMPs vs. whole cell lysates on the strain-level discrimination among gram negative pathogenic and non-pathogenic strains.
Droit, Arnaud; Hunter, Joanna M; Rouleau, Michèle; Ethier, Chantal; Picard-Cloutier, Aude; Bourgais, David; Poirier, Guy G
2007-01-01
Background In the "post-genome" era, mass spectrometry (MS) has become an important method for the analysis of proteins and the rapid advancement of this technique, in combination with other proteomics methods, results in an increasing amount of proteome data. This data must be archived and analysed using specialized bioinformatics tools. Description We herein describe "PARPs database," a data analysis and management pipeline for liquid chromatography tandem mass spectrometry (LC-MS/MS) proteomics. PARPs database is a web-based tool whose features include experiment annotation, protein database searching, protein sequence management, as well as data-mining of the peptides and proteins identified. Conclusion Using this pipeline, we have successfully identified several interactions of biological significance between PARP-1 and other proteins, namely RFC-1, 2, 3, 4 and 5. PMID:18093328
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baulch, Janet
2013-09-11
This is a 'glue grant' that was part of a DOE Low Dose project entitled 'Identification and Characterization of Soluble Factors Involved in Delayed Effects of Low Dose Radiation'. This collaborative program has involved Drs. David L. Springer from Pacific Northwest National Laboratory (PNNL), John H. Miller from Washington State University, Tri-cities (WSU) and William F. Morgan then from the University of Maryland, Baltimore (UMB). In July 2008, Dr. Morgan moved to PNNL and Dr. Janet E. Baulch became PI for this project at University of Maryland. In November of 2008, a one year extension with no new funds wasmore » requested to complete the proteomic analyses. The project stemmed from studies in the Morgan laboratory demonstrating that genomically unstable cells secret a soluble factor or factors into the culture medium, that cause cytogenetic aberrations and apoptosis in normal parental GM10115 cells. The purpose of this project was to identify the death inducing effect (DIE) factor or factors, estimate their relative abundance, identify the cell signaling pathways involved and finally recapitulate DIE in normal cells by exogenous manipulation of putative DIE factors in culture medium. As reported in detail in the previous progress report, analysis of culture medium from the parental cell line, and stable and unstable clones demonstrated inconsistent proteomic profiles as relate to candidate DIE factors. While the proposed proteomic analyses did not provide information that would allow DIE factors to be identified, the analyses provided another important set of observations. Proteomic analysis suggested that proteins associated with the cellular response to oxidative stress and mitochondrial function were elevated in the medium from unstable clones in a manner consistent with mitochondrial dysfunction. These findings correlate with previous studies of these clones that demonstrated functional differences between the mitochondria of stable and unstable clones. These mitochondrial abnormalities in the unstable clones contributes to oxidative stress.« less
Du, Jing; Guo, Shirong; Sun, Jin; Shu, Sheng
2018-05-01
The mechanism of exogenous Spd-induced Ca(NO 3 ) 2 stress tolerance in cucumber was studied by proteomics and physiological analyses. Protein-protein interaction network revealed 13 key proteins involved in Spd-induced Ca(NO 3 ) 2 stress resistance. Ca(NO 3 ) 2 stress is one of the major reasons for secondary salinization that limits cucumber plant development in greenhouse. The conferred protective role of exogenous Spd on cucumber in response to Ca(NO 3 ) 2 stress cues involves changes at the cellular and physiological levels. To investigate the molecular foundation of exogenous Spd in Ca(NO 3 ) 2 stress tolerance, a proteomic approach was performed in our work. After a 9 days period of Ca(NO 3 ) 2 stress and/or exogenous Spd, 71 differential protein spots were confidently identified. The resulting proteins were enriched in seven different categories of biological processes, including protein metabolism, carbohydrate and energy metabolism, ROS homeostasis and stress defense, cell wall related, transcription, others and unknown. Protein metabolism (31.2%), carbohydrate and energy metabolism (15.6%), ROS homeostasis and stress defense (32.5%) were the three largest functional categories in cucumber root and most of them were significantly increased by exogenous Spd. The Spd-responsive protein interaction network revealed 13 key proteins, whose accumulation changes could be critical for Spd-induced resistance; all 13 proteins were upregulated by Spd at transcriptional and protein levels in response to Ca(NO 3 ) 2 stress. Furthermore, accumulation of antioxidant enzymes, non-enzymatic antioxidant and polyamines, along with reduction of H 2 O 2 and MDA, were detected after exogenous Spd application during Ca(NO 3 ) 2 stress. The results of these proteomic and physiological analyses in cucumber root may facilitate a better understanding of the underlying mechanism of Ca(NO 3 ) 2 stress tolerance mediated by exogenous Spd.
Mei, Suyu; Zhu, Hao
2015-01-26
Protein-protein interaction (PPI) prediction is generally treated as a problem of binary classification wherein negative data sampling is still an open problem to be addressed. The commonly used random sampling is prone to yield less representative negative data with considerable false negatives. Meanwhile rational constraints are seldom exerted on model selection to reduce the risk of false positive predictions for most of the existing computational methods. In this work, we propose a novel negative data sampling method based on one-class SVM (support vector machine, SVM) to predict proteome-wide protein interactions between HTLV retrovirus and Homo sapiens, wherein one-class SVM is used to choose reliable and representative negative data, and two-class SVM is used to yield proteome-wide outcomes as predictive feedback for rational model selection. Computational results suggest that one-class SVM is more suited to be used as negative data sampling method than two-class PPI predictor, and the predictive feedback constrained model selection helps to yield a rational predictive model that reduces the risk of false positive predictions. Some predictions have been validated by the recent literature. Lastly, gene ontology based clustering of the predicted PPI networks is conducted to provide valuable cues for the pathogenesis of HTLV retrovirus.
Yang, Shuai; Zhang, Xinlei; Diao, Lihong; Guo, Feifei; Wang, Dan; Liu, Zhongyang; Li, Honglei; Zheng, Junjie; Pan, Jingshan; Nice, Edouard C; Li, Dong; He, Fuchu
2015-09-04
The Chromosome-centric Human Proteome Project (C-HPP) aims to catalog genome-encoded proteins using a chromosome-by-chromosome strategy. As the C-HPP proceeds, the increasing requirement for data-intensive analysis of the MS/MS data poses a challenge to the proteomic community, especially small laboratories lacking computational infrastructure. To address this challenge, we have updated the previous CAPER browser into a higher version, CAPER 3.0, which is a scalable cloud-based system for data-intensive analysis of C-HPP data sets. CAPER 3.0 uses cloud computing technology to facilitate MS/MS-based peptide identification. In particular, it can use both public and private cloud, facilitating the analysis of C-HPP data sets. CAPER 3.0 provides a graphical user interface (GUI) to help users transfer data, configure jobs, track progress, and visualize the results comprehensively. These features enable users without programming expertise to easily conduct data-intensive analysis using CAPER 3.0. Here, we illustrate the usage of CAPER 3.0 with four specific mass spectral data-intensive problems: detecting novel peptides, identifying single amino acid variants (SAVs) derived from known missense mutations, identifying sample-specific SAVs, and identifying exon-skipping events. CAPER 3.0 is available at http://prodigy.bprc.ac.cn/caper3.
Khorsandi, Shirin Elizabeth; Salehi, Siamak; Cortes, Miriam; Vilca-Melendez, Hector; Menon, Krishna; Srinivasan, Parthi; Prachalias, Andreas; Jassem, Wayel; Heaton, Nigel
2018-02-15
Mitochondria have their own genomic, transcriptomic and proteomic machinery but are unable to be autonomous, needing both nuclear and mitochondrial genomes. The aim of this work was to use computational biology to explore the involvement of Mitochondrial microRNAs (MitomiRs) and their interactions with the mitochondrial proteome in a clinical model of primary non function (PNF) of the donor after cardiac death (DCD) liver. Archival array data on the differential expression of miRNA in DCD PNF was re-analyzed using a number of publically available computational algorithms. 10 MitomiRs were identified of importance in DCD PNF, 7 with predicted interaction of their seed sequence with the mitochondrial transcriptome that included both coding, and non coding areas of the hypervariability region 1 (HVR1) and control region. Considering miRNA regulation of the nuclear encoded mitochondrial proteome, 7 hypothetical small proteins were identified with homolog function that ranged from co-factor for formation of ATP Synthase, REDOX balance and an importin/exportin protein. In silico, unconventional seed interactions, both non canonical and alternative seed sites, appear to be of greater importance in MitomiR regulation of the mitochondrial genome. Additionally, a number of novel small proteins of relevance in transplantation have been identified which need further characterization.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fukunaga, Satoki; Environmental Health Science Laboratory, Sumitomo Chemical Co., Ltd., 3-1-98 Kasugade-Naka, Konohana-ku, Osaka 554-8558; Kakehashi, Anna
To determine miRNAs and their predicted target proteins regulatory networks which are potentially involved in onset of pulmonary fibrosis in the bleomycin rat model, we conducted integrative miRNA microarray and iTRAQ-coupled LC-MS/MS proteomic analyses, and evaluated the significance of altered biological functions and pathways. We observed that alterations of miRNAs and proteins are associated with the early phase of bleomycin-induced pulmonary fibrosis, and identified potential target pairs by using ingenuity pathway analysis. Using the data set of these alterations, it was demonstrated that those miRNAs, in association with their predicted target proteins, are potentially involved in canonical pathways reflective ofmore » initial epithelial injury and fibrogenic processes, and biofunctions related to induction of cellular development, movement, growth, and proliferation. Prediction of activated functions suggested that lung cells acquire proliferative, migratory, and invasive capabilities, and resistance to cell death especially in the very early phase of bleomycin-induced pulmonary fibrosis. The present study will provide new insights for understanding the molecular pathogenesis of idiopathic pulmonary fibrosis. - Highlights: • We analyzed bleomycin-induced pulmonary fibrosis in the rat. • Integrative analyses of miRNA microarray and proteomics were conducted. • We determined the alterations of miRNAs and their potential target proteins. • The alterations may control biological functions and pathways in pulmonary fibrosis. • Our result may provide new insights of pulmonary fibrosis.« less
Mahendran, Shalini M; Oikonomopoulou, Katerina; Diamandis, Eleftherios P; Chandran, Vinod
Synovial fluid (SF) is a protein-rich fluid produced into the joint cavity by cells of the synovial membrane. Due to its direct contact with articular cartilage, surfaces of the bone, and the synoviocytes of the inner membrane, it provides a promising reflection of the biochemical state of the joint under varying physiological and pathophysiological conditions. This property of SF has been exploited within numerous studies in search of unique biomarkers of joint pathologies with the ultimate goal of developing minimally invasive clinical assays to detect and/or monitor disease states. Several proteomic methodologies have been employed to mine the SF proteome. From elementary immunoassays to high-throughput analyses using mass spectrometry-based techniques, each has demonstrated distinct advantages and disadvantages in the identification and quantification of SF proteins. This review will explore the role of SF in the elucidation of the arthritis proteome and the extent to which high-throughput techniques have facilitated the discovery and validation of protein biomarkers from osteoarthritis (OA), rheumatoid arthritis (RA), psoriatic arthritis (PsA), and juvenile idiopathic arthritis (JIA) patients.
Srivastava, Amrita; Singh, Anumeha; Singh, Satya S; Mishra, Arun K
2017-04-16
An appreciation of comparative microbial survival is most easily done while evaluating their adaptive strategies during stress. In the present experiment, antioxidative and whole cell proteome variations based on spectrophotometric analysis and SDS-PAGE and 2-dimensional gel electrophoresis have been analysed among salt-tolerant and salt-sensitive Frankia strains. This is the first report of proteomic basis underlying salt tolerance in these newly isolated Frankia strains from Hippophae salicifolia D. Don. Salt-tolerant strain HsIi10 shows higher increment in the contents of superoxide dismutase, catalase and ascorbate peroxidase as compared to salt-sensitive strain HsIi8. Differential 2-DGE profile has revealed differential profiles for salt-tolerant and salt-sensitive strains. Proteomic confirmation of salt tolerance in the strains with inbuilt efficiency of thriving in nitrogen-deficient locales is a definite advantage for these microbes. This would be equally beneficial for improvement of soil nitrogen status. Efficient protein regulation in HsIi10 suggests further exploration for its potential use as biofertilizer in saline soils.
Lawless, Craig; Hubbard, Simon J.; Fan, Jun; Bessant, Conrad; Hermjakob, Henning; Jones, Andrew R.
2012-01-01
Abstract New methods for performing quantitative proteome analyses based on differential labeling protocols or label-free techniques are reported in the literature on an almost monthly basis. In parallel, a correspondingly vast number of software tools for the analysis of quantitative proteomics data has also been described in the literature and produced by private companies. In this article we focus on the review of some of the most popular techniques in the field and present a critical appraisal of several software packages available to process and analyze the data produced. We also describe the importance of community standards to support the wide range of software, which may assist researchers in the analysis of data using different platforms and protocols. It is intended that this review will serve bench scientists both as a useful reference and a guide to the selection and use of different pipelines to perform quantitative proteomics data analysis. We have produced a web-based tool (http://www.proteosuite.org/?q=other_resources) to help researchers find appropriate software for their local instrumentation, available file formats, and quantitative methodology. PMID:22804616
ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function
Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira
2015-01-01
Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026
Lübke, Torben; Lobel, Peter; Sleat, David
2009-01-01
Defects in lysosomal function have been associated with numerous monogenic human diseases typically classified as lysosomal storage diseases. However, there is increasing evidence that lysosomal proteins are also involved in more widespread human diseases including cancer and Alzheimer disease. Thus, there is a continuing interest in understanding the cellular functions of the lysosome and an emerging approach to this is the identification of its constituent proteins by proteomic analyses. To date, the mammalian lysosome has been shown to contain ~ 60 soluble luminal proteins and ~25 transmembrane proteins. However, recent proteomic studies based upon affinity purification of soluble components or subcellular fractionation to obtain both soluble and membrane components suggest that there may be many more of both classes of protein resident within this organelle than previously appreciated. Discovery of such proteins has important implications for understanding the function and the dynamics of the lysosome but can also lead the way towards the discovery of the genetic basis for human diseases of hitherto unknown etiology. Here, we describe current approaches to lysosomal proteomics and data interpretation and review the new lysosomal proteins that have recently emerged from such studies. PMID:18977398
Mass Defect Labeling of Cysteine for Improving Peptide Assignment in Shotgun Proteomic Analyses
Hernandez, Hilda; Niehauser, Sarah; Boltz, Stacey A.; Gawandi, Vijay; Phillips, Robert S.; Amster, I. Jonathan
2006-01-01
A method for improving the identification of peptides in a shotgun proteome analysis using accurate mass measurement has been developed. The improvement is based upon the derivatization of cysteine residues with a novel reagent, 2,4-dibromo-(2′-iodo)acetanilide. The derivitization changes the mass defect of cysteine-containing proteolytic peptides in a manner that increases their identification specificity. Peptide masses were measured using matrix-assisted laser desorption/ionization Fourier transform ion cyclotron mass spectrometry. Reactions with protein standards show that the derivatization of cysteine is rapid and quantitative, and the data suggest that the derivatized peptides are more easily ionized or detected than unlabeled cysteine-containing peptides. The reagent was tested on a 15N-metabolically labeled proteome from M. maripaludis. Proteins were identified by their accurate mass values and from their nitrogen stoichiometry. A total of 47% of the labeled peptides are identified versus 27% for the unlabeled peptides. This procedure permits the identification of proteins from the M. maripaludis proteome that are not usually observed by the standard protocol and shows that better protein coverage is obtained with this methodology. PMID:16689545
Transcriptional and Proteomic Profiling of Aspergillus flavipes in Response to Sulfur Starvation
El-Sayed, Ashraf S. A.; Yassin, Marwa A.; Ali, Gul Shad
2015-01-01
Aspergillus flavipes has received considerable interest due to its potential to produce therapeutic enzymes involved in sulfur amino acid metabolism. In natural habitats, A. flavipes survives under sulfur limitations by mobilizing endogenous and exogenous sulfur to operate diverse cellular processes. Sulfur limitation affects virulence and pathogenicity, and modulates proteome of sulfur assimilating enzymes of several fungi. However, there are no previous reports aimed at exploring effects of sulfur limitation on the regulation of A. flavipes sulfur metabolism enzymes at the transcriptional, post-transcriptional and proteomic levels. In this report, we show that sulfur limitation affects morphological and physiological responses of A. flavipes. Transcription and enzymatic activities of several key sulfur metabolism genes, ATP-sulfurylase, sulfite reductase, methionine permease, cysteine synthase, cystathionine β- and γ-lyase, glutathione reductase and glutathione peroxidase were increased under sulfur starvation conditions. A 50 kDa protein band was strongly induced by sulfur starvation, and the proteomic analyses of this protein band using LC-MS/MS revealed similarity to many proteins involved in the sulfur metabolism pathway. PMID:26633307
Bioinformatics in proteomics: application, terminology, and pitfalls.
Wiemer, Jan C; Prokudin, Alexander
2004-01-01
Bioinformatics applies data mining, i.e., modern computer-based statistics, to biomedical data. It leverages on machine learning approaches, such as artificial neural networks, decision trees and clustering algorithms, and is ideally suited for handling huge data amounts. In this article, we review the analysis of mass spectrometry data in proteomics, starting with common pre-processing steps and using single decision trees and decision tree ensembles for classification. Special emphasis is put on the pitfall of overfitting, i.e., of generating too complex single decision trees. Finally, we discuss the pros and cons of the two different decision tree usages.
EPA'S TOXICOGENOMICS PARTNERSHIPS ACROSS GOVERNMENT, ACADEMIA AND INDUSTRY
Genomics, proteomics and metabonomics technologies are transforming the science of toxicology, and concurrent advances in computing and informatics are providing management and analysis solutions for this onslaught of toxicogenomic data. EPA has been actively developing an intra...
Computer Simulation of the Virulome of Bacillus anthracis Using Proteomics
2006-07-31
hypothetical protein gi|47526566 spermidine /putrescine ABC transporter, spermidine /putrescine-binding protein gi|47526625 oligoendopeptidase F, putative gi...glutamyl-trna(gln) amidotransferase, a subunit x gi|50196927 aspartate aminotransferase x gi|50196970 spermidine synthase x
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Jing; Ma, Zihao; Carr, Steven A.
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC).more » Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies. Molecular & Cellular Proteomics 16: 10.1074/mcp.M116.060301, 121–134, 2017.« less
Venkataramanan, Keerthi P; Min, Lie; Hou, Shuyu; Jones, Shawn W; Ralston, Matthew T; Lee, Kelvin H; Papoutsakis, E Terry
2015-01-01
Clostridium acetobutylicum is a model organism for both clostridial biology and solvent production. The organism is exposed to its own toxic metabolites butyrate and butanol, which trigger an adaptive stress response. Integrative analysis of proteomic and RNAseq data may provide novel insights into post-transcriptional regulation. The identified iTRAQ-based quantitative stress proteome is made up of 616 proteins with a 15 % genome coverage. The differentially expressed proteome correlated poorly with the corresponding differential RNAseq transcriptome. Up to 31 % of the differentially expressed proteins under stress displayed patterns opposite to those of the transcriptome, thus suggesting significant post-transcriptional regulation. The differential proteome of the translation machinery suggests that cells employ a different subset of ribosomal proteins under stress. Several highly upregulated proteins but with low mRNA levels possessed mRNAs with long 5'UTRs and strong RBS scores, thus supporting the argument that regulatory elements on the long 5'UTRs control their translation. For example, the oxidative stress response rubrerythrin was upregulated only at the protein level up to 40-fold without significant mRNA changes. We also identified many leaderless transcripts, several displaying different transcriptional start sites, thus suggesting mRNA-trimming mechanisms under stress. Downregulation of Rho and partner proteins pointed to changes in transcriptional elongation and termination under stress. The integrative proteomic-transcriptomic analysis demonstrated complex expression patterns of a large fraction of the proteome. Such patterns could not have been detected with one or the other omic analyses. Our analysis proposes the involvement of specific molecular mechanisms of post-transcriptional regulation to explain the observed complex stress response.
Lenčo, Juraj; Vajrychová, Marie; Pimková, Kristýna; Prokšová, Magdaléna; Benková, Markéta; Klimentová, Jana; Tambor, Vojtěch; Soukup, Ondřej
2018-04-17
Due to its sensitivity and productivity, bottom-up proteomics based on liquid chromatography-mass spectrometry (LC-MS) has become the core approach in the field. The de facto standard LC-MS platform for proteomics operates at sub-μL/min flow rates, and nanospray is required for efficiently introducing peptides into a mass spectrometer. Although this is almost a "dogma", this view is being reconsidered in light of developments in highly efficient chromatographic columns, and especially with the introduction of exceptionally sensitive MS instruments. Although conventional-flow LC-MS platforms have recently penetrated targeted proteomics successfully, their possibilities in discovery-oriented proteomics have not yet been thoroughly explored. Our objective was to determine what are the extra costs and what optimization and adjustments to a conventional-flow LC-MS system must be undertaken to identify a comparable number of proteins as can be identified on a nanoLC-MS system. We demonstrate that the amount of a complex tryptic digest needed for comparable proteome coverage can be roughly 5-fold greater, providing the column dimensions are properly chosen, extra-column peak dispersion is minimized, column temperature and flow rate are set to levels appropriate for peptide separation, and the composition of mobile phases is fine-tuned. Indeed, we identified 2 835 proteins from 2 μg of HeLa cells tryptic digest separated during a 60 min gradient at 68 μL/min on a 1.0 mm × 250 mm column held at 55 °C and using an aqua-acetonitrile mobile phases containing 0.1% formic acid, 0.4% acetic acid, and 3% dimethyl sulfoxide. Our results document that conventional-flow LC-MS is an attractive alternative for bottom-up exploratory proteomics.
Vanderperre, Benoît; Lucier, Jean-François; Bissonnette, Cyntia; Motard, Julie; Tremblay, Guillaume; Vanderperre, Solène; Wisztorski, Maxence; Salzet, Michel; Boisvert, François-Michel; Roucou, Xavier
2013-01-01
A fully mature mRNA is usually associated to a reference open reading frame encoding a single protein. Yet, mature mRNAs contain unconventional alternative open reading frames (AltORFs) located in untranslated regions (UTRs) or overlapping the reference ORFs (RefORFs) in non-canonical +2 and +3 reading frames. Although recent ribosome profiling and footprinting approaches have suggested the significant use of unconventional translation initiation sites in mammals, direct evidence of large-scale alternative protein expression at the proteome level is still lacking. To determine the contribution of alternative proteins to the human proteome, we generated a database of predicted human AltORFs revealing a new proteome mainly composed of small proteins with a median length of 57 amino acids, compared to 344 amino acids for the reference proteome. We experimentally detected a total of 1,259 alternative proteins by mass spectrometry analyses of human cell lines, tissues and fluids. In plasma and serum, alternative proteins represent up to 55% of the proteome and may be a potential unsuspected new source for biomarkers. We observed constitutive co-expression of RefORFs and AltORFs from endogenous genes and from transfected cDNAs, including tumor suppressor p53, and provide evidence that out-of-frame clones representing AltORFs are mistakenly rejected as false positive in cDNAs screening assays. Functional importance of alternative proteins is strongly supported by significant evolutionary conservation in vertebrates, invertebrates, and yeast. Our results imply that coding of multiple proteins in a single gene by the use of AltORFs may be a common feature in eukaryotes, and confirm that translation of unconventional ORFs generates an as yet unexplored proteome. PMID:23950983
Rokyta, Darin R; Ward, Micaiah J
2017-03-15
The order Scorpiones is one of the most ancient and diverse lineages of venomous animals, having originated approximately 430 million years ago and diversified into 14 extant families. Although partial venom characterizations have been described for numerous scorpion species, we provided the first quantitative transcriptome/proteome comparison for a scorpion species using single-animal approaches. We sequenced the venom-gland transcriptomes of a male and female black-back scorpion (Hadrurus spadix) from the family Caraboctonidae using the Illumina sequencing platform and conducted independent quantitative mass-spectrometry analyses of their venoms. We identified 79 proteomically confirmed venom proteins, an additional 69 transcripts with homology to toxins from other species, and 596 nontoxin proteins expressed at high levels in the venom glands. The venom of H. spadix was rich in antimicrobial peptides, K + -channel toxins, and several classes of peptidases. However, the most diverse and one of the most abundant classes of putative toxins could not be assigned even a tentative functional role on the basis of homology, indicating that this venom contained a wealth of previously unexplored animal toxin diversity. We found good agreement between both transcriptomic and proteomic abundances across individuals, but transcriptomic and proteomic abundandances differed substantially within each individual. Small peptide toxins such as K + -channel toxins and antimicrobial peptides proved challenging to detect proteomically, at least in part due to the significant proteolytic processing involved in their maturation. In addition, we found a significant tendency for our proteomic approach to overestimate the abundances of large putative toxins and underestimate the abundances of smaller toxins. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lundquist, Peter K.; Poliakov, Anton; Bhuiyan, Nazmul H.; Zybailov, Boris; Sun, Qi; van Wijk, Klaas J.
2012-01-01
Plastoglobules (PGs) in chloroplasts are thylakoid-associated monolayer lipoprotein particles containing prenyl and neutral lipids and several dozen proteins mostly with unknown functions. An integrated view of the role of the PG is lacking. Here, we better define the PG proteome and provide a conceptual framework for further studies. The PG proteome from Arabidopsis (Arabidopsis thaliana) leaf chloroplasts was determined by mass spectrometry of isolated PGs and quantitative comparison with the proteomes of unfractionated leaves, thylakoids, and stroma. Scanning electron microscopy showed the purity and size distribution of the isolated PGs. Compared with previous PG proteome analyses, we excluded several proteins and identified six new PG proteins, including an M48 metallopeptidase and two Absence of bc1 complex (ABC1) atypical kinases, confirmed by immunoblotting. This refined PG proteome consisted of 30 proteins, including six ABC1 kinases and seven fibrillins together comprising more than 70% of the PG protein mass. Other fibrillins were located predominantly in the stroma or thylakoid and not in PGs; we discovered that this partitioning can be predicted by their isoelectric point and hydrophobicity. A genome-wide coexpression network for the PG genes was then constructed from mRNA expression data. This revealed a modular network with four distinct modules that each contained at least one ABC1K and/or fibrillin gene. Each module showed clear enrichment in specific functions, including chlorophyll degradation/senescence, isoprenoid biosynthesis, plastid proteolysis, and redox regulators and phosphoregulators of electron flow. We propose a new testable model for the PGs, in which sets of genes are associated with specific PG functions. PMID:22274653
Machine learning applications in proteomics research: how the past can boost the future.
Kelchtermans, Pieter; Bittremieux, Wout; De Grave, Kurt; Degroeve, Sven; Ramon, Jan; Laukens, Kris; Valkenborg, Dirk; Barsnes, Harald; Martens, Lennart
2014-03-01
Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Time, space, and disorder in the expanding proteome universe.
Minde, David-Paul; Dunker, A Keith; Lilley, Kathryn S
2017-04-01
Proteins are highly dynamic entities. Their myriad functions require specific structures, but proteins' dynamic nature ranges all the way from the local mobility of their amino acid constituents to mobility within and well beyond single cells. A truly comprehensive view of the dynamic structural proteome includes: (i) alternative sequences, (ii) alternative conformations, (iii) alternative interactions with a range of biomolecules, (iv) cellular localizations, (v) alternative behaviors in different cell types. While these aspects have traditionally been explored one protein at a time, we highlight recently emerging global approaches that accelerate comprehensive insights into these facets of the dynamic nature of protein structure. Computational tools that integrate and expand on multiple orthogonal data types promise to enable the transition from a disjointed list of static snapshots to a structurally explicit understanding of the dynamics of cellular mechanisms. © 2017 The Authors. Proteomics Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Schwämmle, Veit; León, Ileana Rodríguez; Jensen, Ole Nørregaard
2013-09-06
Large-scale quantitative analyses of biological systems are often performed with few replicate experiments, leading to multiple nonidentical data sets due to missing values. For example, mass spectrometry driven proteomics experiments are frequently performed with few biological or technical replicates due to sample-scarcity or due to duty-cycle or sensitivity constraints, or limited capacity of the available instrumentation, leading to incomplete results where detection of significant feature changes becomes a challenge. This problem is further exacerbated for the detection of significant changes on the peptide level, for example, in phospho-proteomics experiments. In order to assess the extent of this problem and the implications for large-scale proteome analysis, we investigated and optimized the performance of three statistical approaches by using simulated and experimental data sets with varying numbers of missing values. We applied three tools, including standard t test, moderated t test, also known as limma, and rank products for the detection of significantly changing features in simulated and experimental proteomics data sets with missing values. The rank product method was improved to work with data sets containing missing values. Extensive analysis of simulated and experimental data sets revealed that the performance of the statistical analysis tools depended on simple properties of the data sets. High-confidence results were obtained by using the limma and rank products methods for analyses of triplicate data sets that exhibited more than 1000 features and more than 50% missing values. The maximum number of differentially represented features was identified by using limma and rank products methods in a complementary manner. We therefore recommend combined usage of these methods as a novel and optimal way to detect significantly changing features in these data sets. This approach is suitable for large quantitative data sets from stable isotope labeling and mass spectrometry experiments and should be applicable to large data sets of any type. An R script that implements the improved rank products algorithm and the combined analysis is available.
Hou, Junjie; Zhang, Chengqian; Xue, Peng; Wang, Jifeng; Chen, Xiulan; Guo, Xiaojing; Yang, Fuquan
2017-01-01
Ovarian cancer is one of the most common cancer among women in the world, and chemotherapy remains the principal treatment for patients. However, drug resistance is a major obstacle to the effective treatment of ovarian cancers and the underlying mechanism is not clear. An increased understanding of the mechanisms that underline the pathogenesis of drug resistance is therefore needed to develop novel therapeutics and diagnostic. Herein, we report the comparative analysis of the doxorubicin sensitive OVCAR8 cells and its doxorubicin-resistant variant NCI/ADR-RES cells using integrated global proteomics and N-glycoproteomics. A total of 1525 unique N-glycosite-containing peptides from 740 N-glycoproteins were identified and quantified, of which 253 N-glycosite-containing peptides showed significant change in the NCI/ADR-RES cells. Meanwhile, stable isotope labeling by amino acids in cell culture (SILAC) based comparative proteomic analysis of the two ovarian cancer cells led to the quantification of 5509 proteins. As about 50% of the identified N-glycoproteins are low-abundance membrane proteins, only 44% of quantified unique N-glycosite-containing peptides had corresponding protein expression ratios. The comparison and calibration of the N-glycoproteome versus the proteome classified 14 change patterns of N-glycosite-containing peptides, including 8 up-regulated N-glycosite-containing peptides with the increased glycosylation sites occupancy, 35 up-regulated N-glycosite-containing peptides with the unchanged glycosylation sites occupancy, 2 down-regulated N-glycosite-containing peptides with the decreased glycosylation sites occupancy, 46 down-regulated N-glycosite-containing peptides with the unchanged glycosylation sites occupancy. Integrated proteomic and N-glycoproteomic analyses provide new insights, which can help to unravel the relationship of N-glycosylation and multidrug resistance (MDR), understand the mechanism of MDR, and discover the new diagnostic and therapeutic targets. PMID:28077793
Analysis of the pumpkin phloem proteome provides insights into angiosperm sieve tube function.
Lin, Ming-Kuem; Lee, Young-Jin; Lough, Tony J; Phinney, Brett S; Lucas, William J
2009-02-01
Increasing evidence suggests that proteins present in the angiosperm sieve tube system play an important role in the long distance signaling system of plants. To identify the nature of these putatively non-cell-autonomous proteins, we adopted a large scale proteomics approach to analyze pumpkin phloem exudates. Phloem proteins were fractionated by fast protein liquid chromatography using both anion and cation exchange columns and then either in-solution or in-gel digested following further separation by SDS-PAGE. A total of 345 LC-MS/MS data sets were analyzed using a combination of Mascot and X!Tandem against the NCBI non-redundant green plant database and an extensive Cucurbit maxima expressed sequence tag database. In this analysis, 1,209 different consensi were obtained of which 1,121 could be annotated from GenBank and BLAST search analyses against three plant species, Arabidopsis thaliana, rice (Oryza sativa), and poplar (Populus trichocarpa). Gene ontology (GO) enrichment analyses identified sets of phloem proteins that function in RNA binding, mRNA translation, ubiquitin-mediated proteolysis, and macromolecular and vesicle trafficking. Our findings indicate that protein synthesis and turnover, processes that were thought to be absent in enucleate sieve elements, likely occur within the angiosperm phloem translocation stream. In addition, our GO analysis identified a set of phloem proteins that are associated with the GO term "embryonic development ending in seed dormancy"; this finding raises the intriguing question as to whether the phloem may exert some level of control over seed development. The universal significance of the phloem proteome was highlighted by conservation of the phloem proteome in species as diverse as monocots (rice), eudicots (Arabidopsis and pumpkin), and trees (poplar). These results are discussed from the perspective of the role played by the phloem proteome as an integral component of the whole plant communication system.
Lu, Dihong; Ni, Weimin; Stanley, Bruce A.; ...
2016-03-03
The ARABIDOPSIS SKP1-LIKE1 (ASK1) protein functions as a subunit of SKP1-CUL1-F-box (SCF) E3 ubiquitin ligases. Previous genetic studies showed that ASK1 plays important roles in Arabidopsis flower development and male meiosis. However, the molecular impact of ASK1-containing SCF E3 ubiquitin ligases (ASK1-E3s) on the floral proteome and transcriptome is unknown. Here we identified proteins that are potentially regulated by ASK1-E3s by comparing floral bud proteomes of wild-type and the ask1 mutant plants. More than 200 proteins were detected in the ask1 mutant but not in wild-type and >300 were detected at higher levels in the ask1 mutant than in wild-type,more » but their RNA levels were not significantly different between wild-type and ask1 floral buds as shown by transcriptomics analysis, suggesting that they are likely regulated at the protein level by ASK1-E3s. Integrated analyses of floral proteomics and transcriptomics of ask1 and wild-type uncovered several potential aspects of ASK1-E3 functions, including regulation of transcription regulators, kinases, peptidases, and ribosomal proteins, with implications on possible mechanisms of ASK1-E3 functions in floral development. In conclusion, our results suggested that ASK1-E3s play important roles in Arabidopsis protein degradation during flower development. This study opens up new possibilities for further functional studies of these candidate E3 substrates.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Dihong; Ni, Weimin; Stanley, Bruce A.
The ARABIDOPSIS SKP1-LIKE1 (ASK1) protein functions as a subunit of SKP1-CUL1-F-box (SCF) E3 ubiquitin ligases. Previous genetic studies showed that ASK1 plays important roles in Arabidopsis flower development and male meiosis. However, the molecular impact of ASK1-containing SCF E3 ubiquitin ligases (ASK1-E3s) on the floral proteome and transcriptome is unknown. Here we identified proteins that are potentially regulated by ASK1-E3s by comparing floral bud proteomes of wild-type and the ask1 mutant plants. More than 200 proteins were detected in the ask1 mutant but not in wild-type and >300 were detected at higher levels in the ask1 mutant than in wild-type,more » but their RNA levels were not significantly different between wild-type and ask1 floral buds as shown by transcriptomics analysis, suggesting that they are likely regulated at the protein level by ASK1-E3s. Integrated analyses of floral proteomics and transcriptomics of ask1 and wild-type uncovered several potential aspects of ASK1-E3 functions, including regulation of transcription regulators, kinases, peptidases, and ribosomal proteins, with implications on possible mechanisms of ASK1-E3 functions in floral development. In conclusion, our results suggested that ASK1-E3s play important roles in Arabidopsis protein degradation during flower development. This study opens up new possibilities for further functional studies of these candidate E3 substrates.« less
Haraszti, Reka A.; Didiot, Marie-Cecile; Sapp, Ellen; Leszyk, John; Shaffer, Scott A.; Rockwell, Hannah E.; Gao, Fei; Narain, Niven R.; DiFiglia, Marian; Kiebish, Michael A.; Aronin, Neil; Khvorova, Anastasia
2016-01-01
Extracellular vesicles (EVs), including exosomes and microvesicles (MVs), are explored for use in diagnostics, therapeutics and drug delivery. However, little is known about the relationship of protein and lipid composition of EVs and their source cells. Here, we report high-resolution lipidomic and proteomic analyses of exosomes and MVs derived by differential ultracentrifugation from 3 different cell types: U87 glioblastoma cells, Huh7 hepatocellular carcinoma cells and human bone marrow-derived mesenchymal stem cells (MSCs). We identified 3,532 proteins and 1,961 lipid species in the screen. Exosomes differed from MVs in several different areas: (a) The protein patterns of exosomes were more likely different from their cells of origin than were the protein patterns of MVs; (b) The proteomes of U87 and Huh7 exosomes were similar to each other but different from the proteomes of MSC exosomes, whereas the lipidomes of Huh7 and MSC exosomes were similar to each other but different from the lipidomes of U87 exosomes; (c) exosomes exhibited proteins of extracellular matrix, heparin-binding, receptors, immune response and cell adhesion functions, whereas MVs were enriched in endoplasmic reticulum, proteasome and mitochondrial proteins. Exosomes and MVs also differed in their types of lipid contents. Enrichment in glycolipids and free fatty acids characterized exosomes, whereas enrichment in ceramides and sphingomyelins characterized MVs. Furthermore, Huh7 and MSC exosomes were specifically enriched in cardiolipins; U87 exosomes were enriched in sphingomyelins. This study comprehensively analyses the protein and lipid composition of exosomes, MVs and source cells in 3 different cell types. PMID:27863537
Jesupret, Clémence; Baumann, Kate; Jackson, Timothy N W; Ali, Syed Abid; Yang, Daryl C; Greisman, Laura; Kern, Larissa; Steuten, Jessica; Jouiaei, Mahdokht; Casewell, Nicholas R; Undheim, Eivind A B; Koludarov, Ivan; Debono, Jordan; Low, Dolyce H W; Rossi, Sarah; Panagides, Nadya; Winter, Kelly; Ignjatovic, Vera; Summerhayes, Robyn; Jones, Alun; Nouwens, Amanda; Dunstan, Nathan; Hodgson, Wayne C; Winkel, Kenneth D; Monagle, Paul; Fry, Bryan Grieg
2014-06-13
For over a century, venom samples from wild snakes have been collected and stored around the world. However, the quality of storage conditions for "vintage" venoms has rarely been assessed. The goal of this study was to determine whether such historical venom samples are still biochemically and pharmacologically viable for research purposes, or if new sample efforts are needed. In total, 52 samples spanning 5 genera and 13 species with regional variants of some species (e.g., 14 different populations of Notechis scutatus) were analysed by a combined proteomic and pharmacological approach to determine protein structural stability and bioactivity. When venoms were not exposed to air during storage, the proteomic results were virtually indistinguishable from that of fresh venom and bioactivity was equivalent or only slightly reduced. By contrast, a sample of Acanthophis antarcticus venom that was exposed to air (due to a loss of integrity of the rubber stopper) suffered significant degradation as evidenced by the proteomics profile. Interestingly, the neurotoxicity of this sample was nearly the same as fresh venom, indicating that degradation may have occurred in the free N- or C-terminus chains of the proteins, rather than at the tips of loops where the functional residues are located. These results suggest that these and other vintage venom collections may be of continuing value in toxin research. This is particularly important as many snake species worldwide are declining due to habitat destruction or modification. For some venoms (such as N. scutatus from Babel Island, Flinders Island, King Island and St. Francis Island) these were the first analyses ever conducted and these vintage samples may represent the only venom ever collected from these unique island forms of tiger snakes. Such vintage venoms may therefore represent the last remaining stocks of some local populations and thus are precious resources. These venoms also have significant historical value as the Oxyuranus venoms analysed include samples from the first coastal taipan (Oxyuranus scutellatus) collected for antivenom production (the snake that killed the collector Kevin Budden), as well as samples from the first Oxyuranus microlepidotus specimen collected after the species' rediscovery in 1976. These results demonstrate that with proper storage techniques, venom samples can retain structural and pharmacological stability. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Rea, Giuseppina; Cristofaro, Francesco; Pani, Giuseppe; Pascucci, Barbara; Ghuge, Sandip A; Corsetto, Paola Antonia; Imbriani, Marcello; Visai, Livia; Rizzo, Angela M
2016-03-30
Space is a hostile environment characterized by high vacuum, extreme temperatures, meteoroids, space debris, ionospheric plasma, microgravity and space radiation, which all represent risks for human health. A deep understanding of the biological consequences of exposure to the space environment is required to design efficient countermeasures to minimize their negative impact on human health. Recently, proteomic approaches have received a significant amount of attention in the effort to further study microgravity-induced physiological changes. In this review, we summarize the current knowledge about the effects of microgravity on microorganisms (in particular Cupriavidus metallidurans CH34, Bacillus cereus and Rhodospirillum rubrum S1H), plants (whole plants, organs, and cell cultures), mammalian cells (endothelial cells, bone cells, chondrocytes, muscle cells, thyroid cancer cells, immune system cells) and animals (invertebrates, vertebrates and mammals). Herein, we describe their proteome's response to microgravity, focusing on proteomic discoveries and their future potential applications in space research. Space experiments and operational flight experience have identified detrimental effects on human health and performance because of exposure to weightlessness, even when currently available countermeasures are implemented. Many experimental tools and methods have been developed to study microgravity induced physiological changes. Recently, genomic and proteomic approaches have received a significant amount of attention. This review summarizes the recent research studies of the proteome response to microgravity inmicroorganisms, plants, mammalians cells and animals. Current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of all proteomes. Understanding gene and/or protein expression is the key to unlocking the mechanisms behind microgravity-induced problems and to finding effective countermeasures to spaceflight-induced alterations but also for the study of diseases on earth. Future perspectives are also highlighted. Copyright © 2015 Elsevier B.V. All rights reserved.
Global iTRAQ-based proteomic profiling of Toxoplasma gondii oocysts during sporulation.
Zhou, Chun-Xue; Zhu, Xing-Quan; Elsheikha, Hany M; He, Shuai; Li, Qian; Zhou, Dong-Hui; Suo, Xun
2016-10-04
Toxoplasma gondii is a medically and economically important protozoan parasite. However, the molecular mechanisms of its sporulation remain largely unknown. Here, we applied iTRAQ coupled with 2D LC-MS/MS proteomic analysis to investigate the proteomic expression profile of T. gondii oocysts during sporulation. Of the 2095 non-redundant proteins identified, 587 were identified as differentially expressed proteins (DEPs). Based on Gene Ontology enrichment and KEGG pathway analyses the majority of these DEPs were found related to the metabolism of amino acids, carbon and energy. Protein interaction network analysis generated by STRING identified ATP-citrate lyase (ACL), GMP synthase, IMP dehydrogenase (IMPDH), poly (ADP-ribose) glycohydrolase (PARG), and bifunctional dihydrofolate reductase-thymidylate synthase (DHFR-TS) as the top five hubs. We also identified 25 parasite virulence factors that were expressed at relatively high levels in sporulated oocysts compared to non-sporulated oocysts, which might contribute to the infectivity of mature oocysts. Considering the importance of oocysts in the dissemination of toxoplasmosis these findings may help in the search of protein targets with a key role in infectiousness and ecological success of oocysts, creating new opportunities for the development of better means for disease prevention. The development of new preventative interventions against T. gondii infection relies on an improved understanding of the proteome and chemical pathways of this parasite. To identify proteins required for the development of environmentally resistant and infective T. gondii oocysts, we compared the proteome of non-sporulated (immature) oocysts with the proteome of sporulated (mature, infective) oocysts. iTRAQ 2D-LC-MS/MS analysis revealed proteomic changes that distinguish non-sporulated from sporulated oocysts. Many of the differentially expressed proteins were involved in metabolic pathways and 25 virulence factors were identified upregulated in the sporulated oocysts. This work provides the first quantitative characterization of the proteomic variations that occur in T. gondii oocyst stage during sporulation. Copyright © 2016. Published by Elsevier B.V.
Megger, Dominik A; Padden, Juliet; Rosowski, Kristin; Uszkoreit, Julian; Bracht, Thilo; Eisenacher, Martin; Gerges, Christian; Neuhaus, Horst; Schumacher, Brigitte; Schlaak, Jörg F; Sitek, Barbara
2017-02-10
The proteome analysis of bile fluid represents a promising strategy to identify biomarker candidates for various diseases of the hepatobiliary system. However, to obtain substantive results in biomarker discovery studies large patient cohorts necessarily need to be analyzed. Consequently, this would lead to an unmanageable number of samples to be analyzed if sample preparation protocols with extensive fractionation methods are applied. Hence, the performance of simple workflows allowing for "one sample, one shot" experiments have been evaluated in this study. In detail, sixteen different protocols implying modifications at the stages of desalting, delipidation, deglycosylation and tryptic digestion have been examined. Each method has been individually evaluated regarding various performance criteria and comparative analyses have been conducted to uncover possible complementarities. Here, the best performance in terms of proteome coverage has been assessed for a combination of acetone precipitation with in-gel digestion. Finally, a mapping of all obtained protein identifications with putative biomarkers for hepatocellular carcinoma (HCC) and cholangiocellular carcinoma (CCC) revealed several proteins easily detectable in bile fluid. These results can build the basis for future studies with large and well-defined patient cohorts in a more disease-related context. Human bile fluid is a proximal body fluid and supposed to be a potential source of disease markers. However, due to its biochemical composition, the proteome analysis of bile fluid still represents a challenging task and is therefore mostly conducted using extensive fractionation procedures. This in turn leads to a high number of mass spectrometric measurements for one biological sample. Considering the fact that in order to overcome the biological variability a high number of biological samples needs to be analyzed in biomarker discovery studies, this leads to the dilemma of an unmanageable number of necessary MS-based analyses. Hence, easy sample preparation protocols are demanded representing a compromise between proteome coverage and simplicity. In the presented study, such protocols have been evaluated regarding various technical criteria (e.g. identification rates, missed cleavages, chromatographic separation) uncovering the strengths and weaknesses of various methods. Furthermore, a cumulative bile proteome list has been generated that extends the current bile proteome catalog by 248 proteins. Finally, a mapping with putative biomarkers for hepatocellular carcinoma (HCC) and cholangiocellular carcinoma (CCC) derived from tissue-based studies, revealed several of these proteins being easily and reproducibly detectable in human bile. Therefore, the presented technical work represents a solid base for future disease-related studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Computing Prediction and Functional Analysis of Prokaryotic Propionylation.
Wang, Li-Na; Shi, Shao-Ping; Wen, Ping-Ping; Zhou, Zhi-You; Qiu, Jian-Ding
2017-11-27
Identification and systematic analysis of candidates for protein propionylation are crucial steps for understanding its molecular mechanisms and biological functions. Although several proteome-scale methods have been performed to delineate potential propionylated proteins, the majority of lysine-propionylated substrates and their role in pathological physiology still remain largely unknown. By gathering various databases and literatures, experimental prokaryotic propionylation data were collated to be trained in a support vector machine with various features via a three-step feature selection method. A novel online tool for seeking potential lysine-propionylated sites (PropSeek) ( http://bioinfo.ncu.edu.cn/PropSeek.aspx ) was built. Independent test results of leave-one-out and n-fold cross-validation were similar to each other, showing that PropSeek is a stable and robust predictor with satisfying performance. Meanwhile, analyses of Gene Ontology, Kyoto Encyclopedia of Genes and Genomes pathways, and protein-protein interactions implied a potential role of prokaryotic propionylation in protein synthesis and metabolism.
Making the Most of Omics for Symbiosis Research
Chaston, J.; Douglas, A.E.
2012-01-01
Omics, including genomics, proteomics and metabolomics, enable us to explain symbioses in terms of the underlying molecules and their interactions. The central task is to transform molecular catalogs of genes, metabolites etc. into a dynamic understanding of symbiosis function. We review four exemplars of omics studies that achieve this goal, through defined biological questions relating to metabolic integration and regulation of animal-microbial symbioses, the genetic autonomy of bacterial symbionts, and symbiotic protection of animal hosts from pathogens. As omic datasets become increasingly complex, computationally-sophisticated downstream analyses are essential to reveal interactions not evident to visual inspection of the data. We discuss two approaches, phylogenomics and transcriptional clustering, that can divide the primary output of omics studies – long lists of factors – into manageable subsets, and we describe how they have been applied to analyze large datasets and generate testable hypotheses. PMID:22983030
Continually emerging mechanistic complexity of the multi-enzyme cellulosome complex.
Smith, Steven P; Bayer, Edward A; Czjzek, Mirjam
2017-06-01
The robust plant cell wall polysaccharide-degrading properties of anaerobic bacteria are harnessed within elegant, marcomolecular assemblages called cellulosomes, in which proteins of complementary activities amass on scaffold protein networks. Research efforts have focused and continue to focus on providing detailed mechanistic insights into cellulosomal complex assembly, topology, and function. The accumulated information is expanding our fundamental understanding of the lignocellulosic biomass decomposition process and enhancing the potential of engineered cellulosomal systems for biotechnological purposes. Ongoing biochemical studies continue to reveal unexpected functional diversity within traditional cellulase families. Genomic, proteomic, and functional analyses have uncovered unanticipated cellulosomal proteins that augment the function of the native and designer cellulosomes. In addition, complementary structural and computational methods are continuing to provide much needed insights on the influence of cellulosomal interdomain linker regions on cellulosomal assembly and activity. Copyright © 2017 Elsevier Ltd. All rights reserved.
Guo, Xiao; Niemi, Natalie M; Coon, Joshua J; Pagliarini, David J
2017-07-14
The pyruvate dehydrogenase complex (PDC) is the primary metabolic checkpoint connecting glycolysis and mitochondrial oxidative phosphorylation and is important for maintaining cellular and organismal glucose homeostasis. Phosphorylation of the PDC E1 subunit was identified as a key inhibitory modification in bovine tissue ∼50 years ago, and this regulatory process is now known to be conserved throughout evolution. Although Saccharomyces cerevisiae is a pervasive model organism for investigating cellular metabolism and its regulation by signaling processes, the phosphatase(s) responsible for activating the PDC in S. cerevisiae has not been conclusively defined. Here, using comparative mitochondrial phosphoproteomics, analyses of protein-protein interactions by affinity enrichment-mass spectrometry, and in vitro biochemistry, we define Ptc6p as the primary PDC phosphatase in S. cerevisiae Our analyses further suggest additional substrates for related S. cerevisiae phosphatases and describe the overall phosphoproteomic changes that accompany mitochondrial respiratory dysfunction. In summary, our quantitative proteomics and biochemical analyses have identified Ptc6p as the primary-and likely sole- S. cerevisiae PDC phosphatase, closing a key knowledge gap about the regulation of yeast mitochondrial metabolism. Our findings highlight the power of integrative omics and biochemical analyses for annotating the functions of poorly characterized signaling proteins. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
NASA Astrophysics Data System (ADS)
Pfammatter, Sibylle; Bonneil, Eric; McManus, Francis P.; Thibault, Pierre
2018-04-01
The small ubiquitin-like modifier (SUMO) is a member of the family of ubiquitin-like modifiers (UBLs) and is involved in important cellular processes, including DNA damage response, meiosis and cellular trafficking. The large-scale identification of SUMO peptides in a site-specific manner is challenging not only because of the low abundance and dynamic nature of this modification, but also due to the branched structure of the corresponding peptides that further complicate their identification using conventional search engines. Here, we exploited the unusual structure of SUMO peptides to facilitate their separation by high-field asymmetric waveform ion mobility spectrometry (FAIMS) and increase the coverage of SUMO proteome analysis. Upon trypsin digestion, branched peptides contain a SUMO remnant side chain and predominantly form triply protonated ions that facilitate their gas-phase separation using FAIMS. We evaluated the mobility characteristics of synthetic SUMO peptides and further demonstrated the application of FAIMS to profile the changes in protein SUMOylation of HEK293 cells following heat shock, a condition known to affect this modification. FAIMS typically provided a 10-fold improvement of detection limit of SUMO peptides, and enabled a 36% increase in SUMO proteome coverage compared to the same LC-MS/MS analyses performed without FAIMS. [Figure not available: see fulltext.
Shemesh-Mayer, Einat; Ben-Michael, Tomer; Rotem, Neta; Rabinowitch, Haim D.; Doron-Faigenboim, Adi; Kosmala, Arkadiusz; Perlikowski, Dawid; Sherman, Amir; Kamenetsky, Rina
2015-01-01
Commercial cultivars of garlic, a popular condiment, are sterile, making genetic studies and breeding of this plant challenging. However, recent fertility restoration has enabled advanced physiological and genetic research and hybridization in this important crop. Morphophysiological studies, combined with transcriptome and proteome analyses and quantitative PCR validation, enabled the identification of genes and specific processes involved in gametogenesis in fertile and male-sterile garlic genotypes. Both genotypes exhibit normal meiosis at early stages of anther development, but in the male-sterile plants, tapetal hypertrophy after microspore release leads to pollen degeneration. Transcriptome analysis and global gene-expression profiling showed that >16,000 genes are differentially expressed in the fertile vs. male-sterile developing flowers. Proteome analysis and quantitative comparison of 2D-gel protein maps revealed 36 significantly different protein spots, 9 of which were present only in the male-sterile genotype. Bioinformatic and quantitative PCR validation of 10 candidate genes exhibited significant expression differences between male-sterile and fertile flowers. A comparison of morphophysiological and molecular traits of fertile and male-sterile garlic flowers suggests that respiratory restrictions and/or non-regulated programmed cell death of the tapetum can lead to energy deficiency and consequent pollen abortion. Potential molecular markers for male fertility and sterility in garlic are proposed. PMID:25972879
Proteogenomics connects somatic mutations to signalling in breast cancer.
Mertins, Philipp; Mani, D R; Ruggles, Kelly V; Gillette, Michael A; Clauser, Karl R; Wang, Pei; Wang, Xianlong; Qiao, Jana W; Cao, Song; Petralia, Francesca; Kawaler, Emily; Mundt, Filip; Krug, Karsten; Tu, Zhidong; Lei, Jonathan T; Gatza, Michael L; Wilkerson, Matthew; Perou, Charles M; Yellapantula, Venkata; Huang, Kuan-lin; Lin, Chenwei; McLellan, Michael D; Yan, Ping; Davies, Sherri R; Townsend, R Reid; Skates, Steven J; Wang, Jing; Zhang, Bing; Kinsinger, Christopher R; Mesri, Mehdi; Rodriguez, Henry; Ding, Li; Paulovich, Amanda G; Fenyö, David; Ellis, Matthew J; Carr, Steven A
2016-06-02
Somatic mutations have been extensively characterized in breast cancer, but the effects of these genetic alterations on the proteomic landscape remain poorly understood. Here we describe quantitative mass-spectrometry-based proteomic and phosphoproteomic analyses of 105 genomically annotated breast cancers, of which 77 provided high-quality data. Integrated analyses provided insights into the somatic cancer genome including the consequences of chromosomal loss, such as the 5q deletion characteristic of basal-like breast cancer. Interrogation of the 5q trans-effects against the Library of Integrated Network-based Cellular Signatures, connected loss of CETN3 and SKP1 to elevated expression of epidermal growth factor receptor (EGFR), and SKP1 loss also to increased SRC tyrosine kinase. Global proteomic data confirmed a stromal-enriched group of proteins in addition to basal and luminal clusters, and pathway analysis of the phosphoproteome identified a G-protein-coupled receptor cluster that was not readily identified at the mRNA level. In addition to ERBB2, other amplicon-associated highly phosphorylated kinases were identified, including CDK12, PAK1, PTK2, RIPK2 and TLK2. We demonstrate that proteogenomic analysis of breast cancer elucidates the functional consequences of somatic mutations, narrows candidate nominations for driver genes within large deletions and amplified regions, and identifies therapeutic targets.
Li, Chien-Feng; Shen, Kun-Hung; Chien, Lan-Hsiang; Huang, Cheng-Hao; Wu, Ting-Feng; He, Hong-Lin
2018-04-19
Among various heterogeneous types of bladder tumors, urothelial carcinoma is the most prevalent lesion. Some of the urinary bladder urothelial carcinomas (UBUCs) develop local recurrence and may cause distal invasion. Galectin-1 de-regulation significantly affects cell transformation, cell proliferation, angiogenesis, and cell invasiveness. In continuation of our previous investigation on the role of galectin-1 in UBUC tumorigenesis, in this study, proteomics strategies were implemented in order to find more galectin-1-associated signaling pathways. The results of this study showed that galectin-1 knockdown could induce 15 down-regulated proteins and two up-regulated proteins in T24 cells. These de-regulated proteins might participate in lipid/amino acid/energy metabolism, cytoskeleton, cell proliferation, cell-cell interaction, cell apoptosis, metastasis, and protein degradation. The aforementioned dys-regulated proteins were confirmed by western immunoblotting. Proteomics results were further translated to prognostic markers by analyses of biopsy samples. Results of cohort studies demonstrated that over-expressions of glutamine synthetase, alcohol dehydrogenase (NADP⁺), fatty acid binding protein 4, and toll interacting protein in clinical specimens were all significantly associated with galectin-1 up-regulation. Univariate analyses showed that de-regulations of glutamine synthetase and fatty acid binding protein 4 in clinical samples were respectively linked to disease-specific survival and metastasis-free survival.
MaxReport: An Enhanced Proteomic Result Reporting Tool for MaxQuant.
Zhou, Tao; Li, Chuyu; Zhao, Wene; Wang, Xinru; Wang, Fuqiang; Sha, Jiahao
2016-01-01
MaxQuant is a proteomic software widely used for large-scale tandem mass spectrometry data. We have designed and developed an enhanced result reporting tool for MaxQuant, named as MaxReport. This tool can optimize the results of MaxQuant and provide additional functions for result interpretation. MaxReport can generate report tables for protein N-terminal modifications. It also supports isobaric labelling based relative quantification at the protein, peptide or site level. To obtain an overview of the results, MaxReport performs general descriptive statistical analyses for both identification and quantification results. The output results of MaxReport are well organized and therefore helpful for proteomic users to better understand and share their data. The script of MaxReport, which is freely available at http://websdoor.net/bioinfo/maxreport/, is developed using Python code and is compatible across multiple systems including Windows and Linux.
Piehowski, Paul D; Petyuk, Vladislav A; Sandoval, John D; Burnum, Kristin E; Kiebel, Gary R; Monroe, Matthew E; Anderson, Gordon A; Camp, David G; Smith, Richard D
2013-03-01
For bottom-up proteomics, there are wide variety of database-searching algorithms in use for matching peptide sequences to tandem MS spectra. Likewise, there are numerous strategies being employed to produce a confident list of peptide identifications from the different search algorithm outputs. Here we introduce a grid-search approach for determining optimal database filtering criteria in shotgun proteomics data analyses that is easily adaptable to any search. Systematic Trial and Error Parameter Selection--referred to as STEPS--utilizes user-defined parameter ranges to test a wide array of parameter combinations to arrive at an optimal "parameter set" for data filtering, thus maximizing confident identifications. The benefits of this approach in terms of numbers of true-positive identifications are demonstrated using datasets derived from immunoaffinity-depleted blood serum and a bacterial cell lysate, two common proteomics sample types. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zhang, Ying; Wang, Xi; Cui, Dan; Zhu, Jun
2016-12-01
Human whole saliva is a vital body fluid for studying the physiology and pathology of the oral cavity. As a powerful technique for biomarker discovery, MS-based proteomic strategies have been introduced for saliva analysis and identified hundreds of proteins and N-glycosylation sites. However, there is still a lack of quantitative analysis, which is necessary for biomarker screening and biological research. In this study, we establish an integrated workflow by the combination of stable isotope dimethyl labeling, HILIC enrichment, and high resolution MS for both quantification of the global proteome and N-glycoproteome of human saliva from oral ulcer patients. With the help of advanced bioinformatics, we comprehensively studied oral ulcers at both protein and glycoprotein scales. Bioinformatics analyses revealed that starch digestion and protein degradation activities are inhibited while the immune response is promoted in oral ulcer saliva. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Casey, Eoghan; Mahony, Jennifer; Neve, Horst; Noben, Jean-Paul; Dal Bello, Fabio; van Sinderen, Douwe
2015-02-01
Ldl1 is a virulent phage infecting the dairy starter Lactobacillus delbrueckii subsp. lactis LdlS. Electron microscopy analysis revealed that this phage exhibits a large head and a long tail and bears little resemblance to other characterized phages infecting Lactobacillus delbrueckii. In vitro propagation of this phage revealed a latent period of 30 to 40 min and a burst size of 59.9 +/- 1.9 phage particles. Comparative genomic and proteomic analyses showed remarkable similarity between the genome of Ldl1 and that of Lactobacillus plantarum phage ATCC 8014-B2. The genomic and proteomic characteristics of Ldl1 demonstrate that this phage does not belong to any of the four previously recognized L. delbrueckii phage groups, necessitating the creation of a new group, called group e, thus adding to the knowledge on the diversity of phages targeting strains of this industrially important lactic acid bacterial species.
MALDI versus ESI: The Impact of the Ion Source on Peptide Identification.
Nadler, Wiebke Maria; Waidelich, Dietmar; Kerner, Alexander; Hanke, Sabrina; Berg, Regina; Trumpp, Andreas; Rösli, Christoph
2017-03-03
For mass spectrometry-based proteomic analyses, electrospray ionization (ESI) and matrix-assisted laser desorption/ionization (MALDI) are the commonly used ionization techniques. To investigate the influence of the ion source on peptide detection in large-scale proteomics, an optimized GeLC/MS workflow was developed and applied either with ESI/MS or with MALDI/MS for the proteomic analysis of different human cell lines of pancreatic origin. Statistical analysis of the resulting data set with more than 72 000 peptides emphasized the complementary character of the two methods, as the percentage of peptides identified with both approaches was as low as 39%. Significant differences between the resulting peptide sets were observed with respect to amino acid composition, charge-related parameters, hydrophobicity, and modifications of the detected peptides and could be linked to factors governing the respective ion yields in ESI and MALDI.
Bossi, Alessandra; Andreoli, Matteo; Bonini, Francesca; Piletsky, Sergey
2007-09-01
Templating is an effective way for the structural modifications of a material and hence for altering its functional properties. Here protein imprinting was exploited to alter polymeric polyacrylamide (PAA) membranes. The sieving properties and selection abilities of the material formed were evaluated by studying the electrically driven transport of various proteins across templated PAA membranes. The sieving properties correlated with the templating process and depended on the quantity of template used during the polymerisation. For 1 mg/mL protein-templated membranes a 'gate effect' was shown, which induced a preferential migration of the template and of similar-size proteins. Such template preferential electrotransport was exploited for the selective removal of certain proteins in biological fluids prior to proteome analysis (depletion of albumin from human serum); the efficiency of the removal was demonstrated by analysing the serum proteome by two-dimensional electrophoresis experiments.
Microscale immobilized enzyme reactors in proteomics: latest developments.
Safdar, Muhammad; Spross, Jens; Jänis, Janne
2014-01-10
Enzymatic digestion of proteins is one of the key steps in proteomic analyses. There has been a steady progress in the applied digestion protocols in the past, starting from conventional time-consuming in-solution or in-gel digestion protocols to rapid and efficient methods utilizing different types of microscale enzyme reactors. Application of such microreactors has been proven beneficial due to lower sample consumption, higher sensitivity and straightforward coupling with LC-MS set-ups. Novel stationary phases, immobilization techniques and device formats are being constantly developed and tested to optimize digestion efficiency of proteolytic enzymes. This review focuses on the latest developments associated with the preparation and application of microscale enzyme reactors for proteomics applications since 2008 onwards. A special attention has been paid to the discussion of different stationary phases applied for immobilization purposes. Copyright © 2013 Elsevier B.V. All rights reserved.
The Expanding Landscape of the Thiol Redox Proteome*
Yang, Jing; Carroll, Kate S.; Liebler, Daniel C.
2016-01-01
Cysteine occupies a unique place in protein chemistry. The nucleophilic thiol group allows cysteine to undergo a broad range of redox modifications beyond classical thiol-disulfide redox equilibria, including S-sulfenylation (-SOH), S-sulfinylation (-SO2H), S-sulfonylation (-SO3H), S-nitrosylation (-SNO), S-sulfhydration (-SSH), S-glutathionylation (-SSG), and others. Emerging evidence suggests that these post-translational modifications (PTM) are important in cellular redox regulation and protection against oxidative damage. Identification of protein targets of thiol redox modifications is crucial to understanding their roles in biology and disease. However, analysis of these highly labile and dynamic modifications poses challenges. Recent advances in the design of probes for thiol redox forms, together with innovative mass spectrometry based chemoproteomics methods make it possible to perform global, site-specific, and quantitative analyses of thiol redox modifications in complex proteomes. Here, we review chemical proteomic strategies used to expand the landscape of thiol redox modifications. PMID:26518762
A glimpse into the proteome of phototrophic bacterium Rhodobacter capsulatus.
Onder, Ozlem; Aygun-Sunar, Semra; Selamoglu, Nur; Daldal, Fevzi
2010-01-01
A first glimpse into the proteome of Rhodobacter capsulatus revealed more than 450 (with over 210 cytoplasmic and 185 extracytoplasmic known as well as 55 unknown) proteins that are identified with high degree of confidence using nLC-MS/MS analyses. The accumulated data provide a solid platform for ongoing efforts to establish the proteome of this species and the cellular locations of its constituents. They also indicate that at least 40 of the identified proteins, which were annotated in genome databases as unknown hypothetical proteins, correspond to predicted translation products that are indeed present in cells under the growth conditions used in this work. In addition, matching the identification labels of the proteins reported between the two available R. capsulatus genome databases (ERGO-light with RRCxxxxx and NT05 with NT05RCxxxx numbers) indicated that 11 such proteins are listed only in the latter database.
A tutorial in displaying mass spectrometry-based proteomic data using heat maps.
Key, Melissa
2012-01-01
Data visualization plays a critical role in interpreting experimental results of proteomic experiments. Heat maps are particularly useful for this task, as they allow us to find quantitative patterns across proteins and biological samples simultaneously. The quality of a heat map can be vastly improved by understanding the options available to display and organize the data in the heat map. This tutorial illustrates how to optimize heat maps for proteomics data by incorporating known characteristics of the data into the image. First, the concepts used to guide the creating of heat maps are demonstrated. Then, these concepts are applied to two types of analysis: visualizing spectral features across biological samples, and presenting the results of tests of statistical significance. For all examples we provide details of computer code in the open-source statistical programming language R, which can be used for biologists and clinicians with little statistical background. Heat maps are a useful tool for presenting quantitative proteomic data organized in a matrix format. Understanding and optimizing the parameters used to create the heat map can vastly improve both the appearance and the interoperation of heat map data.
Rodent Research-1 Validation of Rodent Hardware
NASA Technical Reports Server (NTRS)
Globus, Ruth; Beegle, Janet
2013-01-01
To achieve novel science objectives, validation of a rodent habitat on ISS will enable - In-flight analyses during long duration spaceflight- Use of genetically altered animals- Application of modern analytical techniques (e.g. genomics, proteomics, and metabolomics)
A defined medium for Leishmania culture allows definition of essential amino acids.
Nayak, Archana; Akpunarlieva, Snezhana; Barrett, Michael; Burchmore, Richard
2018-02-01
Axenic culture of Leishmania is generally performed in rich, serum-supplemented media which sustain robust growth over multiple passages. The use of such undefined media, however, obscures proteomic analyses and confounds the study of metabolism. We have established a simple, defined culture medium that supports the sustained growth of promastigotes over multiple passages and which yields parasites that have similar infectivity to macrophages to parasites grown in a conventional semi-defined medium. We have exploited this medium to investigate the amino acid requirements of promastigotes in culture and have found that phenylalanine, tryptophan, arginine, leucine, lysine and valine are essential for viability in culture. Most of the 20 proteogenic amino acids promote growth of Leishmania promastigotes, with the exception of alanine, asparagine, and glycine. This defined medium will be useful for further studies of promastigote substrate requirements, and will facilitate future proteomic and metabolomic analyses. Copyright © 2018 Elsevier Inc. All rights reserved.
Yuan, Shuiqiao; Zheng, Huili; Zheng, Zhihong; Yan, Wei
2013-01-01
A small portion of cytoplasm is generally retained as the cytoplasmic droplet (CD) on the flagellum of spermatozoa after spermiation in mice. CDs are believed to play a role in osmoadaptation by allowing water entrance or exit. However, many lines of evidence suggest that CDs may have roles beyond osmoregulation. To gain more insights, we purified CDs from murine epididymal spermatozoa and conducted proteomic analyses on proteins highly enriched in CDs. Among 105 proteins identified, 71 (68%) were enzymes involved in energy metabolism. We also found that sperm mitochondria underwent a reactivation process and glycolytic enzymes were further distributed and incorporated into different regions of the flagellum during epididymal sperm maturation. Both processes appeared to require CDs. Our data suggest that the CD represents a transient organelle that serves as an energy source essential for epididymal sperm maturation. PMID:24155961
Mol, Praseeda; Kannegundla, Uday; Dey, Gourav; Gopalakrishnan, Lathika; Dammalli, Manjunath; Kumar, Manish; Patil, Arun H; Basavaraju, Marappa; Rao, Akhila; Ramesha, Kerekoppa P; Prasad, Thottethodi Subrahmanya Keshava
2018-03-01
Bovine milk is important for both veterinary medicine and human nutrition. Understanding the bovine milk proteome at different stages of lactation has therefore broad significance for integrative biology and clinical medicine as well. Indeed, different lactation stages have marked influence on the milk yield, milk constituents, and nourishment of the neonates. We performed a comparative proteome analysis of the bovine milk obtained at different stages of lactation from the Indian indigenous cattle Malnad Gidda (Bos indicus), a widely available breed. The milk differential proteome during the lactation stages in B. indicus has not been investigated to date. Using high-resolution mass spectrometry-based quantitative proteomics of the bovine whey proteins at early, mid, and late lactation stages, we identified a total of 564 proteins, out of which 403 proteins were found to be differentially abundant at different lactation stages. As is expected of any body fluid proteome, 51% of the proteins identified in the milk were found to have signal peptides. Gene ontology analyses were carried out to categorize proteins altered across different lactation stages based on biological process and molecular function, which enabled us to correlate their significance in each lactation stage. We also investigated the potential pathways enriched in different lactation stages using bioinformatics pathway analysis tools. To the best of our knowledge, this study represents the first and largest inventory of milk proteins identified to date for an Indian cattle breed. We believe that the current study broadly informs both veterinary omics research and the emerging field of nutriproteomics during lactation stages.
Weissgerber, Thomas; Sylvester, Marc; Kröninger, Lena
2014-01-01
In the present study, we compared the proteome response of Allochromatium vinosum when growing photoautotrophically in the presence of sulfide, thiosulfate, and elemental sulfur with the proteome response when the organism was growing photoheterotrophically on malate. Applying tandem mass tag analysis as well as two-dimensional (2D) PAGE, we detected 1,955 of the 3,302 predicted proteins by identification of at least two peptides (59.2%) and quantified 1,848 of the identified proteins. Altered relative protein amounts (≥1.5-fold) were observed for 385 proteins, corresponding to 20.8% of the quantified A. vinosum proteome. A significant number of the proteins exhibiting strongly enhanced relative protein levels in the presence of reduced sulfur compounds are well documented essential players during oxidative sulfur metabolism, e.g., the dissimilatory sulfite reductase DsrAB. Changes in protein levels generally matched those observed for the respective relative mRNA levels in a previous study and allowed identification of new genes/proteins participating in oxidative sulfur metabolism. One gene cluster (hyd; Alvin_2036-Alvin_2040) and one hypothetical protein (Alvin_2107) exhibiting strong responses on both the transcriptome and proteome levels were chosen for gene inactivation and phenotypic analyses of the respective mutant strains, which verified the importance of the so-called Isp hydrogenase supercomplex for efficient oxidation of sulfide and a crucial role of Alvin_2107 for the oxidation of sulfur stored in sulfur globules to sulfite. In addition, we analyzed the sulfur globule proteome and identified a new sulfur globule protein (SgpD; Alvin_2515). PMID:24487535
Moore, Henna M; Bai, Baoyan; Boisvert, François-Michel; Latonen, Leena; Rantanen, Ville; Simpson, Jeremy C; Pepperkok, Rainer; Lamond, Angus I; Laiho, Marikki
2011-10-01
The nucleolus is a nuclear organelle that coordinates rRNA transcription and ribosome subunit biogenesis. Recent proteomic analyses have shown that the nucleolus contains proteins involved in cell cycle control, DNA processing and DNA damage response and repair, in addition to the many proteins connected with ribosome subunit production. Here we study the dynamics of nucleolar protein responses in cells exposed to stress and DNA damage caused by ionizing and ultraviolet (UV) radiation in diploid human fibroblasts. We show using a combination of imaging and quantitative proteomics methods that nucleolar substructure and the nucleolar proteome undergo selective reorganization in response to UV damage. The proteomic responses to UV include alterations of functional protein complexes such as the SSU processome and exosome, and paraspeckle proteins, involving both decreases and increases in steady state protein ratios, respectively. Several nonhomologous end-joining proteins (NHEJ), such as Ku70/80, display similar fast responses to UV. In contrast, nucleolar proteomic responses to IR are both temporally and spatially distinct from those caused by UV, and more limited in terms of magnitude. With the exception of the NHEJ and paraspeckle proteins, where IR induces rapid and transient changes within 15 min of the damage, IR does not alter the ratios of most other functional nucleolar protein complexes. The rapid transient decrease of NHEJ proteins in the nucleolus indicates that it may reflect a response to DNA damage. Our results underline that the nucleolus is a specific stress response organelle that responds to different damage and stress agents in a unique, damage-specific manner.
Proteomic analysis of human aqueous humor using multidimensional protein identification technology
Richardson, Matthew R.; Price, Marianne O.; Price, Francis W.; Pardo, Jennifer C.; Grandin, Juan C.; You, Jinsam; Wang, Mu
2009-01-01
Aqueous humor (AH) supports avascular tissues in the anterior segment of the eye, maintains intraocular pressure, and potentially influences the pathogenesis of ocular diseases. Nevertheless, the AH proteome is still poorly defined despite several previous efforts, which were hindered by interfering high abundance proteins, inadequate animal models, and limited proteomic technologies. To facilitate future investigations into AH function, the AH proteome was extensively characterized using an advanced proteomic approach. Samples from patients undergoing cataract surgery were pooled and depleted of interfering abundant proteins and thereby divided into two fractions: albumin-bound and albumin-depleted. Multidimensional Protein Identification Technology (MudPIT) was utilized for each fraction; this incorporates strong cation exchange chromatography to reduce sample complexity before reversed-phase liquid chromatography and tandem mass spectrometric analysis. Twelve proteins had multi-peptide, high confidence identifications in the albumin-bound fraction and 50 proteins had multi-peptide, high confidence identifications in the albumin-depleted fraction. Gene ontological analyses were performed to determine which cellular components and functions were enriched. Many proteins were previously identified in the AH and for several their potential role in the AH has been investigated; however, the majority of identified proteins were novel and only speculative roles can be suggested. The AH was abundant in anti-oxidant and immunoregulatory proteins as well as anti-angiogenic proteins, which may be involved in maintaining the avascular tissues. This is the first known report to extensively characterize and describe the human AH proteome and lays the foundation for future work regarding its function in homeostatic and pathologic states. PMID:20019884
Nuclear proteome analysis of undifferentiated mouse embryonic stem and germ cells.
Buhr, Nicolas; Carapito, Christine; Schaeffer, Christine; Kieffer, Emmanuelle; Van Dorsselaer, Alain; Viville, Stéphane
2008-06-01
Embryonic stem cells (ESCs) and embryonic germ cells (EGCs) provide exciting models for understanding the underlying mechanisms that make a cell pluripotent. Indeed, such understanding would enable dedifferentiation and reprogrammation of any cell type from a patient needing a cell therapy treatment. Proteome analysis has emerged as an important technology for deciphering these biological processes and thereby ESC and EGC proteomes are increasingly studied. Nevertheless, their nuclear proteomes have only been poorly investigated up to now. In order to investigate signaling pathways potentially involved in pluripotency, proteomic analyses have been performed on mouse ESC and EGC nuclear proteins. Nuclei from ESCs and EGCs at undifferentiated stage were purified by subcellular fractionation. After 2-D separation, a subtractive strategy (subtracting culture environment contaminating spots) was applied and a comparison of ESC, (8.5 day post coïtum (dpc))-EGC and (11.5 dpc)-EGC specific nuclear proteomes was performed. A total of 33 ESC, 53 (8.5 dpc)-EGC, and 36 (11.5 dpc)-EGC spots were identified by MALDI-TOF-MS and/or nano-LC-MS/MS. This approach led to the identification of two isoforms (with and without N-terminal acetylation) of a known pluripotency marker, namely developmental pluripotency associated 5 (DPPA5), which has never been identified before in 2-D gel-MS studies of ESCs and EGCs. Furthermore, we demonstrated the efficiency of our subtracting strategy, in association with a nuclear subfractionation by the identification of a new protein (protein arginine N-methyltransferase 7; PRMT7) behaving as proteins involved in pluripotency.
Hulme, Charlotte H; Wilson, Emma L; Fuller, Heidi R; Roberts, Sally; Richardson, James B; Gallacher, Pete; Peffers, Mandy J; Shirran, Sally L; Botting, Catherine H; Wright, Karina T
2018-05-02
Autologous chondrocyte implantation (ACI) has a failure rate of approximately 20%, but it is yet to be fully understood why. Biomarkers are needed that can pre-operatively predict in which patients it is likely to fail, so that alternative or individualised therapies can be offered. We previously used label-free quantitation (LF) with a dynamic range compression proteomic approach to assess the synovial fluid (SF) of ACI responders and non-responders. However, we were able to identify only a few differentially abundant proteins at baseline. In the present study, we built upon these previous findings by assessing higher-abundance proteins within this SF, providing a more global proteomic analysis on the basis of which more of the biology underlying ACI success or failure can be understood. Isobaric tagging for relative and absolute quantitation (iTRAQ) proteomic analysis was used to assess SF from ACI responders (mean Lysholm improvement of 33; n = 14) and non-responders (mean Lysholm decrease of 14; n = 13) at the two stages of surgery (cartilage harvest and chondrocyte implantation). Differentially abundant proteins in iTRAQ and combined iTRAQ and LF datasets were investigated using pathway and network analyses. iTRAQ proteomic analysis confirmed our previous finding that there is a marked proteomic shift in response to cartilage harvest (70 and 54 proteins demonstrating ≥ 2.0-fold change and p < 0.05 between stages I and II in responders and non-responders, respectively). Further, it highlighted 28 proteins that were differentially abundant between responders and non-responders to ACI, which were not found in the LF study, 16 of which were altered at baseline. The differential expression of two proteins (complement C1s subcomponent and matrix metalloproteinase 3) was confirmed biochemically. Combination of the iTRAQ and LF proteomic datasets generated in-depth SF proteome information that was used to generate interactome networks representing ACI success or failure. Functional pathways that are dysregulated in ACI non-responders were identified, including acute-phase response signalling. Several candidate biomarkers for baseline prediction of ACI outcome were identified. A holistic overview of the SF proteome in responders and non-responders to ACI has been profiled, providing a better understanding of the biological pathways underlying clinical outcome, particularly the differential response to cartilage harvest in non-responders.
Marc Snir | Argonne National Laboratory
Molecular biology Proteomics Environmental science & technology Air quality Atmospheric & climate , H.S., Jr., Demonstrating the scalability of a molecular dynamics application on a Petaflop computer Transformations IGSBInstitute for Genomics and Systems Biology IMEInstitute for Molecular Engineering JCESRJoint
EPA SCIENCE FORUM - EPA'S TOXICOGENOMICS PARTNERSHIPS ACROSS GOVERNMENT, ACADEMIA AND INDUSTRY
Over the past decade genomics, proteomics and metabonomics technologies have transformed the science of toxicology, and concurrent advances in computing and informatics have provided management and analysis solutions for this onslaught of toxicogenomic data. EPA has been actively...
Rajjou, Loïc; Belghazi, Maya; Huguet, Romain; Robin, Caroline; Moreau, Adrien; Job, Claudette; Job, Dominique
2006-07-01
The influence of salicylic acid (SA) on elicitation of defense mechanisms in Arabidopsis (Arabidopsis thaliana) seeds and seedlings was assessed by physiological measurements combined with global expression profiling (proteomics). Parallel experiments were carried out using the NahG transgenic plants expressing the bacterial gene encoding SA hydroxylase, which cannot accumulate the active form of this plant defense elicitor. SA markedly improved germination under salt stress. Proteomic analyses disclosed a specific accumulation of protein spots regulated by SA as inferred by silver-nitrate staining of two-dimensional gels, detection of carbonylated (oxidized) proteins, and neosynthesized proteins with [35S]-methionine. The combined results revealed several processes potentially affected by SA. This molecule enhanced the reinduction of the late maturation program during early stages of germination, thereby allowing the germinating seeds to reinforce their capacity to mount adaptive responses in environmental water stress. Other processes affected by SA concerned the quality of protein translation, the priming of seed metabolism, the synthesis of antioxidant enzymes, and the mobilization of seed storage proteins. All the observed effects are likely to improve seed vigor. Another aspect revealed by this study concerned the oxidative stress entailed by SA in germinating seeds, as inferred from a characterization of the carbonylated (oxidized) proteome. Finally, the proteomic data revealed a close interplay between abscisic signaling and SA elicitation of seed vigor.
Zeng, Yunliu; Pan, Zhiyong; Ding, Yuduan; Zhu, Andan; Cao, Hongbo; Xu, Qiang; Deng, Xiuxin
2011-01-01
Here, a comprehensive proteomic analysis of the chromoplasts purified from sweet orange using Nycodenz density gradient centrifugation is reported. A GeLC-MS/MS shotgun approach was used to identify the proteins of pooled chromoplast samples. A total of 493 proteins were identified from purified chromoplasts, of which 418 are putative plastid proteins based on in silico sequence homology and functional analyses. Based on the predicted functions of these identified plastid proteins, a large proportion (∼60%) of the chromoplast proteome of sweet orange is constituted by proteins involved in carbohydrate metabolism, amino acid/protein synthesis, and secondary metabolism. Of note, HDS (hydroxymethylbutenyl 4-diphosphate synthase), PAP (plastid-lipid-associated protein), and psHSPs (plastid small heat shock proteins) involved in the synthesis or storage of carotenoid and stress response are among the most abundant proteins identified. A comparison of chromoplast proteomes between sweet orange and tomato suggested a high level of conservation in a broad range of metabolic pathways. However, the citrus chromoplast was characterized by more extensive carotenoid synthesis, extensive amino acid synthesis without nitrogen assimilation, and evidence for lipid metabolism concerning jasmonic acid synthesis. In conclusion, this study provides an insight into the major metabolic pathways as well as some unique characteristics of the sweet orange chromoplasts at the whole proteome level. PMID:21841170
Kim, Jin Yeong; Wu, Jingni; Kwon, Soon Jae; Oh, Haram; Lee, So Eui; Kim, Sang Gon; Wang, Yiming; Agrawal, Ganesh Kumar; Rakwal, Randeep; Kang, Kyu Young; Ahn, Il-Pyung; Kim, Beom-Gi; Kim, Sun Tae
2014-10-01
Necrotrophic fungal pathogen Cochliobolus miyabeanus causes brown spot disease in rice leaves upon infection, resulting in critical rice yield loss. To better understand the rice-C. miyabeanus interaction, we employed proteomic approaches to establish differential proteomes of total and secreted proteins from the inoculated leaves. The 2DE approach after PEG-fractionation of total proteins coupled with MS (MALDI-TOF/TOF and nESI-LC-MS/MS) analyses led to identification of 49 unique proteins out of 63 differential spots. SDS-PAGE in combination with nESI-LC-MS/MS shotgun approach was applied to identify secreted proteins in the leaf apoplast upon infection and resulted in cataloging of 501 unique proteins, of which 470 and 31 proteins were secreted from rice and C. miyabeanus, respectively. Proteins mapped onto metabolic pathways implied their reprogramming upon infection. The enzymes involved in Calvin cycle and glycolysis decreased in their protein abundance, whereas enzymes in the TCA cycle, amino acids, and ethylene biosynthesis increased. Differential proteomes also generated distribution of identified proteins in the intracellular and extracellular spaces, providing a better insight into defense responses of proteins in rice against C. miyabeanus. Established proteome of the rice-C. miyabeanus interaction serves not only as a good resource for the scientific community but also highlights its significance from biological aspects. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Kültz, Dietmar; Li, Johnathon; Zhang, Xuezhen; Villarreal, Fernando; Pham, Tuan; Paguio, Darlene
2015-12-01
Molecular phenotypes that distinguish resident marine (Bodega Harbor) from landlocked freshwater (FW, Lake Solano) three-spined sticklebacks were revealed by label-free quantitative proteomics. Secreted plasma proteins involved in lipid transport, blood coagulation, proteolysis, plasminogen-activating cascades, extracellular stimulus responses, and immunity are most abundant in this species. Globulins and albumins are much less abundant than in mammalian plasma. Unbiased quantitative proteome profiling identified 45 highly population-specific plasma proteins. Population-specific abundance differences were validated by targeted proteomics based on data-independent acquisition. Gene ontology enrichment analyses and known functions of population-specific plasma proteins indicate enrichment of processes controlling cell adhesion, tissue remodeling, proteolytic processing, and defense signaling in marine sticklebacks. Moreover, fetuin B and leukocyte cell derived chemotaxin 2 are much more abundant in marine fish. These proteins promote bone morphogenesis and likely contribute to population-specific body armor differences. Plasma proteins enriched in FW fish promote translation, heme biosynthesis, and lipid transport, suggesting a greater presence of plasma microparticles. Many prominent population-specific plasma proteins (e.g. apoptosis-associated speck-like protein containing a CARD) lack any homolog of known function or adequate functional characterization. Their functional characterization and the identification of population-specific environmental contexts and selective pressures that cause plasma proteome diversification are future directions emerging from this study. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Nesvizhskii, Alexey I.
2010-01-01
This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues. PMID:20816881
Zakirova, Zuchra; Reed, Jon; Crynen, Gogce; Horne, Lauren; Hassan, Samira; Mathura, Venkatarajan; Mullan, Michael; Crawford, Fiona; Ait-Ghezala, Ghania
2017-09-01
Long-term consequences of combined pyridostigmine bromide (PB) and permethrin (PER) exposure in C57BL6/J mice using a well-characterized mouse model of exposure to these Gulf War (GW) agents were explored at the protein level. We used orthogonal proteomic approaches to identify pathways that are chronically impacted in the mouse CNS due to semiacute GW agent exposure early in life. These analyses were performed on soluble and membrane-bound protein fractions from brain samples using two orthogonal isotopic labeling LC-MS/MS proteomic approaches-stable isotope dimethyl labeling and iTRAQ. The use of these approaches allowed for greater coverage of proteins than was possible by either one alone and revealed both distinct and overlapping datasets. This combined analysis identified changes in several mitochondrial, as well as immune and inflammatory pathways after GW agent exposure. The work discussed here provides insight into GW agent exposure dependent mechanisms that adversely affect mitochondrial function and immune and inflammatory regulation. Collectively, our work identified key pathways which were chronically impacted in the mouse CNS following acute GW agent exposure, this may lead to the identification of potential targets for therapeutic intervention in the future. Long-term consequences of combined PB and PER exposure in C57BL6/J mice using a well-characterized mouse model of exposure to these GW agents were explored at the protein level. Expanding on earlier work, we used orthogonal proteomic approaches to identify pathways that are chronically impacted in the mouse CNS due to semiacute GW agent exposure early in life. These analyses were performed on soluble and membrane-bound protein fractions from brain samples using two orthogonal isotopic labeling LC-MS/MS proteomic approaches-stable isotope dimethyl labeling and iTRAQ. The use of these approaches allowed for greater coverage of proteins than was possible by either one alone and revealed both distinct and overlapping datasets. This combined analysis identified changes in several mitochondrial, as well as immune and inflammatory pathways after GW agent exposure. The work discussed here provides insight into GW agent exposure dependent mechanisms that adversely affect mitochondrial function and immune and inflammatory regulation at 5 months postexposure to PB + PER. © 2017 The Authors. PROTEOMICS - Clinical Applications published by WILEY-VCH Verlag GmbH & Co. KGaA.
Triboulet, Sarah; Aude-Garcia, Catherine; Armand, Lucie; Collin-Faure, Véronique; Chevallet, Mireille; Diemer, Hélène; Gerdil, Adèle; Proamer, Fabienne; Strub, Jean-Marc; Habert, Aurélie; Herlin, Nathalie; Van Dorsselaer, Alain; Carrière, Marie; Rabilloud, Thierry
2015-01-01
Titanium dioxide and copper oxide nanoparticles are more and more widely used because of their catalytic properties, of their light absorbing properties (titanium dioxide) or of their biocidal properties (copper oxide), increasing the risk of adverse health effects. In this frame, the responses of mouse macrophages were studied. Both proteomic and targeted analyses were performed to investigate several parameters, such as phagocytic capacity, cytokine release, copper release, and response at sub toxic doses. Besides titanium dioxide and copper oxide nanoparticles, copper ions were used as controls. We also showed that the overall copper release in the cell does not explain per se the toxicity observed with copper oxide nanoparticles. In addition, both copper ion and copper oxide nanoparticles, but not titanium oxide, induced DNA strands breaks in macrophages. As to functional responses, the phagocytic capacity was not hampered by any of the treatments at non-toxic doses, while copper ion decreased the lipopolysaccharide-induced cytokine and nitric oxide productions. The proteomic analyses highlighted very few changes induced by titanium dioxide nanoparticles, but an induction of heme oxygenase, an increase of glutathione synthesis and a decrease of tetrahydrobiopterin in response to copper oxide nanoparticles. Subsequent targeted analyses demonstrated that the increase in glutathione biosynthesis and the induction of heme oxygenase (e.g. by lovastatin/monacolin K) are critical for macrophages to survive a copper challenge, and that the intermediates of the catecholamine pathway induce a strong cross toxicity with copper oxide nanoparticles and copper ions. PMID:25902355
Qendro, Veneta; Bugos, Grace A; Lundgren, Debbie H; Glynn, John; Han, May H; Han, David K
2017-03-01
In order to gain mechanistic insights into multiple sclerosis (MS) pathogenesis, we utilized a multi-dimensional approach to test the hypothesis that mutations in myelin proteins lead to immune activation and central nervous system autoimmunity in MS. Mass spectrometry-based proteomic analysis of human MS brain lesions revealed seven unique mutations of PLP1; a key myelin protein that is known to be destroyed in MS. Surprisingly, in-depth genomic analysis of two MS patients at the genomic DNA and mRNA confirmed mutated PLP1 in RNA, but not in the genomic DNA. Quantification of wild type and mutant PLP RNA levels by qPCR further validated the presence of mutant PLP RNA in the MS patients. To seek evidence linking mutations in abundant myelin proteins and immune-mediated destruction of myelin, specific immune response against mutant PLP1 in MS patients was examined. Thus, we have designed paired, wild type and mutant peptide microarrays, and examined antibody response to multiple mutated PLP1 in sera from MS patients. Consistent with the idea of different patients exhibiting unique mutation profiles, we found that 13 out of 20 MS patients showed antibody responses against specific but not against all the mutant-PLP1 peptides. Interestingly, we found mutant PLP-directed antibody response against specific mutant peptides in the sera of pre-MS controls. The results from integrative proteomic, genomic, and immune analyses reveal a possible mechanism of mutation-driven pathogenesis in human MS. The study also highlights the need for integrative genomic and proteomic analyses for uncovering pathogenic mechanisms of human diseases. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rudnick, Paul A.; Markey, Sanford P.; Roth, Jeri; Mirokhin, Yuri; Yan, Xinjian; Tchekhovskoi, Dmitrii V.; Edwards, Nathan J.; Thangudu, Ratna R.; Ketchum, Karen A.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Stephen E.
2016-01-01
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics datasets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and non-reference markers of cancer. The CPTAC labs have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these datasets were produced from 2D LC-MS/MS analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) Peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false discovery rate (FDR)-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the datasets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level (“rolled-up”) precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ™. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data, enabling comparisons between different samples and cancer types as well as across the major ‘omics fields. PMID:26860878
Rudnick, Paul A; Markey, Sanford P; Roth, Jeri; Mirokhin, Yuri; Yan, Xinjian; Tchekhovskoi, Dmitrii V; Edwards, Nathan J; Thangudu, Ratna R; Ketchum, Karen A; Kinsinger, Christopher R; Mesri, Mehdi; Rodriguez, Henry; Stein, Stephen E
2016-03-04
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics data sets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and nonreference markers of cancer. The CPTAC laboratories have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these data sets were produced from 2D liquid chromatography-tandem mass spectrometry analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false-discovery rate-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the data sets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level ("rolled-up") precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data to enable comparisons between different samples and cancer types as well as across the major omics fields.
Campos, Jonatan Marques; Neves, Leandro Xavier; de Paiva, Nívia Carolina Nogueira; de Oliveira E Castro, Renata Alves; Casé, Ana Helena; Carneiro, Cláudia Martins; Andrade, Milton Hércules Guerra; Castro-Borges, William
2017-01-16
Schistosomiasis is an endemic disease affecting over 207 million people worldwide caused by helminth parasites of the genus Schistosoma. In Brazil the disease is responsible for the loss of up to 800 lives annually, resulting from the desabilitating effects of this chronic condition. In this study, we infected Balb/c mice with Schistosoma mansoni and analysed global changes in the proteomic profile of soluble liver proteins. Our shotgun analyses revealed predominance of up-regulation of proteins at 5weeks of infection, coinciding with the onset of egg laying, and a remarkable down-regulation of liver constituents at 7weeks, when severe tissue damage is installed. Representatives of glycolytic enzymes and stress response (in particular at the endoplasmic reticulum) were among the most differentially expressed molecules found in the infected liver. Collectively, our data contribute over 70 molecules not previously reported to be found at altered levels in murine schistosomiasis to further exploration of their potential as biomarkers of the disease. Moreover, understanding their intricate interaction using bioinformatics approach can potentially bring clarity to unknown mechanisms linked to the establishment of this condition in the vertebrate host. To our knowledge, this study refers to the first shotgun proteomic analysis to provide an inventory of the global changes in the liver soluble proteome caused by Schistosoma mansoni in the Balb/c model. It also innovates by yielding data on quantification of the identified molecules as a manner to clarify and give insights into the underlying mechanisms for establishment of Schistosomiasis, a neglected tropical disease with historical prevalence in Brazil. Copyright © 2016 Elsevier B.V. All rights reserved.
Jiang, Shuai; Qiu, Limei; Wang, Lingling; Jia, Zhihao; Lv, Zhao; Wang, Mengqiang; Liu, Conghui; Xu, Jiachao; Song, Linsheng
2018-01-01
As invertebrates lack an adaptive immune system, they depend to a large extent on their innate immune system to recognize and clear invading pathogens. Although phagocytes play pivotal roles in invertebrate innate immunity, the molecular mechanisms underlying this killing remain unclear. Cells of this type from the Pacific oyster Crassostrea gigas were classified efficiently in this study via fluorescence-activated cell sorting (FACS) based on their phagocytosis of FITC-labeled latex beads. Transcriptomic and quantitative proteomic analyses revealed a series of differentially expressed genes (DEGs) and proteins present in phagocytes; of the 352 significantly high expressed proteins identified here within the phagocyte proteome, 262 corresponding genes were similarly high expressed in the transcriptome, while 140 of 205 significantly low expressed proteins within the proteome were transcriptionally low expressed. A pathway crosstalk network analysis of these significantly high expressed proteins revealed that phagocytes were highly activated in a number of antimicrobial-related biological processes, including oxidation–reduction and lysosomal proteolysis processes. A number of DEGs, including oxidase, lysosomal protease, and immune receptors, were also validated in this study using quantitative PCR, while seven lysosomal cysteine proteases, referred to as cathepsin Ls, were significantly high expressed in phagocytes. Results show that the expression level of cathepsin L protein in phagocytes [mean fluorescence intensity (MFI): 327 ± 51] was significantly higher (p < 0.01) than that in non-phagocytic hemocytes (MFI: 83 ± 26), while the cathepsin L protein was colocalized with the phagocytosed Vibrio splendidus in oyster hemocytes during this process. The results of this study collectively suggest that oyster phagocytes possess both potent oxidative killing and microbial disintegration capacities; these findings provide important insights into hemocyte phagocytic killing as a component of C. gigas innate immunity. PMID:29942306
Zhang, Yaoyang; Xu, Tao; Shan, Bing; Hart, Jonathan; Aslanian, Aaron; Han, Xuemei; Zong, Nobel; Li, Haomin; Choi, Howard; Wang, Dong; Acharya, Lipi; Du, Lisa; Vogt, Peter K; Ping, Peipei; Yates, John R
2015-11-03
Shotgun proteomics generates valuable information from large-scale and target protein characterizations, including protein expression, protein quantification, protein post-translational modifications (PTMs), protein localization, and protein-protein interactions. Typically, peptides derived from proteolytic digestion, rather than intact proteins, are analyzed by mass spectrometers because peptides are more readily separated, ionized and fragmented. The amino acid sequences of peptides can be interpreted by matching the observed tandem mass spectra to theoretical spectra derived from a protein sequence database. Identified peptides serve as surrogates for their proteins and are often used to establish what proteins were present in the original mixture and to quantify protein abundance. Two major issues exist for assigning peptides to their originating protein. The first issue is maintaining a desired false discovery rate (FDR) when comparing or combining multiple large datasets generated by shotgun analysis and the second issue is properly assigning peptides to proteins when homologous proteins are present in the database. Herein we demonstrate a new computational tool, ProteinInferencer, which can be used for protein inference with both small- or large-scale data sets to produce a well-controlled protein FDR. In addition, ProteinInferencer introduces confidence scoring for individual proteins, which makes protein identifications evaluable. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.
Szuba, Agnieszka; Lorenc-Plucińska, Gabriela
2018-01-01
Tannery waste is highly toxic and dangerous to living organisms because of the high heavy metal content, especially chromium [Cr(III)]. This study analysed the proteomic response of the Populus alba L. clone 'Villafranca' grown for 4years on a tannery waste landfill. In this extremely hostile environment, the plants struggled with continuous stress, which inhibited growth by 54%, with a 67% decrease in tree height and diameter at breast height compared to those of the forest reference plot, respectively. The leaves and roots of the tannery landfill-grown plants produced strong proteomic stress signals for protection against reactive oxygen species (ROS) and repair to ROS-damaged proteins and DNA as well as signals for protection of the photosynthetic apparatus. The content of HSP80 was also high. However, primary metabolic pathways were generally unaffected, and signals of increased protein protection, but not turnover, were found, indicating mechanisms of adaptation to long-term stress conditions present at the landfill. A proteomic tool, two-dimensional electrophoresis coupled with tandem mass spectrometry, was successfully applied in this environmental in situ study of distant plots (280km apart). Copyright © 2017 Elsevier B.V. All rights reserved.
Metabolome and proteome profiling of complex I deficiency induced by rotenone.
Gielisch, Ina; Meierhofer, David
2015-01-02
Complex I (CI; NADH dehydrogenase) deficiency causes mitochondrial diseases, including Leigh syndrome. A variety of clinical symptoms of CI deficiency are known, including neurodegeneration. Here, we report an integrative study combining liquid chromatography-mass spectrometry (LC-MS)-based metabolome and proteome profiling in CI deficient HeLa cells. We report a rapid LC-MS-based method for the relative quantification of targeted metabolome profiling with an additional layer of confidence by applying multiple reaction monitoring (MRM) ion ratios for further identity confirmation and robustness. The proteome was analyzed by label-free quantification (LFQ). More than 6000 protein groups were identified. Pathway and network analyses revealed that the respiratory chain was highly deregulated, with metabolites such as FMN, FAD, NAD(+), and ADP, direct players of the OXPHOS system, and metabolites of the TCA cycle decreased up to 100-fold. Synthesis of functional iron-sulfur clusters, which are of central importance for the electron transfer chain, and degradation products like bilirubin were also significantly reduced. Glutathione metabolism on the pathway level, as well as individual metabolite components such as NADPH, glutathione (GSH), and oxidized glutathione (GSSG), was downregulated. Overall, metabolome and proteome profiles in CI deficient cells correlated well, supporting our integrated approach.
Reynolds, Jessica L.; Mahajan, Supriya D.; Aalinkeel, Ravikunar; Nair, Bindukumar; Sykes, Donald E.; Agosto-Mujica, Arnadri; Hsiao, Chiu Bin; Schwartz, Stanley A.
2010-01-01
We used proteomic analyses to assess how drug abuse modulates immunologic responses to infections with the human immunodeficiency virus type 1 (HIV-1). Two dimensional (2D) difference gel electrophoresis was utilized to determine changes in the proteome of peripheral blood mononuclear cells (PBMC) isolated from HIV-1 positive donors that occurred after treatment with cocaine or methamphetamine. Both drugs differentially regulated the expression of several functional classes of proteins. We further isolated specific subpopulations of PBMC to determine which subpopulations were selectively affected by treatment with drugs of abuse. Monocytes, B cells and T cells were positively or negatively selected from PBMC isolated from HIV-1 positive donors. Our results demonstrate that cocaine and methamphetamine modulate gene expression primarily in monocytes and T cells, the primary targets of HIV-1 infection. Proteomic data were validated with quantitative, real-time PCR. These studies elucidate the molecular mechanisms underlying the effects of drugs of abuse on HIV-1 infections. Several functionally relevant classes of proteins were identified as potential mediators of HIV-1 pathogenesis and disease progression associated with drug abuse. PMID:19543960
Influence of impaired lipoprotein biogenesis on surface and exoproteome of Streptococcus pneumoniae.
Pribyl, Thomas; Moche, Martin; Dreisbach, Annette; Bijlsma, Jetta J E; Saleh, Malek; Abdullah, Mohammed R; Hecker, Michael; van Dijl, Jan Maarten; Becher, Dörte; Hammerschmidt, Sven
2014-02-07
Surface proteins are important for the fitness and virulence of the Gram-positive pathogen Streptococcus pneumoniae. They are crucial for interaction of the pathogen with its human host during infection. Therefore, the analysis of the pneumococcal surface proteome is an important task that requires powerful tools. In this study, two different methods, an optimized biotinylation approach and shaving with trypsin beads, were applied to study the pneumococcal surface proteome and to identify surface-exposed protein domains, respectively. The identification of nearly 95% of the predicted lipoproteins and 75% of the predicted sortase substrates reflects the high coverage of the two classical surface protein classes accomplished in this study. Furthermore, the biotinylation approach was applied to study the impact of an impaired lipoprotein maturation pathway on the cell envelope proteome and exoproteome. Loss of the lipoprotein diacylglyceryl transferase Lgt leads to striking changes in the lipoprotein distribution. Many lipoproteins disappear from the surface proteome and accumulate in the exoproteome. Further insights into lipoprotein processing in pneumococci are provided by immunoblot analyses of bacterial lysates and corresponding supernatant fractions. Taken together, the first comprehensive overview of the pneumococcal surface and exoproteome is presented, and a model for lipoprotein processing in S. pneumoniae is proposed.
PIQMIe: a web server for semi-quantitative proteomics data management and analysis
Kuzniar, Arnold; Kanaar, Roland
2014-01-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. PMID:24861615
PIQMIe: a web server for semi-quantitative proteomics data management and analysis.
Kuzniar, Arnold; Kanaar, Roland
2014-07-01
We present the Proteomics Identifications and Quantitations Data Management and Integration Service or PIQMIe that aids in reliable and scalable data management, analysis and visualization of semi-quantitative mass spectrometry based proteomics experiments. PIQMIe readily integrates peptide and (non-redundant) protein identifications and quantitations from multiple experiments with additional biological information on the protein entries, and makes the linked data available in the form of a light-weight relational database, which enables dedicated data analyses (e.g. in R) and user-driven queries. Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. The web server not only provides data access through a web interface but also supports programmatic access through RESTful web service. The web server is available at http://piqmie.semiqprot-emc.cloudlet.sara.nl or http://www.bioinformatics.nl/piqmie. This website is free and open to all users and there is no login requirement. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Pietrowska, M; Marczak, L; Polanska, J; Nowicka, E; Behrent, K; Tarnawski, R; Stobiecki, M; Polanski, A; Widlak, P
2010-01-01
Mass spectrometry-based analysis of the serum proteome allows identifying multi-peptide patterns/signatures specific for blood of cancer patients, thus having high potential value for cancer diagnostics. However, because of problems with optimization and standardization of experimental and computational design, none of identified proteome patterns/signatures was approved for diagnostics in clinical practice as yet. Here we compared two methods of serum sample preparation for mass spectrometry-based proteome pattern analysis aimed to identify biomarkers that could be used in early detection of breast cancer patients. Blood samples were collected in a group of 92 patients diagnosed at early (I and II) stages of the disease before the start of therapy, and in a group of age-matched healthy controls (104 women). Serum specimens were purified and analyzed using MALDI-ToF spectrometry, either directly or after membrane filtration (50 kDa cut-off) to remove albumin and other large serum proteins. Mass spectra of the low-molecular-weight fraction (2-10 kDa) of the serum proteome were resolved using the Gaussian mixture decomposition, and identified spectral components were used to build classifiers that differentiated samples from breast cancer patients and healthy persons. Mass spectra of complete serum and membrane-filtered albumin-depleted samples have apparently different structure and peaks specific for both types of samples could be identified. The optimal classifier built for the complete serum specimens consisted of 8 spectral components, and had 81% specificity and 72% sensitivity, while that built for the membrane-filtered samples consisted of 4 components, and had 80% specificity and 81% sensitivity. We concluded that pre-processing of samples to remove albumin might be recommended before MALDI-ToF mass spectrometric analysis of the low-molecular-weight components of human serum Keywords: albumin removal; breast cancer; clinical proteomics; mass spectrometry; pattern analysis; serum proteome.
High throughput profile-profile based fold recognition for the entire human proteome.
McGuffin, Liam J; Smith, Richard T; Bryson, Kevin; Sørensen, Søren-Aksel; Jones, David T
2006-06-07
In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power. In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.
Structural and functional features of lysine acetylation of plant and animal tubulins.
Rayevsky, Alexey V; Sharifi, Mohsen; Samofalova, Dariya A; Karpov, Pavel A; Blume, Yaroslav B
2017-10-10
The study of the genome and the proteome of different species and representatives of distinct kingdoms, especially detection of proteome via wide-scaled analyses has various challenges and pitfalls. Attempts to combine all available information together and isolate some common features for determination of the pathway and their mechanism of action generally have a highly complicated nature. However, microtubule (MT) monomers are highly conserved protein structures, and microtubules are structurally conserved from Homo sapiens to Arabidopsis thaliana. The interaction of MT elements with microtubule-associated proteins and post-translational modifiers is fully dependent on protein interfaces, and almost all MT modifications are well described except acetylation. Crystallography and interactome data using different approaches were combined to identify conserved proteins important in acetylation of microtubules. Application of computational methods and comparative analysis of binding modes generated a robust predictive model of acetylation of the ϵ-amino group of Lys40 in α-tubulins. In turn, the model discarded some probable mechanisms of interaction between elements of interest. Reconstruction of unresolved protein structures was carried out with modeling by homology to the existing crystal structure (PDBID: 1Z2B) from B. taurus using Swiss-model server, followed by a molecular dynamics simulation. Docking of the human tubulin fragment with Lys40 into the active site of α-tubulin acetyltransferase, reproduces the binding mode of peptidomimetic from X-ray structure (PDBID: 4PK3). © 2017 International Federation for Cell Biology.
Decoding 2D-PAGE complex maps: relevance to proteomics.
Pietrogrande, Maria Chiara; Marchetti, Nicola; Dondi, Francesco; Righetti, Pier Giorgio
2006-03-20
This review describes two mathematical approaches useful for decoding the complex signal of 2D-PAGE maps of protein mixtures. These methods are helpful for interpreting the large amount of data of each 2D-PAGE map by extracting all the analytical information hidden therein by spot overlapping. Here the basic theory and application to 2D-PAGE maps are reviewed: the means for extracting information from the experimental data and their relevance to proteomics are discussed. One method is based on the quantitative theory of statistical model of peak overlapping (SMO) using the spot experimental data (intensity and spatial coordinates). The second method is based on the study of the 2D-autocovariance function (2D-ACVF) computed on the experimental digitised map. They are two independent methods that are able to extract equal and complementary information from the 2D-PAGE map. Both methods permit to obtain fundamental information on the sample complexity and the separation performance and to single out ordered patterns present in spot positions: the availability of two independent procedures to compute the same separation parameters is a powerful tool to estimate the reliability of the obtained results. The SMO procedure is an unique tool to quantitatively estimate the degree of spot overlapping present in the map, while the 2D-ACVF method is particularly powerful in simply singling out the presence of order in the spot position from the complexity of the whole 2D map, i.e., spot trains. The procedures were validated by extensive numerical computation on computer-generated maps describing experimental 2D-PAGE gels of protein mixtures. Their applicability to real samples was tested on reference maps obtained from literature sources. The review describes the most relevant information for proteomics: sample complexity, separation performance, overlapping extent, identification of spot trains related to post-translational modifications (PTMs).
Next generation of network medicine: interdisciplinary signaling approaches.
Korcsmaros, Tamas; Schneider, Maria Victoria; Superti-Furga, Giulio
2017-02-20
In the last decade, network approaches have transformed our understanding of biological systems. Network analyses and visualizations have allowed us to identify essential molecules and modules in biological systems, and improved our understanding of how changes in cellular processes can lead to complex diseases, such as cancer, infectious and neurodegenerative diseases. "Network medicine" involves unbiased large-scale network-based analyses of diverse data describing interactions between genes, diseases, phenotypes, drug targets, drug transport, drug side-effects, disease trajectories and more. In terms of drug discovery, network medicine exploits our understanding of the network connectivity and signaling system dynamics to help identify optimal, often novel, drug targets. Contrary to initial expectations, however, network approaches have not yet delivered a revolution in molecular medicine. In this review, we propose that a key reason for the limited impact, so far, of network medicine is a lack of quantitative multi-disciplinary studies involving scientists from different backgrounds. To support this argument, we present existing approaches from structural biology, 'omics' technologies (e.g., genomics, proteomics, lipidomics) and computational modeling that point towards how multi-disciplinary efforts allow for important new insights. We also highlight some breakthrough studies as examples of the potential of these approaches, and suggest ways to make greater use of the power of interdisciplinarity. This review reflects discussions held at an interdisciplinary signaling workshop which facilitated knowledge exchange from experts from several different fields, including in silico modelers, computational biologists, biochemists, geneticists, molecular and cell biologists as well as cancer biologists and pharmacologists.
2013-01-01
Background The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing - the matching of peptide measurements across samples. Results We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Conclusions Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods. PMID:24341404
Benjamin, Ashlee M; Thompson, J Will; Soderblom, Erik J; Geromanos, Scott J; Henao, Ricardo; Kraus, Virginia B; Moseley, M Arthur; Lucas, Joseph E
2013-12-16
The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing--the matching of peptide measurements across samples. We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods.
2017-01-01
The changes of protein expression that are monitored in proteomic experiments are a type of biological transformation that also involves changes in chemical composition. Accompanying the myriad molecular-level interactions that underlie any proteomic transformation, there is an overall thermodynamic potential that is sensitive to microenvironmental conditions, including local oxidation and hydration potential. Here, up- and down-expressed proteins identified in 71 comparative proteomics studies were analyzed using the average oxidation state of carbon (ZC) and water demand per residue (\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\overline{n}}_{{\\mathrm{H}}_{2}\\mathrm{O}}$\\end{document}n¯H2O), calculated using elemental abundances and stoichiometric reactions to form proteins from basis species. Experimental lowering of oxygen availability (hypoxia) or water activity (hyperosmotic stress) generally results in decreased ZC or \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\overline{n}}_{{\\mathrm{H}}_{2}\\mathrm{O}}$\\end{document}n¯H2O of up-expressed compared to down-expressed proteins. This correspondence of chemical composition with experimental conditions provides evidence for attraction of the proteomes to a low-energy state. An opposite compositional change, toward higher average oxidation or hydration state, is found for proteomic transformations in colorectal and pancreatic cancer, and in two experiments for adipose-derived stem cells. Calculations of chemical affinity were used to estimate the thermodynamic potentials for proteomic transformations as a function of fugacity of O2 and activity of H2O, which serve as scales of oxidation and hydration potential. Diagrams summarizing the relative potential for formation of up- and down-expressed proteins have predicted equipotential lines that cluster around particular values of oxygen fugacity and water activity for similar datasets. The changes in chemical composition of proteomes are likely linked with reactions among other cellular molecules. A redox balance calculation indicates that an increase in the lipid to protein ratio in cancer cells by 20% over hypoxic cells would generate a large enough electron sink for oxidation of the cancer proteomes. The datasets and computer code used here are made available in a new R package, canprot. PMID:28603672
PANDORA: keyword-based analysis of protein sets by integration of annotation sources.
Kaplan, Noam; Vaaknin, Avishay; Linial, Michal
2003-10-01
Recent advances in high-throughput methods and the application of computational tools for automatic classification of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a web-based tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports SwissProt keywords, NCBI Taxonomy, InterPro entries and the hierarchical classification terms from ENZYME, SCOP and GO databases. The integrated study of several annotation sources simultaneously allows a representation of biological relations of structure, function, cellular location, taxonomy, domains and motifs. PANDORA is also integrated into the ProtoNet system, thus allowing testing thousands of automatically generated clusters. We illustrate how PANDORA enhances the biological understanding of large, non-uniform sets of proteins originating from experimental and computational sources, without the need for prior biological knowledge on individual proteins.
Verdes, Aida; Anand, Prachi; Gorson, Juliette; Jannetti, Stephen; Kelly, Patrick; Leffler, Abba; Simpson, Danny; Ramrattan, Girish; Holford, Mandë
2016-04-19
Animal venoms comprise a diversity of peptide toxins that manipulate molecular targets such as ion channels and receptors, making venom peptides attractive candidates for the development of therapeutics to benefit human health. However, identifying bioactive venom peptides remains a significant challenge. In this review we describe our particular venomics strategy for the discovery, characterization, and optimization of Terebridae venom peptides, teretoxins. Our strategy reflects the scientific path from mollusks to medicine in an integrative sequential approach with the following steps: (1) delimitation of venomous Terebridae lineages through taxonomic and phylogenetic analyses; (2) identification and classification of putative teretoxins through omics methodologies, including genomics, transcriptomics, and proteomics; (3) chemical and recombinant synthesis of promising peptide toxins; (4) structural characterization through experimental and computational methods; (5) determination of teretoxin bioactivity and molecular function through biological assays and computational modeling; (6) optimization of peptide toxin affinity and selectivity to molecular target; and (7) development of strategies for effective delivery of venom peptide therapeutics. While our research focuses on terebrids, the venomics approach outlined here can be applied to the discovery and characterization of peptide toxins from any venomous taxa.
Comparative proteome analysis reveals pathogen specific outer membrane proteins of Leptospira.
Dhandapani, Gunasekaran; Sikha, Thoduvayil; Rana, Aarti; Brahma, Rahul; Akhter, Yusuf; Gopalakrishnan Madanan, Madathiparambil
2018-04-10
Proteomes of pathogenic Leptospira interrogans and L. borgpetersenii and the saprophytic L. biflexa were filtered through computational tools to identify Outer Membrane Proteins (OMPs) that satisfy the required biophysical parameters for their presence on the outer membrane. A total of 133, 130, and 144 OMPs were identified in L. interrogans, L. borgpetersenii, and L. biflexa, respectively, which forms approximately 4% of proteomes. A holistic analysis of transporting and pathogenic characteristics of OMPs together with Clusters of Orthologous Groups (COGs) among the OMPs and their distribution across 3 species was made and put forward a set of 21 candidate OMPs specific to pathogenic leptospires. It is also found that proteins homologous to the candidate OMPs were also present in other pathogenic species of leptospires. Six OMPs from L. interrogans and 2 from L. borgpetersenii observed to have similar COGs while those were not found in any intermediate or saprophytic forms. These OMPs appears to have role in infection and pathogenesis and useful for anti-leptospiral strategies. © 2018 Wiley Periodicals, Inc.
Building high-quality assay libraries for targeted analysis of SWATH MS data.
Schubert, Olga T; Gillet, Ludovic C; Collins, Ben C; Navarro, Pedro; Rosenberger, George; Wolski, Witold E; Lam, Henry; Amodei, Dario; Mallick, Parag; MacLean, Brendan; Aebersold, Ruedi
2015-03-01
Targeted proteomics by selected/multiple reaction monitoring (S/MRM) or, on a larger scale, by SWATH (sequential window acquisition of all theoretical spectra) MS (mass spectrometry) typically relies on spectral reference libraries for peptide identification. Quality and coverage of these libraries are therefore of crucial importance for the performance of the methods. Here we present a detailed protocol that has been successfully used to build high-quality, extensive reference libraries supporting targeted proteomics by SWATH MS. We describe each step of the process, including data acquisition by discovery proteomics, assertion of peptide-spectrum matches (PSMs), generation of consensus spectra and compilation of MS coordinates that uniquely define each targeted peptide. Crucial steps such as false discovery rate (FDR) control, retention time normalization and handling of post-translationally modified peptides are detailed. Finally, we show how to use the library to extract SWATH data with the open-source software Skyline. The protocol takes 2-3 d to complete, depending on the extent of the library and the computational resources available.
Fischer, Martina; Jehmlich, Nico; Rose, Laura; Koch, Sophia; Laue, Michael; Renard, Bernhard Y.; Schmidt, Frank; Heuer, Dagmar
2015-01-01
Chlamydia trachomatis is an important human pathogen that replicates inside the infected host cell in a unique vacuole, the inclusion. The formation of this intracellular bacterial niche is essential for productive Chlamydia infections. Despite its importance for Chlamydia biology, a holistic view on the protein composition of the inclusion, including its membrane, is currently missing. Here we describe the host cell-derived proteome of isolated C. trachomatis inclusions by quantitative proteomics. Computational analysis indicated that the inclusion is a complex intracellular trafficking platform that interacts with host cells’ antero- and retrograde trafficking pathways. Furthermore, the inclusion is highly enriched for sorting nexins of the SNX-BAR retromer, a complex essential for retrograde trafficking. Functional studies showed that in particular, SNX5 controls the C. trachomatis infection and that retrograde trafficking is essential for infectious progeny formation. In summary, these findings suggest that C. trachomatis hijacks retrograde pathways for effective infection. PMID:26042774
A novel strategy for global analysis of the dynamic thiol redox proteome.
Martínez-Acedo, Pablo; Núñez, Estefanía; Gómez, Francisco J Sánchez; Moreno, Margoth; Ramos, Elena; Izquierdo-Álvarez, Alicia; Miró-Casas, Elisabet; Mesa, Raquel; Rodriguez, Patricia; Martínez-Ruiz, Antonio; Dorado, David Garcia; Lamas, Santiago; Vázquez, Jesús
2012-09-01
Nitroxidative stress in cells occurs mainly through the action of reactive nitrogen and oxygen species (RNOS) on protein thiol groups. Reactive nitrogen and oxygen species-mediated protein modifications are associated with pathophysiological states, but can also convey physiological signals. Identification of Cys residues that are modified by oxidative stimuli still poses technical challenges and these changes have never been statistically analyzed from a proteome-wide perspective. Here we show that GELSILOX, a method that combines a robust proteomics protocol with a new computational approach that analyzes variance at the peptide level, allows a simultaneous analysis of dynamic alterations in the redox state of Cys sites and of protein abundance. GELSILOX permits the characterization of the major endothelial redox targets of hydrogen peroxide in endothelial cells and reveals that hypoxia induces a significant increase in the status of oxidized thiols. GELSILOX also detected thiols that are redox-modified by ischemia-reperfusion in heart mitochondria and demonstrated that these alterations are abolished in ischemia-preconditioned animals.
Linkage Of Exposure And Effects Using Genomics, Proteomics, And Metabolomics In Small Fish Models
Poster for the BOSC Computational Toxicology Research Program review. Knowledge of possible toxic mechanisms/modes of action (MOA) of chemicals can provide valuable insights as to appropriate methods for assessing exposure and effects, thereby reducing uncertainties related to e...
EMERGING MOLECULAR COMPUTATIONAL APPROACHES FOR CROSS-SPECIES EXTRAPOLATIONS: A WORKSHOP SUMMARY
Advances in molecular technology have led to the elucidation of full genomic sequences of several multicellular organisms, ranging from nematodes to man. The related molecular field of proteomics and metabolomics are now beginning to advance rapidly as well. In addition, advances...
The Perseus computational platform for comprehensive analysis of (prote)omics data.
Tyanova, Stefka; Temu, Tikira; Sinitcyn, Pavel; Carlson, Arthur; Hein, Marco Y; Geiger, Tamar; Mann, Matthias; Cox, Jürgen
2016-09-01
A main bottleneck in proteomics is the downstream biological analysis of highly multivariate quantitative protein abundance data generated using mass-spectrometry-based analysis. We developed the Perseus software platform (http://www.perseus-framework.org) to support biological and biomedical researchers in interpreting protein quantification, interaction and post-translational modification data. Perseus contains a comprehensive portfolio of statistical tools for high-dimensional omics data analysis covering normalization, pattern recognition, time-series analysis, cross-omics comparisons and multiple-hypothesis testing. A machine learning module supports the classification and validation of patient groups for diagnosis and prognosis, and it also detects predictive protein signatures. Central to Perseus is a user-friendly, interactive workflow environment that provides complete documentation of computational methods used in a publication. All activities in Perseus are realized as plugins, and users can extend the software by programming their own, which can be shared through a plugin store. We anticipate that Perseus's arsenal of algorithms and its intuitive usability will empower interdisciplinary analysis of complex large data sets.
On the importance of mathematical methods for analysis of MALDI-imaging mass spectrometry data.
Trede, Dennis; Kobarg, Jan Hendrik; Oetjen, Janina; Thiele, Herbert; Maass, Peter; Alexandrov, Theodore
2012-03-21
In the last decade, matrix-assisted laser desorption/ionization (MALDI) imaging mass spectrometry (IMS), also called as MALDI-imaging, has proven its potential in proteomics and was successfully applied to various types of biomedical problems, in particular to histopathological label-free analysis of tissue sections. In histopathology, MALDI-imaging is used as a general analytic tool revealing the functional proteomic structure of tissue sections, and as a discovery tool for detecting new biomarkers discriminating a region annotated by an experienced histologist, in particular, for cancer studies. A typical MALDI-imaging data set contains 10⁸ to 10⁹ intensity values occupying more than 1 GB. Analysis and interpretation of such huge amount of data is a mathematically, statistically and computationally challenging problem. In this paper we overview some computational methods for analysis of MALDI-imaging data sets. We discuss the importance of data preprocessing, which typically includes normalization, baseline removal and peak picking, and hightlight the importance of image denoising when visualizing IMS data.
On the Importance of Mathematical Methods for Analysis of MALDI-Imaging Mass Spectrometry Data.
Trede, Dennis; Kobarg, Jan Hendrik; Oetjen, Janina; Thiele, Herbert; Maass, Peter; Alexandrov, Theodore
2012-03-01
In the last decade, matrix-assisted laser desorption/ionization (MALDI) imaging mass spectrometry (IMS), also called as MALDI-imaging, has proven its potential in proteomics and was successfully applied to various types of biomedical problems, in particular to histopathological label-free analysis of tissue sections. In histopathology, MALDI-imaging is used as a general analytic tool revealing the functional proteomic structure of tissue sections, and as a discovery tool for detecting new biomarkers discriminating a region annotated by an experienced histologist, in particular, for cancer studies. A typical MALDI-imaging data set contains 108 to 109 intensity values occupying more than 1 GB. Analysis and interpretation of such huge amount of data is a mathematically, statistically and computationally challenging problem. In this paper we overview some computational methods for analysis of MALDI-imaging data sets. We discuss the importance of data preprocessing, which typically includes normalization, baseline removal and peak picking, and hightlight the importance of image denoising when visualizing IMS data.