Sample records for proteomics high-throughput analysis

  1. Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.

    PubMed

    Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong

    2008-04-01

    The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.

  2. TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics.

    PubMed

    Röst, Hannes L; Liu, Yansheng; D'Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi

    2016-09-01

    Next-generation mass spectrometric (MS) techniques such as SWATH-MS have substantially increased the throughput and reproducibility of proteomic analysis, but ensuring consistent quantification of thousands of peptide analytes across multiple liquid chromatography-tandem MS (LC-MS/MS) runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we developed TRIC (http://proteomics.ethz.ch/tric/), a software tool that utilizes fragment-ion data to perform cross-run alignment, consistent peak-picking and quantification for high-throughput targeted proteomics. TRIC reduced the identification error compared to a state-of-the-art SWATH-MS analysis without alignment by more than threefold at constant recall while correcting for highly nonlinear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups. Thus, TRIC fills a gap in the pipeline for automated analysis of massively parallel targeted proteomics data sets.

  3. A simple dual online ultra-high pressure liquid chromatography system (sDO-UHPLC) for high throughput proteome analysis.

    PubMed

    Lee, Hangyeore; Mun, Dong-Gi; Bae, Jingi; Kim, Hokeun; Oh, Se Yeon; Park, Young Soo; Lee, Jae-Hyuk; Lee, Sang-Won

    2015-08-21

    We report a new and simple design of a fully automated dual-online ultra-high pressure liquid chromatography system. The system employs only two nano-volume switching valves (a two-position four port valve and a two-position ten port valve) that direct solvent flows from two binary nano-pumps for parallel operation of two analytical columns and two solid phase extraction (SPE) columns. Despite the simple design, the sDO-UHPLC offers many advantageous features that include high duty cycle, back flushing sample injection for fast and narrow zone sample injection, online desalting, high separation resolution and high intra/inter-column reproducibility. This system was applied to analyze proteome samples not only in high throughput deep proteome profiling experiments but also in high throughput MRM experiments.

  4. High-throughput and targeted in-depth mass spectrometry-based approaches for biofluid profiling and biomarker discovery.

    PubMed

    Jimenez, Connie R; Piersma, Sander; Pham, Thang V

    2007-12-01

    Proteomics aims to create a link between genomic information, biological function and disease through global studies of protein expression, modification and protein-protein interactions. Recent advances in key proteomics tools, such as mass spectrometry (MS) and (bio)informatics, provide tremendous opportunities for biomarker-related clinical applications. In this review, we focus on two complementary MS-based approaches with high potential for the discovery of biomarker patterns and low-abundant candidate biomarkers in biofluids: high-throughput matrix-assisted laser desorption/ionization time-of-flight mass spectroscopy-based methods for peptidome profiling and label-free liquid chromatography-based methods coupled to MS for in-depth profiling of biofluids with a focus on subproteomes, including the low-molecular-weight proteome, carrier-bound proteome and N-linked glycoproteome. The two approaches differ in their aims, throughput and sensitivity. We discuss recent progress and challenges in the analysis of plasma/serum and proximal fluids using these strategies and highlight the potential of liquid chromatography-MS-based proteomics of cancer cell and tumor secretomes for the discovery of candidate blood-based biomarkers. Strategies for candidate validation are also described.

  5. Computer applications making rapid advances in high throughput microbial proteomics (HTMP).

    PubMed

    Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen

    2014-02-01

    The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.

  6. Nano-LC FTICR tandem mass spectrometry for top-down proteomics: routine baseline unit mass resolution of whole cell lysate proteins up to 72 kDa.

    PubMed

    Tipton, Jeremiah D; Tran, John C; Catherman, Adam D; Ahlf, Dorothy R; Durbin, Kenneth R; Lee, Ji Eun; Kellie, John F; Kelleher, Neil L; Hendrickson, Christopher L; Marshall, Alan G

    2012-03-06

    Current high-throughput top-down proteomic platforms provide routine identification of proteins less than 25 kDa with 4-D separations. This short communication reports the application of technological developments over the past few years that improve protein identification and characterization for masses greater than 25 kDa. Advances in separation science have allowed increased numbers of proteins to be identified, especially by nanoliquid chromatography (nLC) prior to mass spectrometry (MS) analysis. Further, a goal of high-throughput top-down proteomics is to extend the mass range for routine nLC MS analysis up to 80 kDa because gene sequence analysis predicts that ~70% of the human proteome is transcribed to be less than 80 kDa. Normally, large proteins greater than 50 kDa are identified and characterized by top-down proteomics through fraction collection and direct infusion at relatively low throughput. Further, other MS-based techniques provide top-down protein characterization, however at low resolution for intact mass measurement. Here, we present analysis of standard (up to 78 kDa) and whole cell lysate proteins by Fourier transform ion cyclotron resonance mass spectrometry (nLC electrospray ionization (ESI) FTICR MS). The separation platform reduced the complexity of the protein matrix so that, at 14.5 T, proteins from whole cell lysate up to 72 kDa are baseline mass resolved on a nano-LC chromatographic time scale. Further, the results document routine identification of proteins at improved throughput based on accurate mass measurement (less than 10 ppm mass error) of precursor and fragment ions for proteins up to 50 kDa.

  7. Advances in Proteomics Data Analysis and Display Using an Accurate Mass and Time Tag Approach

    PubMed Central

    Zimmer, Jennifer S.D.; Monroe, Matthew E.; Qian, Wei-Jun; Smith, Richard D.

    2007-01-01

    Proteomics has recently demonstrated utility in understanding cellular processes on the molecular level as a component of systems biology approaches and for identifying potential biomarkers of various disease states. The large amount of data generated by utilizing high efficiency (e.g., chromatographic) separations coupled to high mass accuracy mass spectrometry for high-throughput proteomics analyses presents challenges related to data processing, analysis, and display. This review focuses on recent advances in nanoLC-FTICR-MS-based proteomics approaches and the accompanying data processing tools that have been developed to display and interpret the large volumes of data being produced. PMID:16429408

  8. HTAPP: High-Throughput Autonomous Proteomic Pipeline

    PubMed Central

    Yu, Kebing; Salomon, Arthur R.

    2011-01-01

    Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676

  9. Quantitative, multiplexed workflow for deep analysis of human blood plasma and biomarker discovery by mass spectrometry.

    PubMed

    Keshishian, Hasmik; Burgess, Michael W; Specht, Harrison; Wallace, Luke; Clauser, Karl R; Gillette, Michael A; Carr, Steven A

    2017-08-01

    Proteomic characterization of blood plasma is of central importance to clinical proteomics and particularly to biomarker discovery studies. The vast dynamic range and high complexity of the plasma proteome have, however, proven to be serious challenges and have often led to unacceptable tradeoffs between depth of coverage and sample throughput. We present an optimized sample-processing pipeline for analysis of the human plasma proteome that provides greatly increased depth of detection, improved quantitative precision and much higher sample analysis throughput as compared with prior methods. The process includes abundant protein depletion, isobaric labeling at the peptide level for multiplexed relative quantification and ultra-high-performance liquid chromatography coupled to accurate-mass, high-resolution tandem mass spectrometry analysis of peptides fractionated off-line by basic pH reversed-phase (bRP) chromatography. The overall reproducibility of the process, including immunoaffinity depletion, is high, with a process replicate coefficient of variation (CV) of <12%. Using isobaric tags for relative and absolute quantitation (iTRAQ) 4-plex, >4,500 proteins are detected and quantified per patient sample on average, with two or more peptides per protein and starting from as little as 200 μl of plasma. The approach can be multiplexed up to 10-plex using tandem mass tags (TMT) reagents, further increasing throughput, albeit with some decrease in the number of proteins quantified. In addition, we provide a rapid protocol for analysis of nonfractionated depleted plasma samples analyzed in 10-plex. This provides ∼600 quantified proteins for each of the ten samples in ∼5 h of instrument time.

  10. CrossCheck: an open-source web tool for high-throughput screen data analysis.

    PubMed

    Najafov, Jamil; Najafov, Ayaz

    2017-07-19

    Modern high-throughput screening methods allow researchers to generate large datasets that potentially contain important biological information. However, oftentimes, picking relevant hits from such screens and generating testable hypotheses requires training in bioinformatics and the skills to efficiently perform database mining. There are currently no tools available to general public that allow users to cross-reference their screen datasets with published screen datasets. To this end, we developed CrossCheck, an online platform for high-throughput screen data analysis. CrossCheck is a centralized database that allows effortless comparison of the user-entered list of gene symbols with 16,231 published datasets. These datasets include published data from genome-wide RNAi and CRISPR screens, interactome proteomics and phosphoproteomics screens, cancer mutation databases, low-throughput studies of major cell signaling mediators, such as kinases, E3 ubiquitin ligases and phosphatases, and gene ontological information. Moreover, CrossCheck includes a novel database of predicted protein kinase substrates, which was developed using proteome-wide consensus motif searches. CrossCheck dramatically simplifies high-throughput screen data analysis and enables researchers to dig deep into the published literature and streamline data-driven hypothesis generation. CrossCheck is freely accessible as a web-based application at http://proteinguru.com/crosscheck.

  11. Spatially-Resolved Proteomics: Rapid Quantitative Analysis of Laser Capture Microdissected Alveolar Tissue Samples

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Clair, Geremy; Piehowski, Paul D.; Nicola, Teodora

    Global proteomics approaches allow characterization of whole tissue lysates to an impressive depth. However, it is now increasingly recognized that to better understand the complexity of multicellular organisms, global protein profiling of specific spatially defined regions/substructures of tissues (i.e. spatially-resolved proteomics) is essential. Laser capture microdissection (LCM) enables microscopic isolation of defined regions of tissues preserving crucial spatial information. However, current proteomics workflows entail several manual sample preparation steps and are challenged by the microscopic mass-limited samples generated by LCM, and that impact measurement robustness, quantification, and throughput. Here, we coupled LCM with a fully automated sample preparation workflow thatmore » with a single manual step allows: protein extraction, tryptic digestion, peptide cleanup and LC-MS/MS analysis of proteomes from microdissected tissues. Benchmarking against the current state of the art in ultrasensitive global proteomic analysis, our approach demonstrated significant improvements in quantification and throughput. Using our LCM-SNaPP proteomics approach, we characterized to a depth of more than 3,400 proteins, the ontogeny of protein changes during normal lung development in laser capture microdissected alveolar tissue containing ~4,000 cells per sample. Importantly, the data revealed quantitative changes for 350 low abundance transcription factors and signaling molecules, confirming earlier transcript-level observations and defining seven modules of coordinated transcription factor/signaling molecule expression patterns, suggesting that a complex network of temporal regulatory control directs normal lung development with epigenetic regulation fine-tuning pre-natal developmental processes. Our LCM-proteomics approach facilitates efficient, spatially-resolved, ultrasensitive global proteomics analyses in high-throughput that will be enabling for several clinical and biological applications.« less

  12. Efficient visualization of high-throughput targeted proteomics experiments: TAPIR.

    PubMed

    Röst, Hannes L; Rosenberger, George; Aebersold, Ruedi; Malmström, Lars

    2015-07-15

    Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  13. Development of Droplet Microfluidics Enabling High-Throughput Single-Cell Analysis.

    PubMed

    Wen, Na; Zhao, Zhan; Fan, Beiyuan; Chen, Deyong; Men, Dong; Wang, Junbo; Chen, Jian

    2016-07-05

    This article reviews recent developments in droplet microfluidics enabling high-throughput single-cell analysis. Five key aspects in this field are included in this review: (1) prototype demonstration of single-cell encapsulation in microfluidic droplets; (2) technical improvements of single-cell encapsulation in microfluidic droplets; (3) microfluidic droplets enabling single-cell proteomic analysis; (4) microfluidic droplets enabling single-cell genomic analysis; and (5) integrated microfluidic droplet systems enabling single-cell screening. We examine the advantages and limitations of each technique and discuss future research opportunities by focusing on key performances of throughput, multifunctionality, and absolute quantification.

  14. Recent advances on multidimensional liquid chromatography-mass spectrometry for proteomics: from qualitative to quantitative analysis--a review.

    PubMed

    Wu, Qi; Yuan, Huiming; Zhang, Lihua; Zhang, Yukui

    2012-06-20

    With the acceleration of proteome research, increasing attention has been paid to multidimensional liquid chromatography-mass spectrometry (MDLC-MS) due to its high peak capacity and separation efficiency. Recently, many efforts have been put to improve MDLC-based strategies including "top-down" and "bottom-up" to enable highly sensitive qualitative and quantitative analysis of proteins, as well as accelerate the whole analytical procedure. Integrated platforms with combination of sample pretreatment, multidimensional separations and identification were also developed to achieve high throughput and sensitive detection of proteomes, facilitating highly accurate and reproducible quantification. This review summarized the recent advances of such techniques and their applications in qualitative and quantitative analysis of proteomes. Copyright © 2012 Elsevier B.V. All rights reserved.

  15. FunRich proteomics software analysis, let the fun begin!

    PubMed

    Benito-Martin, Alberto; Peinado, Héctor

    2015-08-01

    Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. The UniProtKB guide to the human proteome

    PubMed Central

    Breuza, Lionel; Poux, Sylvain; Estreicher, Anne; Famiglietti, Maria Livia; Magrane, Michele; Tognolli, Michael; Bridge, Alan; Baratin, Delphine; Redaschi, Nicole

    2016-01-01

    Advances in high-throughput and advanced technologies allow researchers to routinely perform whole genome and proteome analysis. For this purpose, they need high-quality resources providing comprehensive gene and protein sets for their organisms of interest. Using the example of the human proteome, we will describe the content of a complete proteome in the UniProt Knowledgebase (UniProtKB). We will show how manual expert curation of UniProtKB/Swiss-Prot is complemented by expert-driven automatic annotation to build a comprehensive, high-quality and traceable resource. We will also illustrate how the complexity of the human proteome is captured and structured in UniProtKB. Database URL: www.uniprot.org PMID:26896845

  17. High-Throughput Cloning and Expression Library Creation for Functional Proteomics

    PubMed Central

    Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua

    2013-01-01

    The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047

  18. High-throughput cloning and expression library creation for functional proteomics.

    PubMed

    Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua

    2013-05-01

    The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particularly important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single-gene experiments, creating the need for fast, flexible, and reliable cloning systems. These collections of ORF clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial, we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator(TM) DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This tutorial is part of the International Proteomics Tutorial Programme (IPTP12). © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Detection of dysregulated protein-association networks by high-throughput proteomics predicts cancer vulnerabilities.

    PubMed

    Lapek, John D; Greninger, Patricia; Morris, Robert; Amzallag, Arnaud; Pruteanu-Malinici, Iulian; Benes, Cyril H; Haas, Wilhelm

    2017-10-01

    The formation of protein complexes and the co-regulation of the cellular concentrations of proteins are essential mechanisms for cellular signaling and for maintaining homeostasis. Here we use isobaric-labeling multiplexed proteomics to analyze protein co-regulation and show that this allows the identification of protein-protein associations with high accuracy. We apply this 'interactome mapping by high-throughput quantitative proteome analysis' (IMAHP) method to a panel of 41 breast cancer cell lines and show that deviations of the observed protein co-regulations in specific cell lines from the consensus network affects cellular fitness. Furthermore, these aberrant interactions serve as biomarkers that predict the drug sensitivity of cell lines in screens across 195 drugs. We expect that IMAHP can be broadly used to gain insight into how changing landscapes of protein-protein associations affect the phenotype of biological systems.

  20. Proteomic Analysis of Metabolic Responses to Biofuels and Chemicals in Photosynthetic Cyanobacteria.

    PubMed

    Sun, T; Chen, L; Zhang, W

    2017-01-01

    Recent progresses in various "omics" technologies have enabled quantitative measurements of biological molecules in a high-throughput manner. Among them, high-throughput proteomics is a rapidly advancing field that offers a new means to quantify metabolic changes at protein level, which has significantly facilitated our understanding of cellular process, such as protein synthesis, posttranslational modifications, and degradation in responding to environmental perturbations. Cyanobacteria are autotrophic prokaryotes that can perform oxygenic photosynthesis and have recently attracted significant attentions as one promising alternative to traditionally biomass-based "microbial cell factories" to produce green fuels and chemicals. However, early studies have shown that the low tolerance to toxic biofuels and chemicals represented one major hurdle for further improving productivity of the cyanobacterial production systems. To address the issue, metabolic responses and their regulation of cyanobacterial cells to toxic end-products need to be defined. In this chapter, we discuss recent progresses in interpreting cyanobacterial responses to biofuels and chemicals using high-throughput proteomics approach, aiming to provide insights and guidelines on how to enhance tolerance and productivity of biofuels or chemicals in the renewable cyanobacteria systems in the future. © 2017 Elsevier Inc. All rights reserved.

  1. Application of targeted quantitative proteomics analysis in human cerebrospinal fluid using a liquid chromatography matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer (LC MALDI TOF/TOF) platform.

    PubMed

    Pan, Sheng; Rush, John; Peskind, Elaine R; Galasko, Douglas; Chung, Kathryn; Quinn, Joseph; Jankovic, Joseph; Leverenz, James B; Zabetian, Cyrus; Pan, Catherine; Wang, Yan; Oh, Jung Hun; Gao, Jean; Zhang, Jianpeng; Montine, Thomas; Zhang, Jing

    2008-02-01

    Targeted quantitative proteomics by mass spectrometry aims to selectively detect one or a panel of peptides/proteins in a complex sample and is particularly appealing for novel biomarker verification/validation because it does not require specific antibodies. Here, we demonstrated the application of targeted quantitative proteomics in searching, identifying, and quantifying selected peptides in human cerebrospinal spinal fluid (CSF) using a matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometer (MALDI TOF/TOF)-based platform. The approach involved two major components: the use of isotopic-labeled synthetic peptides as references for targeted identification and quantification and a highly selective mass spectrometric analysis based on the unique characteristics of the MALDI instrument. The platform provides high confidence for targeted peptide detection in a complex system and can potentially be developed into a high-throughput system. Using the liquid chromatography (LC) MALDI TOF/TOF platform and the complementary identification strategy, we were able to selectively identify and quantify a panel of targeted peptides in the whole proteome of CSF without prior depletion of abundant proteins. The effectiveness and robustness of the approach associated with different sample complexity, sample preparation strategies, as well as mass spectrometric quantification were evaluated. Other issues related to chromatography separation and the feasibility for high-throughput analysis were also discussed. Finally, we applied targeted quantitative proteomics to analyze a subset of previously identified candidate markers in CSF samples of patients with Parkinson's disease (PD) at different stages and Alzheimer's disease (AD) along with normal controls.

  2. TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics

    PubMed Central

    Röst, Hannes L.; Liu, Yansheng; D’Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C.; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi

    2016-01-01

    Large scale, quantitative proteomic studies have become essential for the analysis of clinical cohorts, large perturbation experiments and systems biology studies. While next-generation mass spectrometric techniques such as SWATH-MS have substantially increased throughput and reproducibility, ensuring consistent quantification of thousands of peptide analytes across multiple LC-MS/MS runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we have developed the TRIC software which utilizes fragment ion data to perform cross-run alignment, consistent peak-picking and quantification for high throughput targeted proteomics. TRIC uses a graph-based alignment strategy based on non-linear retention time correction to integrate peak elution information from all LC-MS/MS runs acquired in a study. When compared to state-of-the-art SWATH-MS data analysis, the algorithm was able to reduce the identification error by more than 3-fold at constant recall, while correcting for highly non-linear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem (iPS) cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups and substantially increased the quantitative completeness and biological information in the data, providing insights into protein dynamics of iPS cells. Overall, this study demonstrates the importance of consistent quantification in highly challenging experimental setups, and proposes an algorithm to automate this task, constituting the last missing piece in a pipeline for automated analysis of massively parallel targeted proteomics datasets. PMID:27479329

  3. Accounting for isotopic clustering in Fourier transform mass spectrometry data analysis for clinical diagnostic studies.

    PubMed

    Kakourou, Alexia; Vach, Werner; Nicolardi, Simone; van der Burgt, Yuri; Mertens, Bart

    2016-10-01

    Mass spectrometry based clinical proteomics has emerged as a powerful tool for high-throughput protein profiling and biomarker discovery. Recent improvements in mass spectrometry technology have boosted the potential of proteomic studies in biomedical research. However, the complexity of the proteomic expression introduces new statistical challenges in summarizing and analyzing the acquired data. Statistical methods for optimally processing proteomic data are currently a growing field of research. In this paper we present simple, yet appropriate methods to preprocess, summarize and analyze high-throughput MALDI-FTICR mass spectrometry data, collected in a case-control fashion, while dealing with the statistical challenges that accompany such data. The known statistical properties of the isotopic distribution of the peptide molecules are used to preprocess the spectra and translate the proteomic expression into a condensed data set. Information on either the intensity level or the shape of the identified isotopic clusters is used to derive summary measures on which diagnostic rules for disease status allocation will be based. Results indicate that both the shape of the identified isotopic clusters and the overall intensity level carry information on the class outcome and can be used to predict the presence or absence of the disease.

  4. iAB-RBC-283: A proteomically derived knowledge-base of erythrocyte metabolism that can be used to simulate its physiological and patho-physiological states.

    PubMed

    Bordbar, Aarash; Jamshidi, Neema; Palsson, Bernhard O

    2011-07-12

    The development of high-throughput technologies capable of whole cell measurements of genes, proteins, and metabolites has led to the emergence of systems biology. Integrated analysis of the resulting omic data sets has proved to be hard to achieve. Metabolic network reconstructions enable complex relationships amongst molecular components to be represented formally in a biologically relevant manner while respecting physical constraints. In silico models derived from such reconstructions can then be queried or interrogated through mathematical simulations. Proteomic profiling studies of the mature human erythrocyte have shown more proteins present related to metabolic function than previously thought; however the significance and the causal consequences of these findings have not been explored. Erythrocyte proteomic data was used to reconstruct the most expansive description of erythrocyte metabolism to date, following extensive manual curation, assessment of the literature, and functional testing. The reconstruction contains 281 enzymes representing functions from glycolysis to cofactor and amino acid metabolism. Such a comprehensive view of erythrocyte metabolism implicates the erythrocyte as a potential biomarker for different diseases as well as a 'cell-based' drug-screening tool. The analysis shows that 94 erythrocyte enzymes are implicated in morbid single nucleotide polymorphisms, representing 142 pathologies. In addition, over 230 FDA-approved and experimental pharmaceuticals have enzymatic targets in the erythrocyte. The advancement of proteomic technologies and increased generation of high-throughput proteomic data have created the need for a means to analyze these data in a coherent manner. Network reconstructions provide a systematic means to integrate and analyze proteomic data in a biologically meaning manner. Analysis of the red cell proteome has revealed an unexpected level of complexity in the functional capabilities of human erythrocyte metabolism.

  5. Computational approaches to protein inference in shotgun proteomics

    PubMed Central

    2012-01-01

    Shotgun proteomics has recently emerged as a powerful approach to characterizing proteomes in biological samples. Its overall objective is to identify the form and quantity of each protein in a high-throughput manner by coupling liquid chromatography with tandem mass spectrometry. As a consequence of its high throughput nature, shotgun proteomics faces challenges with respect to the analysis and interpretation of experimental data. Among such challenges, the identification of proteins present in a sample has been recognized as an important computational task. This task generally consists of (1) assigning experimental tandem mass spectra to peptides derived from a protein database, and (2) mapping assigned peptides to proteins and quantifying the confidence of identified proteins. Protein identification is fundamentally a statistical inference problem with a number of methods proposed to address its challenges. In this review we categorize current approaches into rule-based, combinatorial optimization and probabilistic inference techniques, and present them using integer programing and Bayesian inference frameworks. We also discuss the main challenges of protein identification and propose potential solutions with the goal of spurring innovative research in this area. PMID:23176300

  6. Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics

    PubMed Central

    Breckels, Lisa M.; Holden, Sean B.; Wojnar, David; Mulvey, Claire M.; Christoforou, Andy; Groen, Arnoud; Trotter, Matthew W. B.; Kohlbacher, Oliver; Lilley, Kathryn S.; Gatto, Laurent

    2016-01-01

    Sub-cellular localisation of proteins is an essential post-translational regulatory mechanism that can be assayed using high-throughput mass spectrometry (MS). These MS-based spatial proteomics experiments enable us to pinpoint the sub-cellular distribution of thousands of proteins in a specific system under controlled conditions. Recent advances in high-throughput MS methods have yielded a plethora of experimental spatial proteomics data for the cell biology community. Yet, there are many third-party data sources, such as immunofluorescence microscopy or protein annotations and sequences, which represent a rich and vast source of complementary information. We present a unique transfer learning classification framework that utilises a nearest-neighbour or support vector machine system, to integrate heterogeneous data sources to considerably improve on the quantity and quality of sub-cellular protein assignment. We demonstrate the utility of our algorithms through evaluation of five experimental datasets, from four different species in conjunction with four different auxiliary data sources to classify proteins to tens of sub-cellular compartments with high generalisation accuracy. We further apply the method to an experiment on pluripotent mouse embryonic stem cells to classify a set of previously unknown proteins, and validate our findings against a recent high resolution map of the mouse stem cell proteome. The methodology is distributed as part of the open-source Bioconductor pRoloc suite for spatial proteomics data analysis. PMID:27175778

  7. Systems Proteomics for Translational Network Medicine

    PubMed Central

    Arrell, D. Kent; Terzic, Andre

    2012-01-01

    Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016

  8. PeptideDepot: flexible relational database for visual analysis of quantitative proteomic data and integration of existing protein information.

    PubMed

    Yu, Kebing; Salomon, Arthur R

    2009-12-01

    Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through MS/MS. Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to various experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our high throughput autonomous proteomic pipeline used in the automated acquisition and post-acquisition analysis of proteomic data.

  9. Proteome data to explore the impact of pBClin15 on Bacillus cereus ATCC 14579.

    PubMed

    Madeira, Jean-Paul; Alpha-Bazin, Béatrice; Armengaud, Jean; Omer, Hélène; Duport, Catherine

    2016-09-01

    This data article reports changes in the cellular and exoproteome of B. cereus cured from pBClin15.Time-course changes of proteins were assessed by high-throughput nanoLC-MS/MS. We report all the peptides and proteins identified and quantified in B. cereus with and without pBClin15. Proteins were classified into functional groups using the information available in the KEGG classification and we reported their abundance in term of normalized spectral abundance factor. The repertoire of experimentally confirmed proteins of B. cereus presented here is the largest ever reported, and provides new insights into the interplay between pBClin15 and its host B. cereus ATCC 14579. The data reported here is related to a published shotgun proteomics analysis regarding the role of pBClin15, "Deciphering the interactions between the Bacillus cereus linear plasmid, pBClin15, and its host by high-throughput comparative proteomics" Madeira et al. [1]. All the associated mass spectrometry data have been deposited in the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository (http://www.ebi.ac.uk/pride/), with the dataset identifier PRIDE: PXD001568, PRIDE: PXD002788 and PRIDE: PXD002789.

  10. VESPA: Software to Facilitate Genomic Annotation of Prokaryotic Organisms Through Integration of Proteomic and Transcriptomic Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.

    2012-04-25

    Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.

  11. Identification of functional modules using network topology and high-throughput data.

    PubMed

    Ulitsky, Igor; Shamir, Ron

    2007-01-26

    With the advent of systems biology, biological knowledge is often represented today by networks. These include regulatory and metabolic networks, protein-protein interaction networks, and many others. At the same time, high-throughput genomics and proteomics techniques generate very large data sets, which require sophisticated computational analysis. Usually, separate and different analysis methodologies are applied to each of the two data types. An integrated investigation of network and high-throughput information together can improve the quality of the analysis by accounting simultaneously for topological network properties alongside intrinsic features of the high-throughput data. We describe a novel algorithmic framework for this challenge. We first transform the high-throughput data into similarity values, (e.g., by computing pairwise similarity of gene expression patterns from microarray data). Then, given a network of genes or proteins and similarity values between some of them, we seek connected sub-networks (or modules) that manifest high similarity. We develop algorithms for this problem and evaluate their performance on the osmotic shock response network in S. cerevisiae and on the human cell cycle network. We demonstrate that focused, biologically meaningful and relevant functional modules are obtained. In comparison with extant algorithms, our approach has higher sensitivity and higher specificity. We have demonstrated that our method can accurately identify functional modules. Hence, it carries the promise to be highly useful in analysis of high throughput data.

  12. Strong and oriented immobilization of single domain antibodies from crude bacterial lysates for high-throughput compatible cost-effective antibody array generation

    PubMed Central

    Even-Desrumeaux, Klervi; Baty, Daniel; Chames, Patrick

    2010-01-01

    Antibodies microarrays are among the novel class of rapidly emerging proteomic technologies that will allow us to efficiently perform specific diagnosis and proteome analysis. Recombinant antibody fragments are especially suited for this approach but their stability is often a limiting factor. Camelids produce functional antibodies devoid of light chains (HCAbs) of which the single N-terminal domain is fully capable of antigen binding. When produced as an independent domain, these so-called single domain antibody fragments (sdAbs) have several advantages for biotechnological applications thanks to their unique properties of size (15 kDa), stability, solubility, and expression yield. These features should allow sdAbs to outperform other antibody formats in a number of applications, notably as capture molecule for antibody arrays. In this study, we have produced antibody microarrays using direct and oriented immobilization of sdAbs produced in crude bacterial lysates to generate proof-of-principle of a high-throughput compatible array design. Several sdAb immobilization strategies have been explored. Immobilization of in vivo biotinylated sdAbs by direct spotting of bacterial lysate on streptavidin and sandwich detection was developed to achieve high sensitivity and specificity, whereas immobilization of “multi-tagged” sdAbs via anti-tag antibodies and direct labeled sample detection strategy was optimized for the design of high-density antibody arrays for high-throughput proteomics and identification of potential biomarkers. PMID:20859568

  13. A complete mass spectrometric map for the analysis of the yeast proteome and its application to quantitative trait analysis

    PubMed Central

    Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi

    2013-01-01

    Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424

  14. PeptideDepot: Flexible Relational Database for Visual Analysis of Quantitative Proteomic Data and Integration of Existing Protein Information

    PubMed Central

    Yu, Kebing; Salomon, Arthur R.

    2010-01-01

    Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through tandem mass spectrometry (MS/MS). Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to a variety of experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our High Throughput Autonomous Proteomic Pipeline (HTAPP) used in the automated acquisition and post-acquisition analysis of proteomic data. PMID:19834895

  15. Biochemical Markers of Brain Injury: An Integrated Proteomics-Based Approach

    DTIC Science & Technology

    2006-02-01

    Anthony J Williams, X-C May Lu, Renwu Chen, Zhilin Liao, Rebeca Connors, Kevin K Wang, Ron L Hayes, Frank C Tortella, Jitendra R Dave. High throughput... YANG , A., et al. (2002). Evalu- ation of two-dimensional differential gel electrophoresis for proteomic expression analysis of a model breast cancer cell...apoptosis. J. Biol. Chem. 279, 1030–1039. Kuida K., Zheng T. S., Na S., Kuan C., Yang D., Karasuyama H., Rakic P. and Flavell R. A. (1996) Decreased apoptosis

  16. [Techniques for rapid production of monoclonal antibodies for use with antibody technology].

    PubMed

    Kamada, Haruhiko

    2012-01-01

    A monoclonal antibody (Mab), due to its specific binding ability to a target protein, can potentially be one of the most useful tools for the functional analysis of proteins in recent proteomics-based research. However, the production of Mab is a very time-consuming and laborious process (i.e., preparation of recombinant antigens, immunization of animals, preparation of hybridomas), making it the rate-limiting step in using Mabs in high-throughput proteomics research, which heavily relies on comprehensive and rapid methods. Therefore, there is a great demand for new methods to efficiently generate Mabs against a group of proteins identified by proteome analysis. Here, we describe a useful method called "Antibody proteomic technique" for the rapid generations of Mabs to pharmaceutical target, which were identified by proteomic analyses of disease samples (ex. tumor tissue, etc.). We also introduce another method to find profitable targets on vasculature, which is called "Vascular proteomic technique". Our results suggest that this method for the rapid generation of Mabs to proteins may be very useful in proteomics-based research as well as in clinical applications.

  17. Proteome Studies of Filamentous Fungi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baker, Scott E.; Panisko, Ellen A.

    2011-04-20

    The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide breadth of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, non-gel basedmore » proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of different variations on the general method and technologies for identifying peptides in a given sample. We present a method that can serve as a “baseline” for proteomic studies of fungi.« less

  18. Clinical proteomics-driven precision medicine for targeted cancer therapy: current overview and future perspectives.

    PubMed

    Zhou, Li; Wang, Kui; Li, Qifu; Nice, Edouard C; Zhang, Haiyuan; Huang, Canhua

    2016-01-01

    Cancer is a common disease that is a leading cause of death worldwide. Currently, early detection and novel therapeutic strategies are urgently needed for more effective management of cancer. Importantly, protein profiling using clinical proteomic strategies, with spectacular sensitivity and precision, offer excellent promise for the identification of potential biomarkers that would direct the development of targeted therapeutic anticancer drugs for precision medicine. In particular, clinical sample sources, including tumor tissues and body fluids (blood, feces, urine and saliva), have been widely investigated using modern high-throughput mass spectrometry-based proteomic approaches combined with bioinformatic analysis, to pursue the possibilities of precision medicine for targeted cancer therapy. Discussed in this review are the current advantages and limitations of clinical proteomics, the available strategies of clinical proteomics for the management of precision medicine, as well as the challenges and future perspectives of clinical proteomics-driven precision medicine for targeted cancer therapy.

  19. Proteome studies of filamentous fungi.

    PubMed

    Baker, Scott E; Panisko, Ellen A

    2011-01-01

    The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide variety of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, nongel-based proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of variations on the general methods and technologies for identifying peptides in a given sample. We present a method that can serve as a "baseline" for proteomic studies of fungi.

  20. A high-throughput, multi-channel photon-counting detector with picosecond timing

    NASA Astrophysics Data System (ADS)

    Lapington, J. S.; Fraser, G. W.; Miller, G. M.; Ashton, T. J. R.; Jarron, P.; Despeisse, M.; Powolny, F.; Howorth, J.; Milnes, J.

    2009-06-01

    High-throughput photon counting with high time resolution is a niche application area where vacuum tubes can still outperform solid-state devices. Applications in the life sciences utilizing time-resolved spectroscopies, particularly in the growing field of proteomics, will benefit greatly from performance enhancements in event timing and detector throughput. The HiContent project is a collaboration between the University of Leicester Space Research Centre, the Microelectronics Group at CERN, Photek Ltd., and end-users at the Gray Cancer Institute and the University of Manchester. The goal is to develop a detector system specifically designed for optical proteomics, capable of high content (multi-parametric) analysis at high throughput. The HiContent detector system is being developed to exploit this niche market. It combines multi-channel, high time resolution photon counting in a single miniaturized detector system with integrated electronics. The combination of enabling technologies; small pore microchannel plate devices with very high time resolution, and high-speed multi-channel ASIC electronics developed for the LHC at CERN, provides the necessary building blocks for a high-throughput detector system with up to 1024 parallel counting channels and 20 ps time resolution. We describe the detector and electronic design, discuss the current status of the HiContent project and present the results from a 64-channel prototype system. In the absence of an operational detector, we present measurements of the electronics performance using a pulse generator to simulate detector events. Event timing results from the NINO high-speed front-end ASIC captured using a fast digital oscilloscope are compared with data taken with the proposed electronic configuration which uses the multi-channel HPTDC timing ASIC.

  1. Advanced proteomic liquid chromatography

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xie, Fang; Smith, Richard D.; Shen, Yufeng

    2012-10-26

    Liquid chromatography coupled with mass spectrometry is the predominant platform used to analyze proteomics samples consisting of large numbers of proteins and their proteolytic products (e.g., truncated polypeptides) and spanning a wide range of relative concentrations. This review provides an overview of advanced capillary liquid chromatography techniques and methodologies that greatly improve separation resolving power and proteomics analysis coverage, sensitivity, and throughput.

  2. Analyzing large-scale proteomics projects with latent semantic indexing.

    PubMed

    Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning

    2008-01-01

    Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.

  3. Advanced proteomic liquid chromatography

    PubMed Central

    Xie, Fang; Smith, Richard D.; Shen, Yufeng

    2012-01-01

    Liquid chromatography coupled with mass spectrometry is the predominant platform used to analyze proteomics samples consisting of large numbers of proteins and their proteolytic products (e.g., truncated polypeptides) and spanning a wide range of relative concentrations. This review provides an overview of advanced capillary liquid chromatography techniques and methodologies that greatly improve separation resolving power and proteomics analysis coverage, sensitivity, and throughput. PMID:22840822

  4. Fast liquid chromatography combined with mass spectrometry for the analysis of metabolites and proteins in human body fluids.

    PubMed

    Kortz, Linda; Helmschrodt, Christin; Ceglarek, Uta

    2011-03-01

    In the last decade various analytical strategies have been established to enhance separation speed and efficiency in high performance liquid chromatography applications. Chromatographic supports based on monolithic material, small porous particles, and porous layer beads have been developed and commercialized to improve throughput and separation efficiency. This paper provides an overview of current developments in fast chromatography combined with mass spectrometry for the analysis of metabolites and proteins in clinical applications. Advances and limitations of fast chromatography for the combination with mass spectrometry are discussed. Practical aspects of, recent developments in, and the present status of high-throughput analysis of human body fluids for therapeutic drug monitoring, toxicology, clinical metabolomics, and proteomics are presented.

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kolker, Eugene

    Our project focused primarily on analysis of different types of data produced by global high-throughput technologies, data integration of gene annotation, and gene and protein expression information, as well as on getting a better functional annotation of Shewanella genes. Specifically, four of our numerous major activities and achievements include the development of: statistical models for identification and expression proteomics, superior to currently available approaches (including our own earlier ones); approaches to improve gene annotations on the whole-organism scale; standards for annotation, transcriptomics and proteomics approaches; and generalized approaches for data integration of gene annotation, gene and protein expression information.

  6. Microfluidics for the analysis of membrane proteins: how do we get there?

    PubMed

    Battle, Katrina N; Uba, Franklin I; Soper, Steven A

    2014-08-01

    The development of fully automated and high-throughput systems for proteomics is now in demand because of the need to generate new protein-based disease biomarkers. Unfortunately, it is difficult to identify protein biomarkers that are low abundant when in the presence of highly abundant proteins, especially in complex biological samples such as serum, cell lysates, and other biological fluids. Membrane proteins, which are in many cases of low abundance compared to the cytosolic proteins, have various functions and can provide insight into the state of a disease and serve as targets for new drugs making them attractive biomarker candidates. Traditionally, proteins are identified through the use of gel electrophoretic techniques, which are not always suitable for particular protein samples such as membrane proteins. Microfluidics offers the potential as a fully automated platform for the efficient and high-throughput analysis of complex samples, such as membrane proteins, and do so with performance metrics that exceed their bench-top counterparts. In recent years, there have been various improvements to microfluidics and their use for proteomic analysis as reported in the literature. Consequently, this review presents an overview of the traditional proteomic-processing pipelines for membrane proteins and insights into new technological developments with a focus on the applicability of microfluidics for the analysis of membrane proteins. Sample preparation techniques will be discussed in detail and novel interfacing strategies as it relates to MS will be highlighted. Lastly, some general conclusions and future perspectives are presented. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Active site specificity profiling datasets of matrix metalloproteinases (MMPs) 1, 2, 3, 7, 8, 9, 12, 13 and 14.

    PubMed

    Eckhard, Ulrich; Huesgen, Pitter F; Schilling, Oliver; Bellac, Caroline L; Butler, Georgina S; Cox, Jennifer H; Dufour, Antoine; Goebeler, Verena; Kappelhoff, Reinhild; Auf dem Keller, Ulrich; Klein, Theo; Lange, Philipp F; Marino, Giada; Morrison, Charlotte J; Prudova, Anna; Rodriguez, David; Starr, Amanda E; Wang, Yili; Overall, Christopher M

    2016-06-01

    The data described provide a comprehensive resource for the family-wide active site specificity portrayal of the human matrix metalloproteinase family. We used the high-throughput proteomic technique PICS (Proteomic Identification of protease Cleavage Sites) to comprehensively assay 9 different MMPs. We identified more than 4300 peptide cleavage sites, spanning both the prime and non-prime sides of the scissile peptide bond allowing detailed subsite cooperativity analysis. The proteomic cleavage data were expanded by kinetic analysis using a set of 6 quenched-fluorescent peptide substrates designed using these results. These datasets represent one of the largest specificity profiling efforts with subsequent structural follow up for any protease family and put the spotlight on the specificity similarities and differences of the MMP family. A detailed analysis of this data may be found in Eckhard et al. (2015) [1]. The raw mass spectrometry data and the corresponding metadata have been deposited in PRIDE/ProteomeXchange with the accession number PXD002265.

  8. NMR in the SPINE Structural Proteomics project.

    PubMed

    Ab, E; Atkinson, A R; Banci, L; Bertini, I; Ciofi-Baffoni, S; Brunner, K; Diercks, T; Dötsch, V; Engelke, F; Folkers, G E; Griesinger, C; Gronwald, W; Günther, U; Habeck, M; de Jong, R N; Kalbitzer, H R; Kieffer, B; Leeflang, B R; Loss, S; Luchinat, C; Marquardsen, T; Moskau, D; Neidig, K P; Nilges, M; Piccioli, M; Pierattelli, R; Rieping, W; Schippmann, T; Schwalbe, H; Travé, G; Trenner, J; Wöhnert, J; Zweckstetter, M; Kaptein, R

    2006-10-01

    This paper describes the developments, role and contributions of the NMR spectroscopy groups in the Structural Proteomics In Europe (SPINE) consortium. Focusing on the development of high-throughput (HTP) pipelines for NMR structure determinations of proteins, all aspects from sample preparation, data acquisition, data processing, data analysis to structure determination have been improved with respect to sensitivity, automation, speed, robustness and validation. Specific highlights are protonless (13)C-direct detection methods and inferential structure determinations (ISD). In addition to technological improvements, these methods have been applied to deliver over 60 NMR structures of proteins, among which are five that failed to crystallize. The inclusion of NMR spectroscopy in structural proteomics pipelines improves the success rate for protein structure determinations.

  9. Quantitative proteomics in cardiovascular research: global and targeted strategies

    PubMed Central

    Shen, Xiaomeng; Young, Rebeccah; Canty, John M.; Qu, Jun

    2014-01-01

    Extensive technical advances in the past decade have substantially expanded quantitative proteomics in cardiovascular research. This has great promise for elucidating the mechanisms of cardiovascular diseases (CVD) and the discovery of cardiac biomarkers used for diagnosis and treatment evaluation. Global and targeted proteomics are the two major avenues of quantitative proteomics. While global approaches enable unbiased discovery of altered proteins via relative quantification at the proteome level, targeted techniques provide higher sensitivity and accuracy, and are capable of multiplexed absolute quantification in numerous clinical/biological samples. While promising, technical challenges need to be overcome to enable full utilization of these techniques in cardiovascular medicine. Here we discuss recent advances in quantitative proteomics and summarize applications in cardiovascular research with an emphasis on biomarker discovery and elucidating molecular mechanisms of disease. We propose the integration of global and targeted strategies as a high-throughput pipeline for cardiovascular proteomics. Targeted approaches enable rapid, extensive validation of biomarker candidates discovered by global proteomics. These approaches provide a promising alternative to immunoassays and other low-throughput means currently used for limited validation. PMID:24920501

  10. File formats commonly used in mass spectrometry proteomics.

    PubMed

    Deutsch, Eric W

    2012-12-01

    The application of mass spectrometry (MS) to the analysis of proteomes has enabled the high-throughput identification and abundance measurement of hundreds to thousands of proteins per experiment. However, the formidable informatics challenge associated with analyzing MS data has required a wide variety of data file formats to encode the complex data types associated with MS workflows. These formats encompass the encoding of input instruction for instruments, output products of the instruments, and several levels of information and results used by and produced by the informatics analysis tools. A brief overview of the most common file formats in use today is presented here, along with a discussion of related topics.

  11. COMPASS: a suite of pre- and post-search proteomics software tools for OMSSA

    PubMed Central

    Wenger, Craig D.; Phanstiel, Douglas H.; Lee, M. Violet; Bailey, Derek J.; Coon, Joshua J.

    2011-01-01

    Here we present the Coon OMSSA Proteomic Analysis Software Suite (COMPASS): a free and open-source software pipeline for high-throughput analysis of proteomics data, designed around the Open Mass Spectrometry Search Algorithm. We detail a synergistic set of tools for protein database generation, spectral reduction, peptide false discovery rate analysis, peptide quantitation via isobaric labeling, protein parsimony and protein false discovery rate analysis, and protein quantitation. We strive for maximum ease of use, utilizing graphical user interfaces and working with data files in the original instrument vendor format. Results are stored in plain text comma-separated values files, which are easy to view and manipulate with a text editor or spreadsheet program. We illustrate the operation and efficacy of COMPASS through the use of two LC–MS/MS datasets. The first is a dataset of a highly annotated mixture of standard proteins and manually validated contaminants that exhibits the identification workflow. The second is a dataset of yeast peptides, labeled with isobaric stable isotope tags and mixed in known ratios, to demonstrate the quantitative workflow. For these two datasets, COMPASS performs equivalently or better than the current de facto standard, the Trans-Proteomic Pipeline. PMID:21298793

  12. Translational Research and Plasma Proteomic in Cancer.

    PubMed

    Santini, Annamaria Chiara; Giovane, Giancarlo; Auletta, Adelaide; Di Carlo, Angelina; Fiorelli, Alfonso; Cito, Letizia; Astarita, Carlo; Giordano, Antonio; Alfano, Roberto; Feola, Antonia; Di Domenico, Marina

    2016-04-01

    Proteomics is a recent field of research in molecular biology that can help in the fight against cancer through the search for biomarkers that can detect this disease in the early stages of its development. Proteomic is a speedily growing technology, also thanks to the development of even more sensitive and fast mass spectrometry analysis. Although this technique is the most widespread for the discovery of new cancer biomarkers, it still suffers of a poor sensitivity and insufficient reproducibility, essentially due to the tumor heterogeneity. Common technical shortcomings include limitations in the sensitivity of detecting low abundant biomarkers and possible systematic biases in the observed data. Current research attempts are trying to develop high-resolution proteomic instrumentation for high-throughput monitoring of protein changes that occur in cancer. In this review, we describe the basic features of the proteomic tools which have proven to be useful in cancer research, showing their advantages and disadvantages. The application of these proteomic tools could provide early biomarkers detection in various cancer types and could improve the understanding the mechanisms of tumor growth and dissemination. © 2015 Wiley Periodicals, Inc.

  13. Proteomics of Plant Pathogenic Fungi

    PubMed Central

    González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V.

    2010-01-01

    Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection. PMID:20589070

  14. Proteomics of plant pathogenic fungi.

    PubMed

    González-Fernández, Raquel; Prats, Elena; Jorrín-Novo, Jesús V

    2010-01-01

    Plant pathogenic fungi cause important yield losses in crops. In order to develop efficient and environmental friendly crop protection strategies, molecular studies of the fungal biological cycle, virulence factors, and interaction with its host are necessary. For that reason, several approaches have been performed using both classical genetic, cell biology, and biochemistry and the modern, holistic, and high-throughput, omic techniques. This work briefly overviews the tools available for studying Plant Pathogenic Fungi and is amply focused on MS-based Proteomics analysis, based on original papers published up to December 2009. At a methodological level, different steps in a proteomic workflow experiment are discussed. Separate sections are devoted to fungal descriptive (intracellular, subcellular, extracellular) and differential expression proteomics and interactomics. From the work published we can conclude that Proteomics, in combination with other techniques, constitutes a powerful tool for providing important information about pathogenicity and virulence factors, thus opening up new possibilities for crop disease diagnosis and crop protection.

  15. Recent advances in micro-scale and nano-scale high-performance liquid-phase chromatography for proteome research.

    PubMed

    Tao, Dingyin; Zhang, Lihua; Shan, Yichu; Liang, Zhen; Zhang, Yukui

    2011-01-01

    High-performance liquid chromatography-electrospray ionization tandem mass spectrometry (HPLC-ESI-MS-MS) is regarded as one of the most powerful techniques for separation and identification of proteins. Recently, much effort has been made to improve the separation capacity, detection sensitivity, and analysis throughput of micro- and nano-HPLC, by increasing column length, reducing column internal diameter, and using integrated techniques. Development of HPLC columns has also been rapid, as a result of the use of submicrometer packing materials and monolithic columns. All these innovations result in clearly improved performance of micro- and nano-HPLC for proteome research.

  16. Solid-Phase Extraction Strategies to Surmount Body Fluid Sample Complexity in High-Throughput Mass Spectrometry-Based Proteomics

    PubMed Central

    Bladergroen, Marco R.; van der Burgt, Yuri E. M.

    2015-01-01

    For large-scale and standardized applications in mass spectrometry- (MS-) based proteomics automation of each step is essential. Here we present high-throughput sample preparation solutions for balancing the speed of current MS-acquisitions and the time needed for analytical workup of body fluids. The discussed workflows reduce body fluid sample complexity and apply for both bottom-up proteomics experiments and top-down protein characterization approaches. Various sample preparation methods that involve solid-phase extraction (SPE) including affinity enrichment strategies have been automated. Obtained peptide and protein fractions can be mass analyzed by direct infusion into an electrospray ionization (ESI) source or by means of matrix-assisted laser desorption ionization (MALDI) without further need of time-consuming liquid chromatography (LC) separations. PMID:25692071

  17. Label-free proteomic analysis to confirm the predicted proteome of Corynebacterium pseudotuberculosis under nitrosative stress mediated by nitric oxide.

    PubMed

    Silva, Wanderson M; Carvalho, Rodrigo D; Soares, Siomar C; Bastos, Isabela Fs; Folador, Edson L; Souza, Gustavo Hmf; Le Loir, Yves; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco

    2014-12-04

    Corynebacterium pseudotuberculosis biovar ovis is a facultative intracellular pathogen, and the etiological agent of caseous lymphadenitis in small ruminants. During the infection process, the bacterium is subjected to several stress conditions, including nitrosative stress, which is caused by nitric oxide (NO). In silico analysis of the genome of C. pseudotuberculosis ovis 1002 predicted several genes that could influence the resistance of this pathogen to nitrosative stress. Here, we applied high-throughput proteomics using high definition mass spectrometry to characterize the functional genome of C. pseudotuberculosis ovis 1002 in the presence of NO-donor Diethylenetriamine/nitric oxide adduct (DETA/NO), with the aim of identifying proteins involved in nitrosative stress resistance. We characterized 835 proteins, representing approximately 41% of the predicted proteome of C. pseudotuberculosis ovis 1002, following exposure to nitrosative stress. In total, 102 proteins were exclusive to the proteome of DETA/NO-induced cells, and a further 58 proteins were differentially regulated between the DETA/NO and control conditions. An interactomic analysis of the differential proteome of C. pseudotuberculosis in response to nitrosative stress was also performed. Our proteomic data set suggested the activation of both a general stress response and a specific nitrosative stress response, as well as changes in proteins involved in cellular metabolism, detoxification, transcriptional regulation, and DNA synthesis and repair. Our proteomic analysis validated previously-determined in silico data for C. pseudotuberculosis ovis 1002. In addition, proteomic screening performed in the presence of NO enabled the identification of a set of factors that can influence the resistance and survival of C. pseudotuberculosis during exposure to nitrosative stress.

  18. Methods, Tools and Current Perspectives in Proteogenomics *

    PubMed Central

    Ruggles, Kelly V.; Krug, Karsten; Wang, Xiaojing; Clauser, Karl R.; Wang, Jing; Payne, Samuel H.; Fenyö, David; Zhang, Bing; Mani, D. R.

    2017-01-01

    With combined technological advancements in high-throughput next-generation sequencing and deep mass spectrometry-based proteomics, proteogenomics, i.e. the integrative analysis of proteomic and genomic data, has emerged as a new research field. Early efforts in the field were focused on improving protein identification using sample-specific genomic and transcriptomic sequencing data. More recently, integrative analysis of quantitative measurements from genomic and proteomic studies have identified novel insights into gene expression regulation, cell signaling, and disease. Many methods and tools have been developed or adapted to enable an array of integrative proteogenomic approaches and in this article, we systematically classify published methods and tools into four major categories, (1) Sequence-centric proteogenomics; (2) Analysis of proteogenomic relationships; (3) Integrative modeling of proteogenomic data; and (4) Data sharing and visualization. We provide a comprehensive review of methods and available tools in each category and highlight their typical applications. PMID:28456751

  19. Proteomic analysis of formalin-fixed paraffin embedded tissue by MALDI imaging mass spectrometry

    PubMed Central

    Casadonte, Rita; Caprioli, Richard M

    2012-01-01

    Archived formalin-fixed paraffin-embedded (FFPE) tissue collections represent a valuable informational resource for proteomic studies. Multiple FFPE core biopsies can be assembled in a single block to form tissue microarrays (TMAs). We describe a protocol for analyzing protein in FFPE -TMAs using matrix-assisted laser desorption/ionization (MAL DI) imaging mass spectrometry (IMS). The workflow incorporates an antigen retrieval step following deparaffinization, in situ trypsin digestion, matrix application and then mass spectrometry signal acquisition. The direct analysis of FFPE -TMA tissue using IMS allows direct analysis of multiple tissue samples in a single experiment without extraction and purification of proteins. The advantages of high speed and throughput, easy sample handling and excellent reproducibility make this technology a favorable approach for the proteomic analysis of clinical research cohorts with large sample numbers. For example, TMA analysis of 300 FFPE cores would typically require 6 h of total time through data acquisition, not including data analysis. PMID:22011652

  20. A software suite for the generation and comparison of peptide arrays from sets of data collected by liquid chromatography-mass spectrometry.

    PubMed

    Li, Xiao-jun; Yi, Eugene C; Kemp, Christopher J; Zhang, Hui; Aebersold, Ruedi

    2005-09-01

    There is an increasing interest in the quantitative proteomic measurement of the protein contents of substantially similar biological samples, e.g. for the analysis of cellular response to perturbations over time or for the discovery of protein biomarkers from clinical samples. Technical limitations of current proteomic platforms such as limited reproducibility and low throughput make this a challenging task. A new LC-MS-based platform is able to generate complex peptide patterns from the analysis of proteolyzed protein samples at high throughput and represents a promising approach for quantitative proteomics. A crucial component of the LC-MS approach is the accurate evaluation of the abundance of detected peptides over many samples and the identification of peptide features that can stratify samples with respect to their genetic, physiological, or environmental origins. We present here a new software suite, SpecArray, that generates a peptide versus sample array from a set of LC-MS data. A peptide array stores the relative abundance of thousands of peptide features in many samples and is in a format identical to that of a gene expression microarray. A peptide array can be subjected to an unsupervised clustering analysis to stratify samples or to a discriminant analysis to identify discriminatory peptide features. We applied the SpecArray to analyze two sets of LC-MS data: one was from four repeat LC-MS analyses of the same glycopeptide sample, and another was from LC-MS analysis of serum samples of five male and five female mice. We demonstrate through these two study cases that the SpecArray software suite can serve as an effective software platform in the LC-MS approach for quantitative proteomics.

  1. Selected reaction monitoring mass spectrometry: a methodology overview.

    PubMed

    Ebhardt, H Alexander

    2014-01-01

    Moving past the discovery phase of proteomics, the term targeted proteomics combines multiple approaches investigating a certain set of proteins in more detail. One such targeted proteomics approach is the combination of liquid chromatography and selected or multiple reaction monitoring mass spectrometry (SRM, MRM). SRM-MS requires prior knowledge of the fragmentation pattern of peptides, as the presence of the analyte in a sample is determined by measuring the m/z values of predefined precursor and fragment ions. Using scheduled SRM-MS, many analytes can robustly be monitored allowing for high-throughput sample analysis of the same set of proteins over many conditions. In this chapter, fundaments of SRM-MS are explained as well as an optimized SRM pipeline from assay generation to data analyzed.

  2. Protein identification and quantification from riverbank grape, Vitis riparia: Comparing SDS-PAGE and FASP-GPF techniques for shotgun proteomic analysis.

    PubMed

    George, Iniga S; Fennell, Anne Y; Haynes, Paul A

    2015-09-01

    Protein sample preparation optimisation is critical for establishing reproducible high throughput proteomic analysis. In this study, two different fractionation sample preparation techniques (in-gel digestion and in-solution digestion) for shotgun proteomics were used to quantitatively compare proteins identified in Vitis riparia leaf samples. The total number of proteins and peptides identified were compared between filter aided sample preparation (FASP) coupled with gas phase fractionation (GPF) and SDS-PAGE methods. There was a 24% increase in the total number of reproducibly identified proteins when FASP-GPF was used. FASP-GPF is more reproducible, less expensive and a better method than SDS-PAGE for shotgun proteomics of grapevine samples as it significantly increases protein identification across biological replicates. Total peptide and protein information from the two fractionation techniques is available in PRIDE with the identifier PXD001399 (http://proteomecentral.proteomexchange.org/dataset/PXD001399). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. The proteomic landscape of triple-negative breast cancer.

    PubMed

    Lawrence, Robert T; Perez, Elizabeth M; Hernández, Daniel; Miller, Chris P; Haas, Kelsey M; Irie, Hanna Y; Lee, Su-In; Blau, C Anthony; Villén, Judit

    2015-04-28

    Triple-negative breast cancer is a heterogeneous disease characterized by poor clinical outcomes and a shortage of targeted treatment options. To discover molecular features of triple-negative breast cancer, we performed quantitative proteomics analysis of twenty human-derived breast cell lines and four primary breast tumors to a depth of more than 12,000 distinct proteins. We used this data to identify breast cancer subtypes at the protein level and demonstrate the precise quantification of biomarkers, signaling proteins, and biological pathways by mass spectrometry. We integrated proteomics data with exome sequence resources to identify genomic aberrations that affect protein expression. We performed a high-throughput drug screen to identify protein markers of drug sensitivity and understand the mechanisms of drug resistance. The genome and proteome provide complementary information that, when combined, yield a powerful engine for therapeutic discovery. This resource is available to the cancer research community to catalyze further analysis and investigation. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  4. File Formats Commonly Used in Mass Spectrometry Proteomics*

    PubMed Central

    Deutsch, Eric W.

    2012-01-01

    The application of mass spectrometry (MS) to the analysis of proteomes has enabled the high-throughput identification and abundance measurement of hundreds to thousands of proteins per experiment. However, the formidable informatics challenge associated with analyzing MS data has required a wide variety of data file formats to encode the complex data types associated with MS workflows. These formats encompass the encoding of input instruction for instruments, output products of the instruments, and several levels of information and results used by and produced by the informatics analysis tools. A brief overview of the most common file formats in use today is presented here, along with a discussion of related topics. PMID:22956731

  5. YPED: An Integrated Bioinformatics Suite and Database for Mass Spectrometry-based Proteomics Research

    PubMed Central

    Colangelo, Christopher M.; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L.; Carriero, Nicholas J.; Gulcicek, Erol E.; Lam, TuKiet T.; Wu, Terence; Bjornson, Robert D.; Bruce, Can; Nairn, Angus C.; Rinehart, Jesse; Miller, Perry L.; Williams, Kenneth R.

    2015-01-01

    We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry (LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED’s database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. PMID:25712262

  6. YPED: an integrated bioinformatics suite and database for mass spectrometry-based proteomics research.

    PubMed

    Colangelo, Christopher M; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L; Carriero, Nicholas J; Gulcicek, Erol E; Lam, TuKiet T; Wu, Terence; Bjornson, Robert D; Bruce, Can; Nairn, Angus C; Rinehart, Jesse; Miller, Perry L; Williams, Kenneth R

    2015-02-01

    We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

  7. TimeXNet Web: Identifying cellular response networks from diverse omics time-course data.

    PubMed

    Tan, Phit Ling; López, Yosvany; Nakai, Kenta; Patil, Ashwini

    2018-05-14

    Condition-specific time-course omics profiles are frequently used to study cellular response to stimuli and identify associated signaling pathways. However, few online tools allow users to analyze multiple types of high-throughput time-course data. TimeXNet Web is a web server that extracts a time-dependent gene/protein response network from time-course transcriptomic, proteomic or phospho-proteomic data, and an input interaction network. It classifies the given genes/proteins into time-dependent groups based on the time of their highest activity and identifies the most probable paths connecting genes/proteins in consecutive groups. The response sub-network is enriched in activated genes/proteins and contains novel regulators that do not show any observable change in the input data. Users can view the resultant response network and analyze it for functional enrichment. TimeXNet Web supports the analysis of high-throughput data from multiple species by providing high quality, weighted protein-protein interaction networks for 12 model organisms. http://txnet.hgc.jp/. ashwini@hgc.jp. Supplementary data are available at Bioinformatics online.

  8. Global analysis of the rat and human platelet proteome – the molecular blueprint for illustrating multi-functional platelets and cross-species function evolution

    PubMed Central

    Yu, Yanbao; Leng, Taohua; Yun, Dong; Liu, Na; Yao, Jun; Dai, Ying; Yang, Pengyuan; Chen, Xian

    2013-01-01

    Emerging evidences indicate that blood platelets function in multiple biological processes including immune response, bone metastasis and liver regeneration in addition to their known roles in hemostasis and thrombosis. Global elucidation of platelet proteome will provide the molecular base of these platelet functions. Here, we set up a high throughput platform for maximum exploration of the rat/human platelet proteome using integrated proteomics technologies, and then applied to identify the largest number of the proteins expressed in both rat and human platelets. After stringent statistical filtration, a total of 837 unique proteins matched with at least two unique peptides were precisely identified, making it the first comprehensive protein database so far for rat platelets. Meanwhile, quantitative analyses of the thrombin-stimulated platelets offered great insights into the biological functions of platelet proteins and therefore confirmed our global profiling data. A comparative proteomic analysis between rat and human platelets was also conducted, which revealed not only a significant similarity, but also an across-species evolutionary link that the orthologous proteins representing ‘core proteome’, and the ‘evolutionary proteome’ is actually a relatively static proteome. PMID:20443191

  9. Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data.

    PubMed

    Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O

    2015-08-25

    Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.

  10. Definitive screening design enables optimization of LC-ESI-MS/MS parameters in proteomics.

    PubMed

    Aburaya, Shunsuke; Aoki, Wataru; Minakuchi, Hiroyoshi; Ueda, Mitsuyoshi

    2017-12-01

    In proteomics, more than 100,000 peptides are generated from the digestion of human cell lysates. Proteome samples have a broad dynamic range in protein abundance; therefore, it is critical to optimize various parameters of LC-ESI-MS/MS to comprehensively identify these peptides. However, there are many parameters for LC-ESI-MS/MS analysis. In this study, we applied definitive screening design to simultaneously optimize 14 parameters in the operation of monolithic capillary LC-ESI-MS/MS to increase the number of identified proteins and/or the average peak area of MS1. The simultaneous optimization enabled the determination of two-factor interactions between LC and MS. Finally, we found two parameter sets of monolithic capillary LC-ESI-MS/MS that increased the number of identified proteins by 8.1% or the average peak area of MS1 by 67%. The definitive screening design would be highly useful for high-throughput analysis of the best parameter set in LC-ESI-MS/MS systems.

  11. Studies of a biochemical factory: tomato trichome deep expressed sequence tag sequencing and proteomics.

    PubMed

    Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L

    2010-07-01

    Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.

  12. Studies of a Biochemical Factory: Tomato Trichome Deep Expressed Sequence Tag Sequencing and Proteomics1[W][OA

    PubMed Central

    Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.

    2010-01-01

    Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087

  13. A comprehensive and scalable database search system for metaproteomics.

    PubMed

    Chatterjee, Sandip; Stupp, Gregory S; Park, Sung Kyu Robin; Ducom, Jean-Christophe; Yates, John R; Su, Andrew I; Wolan, Dennis W

    2016-08-16

    Mass spectrometry-based shotgun proteomics experiments rely on accurate matching of experimental spectra against a database of protein sequences. Existing computational analysis methods are limited in the size of their sequence databases, which severely restricts the proteomic sequencing depth and functional analysis of highly complex samples. The growing amount of public high-throughput sequencing data will only exacerbate this problem. We designed a broadly applicable metaproteomic analysis method (ComPIL) that addresses protein database size limitations. Our approach to overcome this significant limitation in metaproteomics was to design a scalable set of sequence databases assembled for optimal library querying speeds. ComPIL was integrated with a modified version of the search engine ProLuCID (termed "Blazmass") to permit rapid matching of experimental spectra. Proof-of-principle analysis of human HEK293 lysate with a ComPIL database derived from high-quality genomic libraries was able to detect nearly all of the same peptides as a search with a human database (~500x fewer peptides in the database), with a small reduction in sensitivity. We were also able to detect proteins from the adenovirus used to immortalize these cells. We applied our method to a set of healthy human gut microbiome proteomic samples and showed a substantial increase in the number of identified peptides and proteins compared to previous metaproteomic analyses, while retaining a high degree of protein identification accuracy and allowing for a more in-depth characterization of the functional landscape of the samples. The combination of ComPIL with Blazmass allows proteomic searches to be performed with database sizes much larger than previously possible. These large database searches can be applied to complex meta-samples with unknown composition or proteomic samples where unexpected proteins may be identified. The protein database, proteomic search engine, and the proteomic data files for the 5 microbiome samples characterized and discussed herein are open source and available for use and additional analysis.

  14. Computer aided manual validation of mass spectrometry-based proteomic data.

    PubMed

    Curran, Timothy G; Bryson, Bryan D; Reigelhaupt, Michael; Johnson, Hannah; White, Forest M

    2013-06-15

    Advances in mass spectrometry-based proteomic technologies have increased the speed of analysis and the depth provided by a single analysis. Computational tools to evaluate the accuracy of peptide identifications from these high-throughput analyses have not kept pace with technological advances; currently the most common quality evaluation methods are based on statistical analysis of the likelihood of false positive identifications in large-scale data sets. While helpful, these calculations do not consider the accuracy of each identification, thus creating a precarious situation for biologists relying on the data to inform experimental design. Manual validation is the gold standard approach to confirm accuracy of database identifications, but is extremely time-intensive. To palliate the increasing time required to manually validate large proteomic datasets, we provide computer aided manual validation software (CAMV) to expedite the process. Relevant spectra are collected, catalogued, and pre-labeled, allowing users to efficiently judge the quality of each identification and summarize applicable quantitative information. CAMV significantly reduces the burden associated with manual validation and will hopefully encourage broader adoption of manual validation in mass spectrometry-based proteomics. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. PACOM: A Versatile Tool for Integrating, Filtering, Visualizing, and Comparing Multiple Large Mass Spectrometry Proteomics Data Sets.

    PubMed

    Martínez-Bartolomé, Salvador; Medina-Aunon, J Alberto; López-García, Miguel Ángel; González-Tejedo, Carmen; Prieto, Gorka; Navajas, Rosana; Salazar-Donate, Emilio; Fernández-Costa, Carolina; Yates, John R; Albar, Juan Pablo

    2018-04-06

    Mass-spectrometry-based proteomics has evolved into a high-throughput technology in which numerous large-scale data sets are generated from diverse analytical platforms. Furthermore, several scientific journals and funding agencies have emphasized the storage of proteomics data in public repositories to facilitate its evaluation, inspection, and reanalysis. (1) As a consequence, public proteomics data repositories are growing rapidly. However, tools are needed to integrate multiple proteomics data sets to compare different experimental features or to perform quality control analysis. Here, we present a new Java stand-alone tool, Proteomics Assay COMparator (PACOM), that is able to import, combine, and simultaneously compare numerous proteomics experiments to check the integrity of the proteomic data as well as verify data quality. With PACOM, the user can detect source of errors that may have been introduced in any step of a proteomics workflow and that influence the final results. Data sets can be easily compared and integrated, and data quality and reproducibility can be visually assessed through a rich set of graphical representations of proteomics data features as well as a wide variety of data filters. Its flexibility and easy-to-use interface make PACOM a unique tool for daily use in a proteomics laboratory. PACOM is available at https://github.com/smdb21/pacom .

  16. Content Is King: Databases Preserve the Collective Information of Science.

    PubMed

    Yates, John R

    2018-04-01

    Databases store sequence information experimentally gathered to create resources that further science. In the last 20 years databases have become critical components of fields like proteomics where they provide the basis for large-scale and high-throughput proteomic informatics. Amos Bairoch, winner of the Association of Biomolecular Resource Facilities Frederick Sanger Award, has created some of the important databases proteomic research depends upon for accurate interpretation of data.

  17. Analysis of Protein Expression in Cell Microarrays: A Tool for Antibody-based Proteomics

    PubMed Central

    Andersson, Ann-Catrin; Strömberg, Sara; Bäckvall, Helena; Kampf, Caroline; Uhlen, Mathias; Wester, Kenneth; Pontén, Fredrik

    2006-01-01

    Tissue microarray (TMA) technology provides a possibility to explore protein expression patterns in a multitude of normal and disease tissues in a high-throughput setting. Although TMAs have been used for analysis of tissue samples, robust methods for studying in vitro cultured cell lines and cell aspirates in a TMA format have been lacking. We have adopted a technique to homogeneously distribute cells in an agarose gel matrix, creating an artificial tissue. This enables simultaneous profiling of protein expression in suspension- and adherent-grown cell samples assembled in a microarray. In addition, the present study provides an optimized strategy for the basic laboratory steps to efficiently produce TMAs. Presented modifications resulted in an improved quality of specimens and a higher section yield compared with standard TMA production protocols. Sections from the generated cell TMAs were tested for immunohistochemical staining properties using 20 well-characterized antibodies. Comparison of immunoreactivity in cultured dispersed cells and corresponding cells in tissue samples showed congruent results for all tested antibodies. We conclude that a modified TMA technique, including cell samples, provides a valuable tool for high-throughput analysis of protein expression, and that this technique can be used for global approaches to explore the human proteome. PMID:16957166

  18. High throughput proteomic analysis of the secretome in an explant model of articular cartilage inflammation

    PubMed Central

    Clutterbuck, Abigail L.; Smith, Julia R.; Allaway, David; Harris, Pat; Liddell, Susan; Mobasheri, Ali

    2011-01-01

    This study employed a targeted high-throughput proteomic approach to identify the major proteins present in the secretome of articular cartilage. Explants from equine metacarpophalangeal joints were incubated alone or with interleukin-1beta (IL-1β, 10 ng/ml), with or without carprofen, a non-steroidal anti-inflammatory drug, for six days. After tryptic digestion of culture medium supernatants, resulting peptides were separated by HPLC and detected in a Bruker amaZon ion trap instrument. The five most abundant peptides in each MS scan were fragmented and the fragmentation patterns compared to mammalian entries in the Swiss-Prot database, using the Mascot search engine. Tryptic peptides originating from aggrecan core protein, cartilage oligomeric matrix protein (COMP), fibronectin, fibromodulin, thrombospondin-1 (TSP-1), clusterin (CLU), cartilage intermediate layer protein-1 (CILP-1), chondroadherin (CHAD) and matrix metalloproteinases MMP-1 and MMP-3 were detected. Quantitative western blotting confirmed the presence of CILP-1, CLU, MMP-1, MMP-3 and TSP-1. Treatment with IL-1β increased MMP-1, MMP-3 and TSP-1 and decreased the CLU precursor but did not affect CILP-1 and CLU levels. Many of the proteins identified have well-established extracellular matrix functions and are involved in early repair/stress responses in cartilage. This high throughput approach may be used to study the changes that occur in the early stages of osteoarthritis. PMID:21354348

  19. Comparative proteome analysis of Milnesium tardigradum in early embryonic state versus adults in active and anhydrobiotic state.

    PubMed

    Schokraie, Elham; Warnken, Uwe; Hotz-Wagenblatt, Agnes; Grohme, Markus A; Hengherr, Steffen; Förster, Frank; Schill, Ralph O; Frohme, Marcus; Dandekar, Thomas; Schnölzer, Martina

    2012-01-01

    Tardigrades have fascinated researchers for more than 300 years because of their extraordinary capability to undergo cryptobiosis and survive extreme environmental conditions. However, the survival mechanisms of tardigrades are still poorly understood mainly due to the absence of detailed knowledge about the proteome and genome of these organisms. Our study was intended to provide a basis for the functional characterization of expressed proteins in different states of tardigrades. High-throughput, high-accuracy proteomics in combination with a newly developed tardigrade specific protein database resulted in the identification of more than 3000 proteins in three different states: early embryonic state and adult animals in active and anhydrobiotic state. This comprehensive proteome resource includes protein families such as chaperones, antioxidants, ribosomal proteins, cytoskeletal proteins, transporters, protein channels, nutrient reservoirs, and developmental proteins. A comparative analysis of protein families in the different states was performed by calculating the exponentially modified protein abundance index which classifies proteins in major and minor components. This is the first step to analyzing the proteins involved in early embryonic development, and furthermore proteins which might play an important role in the transition into the anhydrobiotic state.

  20. Comparative proteome analysis of Milnesium tardigradum in early embryonic state versus adults in active and anhydrobiotic state

    PubMed Central

    Schokraie, Elham; Warnken, Uwe; Hotz-Wagenblatt, Agnes; Grohme, Markus A.; Hengherr, Steffen; Förster, Frank; Schill, Ralph O.; Frohme, Marcus; Dandekar, Thomas; Schnölzer, Martina

    2012-01-01

    Tardigrades have fascinated researchers for more than 300 years because of their extraordinary capability to undergo cryptobiosis and survive extreme environmental conditions. However, the survival mechanisms of tardigrades are still poorly understood mainly due to the absence of detailed knowledge about the proteome and genome of these organisms. Our study was intended to provide a basis for the functional characterization of expressed proteins in different states of tardigrades. High-throughput, high-accuracy proteomics in combination with a newly developed tardigrade specific protein database resulted in the identification of more than 3000 proteins in three different states: early embryonic state and adult animals in active and anhydrobiotic state. This comprehensive proteome resource includes protein families such as chaperones, antioxidants, ribosomal proteins, cytoskeletal proteins, transporters, protein channels, nutrient reservoirs, and developmental proteins. A comparative analysis of protein families in the different states was performed by calculating the exponentially modified protein abundance index which classifies proteins in major and minor components. This is the first step to analyzing the proteins involved in early embryonic development, and furthermore proteins which might play an important role in the transition into the anhydrobiotic state. PMID:23029181

  1. Proteomics: a new approach to the study of disease.

    PubMed

    Chambers, G; Lawrie, L; Cash, P; Murray, G I

    2000-11-01

    The global analysis of cellular proteins has recently been termed proteomics and is a key area of research that is developing in the post-genome era. Proteomics uses a combination of sophisticated techniques including two-dimensional (2D) gel electrophoresis, image analysis, mass spectrometry, amino acid sequencing, and bio-informatics to resolve comprehensively, to quantify, and to characterize proteins. The application of proteomics provides major opportunities to elucidate disease mechanisms and to identify new diagnostic markers and therapeutic targets. This review aims to explain briefly the background to proteomics and then to outline proteomic techniques. Applications to the study of human disease conditions ranging from cancer to infectious diseases are reviewed. Finally, possible future advances are briefly considered, especially those which may lead to faster sample throughput and increased sensitivity for the detection of individual proteins. Copyright 2000 John Wiley & Sons, Ltd.

  2. MS-REDUCE: an ultrafast technique for reduction of big mass spectrometry data for high-throughput processing.

    PubMed

    Awan, Muaaz Gul; Saeed, Fahad

    2016-05-15

    Modern proteomics studies utilize high-throughput mass spectrometers which can produce data at an astonishing rate. These big mass spectrometry (MS) datasets can easily reach peta-scale level creating storage and analytic problems for large-scale systems biology studies. Each spectrum consists of thousands of peaks which have to be processed to deduce the peptide. However, only a small percentage of peaks in a spectrum are useful for peptide deduction as most of the peaks are either noise or not useful for a given spectrum. This redundant processing of non-useful peaks is a bottleneck for streaming high-throughput processing of big MS data. One way to reduce the amount of computation required in a high-throughput environment is to eliminate non-useful peaks. Existing noise removing algorithms are limited in their data-reduction capability and are compute intensive making them unsuitable for big data and high-throughput environments. In this paper we introduce a novel low-complexity technique based on classification, quantization and sampling of MS peaks. We present a novel data-reductive strategy for analysis of Big MS data. Our algorithm, called MS-REDUCE, is capable of eliminating noisy peaks as well as peaks that do not contribute to peptide deduction before any peptide deduction is attempted. Our experiments have shown up to 100× speed up over existing state of the art noise elimination algorithms while maintaining comparable high quality matches. Using our approach we were able to process a million spectra in just under an hour on a moderate server. The developed tool and strategy has been made available to wider proteomics and parallel computing community and the code can be found at https://github.com/pcdslab/MSREDUCE CONTACT: : fahad.saeed@wmich.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. The application of proteomics in different aspects of hepatocellular carcinoma research.

    PubMed

    Xing, Xiaohua; Liang, Dong; Huang, Yao; Zeng, Yongyi; Han, Xiao; Liu, Xiaolong; Liu, Jingfeng

    2016-08-11

    Hepatocellular carcinoma (HCC) is one of the most common malignant tumors, which is causing the second leading cancer-related death worldwide. With the significant advances of high-throughput protein analysis techniques, the proteomics offered an extremely useful and versatile analytical platform for biomedical researches. In recent years, different proteomic strategies have been widely applied in the various aspects of HCC studies, ranging from screening the early diagnostic and prognostic biomarkers to in-depth investigating the underlying molecular mechanisms. In this review, we would like to systematically summarize the current applications of proteomics in hepatocellular carcinoma study, and discuss the challenges of applying proteomics in study clinical samples, as well as discuss the possible application of proteomics in precision medicine. In this review, we have systematically summarized the current applications of proteomics in hepatocellular carcinoma study, ranging from screening biomarkers to in-depth investigating the underlying molecular mechanisms. In addition, we have discussed the challenges of applying proteomics in study clinical samples, as well as the possible applications of proteomics in precision medicine. We believe that this review would help readers to be better familiar with the recent progresses of clinical proteomics, especially in the field of hepatocellular carcinoma research. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. DOGMA: domain-based transcriptome and proteome quality assessment.

    PubMed

    Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten

    2016-09-01

    Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Synovial fluid proteomics in the pursuit of arthritis mediators: An evolving field of novel biomarker discovery.

    PubMed

    Mahendran, Shalini M; Oikonomopoulou, Katerina; Diamandis, Eleftherios P; Chandran, Vinod

    Synovial fluid (SF) is a protein-rich fluid produced into the joint cavity by cells of the synovial membrane. Due to its direct contact with articular cartilage, surfaces of the bone, and the synoviocytes of the inner membrane, it provides a promising reflection of the biochemical state of the joint under varying physiological and pathophysiological conditions. This property of SF has been exploited within numerous studies in search of unique biomarkers of joint pathologies with the ultimate goal of developing minimally invasive clinical assays to detect and/or monitor disease states. Several proteomic methodologies have been employed to mine the SF proteome. From elementary immunoassays to high-throughput analyses using mass spectrometry-based techniques, each has demonstrated distinct advantages and disadvantages in the identification and quantification of SF proteins. This review will explore the role of SF in the elucidation of the arthritis proteome and the extent to which high-throughput techniques have facilitated the discovery and validation of protein biomarkers from osteoarthritis (OA), rheumatoid arthritis (RA), psoriatic arthritis (PsA), and juvenile idiopathic arthritis (JIA) patients.

  6. Microbial genomics, transcriptomics and proteomics: new discoveries in decomposition research using complementary methods.

    PubMed

    Baldrian, Petr; López-Mondéjar, Rubén

    2014-02-01

    Molecular methods for the analysis of biomolecules have undergone rapid technological development in the last decade. The advent of next-generation sequencing methods and improvements in instrumental resolution enabled the analysis of complex transcriptome, proteome and metabolome data, as well as a detailed annotation of microbial genomes. The mechanisms of decomposition by model fungi have been described in unprecedented detail by the combination of genome sequencing, transcriptomics and proteomics. The increasing number of available genomes for fungi and bacteria shows that the genetic potential for decomposition of organic matter is widespread among taxonomically diverse microbial taxa, while expression studies document the importance of the regulation of expression in decomposition efficiency. Importantly, high-throughput methods of nucleic acid analysis used for the analysis of metagenomes and metatranscriptomes indicate the high diversity of decomposer communities in natural habitats and their taxonomic composition. Today, the metaproteomics of natural habitats is of interest. In combination with advanced analytical techniques to explore the products of decomposition and the accumulation of information on the genomes of environmentally relevant microorganisms, advanced methods in microbial ecophysiology should increase our understanding of the complex processes of organic matter transformation.

  7. Remodeling Cildb, a popular database for cilia and links for ciliopathies

    PubMed Central

    2014-01-01

    Background New generation technologies in cell and molecular biology generate large amounts of data hard to exploit for individual proteins. This is particularly true for ciliary and centrosomal research. Cildb is a multi–species knowledgebase gathering high throughput studies, which allows advanced searches to identify proteins involved in centrosome, basal body or cilia biogenesis, composition and function. Combined to localization of genetic diseases on human chromosomes given by OMIM links, candidate ciliopathy proteins can be compiled through Cildb searches. Methods Othology between recent versions of the whole proteomes was computed using Inparanoid and ciliary high throughput studies were remapped on these recent versions. Results Due to constant evolution of the ciliary and centrosomal field, Cildb has been recently upgraded twice, with new species whole proteomes and new ciliary studies, and the latter version displays a novel BioMart interface, much more intuitive than the previous ones. Conclusions This already popular database is designed now for easier use and is up to date in regard to high throughput ciliary studies. PMID:25422781

  8. Free Flow Zonal Electrophoresis for Fractionation of Plant Membrane Compartments Prior to Proteomic Analysis.

    PubMed

    Barkla, Bronwyn J

    2018-01-01

    Free flow zonal electrophoresis (FFZE) is a versatile, reproducible, and potentially high-throughput technique for the separation of plant organelles and membranes by differences in membrane surface charge. It offers considerable benefits over traditional fractionation techniques, such as density gradient centrifugation and two-phase partitioning, as it is relatively fast, sample recovery is high, and the method provides unparalleled sample purity. It has been used to successfully purify chloroplasts and mitochondria from plants but also, to obtain highly pure fractions of plasma membrane, tonoplast, ER, Golgi, and thylakoid membranes. Application of the technique can significantly improve protein coverage in large-scale proteomics studies by decreasing sample complexity. Here, we describe the method for the fractionation of plant cellular membranes from leaves by FFZE.

  9. The amino acid's backup bone - storage solutions for proteomics facilities.

    PubMed

    Meckel, Hagen; Stephan, Christian; Bunse, Christian; Krafzik, Michael; Reher, Christopher; Kohl, Michael; Meyer, Helmut Erich; Eisenacher, Martin

    2014-01-01

    Proteomics methods, especially high-throughput mass spectrometry analysis have been continually developed and improved over the years. The analysis of complex biological samples produces large volumes of raw data. Data storage and recovery management pose substantial challenges to biomedical or proteomic facilities regarding backup and archiving concepts as well as hardware requirements. In this article we describe differences between the terms backup and archive with regard to manual and automatic approaches. We also introduce different storage concepts and technologies from transportable media to professional solutions such as redundant array of independent disks (RAID) systems, network attached storages (NAS) and storage area network (SAN). Moreover, we present a software solution, which we developed for the purpose of long-term preservation of large mass spectrometry raw data files on an object storage device (OSD) archiving system. Finally, advantages, disadvantages, and experiences from routine operations of the presented concepts and technologies are evaluated and discussed. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013. Published by Elsevier B.V.

  10. Metabolomics and proteomics technologies to explore the herbal preparation affecting metabolic disorders using high resolution mass spectrometry.

    PubMed

    Zhang, Aihua; Zhou, Xiaohang; Zhao, Hongwei; Zou, Shiyu; Ma, Chung Wah; Liu, Qi; Sun, Hui; Liu, Liang; Wang, Xijun

    2017-01-31

    An integrative metabolomics and proteomics approach can provide novel insights in the understanding of biological systems. We have integrated proteome and metabolome data sets for a holistic view of the molecular mechanisms in disease. Using quantitative iTRAQ-LC-MS/MS proteomics coupled with UPLC-Q-TOF-HDMS based metabolomics, we determined the protein and metabolite expression changes in the kidney-yang deficiency syndrome (KYDS) rat model and further investigated the intervention effects of the Jinkui Shenqi Pill (JSP). The VIP-plot of the orthogonal PLS-DA (OPLS-DA) was used for discovering the potential biomarkers to clarify the therapeutic mechanisms of JSP in treating KYDS. The results showed that JSP can alleviate the kidney impairment induced by KYDS. Sixty potential biomarkers, including 5-l-glutamyl-taurine, phenylacetaldehyde, 4,6-dihydroxyquinoline, and xanthurenic acid etc., were definitely up- or down-regulated. The regulatory effect of JSP on the disturbed metabolic pathways was proved by the established metabonomic method. Using pathway analyses, we identified the disturbed metabolic pathways such as taurine and hypotaurine metabolism, pyrimidine metabolism, tyrosine metabolism, tryptophan metabolism, histidine metabolism, steroid hormone biosynthesis, etc. Furthermore, using iTRAQ-based quantitative proteomics analysis, seventeen differential proteins were identified and significantly altered by the JSP treatment. These proteins appear to be involved in Wnt, chemokine, PPAR, and MAPK signaling pathways, etc. Functional pathway analysis revealed that most of the proteins were found to play a key role in the regulation of metabolism pathways. Bioinformatics analysis with the IPA software found that these differentially-expressed moleculars had a strong correlation with the α-adrenergic signaling, FGF signaling, etc. Our data indicate that high-throughput metabolomics and proteomics can provide an insight on the herbal preparations affecting the metabolic disorders using high resolution mass spectrometry.

  11. Applicability of a high-throughput shotgun plasma protein screening approach in understanding maternal biological pathways relevant to infant birth weight outcome.

    PubMed

    Kumarathasan, P; Vincent, R; Das, D; Mohottalage, S; Blais, E; Blank, K; Karthikeyan, S; Vuong, N Q; Arbuckle, T E; Fraser, W D

    2014-04-04

    There are reports linking maternal nutritional status, smoking and environmental chemical exposures to adverse pregnancy outcomes. However, biological bases for association between some of these factors and birth outcomes are yet to be established. The objective of this preliminary work is to test the capability of a new high-throughput shotgun plasma proteomic screening in identifying maternal changes relevant to pregnancy outcome. A subset of third trimester plasma samples (N=12) associated with normal and low-birth weight infants were fractionated, tryptic-digested and analyzed for global proteomic changes using a MALDI-TOF-TOF-MS methodology. Mass spectral data were mined for candidate biomarkers using bioinformatic and statistical tools. Maternal plasma profiles of cytokines (e.g. IL8, TNF-α), chemokines (e.g. MCP-1) and cardiovascular endpoints (e.g. ET-1, MMP-9) were analyzed by a targeted approach using multiplex protein array and HPLC-Fluorescence methods. Target and global plasma proteomic markers were used to identify protein interaction networks and maternal biological pathways relevant to low infant birth weight. Our results exhibited the potential to discriminate specific maternal physiologies relevant to risk of adverse birth outcomes. This proteomic approach can be valuable in understanding the impacts of maternal factors such as environmental contaminant exposures and nutrition on birth outcomes in future work. We demonstrate here the fitness of mass spectrometry-based shot-gun proteomics for surveillance of biological changes in mothers, and for adverse pathway analysis in combination with target biomarker information. This approach has potential for enabling early detection of mothers at risk for low infant birth weight and preterm birth, and thus early intervention for mitigation and prevention of adverse pregnancy outcomes. This article is part of a Special Issue entitled: Can Proteomics Fill the Gap Between Genomics and Phenotypes? Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  12. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data.

    PubMed

    Peterson, Elena S; McCue, Lee Ann; Schrimpe-Rutledge, Alexandra C; Jensen, Jeffrey L; Walker, Hyunjoo; Kobold, Markus A; Webb, Samantha R; Payne, Samuel H; Ansong, Charles; Adkins, Joshua N; Cannon, William R; Webb-Robertson, Bobbie-Jo M

    2012-04-05

    The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php.

  13. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data

    PubMed Central

    2012-01-01

    Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php. PMID:22480257

  14. Multicapillary SDS-gel electrophoresis for the analysis of fluorescently labeled mAb preparations: a high throughput quality control process for the production of QuantiPlasma and PlasmaScan mAb libraries.

    PubMed

    Székely, Andrea; Szekrényes, Akos; Kerékgyártó, Márta; Balogh, Attila; Kádas, János; Lázár, József; Guttman, András; Kurucz, István; Takács, László

    2014-08-01

    Molecular heterogeneity of mAb preparations is the result of various co- and post-translational modifications and to contaminants related to the production process. Changes in molecular composition results in alterations of functional performance, therefore quality control and validation of therapeutic or diagnostic protein products is essential. A special case is the consistent production of mAb libraries (QuantiPlasma™ and PlasmaScan™) for proteome profiling, quality control of which represents a challenge because of high number of mAbs (>1000). Here, we devise a generally applicable multicapillary SDS-gel electrophoresis process for the analysis of fluorescently labeled mAb preparations for the high throughput quality control of mAbs of the QuantiPlasma™ and PlasmaScan™ libraries. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. High-coverage quantitative proteomics using amine-specific isotopic labeling.

    PubMed

    Melanson, Jeremy E; Avery, Steven L; Pinto, Devanand M

    2006-08-01

    Peptide dimethylation with isotopically coded formaldehydes was evaluated as a potential alternative to techniques such as the iTRAQ method for comparative proteomics. The isotopic labeling strategy and custom-designed protein quantitation software were tested using protein standards and then applied to measure proteins levels associated with Alzheimer's disease (AD). The method provided high accuracy (10% error), precision (14% RSD) and coverage (70%) when applied to the analysis of a standard solution of BSA by LC-MS/MS. The technique was then applied to measure protein abundance levels in brain tissue afflicted with AD relative to normal brain tissue. 2-D LC-MS analysis identified 548 unique proteins (p<0.05). Of these, 349 were quantified with two or more peptides that met the statistical criteria used in this study. Several classes of proteins exhibited significant changes in abundance. For example, elevated levels of antioxidant proteins and decreased levels of mitochondrial electron transport proteins were observed. The results demonstrate the utility of the labeling method for high-throughput quantitative analysis.

  16. Advances in targeted proteomics and applications to biomedical research

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shi, Tujin; Song, Ehwang; Nie, Song

    Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity (Shi et al., Proteomics, 12, 1074–1092, 2012) herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications inmore » human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed.« less

  17. GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies

    PubMed Central

    Zhang, Bing; Schmoyer, Denise; Kirov, Stefan; Snoddy, Jay

    2004-01-01

    Background Microarray and other high-throughput technologies are producing large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in the gene sets. Results We have created a web-based tool for data analysis and data visualization for sets of genes called GOTree Machine (GOTM). This tool was originally intended to analyze sets of co-regulated genes identified from microarray analysis but is adaptable for use with other gene sets from other high-throughput analyses. GOTree Machine generates a GOTree, a tree-like structure to navigate the Gene Ontology Directed Acyclic Graph for input gene sets. This system provides user friendly data navigation and visualization. Statistical analysis helps users to identify the most important Gene Ontology categories for the input gene sets and suggests biological areas that warrant further study. GOTree Machine is available online at . Conclusion GOTree Machine has a broad application in functional genomic, proteomic and other high-throughput methods that generate large sets of interesting genes; its primary purpose is to help users sort for interesting patterns in gene sets. PMID:14975175

  18. Serum proteome profiling in canine idiopathic dilated cardiomyopathy using TMT-based quantitative proteomics approach.

    PubMed

    Bilić, Petra; Guillemin, Nicolas; Kovačević, Alan; Beer Ljubić, Blanka; Jović, Ines; Galan, Asier; Eckersall, Peter David; Burchmore, Richard; Mrljak, Vladimir

    2018-05-15

    Idiopathic dilated cardiomyopathy (iDCM) is a primary myocardial disorder with an unknown aetiology, characterized by reduced contractility and ventricular dilation of the left or both ventricles. Naturally occurring canine iDCM was used herein to identify serum proteomic signature of the disease compared to the healthy state, providing an insight into underlying mechanisms and revealing proteins with biomarker potential. To achieve this, we used high-throughput label-based quantitative LC-MS/MS proteomics approach and bioinformatics analysis of the in silico inferred interactome protein network created from the initial list of differential proteins. To complement the proteomic analysis, serum biochemical parameters and levels of know biomarkers of cardiac function were measured. Several proteins with biomarker potential were identified, such as inter-alpha-trypsin inhibitor heavy chain H4, microfibril-associated glycoprotein 4 and apolipoprotein A-IV, which were validated using an independent method (Western blotting) and showed high specificity and sensitivity according to the receiver operating characteristic curve analysis. Bioinformatics analysis revealed involvement of different pathways in iDCM, such as complement cascade activation, lipoprotein particles dynamics, elastic fibre formation, GPCR signalling and respiratory electron transport chain. Idiopathic dilated cardiomyopathy is a severe primary myocardial disease of unknown cause, affecting both humans and dogs. This study is a contribution to the canine heart disease research by means of proteomic and bioinformatic state of the art analyses, following similar approach in human iDCM research. Importantly, we used serum as non-invasive and easily accessible biological source of information and contributed to the scarce data on biofluid proteome research on this topic. Bioinformatics analysis revealed biological pathways modulated in canine iDCM with potential of further targeted research. Also, several proteins with biomarker potential have been identified and successfully validated. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Comprehensive Analysis of Cancer-Proteogenome to Identify Biomarkers for the Early Diagnosis and Prognosis of Cancer.

    PubMed

    Shukla, Hem D

    2017-10-25

    During the past century, our understanding of cancer diagnosis and treatment has been based on a monogenic approach, and as a consequence our knowledge of the clinical genetic underpinnings of cancer is incomplete. Since the completion of the human genome in 2003, it has steered us into therapeutic target discovery, enabling us to mine the genome using cutting edge proteogenomics tools. A number of novel and promising cancer targets have emerged from the genome project for diagnostics, therapeutics, and prognostic markers, which are being used to monitor response to cancer treatment. The heterogeneous nature of cancer has hindered progress in understanding the underlying mechanisms that lead to abnormal cellular growth. Since, the start of The Cancer Genome Atlas (TCGA), and the International Genome consortium projects, there has been tremendous progress in genome sequencing and immense numbers of cancer genomes have been completed, and this approach has transformed our understanding of the diagnosis and treatment of different types of cancers. By employing Genomics and proteomics technologies, an immense amount of genomic data is being generated on clinical tumors, which has transformed the cancer landscape and has the potential to transform cancer diagnosis and prognosis. A complete molecular view of the cancer landscape is necessary for understanding the underlying mechanisms of cancer initiation to improve diagnosis and prognosis, which ultimately will lead to personalized treatment. Interestingly, cancer proteome analysis has also allowed us to identify biomarkers to monitor drug and radiation resistance in patients undergoing cancer treatment. Further, TCGA-funded studies have allowed for the genomic and transcriptomic characterization of targeted cancers, this analysis aiding the development of targeted therapies for highly lethal malignancy. High-throughput technologies, such as complete proteome, epigenome, protein-protein interaction, and pharmacogenomics data, are indispensable to glean into the cancer genome and proteome and these approaches have generated multidimensional universal studies of genes and proteins (OMICS) data which has the potential to facilitate precision medicine. However, due to slow progress in computational technologies, the translation of big omics data into their clinical aspects have been slow. In this review, attempts have been made to describe the role of high-throughput genomic and proteomic technologies in identifying a panel of biomarkers which could be used for the early diagnosis and prognosis of cancer.

  20. WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data

    PubMed Central

    Yi, Ming; Horton, Jay D; Cohen, Jonathan C; Hobbs, Helen H; Stephens, Robert M

    2006-01-01

    Background Analysis of High Throughput (HTP) Data such as microarray and proteomics data has provided a powerful methodology to study patterns of gene regulation at genome scale. A major unresolved problem in the post-genomic era is to assemble the large amounts of data generated into a meaningful biological context. We have developed a comprehensive software tool, WholePathwayScope (WPS), for deriving biological insights from analysis of HTP data. Result WPS extracts gene lists with shared biological themes through color cue templates. WPS statistically evaluates global functional category enrichment of gene lists and pathway-level pattern enrichment of data. WPS incorporates well-known biological pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) and Biocarta, GO (Gene Ontology) terms as well as user-defined pathways or relevant gene clusters or groups, and explores gene-term relationships within the derived gene-term association networks (GTANs). WPS simultaneously compares multiple datasets within biological contexts either as pathways or as association networks. WPS also integrates Genetic Association Database and Partial MedGene Database for disease-association information. We have used this program to analyze and compare microarray and proteomics datasets derived from a variety of biological systems. Application examples demonstrated the capacity of WPS to significantly facilitate the analysis of HTP data for integrative discovery. Conclusion This tool represents a pathway-based platform for discovery integration to maximize analysis power. The tool is freely available at . PMID:16423281

  1. ms_lims, a simple yet powerful open source laboratory information management system for MS-driven proteomics.

    PubMed

    Helsens, Kenny; Colaert, Niklaas; Barsnes, Harald; Muth, Thilo; Flikka, Kristian; Staes, An; Timmerman, Evy; Wortelkamp, Steffi; Sickmann, Albert; Vandekerckhove, Joël; Gevaert, Kris; Martens, Lennart

    2010-03-01

    MS-based proteomics produces large amounts of mass spectra that require processing, identification and possibly quantification before interpretation can be undertaken. High-throughput studies require automation of these various steps, and management of the data in association with the results obtained. We here present ms_lims (http://genesis.UGent.be/ms_lims), a freely available, open-source system based on a central database to automate data management and processing in MS-driven proteomics analyses.

  2. Determination of burn patient outcome by large-scale quantitative discovery proteomics

    PubMed Central

    Finnerty, Celeste C.; Jeschke, Marc G.; Qian, Wei-Jun; Kaushal, Amit; Xiao, Wenzhong; Liu, Tao; Gritsenko, Marina A.; Moore, Ronald J.; Camp, David G.; Moldawer, Lyle L.; Elson, Constance; Schoenfeld, David; Gamelli, Richard; Gibran, Nicole; Klein, Matthew; Arnoldo, Brett; Remick, Daniel; Smith, Richard D.; Davis, Ronald; Tompkins, Ronald G.; Herndon, David N.

    2013-01-01

    Objective Emerging proteomics techniques can be used to establish proteomic outcome signatures and to identify candidate biomarkers for survival following traumatic injury. We applied high-resolution liquid chromatography-mass spectrometry (LC-MS) and multiplex cytokine analysis to profile the plasma proteome of survivors and non-survivors of massive burn injury to determine the proteomic survival signature following a major burn injury. Design Proteomic discovery study. Setting Five burn hospitals across the U.S. Patients Thirty-two burn patients (16 non-survivors and 16 survivors), 19–89 years of age, were admitted within 96 h of injury to the participating hospitals with burns covering >20% of the total body surface area and required at least one surgical intervention. Interventions None. Measurements and Main Results We found differences in circulating levels of 43 proteins involved in the acute phase response, hepatic signaling, the complement cascade, inflammation, and insulin resistance. Thirty-two of the proteins identified were not previously known to play a role in the response to burn. IL-4, IL-8, GM-CSF, MCP-1, and β2-microglobulin correlated well with survival and may serve as clinical biomarkers. Conclusions These results demonstrate the utility of these techniques for establishing proteomic survival signatures and for use as a discovery tool to identify candidate biomarkers for survival. This is the first clinical application of a high-throughput, large-scale LC-MS-based quantitative plasma proteomic approach for biomarker discovery for the prediction of patient outcome following burn, trauma or critical illness. PMID:23507713

  3. Multiplex High-Throughput Targeted Proteomic Assay To Identify Induced Pluripotent Stem Cells.

    PubMed

    Baud, Anna; Wessely, Frank; Mazzacuva, Francesca; McCormick, James; Camuzeaux, Stephane; Heywood, Wendy E; Little, Daniel; Vowles, Jane; Tuefferd, Marianne; Mosaku, Olukunbi; Lako, Majlinda; Armstrong, Lyle; Webber, Caleb; Cader, M Zameel; Peeters, Pieter; Gissen, Paul; Cowley, Sally A; Mills, Kevin

    2017-02-21

    Induced pluripotent stem cells have great potential as a human model system in regenerative medicine, disease modeling, and drug screening. However, their use in medical research is hampered by laborious reprogramming procedures that yield low numbers of induced pluripotent stem cells. For further applications in research, only the best, competent clones should be used. The standard assays for pluripotency are based on genomic approaches, which take up to 1 week to perform and incur significant cost. Therefore, there is a need for a rapid and cost-effective assay able to distinguish between pluripotent and nonpluripotent cells. Here, we describe a novel multiplexed, high-throughput, and sensitive peptide-based multiple reaction monitoring mass spectrometry assay, allowing for the identification and absolute quantitation of multiple core transcription factors and pluripotency markers. This assay provides simpler and high-throughput classification into either pluripotent or nonpluripotent cells in 7 min analysis while being more cost-effective than conventional genomic tests.

  4. Image analysis tools and emerging algorithms for expression proteomics

    PubMed Central

    English, Jane A.; Lisacek, Frederique; Morris, Jeffrey S.; Yang, Guang-Zhong; Dunn, Michael J.

    2012-01-01

    Since their origins in academic endeavours in the 1970s, computational analysis tools have matured into a number of established commercial packages that underpin research in expression proteomics. In this paper we describe the image analysis pipeline for the established 2-D Gel Electrophoresis (2-DE) technique of protein separation, and by first covering signal analysis for Mass Spectrometry (MS), we also explain the current image analysis workflow for the emerging high-throughput ‘shotgun’ proteomics platform of Liquid Chromatography coupled to MS (LC/MS). The bioinformatics challenges for both methods are illustrated and compared, whilst existing commercial and academic packages and their workflows are described from both a user’s and a technical perspective. Attention is given to the importance of sound statistical treatment of the resultant quantifications in the search for differential expression. Despite wide availability of proteomics software, a number of challenges have yet to be overcome regarding algorithm accuracy, objectivity and automation, generally due to deterministic spot-centric approaches that discard information early in the pipeline, propagating errors. We review recent advances in signal and image analysis algorithms in 2-DE, MS, LC/MS and Imaging MS. Particular attention is given to wavelet techniques, automated image-based alignment and differential analysis in 2-DE, Bayesian peak mixture models and functional mixed modelling in MS, and group-wise consensus alignment methods for LC/MS. PMID:21046614

  5. Ion channel drug discovery and research: the automated Nano-Patch-Clamp technology.

    PubMed

    Brueggemann, A; George, M; Klau, M; Beckler, M; Steindl, J; Behrends, J C; Fertig, N

    2004-01-01

    Unlike the genomics revolution, which was largely enabled by a single technological advance (high throughput sequencing), rapid advancement in proteomics will require a broader effort to increase the throughput of a number of key tools for functional analysis of different types of proteins. In the case of ion channels -a class of (membrane) proteins of great physiological importance and potential as drug targets- the lack of adequate assay technologies is felt particularly strongly. The available, indirect, high throughput screening methods for ion channels clearly generate insufficient information. The best technology to study ion channel function and screen for compound interaction is the patch clamp technique, but patch clamping suffers from low throughput, which is not acceptable for drug screening. A first step towards a solution is presented here. The nano patch clamp technology, which is based on a planar, microstructured glass chip, enables automatic whole cell patch clamp measurements. The Port-a-Patch is an automated electrophysiology workstation, which uses planar patch clamp chips. This approach enables high quality and high content ion channel and compound evaluation on a one-cell-at-a-time basis. The presented automation of the patch process and its scalability to an array format are the prerequisites for any higher throughput electrophysiology instruments.

  6. Identifying the missing proteins in human proteome by biological language model.

    PubMed

    Dong, Qiwen; Wang, Kai; Liu, Xuan

    2016-12-23

    With the rapid development of high-throughput sequencing technology, the proteomics research becomes a trendy field in the post genomics era. It is necessary to identify all the native-encoding protein sequences for further function and pathway analysis. Toward that end, the Human Proteome Organization lunched the Human Protein Project in 2011. However many proteins are hard to be detected by experiment methods, which becomes one of the bottleneck in Human Proteome Project. In consideration of the complicatedness of detecting these missing proteins by using wet-experiment approach, here we use bioinformatics method to pre-filter the missing proteins. Since there are analogy between the biological sequences and natural language, the n-gram models from Natural Language Processing field has been used to filter the missing proteins. The dataset used in this study contains 616 missing proteins from the "uncertain" category of the neXtProt database. There are 102 proteins deduced by the n-gram model, which have high probability to be native human proteins. We perform a detail analysis on the predicted structure and function of these missing proteins and also compare the high probability proteins with other mass spectrum datasets. The evaluation shows that the results reported here are in good agreement with those obtained by other well-established databases. The analysis shows that 102 proteins may be native gene-coding proteins and some of the missing proteins are membrane or natively disordered proteins which are hard to be detected by experiment methods.

  7. Urine Sample Preparation in 96-Well Filter Plates for Quantitative Clinical Proteomics

    PubMed Central

    2015-01-01

    Urine is an important, noninvasively collected body fluid source for the diagnosis and prognosis of human diseases. Liquid chromatography mass spectrometry (LC-MS) based shotgun proteomics has evolved as a sensitive and informative technique to discover candidate disease biomarkers from urine specimens. Filter-aided sample preparation (FASP) generates peptide samples from protein mixtures of cell lysate or body fluid origin. Here, we describe a FASP method adapted to 96-well filter plates, named 96FASP. Soluble urine concentrates containing ∼10 μg of total protein were processed by 96FASP and LC-MS resulting in 700–900 protein identifications at a 1% false discovery rate (FDR). The experimental repeatability, as assessed by label-free quantification and Pearson correlation analysis for shared proteins among replicates, was high (R ≥ 0.97). Application to urinary pellet lysates which is of particular interest in the context of urinary tract infection analysis was also demonstrated. On average, 1700 proteins (±398) were identified in five experiments. In a pilot study using 96FASP for analysis of eight soluble urine samples, we demonstrated that protein profiles of technical replicates invariably clustered; the protein profiles for distinct urine donors were very different from each other. Robust, highly parallel methods to generate peptide mixtures from urine and other body fluids are critical to increase cost-effectiveness in clinical proteomics projects. This 96FASP method has potential to become a gold standard for high-throughput quantitative clinical proteomics. PMID:24797144

  8. Proteomics and Systems Biology: Current and Future Applications in the Nutritional Sciences1

    PubMed Central

    Moore, J. Bernadette; Weeks, Mark E.

    2011-01-01

    In the last decade, advances in genomics, proteomics, and metabolomics have yielded large-scale datasets that have driven an interest in global analyses, with the objective of understanding biological systems as a whole. Systems biology integrates computational modeling and experimental biology to predict and characterize the dynamic properties of biological systems, which are viewed as complex signaling networks. Whereas the systems analysis of disease-perturbed networks holds promise for identification of drug targets for therapy, equally the identified critical network nodes may be targeted through nutritional intervention in either a preventative or therapeutic fashion. As such, in the context of the nutritional sciences, it is envisioned that systems analysis of normal and nutrient-perturbed signaling networks in combination with knowledge of underlying genetic polymorphisms will lead to a future in which the health of individuals will be improved through predictive and preventative nutrition. Although high-throughput transcriptomic microarray data were initially most readily available and amenable to systems analysis, recent technological and methodological advances in MS have contributed to a linear increase in proteomic investigations. It is now commonplace for combined proteomic technologies to generate complex, multi-faceted datasets, and these will be the keystone of future systems biology research. This review will define systems biology, outline current proteomic methodologies, highlight successful applications of proteomics in nutrition research, and discuss the challenges for future applications of systems biology approaches in the nutritional sciences. PMID:22332076

  9. Tissue matrix arrays for high throughput screening and systems analysis of cell function

    PubMed Central

    Beachley, Vince Z.; Wolf, Matthew T.; Sadtler, Kaitlyn; Manda, Srikanth S.; Jacobs, Heather; Blatchley, Michael; Bader, Joel S.; Pandey, Akhilesh; Pardoll, Drew; Elisseeff, Jennifer H.

    2015-01-01

    Cell and protein arrays have demonstrated remarkable utility in the high-throughput evaluation of biological responses; however, they lack the complexity of native tissue and organs. Here, we describe tissue extracellular matrix (ECM) arrays for screening biological outputs and systems analysis. We spotted processed tissue ECM particles as two-dimensional arrays or incorporated them with cells to generate three-dimensional cell-matrix microtissue arrays. We then investigated the response of human stem, cancer, and immune cells to tissue ECM arrays originating from 11 different tissues, and validated the 2D and 3D arrays as representative of the in vivo microenvironment through quantitative analysis of tissue-specific cellular responses, including matrix production, adhesion and proliferation, and morphological changes following culture. The biological outputs correlated with tissue proteomics, and network analysis identified several proteins linked to cell function. Our methodology enables broad screening of ECMs to connect tissue-specific composition with biological activity, providing a new resource for biomaterials research and translation. PMID:26480475

  10. Proteomic Workflows for Biomarker Identification Using Mass Spectrometry — Technical and Statistical Considerations during Initial Discovery

    PubMed Central

    Orton, Dennis J.; Doucette, Alan A.

    2013-01-01

    Identification of biomarkers capable of differentiating between pathophysiological states of an individual is a laudable goal in the field of proteomics. Protein biomarker discovery generally employs high throughput sample characterization by mass spectrometry (MS), being capable of identifying and quantifying thousands of proteins per sample. While MS-based technologies have rapidly matured, the identification of truly informative biomarkers remains elusive, with only a handful of clinically applicable tests stemming from proteomic workflows. This underlying lack of progress is attributed in large part to erroneous experimental design, biased sample handling, as well as improper statistical analysis of the resulting data. This review will discuss in detail the importance of experimental design and provide some insight into the overall workflow required for biomarker identification experiments. Proper balance between the degree of biological vs. technical replication is required for confident biomarker identification. PMID:28250400

  11. P2P proteomics -- data sharing for enhanced protein identification

    PubMed Central

    2012-01-01

    Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that could be derived from the proposed P2P proteomics system, however it is not straightforward to arrive to the same conclusion by conventional means as it is difficult to discard organic contamination of samples. The actual presence of this contaminant was only stated after the ABRF study of all the identifications reported by the laboratories. PMID:22293032

  12. High-Throughput Quantitative Proteomic Analysis of Dengue Virus Type 2 Infected A549 Cells

    PubMed Central

    Chiu, Han-Chen; Hannemann, Holger; Heesom, Kate J.; Matthews, David A.; Davidson, Andrew D.

    2014-01-01

    Disease caused by dengue virus is a global health concern with up to 390 million individuals infected annually worldwide. There are no vaccines or antiviral compounds available to either prevent or treat dengue disease which may be fatal. To increase our understanding of the interaction of dengue virus with the host cell, we analyzed changes in the proteome of human A549 cells in response to dengue virus type 2 infection using stable isotope labelling in cell culture (SILAC) in combination with high-throughput mass spectrometry (MS). Mock and infected A549 cells were fractionated into nuclear and cytoplasmic extracts before analysis to identify proteins that redistribute between cellular compartments during infection and reduce the complexity of the analysis. We identified and quantified 3098 and 2115 proteins in the cytoplasmic and nuclear fractions respectively. Proteins that showed a significant alteration in amount during infection were examined using gene enrichment, pathway and network analysis tools. The analyses revealed that dengue virus infection modulated the amounts of proteins involved in the interferon and unfolded protein responses, lipid metabolism and the cell cycle. The SILAC-MS results were validated for a select number of proteins over a time course of infection by Western blotting and immunofluorescence microscopy. Our study demonstrates for the first time the power of SILAC-MS for identifying and quantifying novel changes in cellular protein amounts in response to dengue virus infection. PMID:24671231

  13. A Method for Label-Free, Differential Top-Down Proteomics.

    PubMed

    Ntai, Ioanna; Toby, Timothy K; LeDuc, Richard D; Kelleher, Neil L

    2016-01-01

    Biomarker discovery in the translational research has heavily relied on labeled and label-free quantitative bottom-up proteomics. Here, we describe a new approach to biomarker studies that utilizes high-throughput top-down proteomics and is the first to offer whole protein characterization and relative quantitation within the same experiment. Using yeast as a model, we report procedures for a label-free approach to quantify the relative abundance of intact proteins ranging from 0 to 30 kDa in two different states. In this chapter, we describe the integrated methodology for the large-scale profiling and quantitation of the intact proteome by liquid chromatography-mass spectrometry (LC-MS) without the need for metabolic or chemical labeling. This recent advance for quantitative top-down proteomics is best implemented with a robust and highly controlled sample preparation workflow before data acquisition on a high-resolution mass spectrometer, and the application of a hierarchical linear statistical model to account for the multiple levels of variance contained in quantitative proteomic comparisons of samples for basic and clinical research.

  14. Proteomics Analysis of the Nucleolus in Adenovirus-infected Cells

    PubMed Central

    Lam, Yun W.; Evans, Vanessa C.; Heesom, Kate J.; Lamond, Angus I.; Matthews, David A.

    2010-01-01

    Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined. PMID:19812395

  15. Proteomics analysis of the nucleolus in adenovirus-infected cells.

    PubMed

    Lam, Yun W; Evans, Vanessa C; Heesom, Kate J; Lamond, Angus I; Matthews, David A

    2010-01-01

    Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined.

  16. Broadband ion mobility deconvolution for rapid analysis of complex mixtures.

    PubMed

    Pettit, Michael E; Brantley, Matthew R; Donnarumma, Fabrizio; Murray, Kermit K; Solouki, Touradj

    2018-05-04

    High resolving power ion mobility (IM) allows for accurate characterization of complex mixtures in high-throughput IM mass spectrometry (IM-MS) experiments. We previously demonstrated that pure component IM-MS data can be extracted from IM unresolved post-IM/collision-induced dissociation (CID) MS data using automated ion mobility deconvolution (AIMD) software [Matthew Brantley, Behrooz Zekavat, Brett Harper, Rachel Mason, and Touradj Solouki, J. Am. Soc. Mass Spectrom., 2014, 25, 1810-1819]. In our previous reports, we utilized a quadrupole ion filter for m/z-isolation of IM unresolved monoisotopic species prior to post-IM/CID MS. Here, we utilize a broadband IM-MS deconvolution strategy to remove the m/z-isolation requirement for successful deconvolution of IM unresolved peaks. Broadband data collection has throughput and multiplexing advantages; hence, elimination of the ion isolation step reduces experimental run times and thus expands the applicability of AIMD to high-throughput bottom-up proteomics. We demonstrate broadband IM-MS deconvolution of two separate and unrelated pairs of IM unresolved isomers (viz., a pair of isomeric hexapeptides and a pair of isomeric trisaccharides) in a simulated complex mixture. Moreover, we show that broadband IM-MS deconvolution improves high-throughput bottom-up characterization of a proteolytic digest of rat brain tissue. To our knowledge, this manuscript is the first to report successful deconvolution of pure component IM and MS data from an IM-assisted data-independent analysis (DIA) or HDMSE dataset.

  17. MAPPI-DAT: data management and analysis for protein-protein interaction data from the high-throughput MAPPIT cell microarray platform.

    PubMed

    Gupta, Surya; De Puysseleyr, Veronic; Van der Heyden, José; Maddelein, Davy; Lemmens, Irma; Lievens, Sam; Degroeve, Sven; Tavernier, Jan; Martens, Lennart

    2017-05-01

    Protein-protein interaction (PPI) studies have dramatically expanded our knowledge about cellular behaviour and development in different conditions. A multitude of high-throughput PPI techniques have been developed to achieve proteome-scale coverage for PPI studies, including the microarray based Mammalian Protein-Protein Interaction Trap (MAPPIT) system. Because such high-throughput techniques typically report thousands of interactions, managing and analysing the large amounts of acquired data is a challenge. We have therefore built the MAPPIT cell microArray Protein Protein Interaction-Data management & Analysis Tool (MAPPI-DAT) as an automated data management and analysis tool for MAPPIT cell microarray experiments. MAPPI-DAT stores the experimental data and metadata in a systematic and structured way, automates data analysis and interpretation, and enables the meta-analysis of MAPPIT cell microarray data across all stored experiments. MAPPI-DAT is developed in Python, using R for data analysis and MySQL as data management system. MAPPI-DAT is cross-platform and can be ran on Microsoft Windows, Linux and OS X/macOS. The source code and a Microsoft Windows executable are freely available under the permissive Apache2 open source license at https://github.com/compomics/MAPPI-DAT. jan.tavernier@vib-ugent.be or lennart.martens@vib-ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  18. Advances in targeted proteomics and applications to biomedical research

    PubMed Central

    Shi, Tujin; Song, Ehwang; Nie, Song; Rodland, Karin D.; Liu, Tao; Qian, Wei-Jun; Smith, Richard D.

    2016-01-01

    Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications in human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed. PMID:27302376

  19. Proteomic Cinderella: Customized analysis of bulky MS/MS data in one night.

    PubMed

    Kiseleva, Olga; Poverennaya, Ekaterina; Shargunov, Alexander; Lisitsa, Andrey

    2018-02-01

    Proteomic challenges, stirred up by the advent of high-throughput technologies, produce large amount of MS data. Nowadays, the routine manual search does not satisfy the "speed" of modern science any longer. In our work, the necessity of single-thread analysis of bulky data emerged during interpretation of HepG2 proteome profiling results for proteoforms searching. We compared the contribution of each of the eight search engines (X!Tandem, MS-GF[Formula: see text], MS Amanda, MyriMatch, Comet, Tide, Andromeda, and OMSSA) integrated in an open-source graphical user interface SearchGUI ( http://searchgui.googlecode.com ) into total result of proteoforms identification and optimized set of engines working simultaneously. We also compared the results of our search combination with Mascot results using protein kit UPS2, containing 48 human proteins. We selected combination of X!Tandem, MS-GF[Formula: see text] and OMMSA as the most time-efficient and productive combination of search. We added homemade java-script to automatize pipeline from file picking to report generation. These settings resulted in rise of the efficiency of our customized pipeline unobtainable by manual scouting: the analysis of 192 files searched against human proteome (42153 entries) downloaded from UniProt took 11[Formula: see text]h.

  20. Application of Large-Scale Aptamer-Based Proteomic Profiling to Planned Myocardial Infarctions.

    PubMed

    Jacob, Jaison; Ngo, Debby; Finkel, Nancy; Pitts, Rebecca; Gleim, Scott; Benson, Mark D; Keyes, Michelle J; Farrell, Laurie A; Morgan, Thomas; Jennings, Lori L; Gerszten, Robert E

    2018-03-20

    Emerging proteomic technologies using novel affinity-based reagents allow for efficient multiplexing with high-sample throughput. To identify early biomarkers of myocardial injury, we recently applied an aptamer-based proteomic profiling platform that measures 1129 proteins to samples from patients undergoing septal alcohol ablation for hypertrophic cardiomyopathy, a human model of planned myocardial injury. Here, we examined the scalability of this approach using a markedly expanded platform to study a far broader range of human proteins in the context of myocardial injury. We applied a highly multiplexed, expanded proteomic technique that uses single-stranded DNA aptamers to assay 4783 human proteins (4137 distinct human gene targets) to derivation and validation cohorts of planned myocardial injury, individuals with spontaneous myocardial infarction, and at-risk controls. We found 376 target proteins that significantly changed in the blood after planned myocardial injury in a derivation cohort (n=20; P <1.05E-05, 1-way repeated measures analysis of variance, Bonferroni threshold). Two hundred forty-seven of these proteins were validated in an independent planned myocardial injury cohort (n=15; P <1.33E-04, 1-way repeated measures analysis of variance); >90% were directionally consistent and reached nominal significance in the validation cohort. Among the validated proteins that were increased within 1 hour after planned myocardial injury, 29 were also elevated in patients with spontaneous myocardial infarction (n=63; P <6.17E-04). Many of the novel markers identified in our study are intracellular proteins not previously identified in the peripheral circulation or have functional roles relevant to myocardial injury. For example, the cardiac LIM protein, cysteine- and glycine-rich protein 3, is thought to mediate cardiac mechanotransduction and stress responses, whereas the mitochondrial ATP synthase F 0 subunit component is a vasoactive peptide on its release from cells. Last, we performed aptamer-affinity enrichment coupled with mass spectrometry to technically verify aptamer specificity for a subset of the new biomarkers. Our results demonstrate the feasibility of large-scale aptamer multiplexing at a level that has not previously been reported and with sample throughput that greatly exceeds other existing proteomic methods. The expanded aptamer-based proteomic platform provides a unique opportunity for biomarker and pathway discovery after myocardial injury. © 2017 American Heart Association, Inc.

  1. LOCATE: a mouse protein subcellular localization database

    PubMed Central

    Fink, J. Lynn; Aturaliya, Rajith N.; Davis, Melissa J.; Zhang, Fasheng; Hanson, Kelly; Teasdale, Melvena S.; Kai, Chikatoshi; Kawai, Jun; Carninci, Piero; Hayashizaki, Yoshihide; Teasdale, Rohan D.

    2006-01-01

    We present here LOCATE, a curated, web-accessible database that houses data describing the membrane organization and subcellular localization of proteins from the FANTOM3 Isoform Protein Sequence set. Membrane organization is predicted by the high-throughput, computational pipeline MemO. The subcellular locations of selected proteins from this set were determined by a high-throughput, immunofluorescence-based assay and by manually reviewing >1700 peer-reviewed publications. LOCATE represents the first effort to catalogue the experimentally verified subcellular location and membrane organization of mammalian proteins using a high-throughput approach and provides localization data for ∼40% of the mouse proteome. It is available at . PMID:16381849

  2. Trends in mass spectrometry instrumentation for proteomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, Richard D.

    2002-12-01

    Mass spectrometry has become a primary tool for proteomics due to its capabilities for rapid and sensitive protein identification and quantitation. It is now possible to identify thousands of proteins from microgram sample quantities in a single day and to quantify relative protein abundances. However, the needs for increased capabilities for proteome measurements are immense and are now driving both new strategies and instrument advances. These developments include those based on integration with multi-dimensional liquid separations and high accuracy mass measurements, and promise more than order of magnitude improvements in sensitivity, dynamic range, and throughput for proteomic analyses in themore » near future.« less

  3. Mass spectrometry-based proteomics for translational research: a technical overview.

    PubMed

    Paulo, Joao A; Kadiyala, Vivek; Banks, Peter A; Steen, Hanno; Conwell, Darwin L

    2012-03-01

    Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease.

  4. Mass Spectrometry-Based Proteomics for Translational Research: A Technical Overview

    PubMed Central

    Paulo, Joao A.; Kadiyala, Vivek; Banks, Peter A.; Steen, Hanno; Conwell, Darwin L.

    2012-01-01

    Mass spectrometry-based investigation of clinical samples enables the high-throughput identification of protein biomarkers. We provide an overview of mass spectrometry-based proteomic techniques that are applicable to the investigation of clinical samples. We address sample collection, protein extraction and fractionation, mass spectrometry modalities, and quantitative proteomics. Finally, we examine the limitations and further potential of such technologies. Liquid chromatography fractionation coupled with tandem mass spectrometry is well suited to handle mixtures of hundreds or thousands of proteins. Mass spectrometry-based proteome elucidation can reveal potential biomarkers and aid in the development of hypotheses for downstream investigation of the molecular mechanisms of disease. PMID:22461744

  5. Comparison of Normal and Breast Cancer Cell lines using Proteome, Genome and Interactome data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patwardhan, Anil J.; Strittmatter, Eric F.; Camp, David G.

    2005-12-01

    Normal and cancer cell line proteomes were profiled using high throughput mass spectrometry techniques. Application of both protein-level and peptide-level sample fractionation combined with LC-MS/MS analysis enabled the confident identification of 2,235 unmodified proteins representing a broad range of functional and compartmental classes. An iterative multi-step search strategy was used to identify post-translational modifications and detected several proteins that are preferentially modified in cancer cells. Information regarding both unmodified and modified protein forms was combined with publicly available gene expression and protein-protein interaction data. The resulting integrated dataset revealed several functionally related proteins that are differentially regulated between normal andmore » cancer cell lines.« less

  6. Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data

    NASA Astrophysics Data System (ADS)

    Trifonov, Vladimir; Pasqualucci, Laura; Dalla-Favera, Riccardo; Rabadan, Raul

    2011-12-01

    Recent developments in extracting and processing biological and clinical data are allowing quantitative approaches to studying living systems. High-throughput sequencing (HTS), expression profiles, proteomics, and electronic health records (EHR) are some examples of such technologies. Extracting meaningful information from those technologies requires careful analysis of the large volumes of data they produce. In this note, we present a set of fractal-like distributions that commonly appear in the analysis of such data. The first set of examples are drawn from a HTS experiment. Here, the distributions appear as part of the evaluation of the error rate of the sequencing and the identification of tumorogenic genomic alterations. The other examples are obtained from risk factor evaluation and analysis of relative disease prevalence and co-mordbidity as these appear in EHR. The distributions are also relevant to identification of subclonal populations in tumors and the study of quasi-species and intrahost diversity of viral populations.

  7. The Scottish Structural Proteomics Facility: targets, methods and outputs.

    PubMed

    Oke, Muse; Carter, Lester G; Johnson, Kenneth A; Liu, Huanting; McMahon, Stephen A; Yan, Xuan; Kerou, Melina; Weikart, Nadine D; Kadi, Nadia; Sheikh, Md Arif; Schmelz, Stefan; Dorward, Mark; Zawadzki, Michal; Cozens, Christopher; Falconer, Helen; Powers, Helen; Overton, Ian M; van Niekerk, C A Johannes; Peng, Xu; Patel, Prakash; Garrett, Roger A; Prangishvili, David; Botting, Catherine H; Coote, Peter J; Dryden, David T F; Barton, Geoffrey J; Schwarz-Linek, Ulrich; Challis, Gregory L; Taylor, Garry L; White, Malcolm F; Naismith, James H

    2010-06-01

    The Scottish Structural Proteomics Facility was funded to develop a laboratory scale approach to high throughput structure determination. The effort was successful in that over 40 structures were determined. These structures and the methods harnessed to obtain them are reported here. This report reflects on the value of automation but also on the continued requirement for a high degree of scientific and technical expertise. The efficiency of the process poses challenges to the current paradigm of structural analysis and publication. In the 5 year period we published ten peer-reviewed papers reporting structural data arising from the pipeline. Nevertheless, the number of structures solved exceeded our ability to analyse and publish each new finding. By reporting the experimental details and depositing the structures we hope to maximize the impact of the project by allowing others to follow up the relevant biology.

  8. DAnTE: a statistical tool for quantitative analysis of –omics data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Polpitiya, Ashoka D.; Qian, Weijun; Jaitly, Navdeep

    2008-05-03

    DAnTE (Data Analysis Tool Extension) is a statistical tool designed to address challenges unique to quantitative bottom-up, shotgun proteomics data. This tool has also been demonstrated for microarray data and can easily be extended to other high-throughput data types. DAnTE features selected normalization methods, missing value imputation algorithms, peptide to protein rollup methods, an extensive array of plotting functions, and a comprehensive ANOVA scheme that can handle unbalanced data and random effects. The Graphical User Interface (GUI) is designed to be very intuitive and user friendly.

  9. Chipster: user-friendly analysis software for microarray and other high-throughput data.

    PubMed

    Kallio, M Aleksi; Tuimala, Jarno T; Hupponen, Taavi; Klemelä, Petri; Gentile, Massimiliano; Scheinin, Ilari; Koski, Mikko; Käki, Janne; Korpelainen, Eija I

    2011-10-14

    The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available.

  10. Chipster: user-friendly analysis software for microarray and other high-throughput data

    PubMed Central

    2011-01-01

    Background The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Results Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Conclusions Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available. PMID:21999641

  11. Knowledge-based analysis of microarrays for the discovery of transcriptional regulation relationships

    PubMed Central

    2010-01-01

    Background The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. Results In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. Conclusion High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data. PMID:20122245

  12. Knowledge-based analysis of microarrays for the discovery of transcriptional regulation relationships.

    PubMed

    Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong

    2010-01-18

    The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.

  13. A Proteomic Workflow Using High-Throughput De Novo Sequencing Towards Complementation of Genome Information for Improved Comparative Crop Science.

    PubMed

    Turetschek, Reinhard; Lyon, David; Desalegn, Getinet; Kaul, Hans-Peter; Wienkoop, Stefanie

    2016-01-01

    The proteomic study of non-model organisms, such as many crop plants, is challenging due to the lack of comprehensive genome information. Changing environmental conditions require the study and selection of adapted cultivars. Mutations, inherent to cultivars, hamper protein identification and thus considerably complicate the qualitative and quantitative comparison in large-scale systems biology approaches. With this workflow, cultivar-specific mutations are detected from high-throughput comparative MS analyses, by extracting sequence polymorphisms with de novo sequencing. Stringent criteria are suggested to filter for confidential mutations. Subsequently, these polymorphisms complement the initially used database, which is ready to use with any preferred database search algorithm. In our example, we thereby identified 26 specific mutations in two cultivars of Pisum sativum and achieved an increased number (17 %) of peptide spectrum matches.

  14. Open source libraries and frameworks for mass spectrometry based proteomics: A developer's perspective☆

    PubMed Central

    Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio

    2014-01-01

    Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. PMID:23467006

  15. Open source libraries and frameworks for mass spectrometry based proteomics: a developer's perspective.

    PubMed

    Perez-Riverol, Yasset; Wang, Rui; Hermjakob, Henning; Müller, Markus; Vesada, Vladimir; Vizcaíno, Juan Antonio

    2014-01-01

    Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.

  16. A Customizable Flow Injection System for Automated, High Throughput, and Time Sensitive Ion Mobility Spectrometry and Mass Spectrometry Measurements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Orton, Daniel J.; Tfaily, Malak M.; Moore, Ronald J.

    To better understand disease conditions and environmental perturbations, multi-omic studies (i.e. proteomic, lipidomic, metabolomic, etc. analyses) are vastly increasing in popularity. In a multi-omic study, a single sample is typically extracted in multiple ways and numerous analyses are performed using different instruments. Thus, one sample becomes many analyses, making high throughput and reproducible evaluations a necessity. One way to address the numerous samples and varying instrumental conditions is to utilize a flow injection analysis (FIA) system for rapid sample injection. While some FIA systems have been created to address these challenges, many have limitations such as high consumable costs, lowmore » pressure capabilities, limited pressure monitoring and fixed flow rates. To address these limitations, we created an automated, customizable FIA system capable of operating at diverse flow rates (~50 nL/min to 500 µL/min) to accommodate low- and high-flow instrument sources. This system can also operate at varying analytical throughputs from 24 to 1200 samples per day to enable different MS analysis approaches. Applications ranging from native protein analyses to molecular library construction were performed using the FIA system. The results from these studies showed a highly robust platform, providing consistent performance over many days without carryover as long as washing buffers specific to each molecular analysis were utilized.« less

  17. Next-Generation Technologies for Multiomics Approaches Including Interactome Sequencing

    PubMed Central

    Ohashi, Hiroyuki; Miyamoto-Sato, Etsuko

    2015-01-01

    The development of high-speed analytical techniques such as next-generation sequencing and microarrays allows high-throughput analysis of biological information at a low cost. These techniques contribute to medical and bioscience advancements and provide new avenues for scientific research. Here, we outline a variety of new innovative techniques and discuss their use in omics research (e.g., genomics, transcriptomics, metabolomics, proteomics, and interactomics). We also discuss the possible applications of these methods, including an interactome sequencing technology that we developed, in future medical and life science research. PMID:25649523

  18. Top-down proteomics reveals a unique protein S-thiolation switch in Salmonella Typimurium in response to infection-like conditions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ansong, Charles; Wu, Si; Meng, Da

    Characterization of the mature protein complement in cells is crucial for a better understanding of cellular processes on a systems-wide scale. Bottom-up proteomic approaches often lead to loss of critical information about an endogenous protein’s actual state due to post translational modifications (PTMs) and other processes. Top-down approaches that involve analysis of the intact protein can address this concern but present significant analytical challenges related to the separation quality needed, measurement sensitivity, and speed that result in low throughput and limited coverage. Here we used single-dimension ultra high pressure liquid chromatography mass spectrometry to investigate the comprehensive ‘intact’ proteome ofmore » the Gram negative bacterial pathogen Salmonella Typhimurium. Top-down proteomics analysis revealed 563 unique proteins including 1665 proteoforms generated by PTMs, representing the largest microbial top-down dataset reported to date. Our analysis not only confirmed several previously recognized aspects of Salmonella biology and bacterial PTMs in general, but also revealed several novel biological insights. Of particular interest was differential utilization of the protein S-thiolation forms S-glutathionylation and S-cysteinylation in response to infection-like conditions versus basal conditions, which was corroborated by changes in corresponding biosynthetic pathways. This differential utilization highlights underlying metabolic mechanisms that modulate changes in cellular signaling, and represents to our knowledge the first report of S-cysteinylation in Gram negative bacteria. The demonstrated utility of our simple proteome-wide intact protein level measurement strategy for gaining biological insight should promote broader adoption and applications of top-down proteomics approaches.« less

  19. Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

    PubMed

    Keich, Uri; Kertesz-Farkas, Attila; Noble, William Stafford

    2015-08-07

    Interpreting the potentially vast number of hypotheses generated by a shotgun proteomics experiment requires a valid and accurate procedure for assigning statistical confidence estimates to identified tandem mass spectra. Despite the crucial role such procedures play in most high-throughput proteomics experiments, the scientific literature has not reached a consensus about the best confidence estimation methodology. In this work, we evaluate, using theoretical and empirical analysis, four previously proposed protocols for estimating the false discovery rate (FDR) associated with a set of identified tandem mass spectra: two variants of the target-decoy competition protocol (TDC) of Elias and Gygi and two variants of the separate target-decoy search protocol of Käll et al. Our analysis reveals significant biases in the two separate target-decoy search protocols. Moreover, the one TDC protocol that provides an unbiased FDR estimate among the target PSMs does so at the cost of forfeiting a random subset of high-scoring spectrum identifications. We therefore propose the mix-max procedure to provide unbiased, accurate FDR estimates in the presence of well-calibrated scores. The method avoids biases associated with the two separate target-decoy search protocols and also avoids the propensity for target-decoy competition to discard a random subset of high-scoring target identifications.

  20. Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics

    PubMed Central

    2016-01-01

    Interpreting the potentially vast number of hypotheses generated by a shotgun proteomics experiment requires a valid and accurate procedure for assigning statistical confidence estimates to identified tandem mass spectra. Despite the crucial role such procedures play in most high-throughput proteomics experiments, the scientific literature has not reached a consensus about the best confidence estimation methodology. In this work, we evaluate, using theoretical and empirical analysis, four previously proposed protocols for estimating the false discovery rate (FDR) associated with a set of identified tandem mass spectra: two variants of the target-decoy competition protocol (TDC) of Elias and Gygi and two variants of the separate target-decoy search protocol of Käll et al. Our analysis reveals significant biases in the two separate target-decoy search protocols. Moreover, the one TDC protocol that provides an unbiased FDR estimate among the target PSMs does so at the cost of forfeiting a random subset of high-scoring spectrum identifications. We therefore propose the mix-max procedure to provide unbiased, accurate FDR estimates in the presence of well-calibrated scores. The method avoids biases associated with the two separate target-decoy search protocols and also avoids the propensity for target-decoy competition to discard a random subset of high-scoring target identifications. PMID:26152888

  1. Serial isoelectric focusing as an effective and economic way to obtain maximal resolution and high-throughput in 2D-based comparative proteomics of scarce samples: proof-of-principle.

    PubMed

    Farhoud, Murtada H; Wessels, Hans J C T; Wevers, Ron A; van Engelen, Baziel G; van den Heuvel, Lambert P; Smeitink, Jan A

    2005-01-01

    In 2D-based comparative proteomics of scarce samples, such as limited patient material, established methods for prefractionation and subsequent use of different narrow range IPG strips to increase overall resolution are difficult to apply. Also, a high number of samples, a prerequisite for drawing meaningful conclusions when pathological and control samples are considered, will increase the associated amount of work almost exponentially. Here, we introduce a novel, effective, and economic method designed to obtain maximum 2D resolution while maintaining the high throughput necessary to perform large-scale comparative proteomics studies. The method is based on connecting different IPG strips serially head-to-tail so that a complete line of different IPG strips with sequential pH regions can be focused in the same experiment. We show that when 3 IPG strips (covering together the pH range of 3-11) are connected head-to-tail an optimal resolution is achieved along the whole pH range. Sample consumption, time required, and associated costs are reduced by almost 70%, and the workload is reduced significantly.

  2. Study of cellular oncometabolism via multidimensional protein identification technology.

    PubMed

    Aukim-Hastie, Claire; Garbis, Spiros D

    2014-01-01

    Cellular proteomics is becoming a widespread clinical application, matching the definition of bench-to-bedside translation. Among various fields of investigation, this approach can be applied to the study of the metabolic alterations that accompany oncogenesis and tumor progression, which are globally referred to as oncometabolism. Here, we describe a multidimensional protein identification technology (MuDPIT)-based strategy that can be employed to study the cellular proteome of malignant cells and tissues. This method has previously been shown to be compatible with the reproducible, in-depth analysis of up to a thousand proteins in clinical samples. The possibility to employ this technique to study clinical specimens demonstrates its robustness. MuDPIT is advantageous as compared to other approaches because it is direct, highly sensitive, and reproducible, it provides high resolution with ultra-high mass accuracy, it allows for relative quantifications, and it is compatible with multiplexing (thus limiting costs).This method enables the direct assessment of the proteomic profile of neoplastic cells and tissues and could be employed in the near future as a high-throughput, rapid, quantitative, and cost-effective screening platform for clinical samples. © 2014 Elsevier Inc. All rights reserved.

  3. Abundant Lysine Methylation and N-Terminal Acetylation in Sulfolobus islandicus Revealed by Bottom-Up and Top-Down Proteomics*

    PubMed Central

    Vorontsov, Egor A.; Rensen, Elena; Prangishvili, David; Krupovic, Mart; Chamot-Rooke, Julia

    2016-01-01

    Protein post-translational methylation has been reported to occur in archaea, including members of the genus Sulfolobus, but has never been characterized on a proteome-wide scale. Among important Sulfolobus proteins carrying such modification are the chromatin proteins that have been described to be methylated on lysine side chains, resembling eukaryotic histones in that aspect. To get more insight into the extent of this modification and its dynamics during the different growth steps of the thermoacidophylic archaeon S. islandicus LAL14/1, we performed a global and deep proteomic analysis using a combination of high-throughput bottom-up and top-down approaches on a single high-resolution mass spectrometer. 1,931 methylation sites on 751 proteins were found by the bottom-up analysis, with methylation sites on 526 proteins monitored throughout three cell culture growth stages: early-exponential, mid-exponential, and stationary. The top-down analysis revealed 3,978 proteoforms arising from 681 proteins, including 292 methylated proteoforms, 85 of which were comprehensively characterized. Methylated proteoforms of the five chromatin proteins (Alba1, Alba2, Cren7, Sul7d1, Sul7d2) were fully characterized by a combination of bottom-up and top-down data. The top-down analysis also revealed an increase of methylation during cell growth for two chromatin proteins, which had not been evidenced by bottom-up. These results shed new light on the ubiquitous lysine methylation throughout the S. islandicus proteome. Furthermore, we found that S. islandicus proteins are frequently acetylated at the N terminus, following the removal of the N-terminal methionine. This study highlights the great value of combining bottom-up and top-down proteomics for obtaining an unprecedented level of accuracy in detecting differentially modified intact proteoforms. The data have been deposited to the ProteomeXchange with identifiers PXD003074 and PXD004179. PMID:27555370

  4. Respiratory Toxicity Biomarkers

    EPA Science Inventory

    The advancement in high throughput genomic, proteomic and metabolomic techniques have accelerated pace of lung biomarker discovery. A recent growth in the discovery of new lung toxicity/disease biomarkers have led to significant advances in our understanding of pathological proce...

  5. Machine learning in computational biology to accelerate high-throughput protein expression.

    PubMed

    Sastry, Anand; Monk, Jonathan; Tegel, Hanna; Uhlen, Mathias; Palsson, Bernhard O; Rockberg, Johan; Brunk, Elizabeth

    2017-08-15

    The Human Protein Atlas (HPA) enables the simultaneous characterization of thousands of proteins across various tissues to pinpoint their spatial location in the human body. This has been achieved through transcriptomics and high-throughput immunohistochemistry-based approaches, where over 40 000 unique human protein fragments have been expressed in E. coli. These datasets enable quantitative tracking of entire cellular proteomes and present new avenues for understanding molecular-level properties influencing expression and solubility. Combining computational biology and machine learning identifies protein properties that hinder the HPA high-throughput antibody production pipeline. We predict protein expression and solubility with accuracies of 70% and 80%, respectively, based on a subset of key properties (aromaticity, hydropathy and isoelectric point). We guide the selection of protein fragments based on these characteristics to optimize high-throughput experimentation. We present the machine learning workflow as a series of IPython notebooks hosted on GitHub (https://github.com/SBRG/Protein_ML). The workflow can be used as a template for analysis of further expression and solubility datasets. ebrunk@ucsd.edu or johanr@biotech.kth.se. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  6. Microchip-Based Single-Cell Functional Proteomics for Biomedical Applications

    PubMed Central

    Lu, Yao; Yang, Liu; Wei, Wei; Shi, Qihui

    2017-01-01

    Cellular heterogeneity has been widely recognized but only recently have single cell tools become available that allow characterizing heterogeneity at the genomic and proteomic levels. We review the technological advances in microchip-based toolkits for single-cell functional proteomics. Each of these tools has distinct advantages and limitations, and a few have advanced toward being applied to address biological or clinical problems that fail to be addressed by traditional population-based methods. High-throughput single-cell proteomic assays generate high-dimensional data sets that contain new information and thus require developing new analytical framework to extract new biology. In this review article, we highlight a few biological and clinical applications in which the microchip-based single-cell proteomic tools provide unique advantages. The examples include resolving functional heterogeneity and dynamics of immune cells, dissecting cell-cell interaction by creating well-contolled on-chip microenvironment, capturing high-resolution snapshots of immune system functions in patients for better immunotherapy and elucidating phosphoprotein signaling networks in cancer cells for guiding effective molecularly targeted therapies. PMID:28280819

  7. High-throughput protein analysis integrating bioinformatics and experimental assays

    PubMed Central

    del Val, Coral; Mehrle, Alexander; Falkenhahn, Mechthild; Seiler, Markus; Glatting, Karl-Heinz; Poustka, Annemarie; Suhai, Sandor; Wiemann, Stefan

    2004-01-01

    The wealth of transcript information that has been made publicly available in recent years requires the development of high-throughput functional genomics and proteomics approaches for its analysis. Such approaches need suitable data integration procedures and a high level of automation in order to gain maximum benefit from the results generated. We have designed an automatic pipeline to analyse annotated open reading frames (ORFs) stemming from full-length cDNAs produced mainly by the German cDNA Consortium. The ORFs are cloned into expression vectors for use in large-scale assays such as the determination of subcellular protein localization or kinase reaction specificity. Additionally, all identified ORFs undergo exhaustive bioinformatic analysis such as similarity searches, protein domain architecture determination and prediction of physicochemical characteristics and secondary structure, using a wide variety of bioinformatic methods in combination with the most up-to-date public databases (e.g. PRINTS, BLOCKS, INTERPRO, PROSITE SWISSPROT). Data from experimental results and from the bioinformatic analysis are integrated and stored in a relational database (MS SQL-Server), which makes it possible for researchers to find answers to biological questions easily, thereby speeding up the selection of targets for further analysis. The designed pipeline constitutes a new automatic approach to obtaining and administrating relevant biological data from high-throughput investigations of cDNAs in order to systematically identify and characterize novel genes, as well as to comprehensively describe the function of the encoded proteins. PMID:14762202

  8. Statistical characterization of multiple-reaction monitoring mass spectrometry (MRM-MS) assays for quantitative proteomics

    PubMed Central

    2012-01-01

    Multiple reaction monitoring mass spectrometry (MRM-MS) with stable isotope dilution (SID) is increasingly becoming a widely accepted assay for the quantification of proteins and peptides. These assays have shown great promise in relatively high throughput verification of candidate biomarkers. While the use of MRM-MS assays is well established in the small molecule realm, their introduction and use in proteomics is relatively recent. As such, statistical and computational methods for the analysis of MRM-MS data from proteins and peptides are still being developed. Based on our extensive experience with analyzing a wide range of SID-MRM-MS data, we set forth a methodology for analysis that encompasses significant aspects ranging from data quality assessment, assay characterization including calibration curves, limits of detection (LOD) and quantification (LOQ), and measurement of intra- and interlaboratory precision. We draw upon publicly available seminal datasets to illustrate our methods and algorithms. PMID:23176545

  9. Statistical characterization of multiple-reaction monitoring mass spectrometry (MRM-MS) assays for quantitative proteomics.

    PubMed

    Mani, D R; Abbatiello, Susan E; Carr, Steven A

    2012-01-01

    Multiple reaction monitoring mass spectrometry (MRM-MS) with stable isotope dilution (SID) is increasingly becoming a widely accepted assay for the quantification of proteins and peptides. These assays have shown great promise in relatively high throughput verification of candidate biomarkers. While the use of MRM-MS assays is well established in the small molecule realm, their introduction and use in proteomics is relatively recent. As such, statistical and computational methods for the analysis of MRM-MS data from proteins and peptides are still being developed. Based on our extensive experience with analyzing a wide range of SID-MRM-MS data, we set forth a methodology for analysis that encompasses significant aspects ranging from data quality assessment, assay characterization including calibration curves, limits of detection (LOD) and quantification (LOQ), and measurement of intra- and interlaboratory precision. We draw upon publicly available seminal datasets to illustrate our methods and algorithms.

  10. Enrichment of plasma membrane proteins using nanoparticle pellicles: comparison between silica and higher density nanoparticles

    PubMed Central

    Choksawangkarn, Waeowalee; Kim, Sung-Kyoung; Cannon, Joe R.; Edwards, Nathan J.; Lee, Sang Bok; Fenselau, Catherine

    2013-01-01

    Proteomic and other characterization of plasma membrane proteins is made difficult by their low abundance, hydrophobicity, frequent carboxylation and dynamic population. We and others have proposed that underrepresentation in LC-MS/MS analysis can be partially compensated by enriching the plasma membrane and its proteins using cationic nanoparticle pellicles. The nanoparticles increase the density of plasma membrane sheets and thus enhance separation by centrifugation from other lysed cellular components. Herein we test the hypothesis that the use of nanoparticles with increased densities can provide enhanced enrichment of plasma membrane proteins for proteomic analysis. Multiple myeloma cells were grown and coated in suspension with three different pellicles of three different densities and both pellicle coated and uncoated suspensions analyzed by high-throughput LC-MS/MS. Enrichment was evaluated by the total number and the spectral counts of identified plasma membrane proteins. PMID:23289353

  11. Comparative proteomics reveals recruitment patterns of some protein families in the venoms of Cnidaria.

    PubMed

    Jaimes-Becerra, Adrian; Chung, Ray; Morandini, André C; Weston, Andrew J; Padilla, Gabriel; Gacesa, Ranko; Ward, Malcolm; Long, Paul F; Marques, Antonio C

    2017-10-01

    Cnidarians are probably the oldest group of animals to be venomous, yet our current picture of cnidarian venom evolution is highly imbalanced due to limited taxon sampling. High-throughput tandem mass spectrometry was used to determine venom composition of the scyphozoan Chrysaora lactea and two cubozoans Tamoya haplonema and Chiropsalmus quadrumanus. Protein recruitment patterns were then compared against 5 other cnidarian venom proteomes taken from the literature. A total of 28 putative toxin protein families were identified, many for the first time in Cnidaria. Character mapping analysis revealed that 17 toxin protein families with predominantly cytolytic biological activities were likely recruited into the cnidarian venom proteome before the lineage split between Anthozoa and Medusozoa. Thereafter, venoms of Medusozoa and Anthozoa differed during subsequent divergence of cnidarian classes. Recruitment and loss of toxin protein families did not correlate with accepted phylogenetic patterns of Cnidaria. Selective pressures that drive toxin diversification independent of taxonomic positioning have yet to be identified in Cnidaria and now warrant experimental consideration. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Unbiased Protein Association Study on the Public Human Proteome Reveals Biological Connections between Co-Occurring Protein Pairs

    PubMed Central

    2017-01-01

    Mass-spectrometry-based, high-throughput proteomics experiments produce large amounts of data. While typically acquired to answer specific biological questions, these data can also be reused in orthogonal ways to reveal new biological knowledge. We here present a novel method for such orthogonal data reuse of public proteomics data. Our method elucidates biological relationships between proteins based on the co-occurrence of these proteins across human experiments in the PRIDE database. The majority of the significantly co-occurring protein pairs that were detected by our method have been successfully mapped to existing biological knowledge. The validity of our novel method is substantiated by the extremely few pairs that can be mapped to existing knowledge based on random associations between the same set of proteins. Moreover, using literature searches and the STRING database, we were able to derive meaningful biological associations for unannotated protein pairs that were detected using our method, further illustrating that as-yet unknown associations present highly interesting targets for follow-up analysis. PMID:28480704

  13. High-throughput sperm differential proteomics suggests that epigenetic alterations contribute to failed assisted reproduction.

    PubMed

    Azpiazu, Rubén; Amaral, Alexandra; Castillo, Judit; Estanyol, Josep Maria; Guimerà, Marta; Ballescà, Josep Lluís; Balasch, Juan; Oliva, Rafael

    2014-06-01

    Are there quantitative alterations in the proteome of normozoospermic sperm samples that are able to complete IVF but whose female partner does not achieve pregnancy? Normozoospermic sperm samples with different IVF outcomes (pregnancy versus no pregnancy) differed in the levels of at least 66 proteins. The analysis of the proteome of sperm samples with distinct fertilization capacity using low-throughput proteomic techniques resulted in the detection of a few differential proteins. Current high-throughput mass spectrometry approaches allow the identification and quantification of a substantially higher number of proteins. This was a case-control study including 31 men with normozoospermic sperm and their partners who underwent IVF with successful fertilization recruited between 2007 and 2008. Normozoospermic sperm samples from 15 men whose female partners did not achieve pregnancy after IVF (no pregnancy) and 16 men from couples that did achieve pregnancy after IVF (pregnancy) were included in this study. To perform the differential proteomic experiments, 10 no pregnancy samples and 10 pregnancy samples were separately pooled and subsequently used for tandem mass tags (TMT) protein labelling, sodium dodecyl sulphate-polyacrylamide gel electrophoresis, liquid chromatography tandem mass spectrometry (LC-MS/MS) identification and peak intensity relative protein quantification. Bioinformatic analyses were performed using UniProt Knowledgebase, DAVID and Reactome. Individual samples (n = 5 no pregnancy samples; n = 6 pregnancy samples) and aliquots from the above TMT pools were used for western blotting. By using TMT labelling and LC-MS/MS, we have detected 31 proteins present at lower abundance (ratio no pregnancy/pregnancy < 0.67) and 35 at higher abundance (ratio no pregnancy/pregnancy > 1.5) in the no pregnancy group. Bioinformatic analyses showed that the proteins with differing abundance are involved in chromatin assembly and lipoprotein metabolism (P values < 0.05). In addition, the differential abundance of one of the proteins (SRSF protein kinase 1) was further validated by western blotting using independent samples (P value < 0.01). For individual samples the amount of recovered sperm not used for IVF was low and in most of the cases insufficient for MS analysis, therefore pools of samples had to be used to this end. Alterations in the proteins involved in chromatin assembly and metabolism may result in epigenetic errors during spermatogenesis, leading to inaccurate sperm epigenetic signatures, which could ultimately prevent embryonic development. These sperm proteins may thus possibly have clinical relevance. This work was supported by the Spanish Ministry of Economy and Competitiveness (Ministerio de Economia y Competividad; FEDER BFU 2009-07118 and PI13/00699) and Fundación Salud 2000 SERONO13-015. There are no competing interests to declare.

  14. Comparative analysis of cerebrospinal fluid from the meningo-encephalitic stage of T. b. gambiense and rhodesiense sleeping sickness patients using TMT quantitative proteomics.

    PubMed

    Tiberti, Natalia; Sanchez, Jean-Charles

    2015-09-01

    The quantitative proteomics data here reported are part of a research article entitled "Increased acute immune response during the meningo-encephalitic stage of Trypanosoma brucei rhodesiense sleeping sickness compared to Trypanosoma brucei gambiense", published by Tiberti et al., 2015. Transl. Proteomics 6, 1-9. Sleeping sickness (human African trypanosomiasis - HAT) is a deadly neglected tropical disease affecting mainly rural communities in sub-Saharan Africa. This parasitic disease is caused by the Trypanosoma brucei (T. b.) parasite, which is transmitted to the human host through the bite of the tse-tse fly. Two parasite sub-species, T. b. rhodesiense and T. b. gambiense, are responsible for two clinically different and geographically separated forms of sleeping sickness. The objective of the present study was to characterise and compare the cerebrospinal fluid (CSF) proteome of stage 2 (meningo-encephalitic stage) HAT patients suffering from T. b. gambiense or T. b. rhodesiense disease using high-throughput quantitative proteomics and the Tandem Mass Tag (TMT(®)) isobaric labelling. In order to evaluate the CSF proteome in the context of HAT pathophysiology, the protein dataset was then submitted to gene ontology and pathway analysis. Two significantly differentially expressed proteins (C-reactive protein and orosomucoid 1) were further verified on a larger population of patients (n=185) by ELISA, confirming the mass spectrometry results. By showing a predominant involvement of the acute immune response in rhodesiense HAT, the proteomics results obtained in this work will contribute to further understand the mechanisms of pathology occurring in HAT and to propose new biomarkers of potential clinical utility. The mass spectrometry raw data are available in the Pride Archive via ProteomeXchange through the identifier PXD001082.

  15. Overview of proteomics studies in obstructive sleep apnea

    PubMed Central

    Feliciano, Amélia; Torres, Vukosava Milic; Vaz, Fátima; Carvalho, Ana Sofia; Matthiesen, Rune; Pinto, Paula; Malhotra, Atul; Bárbara, Cristina; Penque, Deborah

    2015-01-01

    Obstructive sleep apnea (OSA) is an underdiagnosed common public health concern causing deleterious effects on metabolic and cardiovascular health. Although much has been learned regarding the pathophysiology and consequences of OSA in the past decades, the molecular mechanisms associated with such processes remain poorly defined. The advanced high-throughput proteomics-based technologies have become a fundamental approach for identifying novel disease mediators as potential diagnostic and therapeutic targets for many diseases, including OSA. Here, we briefly review OSA pathophysiology and the technological advances in proteomics and the first results of its application to address critical issues in the OSA field. PMID:25770042

  16. Development of an Efficient Virus Induced Gene Silencing Strategy in the Non-Model Wild Ginger-Zingiber zerumbet and Investigation of Associated Proteome Changes

    PubMed Central

    Mahadevan, Chidambareswaren; Jaleel, Abdul; Deb, Lokesh; Thomas, George; Sakuntala, Manjula

    2015-01-01

    Zingiber zerumbet (Zingiberaceae) is a wild, tropical medicinal herb that shows a high degree of resistance to diseases affecting cultivated ginger. Barley stripe mosaic virus (BSMV) silencing vectors containing an endogenous phytoene desaturase (PDS) gene fragment were agroinfiltrated into young leaves of Z. zerumbet under controlled growth conditions to effect virus-induced gene silencing (VIGS). Infiltrated leaves as well as newly emerged leaves and tillers showed visual signs of PDS silencing after 30 days. Replication and systemic movement of the viral vectors in silenced plants were confirmed by RT-PCR. Real-time quantitative PCR analysis verified significant down-regulation of PDS transcripts in the silenced tissues. Label-free proteomic analysis was conducted in leaves with established PDS transcript down regulation and buffer-infiltrated (mock) leaves. A total of 474 proteins were obtained, which were up-regulated, down-regulated or modulated de novo during VIGS. Most of these proteins were localized to the chloroplast, as revealed by UniprotKB analysis, and among the up-regulated proteins there were abiotic stress responsive, photosynthetic, metabolic and membrane proteins. Moreover, the demonstration of viral proteins together with host proteins proved successful viral infection. We report for the first time the establishment of a high-throughput gene functional analysis platform using BSMV-mediated VIGS in Z. zerumbet, as well as proteomic changes associated with VIGS. PMID:25918840

  17. Generation and analyses of human synthetic antibody libraries and their application for protein microarrays.

    PubMed

    Säll, Anna; Walle, Maria; Wingren, Christer; Müller, Susanne; Nyman, Tomas; Vala, Andrea; Ohlin, Mats; Borrebaeck, Carl A K; Persson, Helena

    2016-10-01

    Antibody-based proteomics offers distinct advantages in the analysis of complex samples for discovery and validation of biomarkers associated with disease. However, its large-scale implementation requires tools and technologies that allow development of suitable antibody or antibody fragments in a high-throughput manner. To address this we designed and constructed two human synthetic antibody fragment (scFv) libraries denoted HelL-11 and HelL-13. By the use of phage display technology, in total 466 unique scFv antibodies specific for 114 different antigens were generated. The specificities of these antibodies were analyzed in a variety of immunochemical assays and a subset was further evaluated for functionality in protein microarray applications. This high-throughput approach demonstrates the ability to rapidly generate a wealth of reagents not only for proteome research, but potentially also for diagnostics and therapeutics. In addition, this work provides a great example on how a synthetic approach can be used to optimize library designs. By having precise control of the diversity introduced into the antigen-binding sites, synthetic libraries offer increased understanding of how different diversity contributes to antibody binding reactivity and stability, thereby providing the key to future library optimization. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. SteinerNet: a web server for integrating ‘omic’ data to discover hidden components of response pathways

    PubMed Central

    Tuncbag, Nurcan; McCallum, Scott; Huang, Shao-shan Carol; Fraenkel, Ernest

    2012-01-01

    High-throughput technologies including transcriptional profiling, proteomics and reverse genetics screens provide detailed molecular descriptions of cellular responses to perturbations. However, it is difficult to integrate these diverse data to reconstruct biologically meaningful signaling networks. Previously, we have established a framework for integrating transcriptional, proteomic and interactome data by searching for the solution to the prize-collecting Steiner tree problem. Here, we present a web server, SteinerNet, to make this method available in a user-friendly format for a broad range of users with data from any species. At a minimum, a user only needs to provide a set of experimentally detected proteins and/or genes and the server will search for connections among these data from the provided interactomes for yeast, human, mouse, Drosophila melanogaster and Caenorhabditis elegans. More advanced users can upload their own interactome data as well. The server provides interactive visualization of the resulting optimal network and downloadable files detailing the analysis and results. We believe that SteinerNet will be useful for researchers who would like to integrate their high-throughput data for a specific condition or cellular response and to find biologically meaningful pathways. SteinerNet is accessible at http://fraenkel.mit.edu/steinernet. PMID:22638579

  19. The clinical impact of recent advances in LC-MS for cancer biomarker discovery and verification

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Hui; Shi, Tujin; Qian, Wei-Jun

    2015-12-04

    Mass spectrometry-based proteomics has become an indispensable tool in biomedical research with broad applications ranging from fundamental biology, systems biology, and biomarker discovery. Recent advances in LC-MS have made it become a major technology in clinical applications, especially in cancer biomarker discovery and verification. To overcome the challenges associated with the analysis of clinical samples, such as extremely wide dynamic range of protein concentrations in biofluids and the need to perform high throughput and accurate quantification, significant efforts have been devoted to improve the overall performance of LC-MS bases clinical proteomics. In this review, we summarize the recent advances inmore » LC-MS in the aspect of cancer biomarker discovery and quantification, and discuss its potentials, limitations, and future perspectives.« less

  20. Searching for microbial protein over-expression in a complex matrix using automated high throughput MS-based proteomics tools.

    PubMed

    Akeroyd, Michiel; Olsthoorn, Maurien; Gerritsma, Jort; Gutker-Vermaas, Diana; Ekkelkamp, Laurens; van Rij, Tjeerd; Klaassen, Paul; Plugge, Wim; Smit, Ed; Strupat, Kerstin; Wenzel, Thibaut; van Tilborg, Marcel; van der Hoeven, Rob

    2013-03-10

    In the discovery of new enzymes genomic and cDNA expression libraries containing thousands of differential clones are generated to obtain biodiversity. These libraries need to be screened for the activity of interest. Removing so-called empty and redundant clones significantly reduces the size of these expression libraries and therefore speeds up new enzyme discovery. Here, we present a sensitive, generic workflow for high throughput screening of successful microbial protein over-expression in microtiter plates containing a complex matrix based on mass spectrometry techniques. MALDI-LTQ-Orbitrap screening followed by principal component analysis and peptide mass fingerprinting was developed to obtain a throughput of ∼12,000 samples per week. Alternatively, a UHPLC-MS(2) approach including MS(2) protein identification was developed for microorganisms with a complex protein secretome with a throughput of ∼2000 samples per week. TCA-induced protein precipitation enhanced by addition of bovine serum albumin is used for protein purification prior to MS detection. We show that this generic workflow can effectively reduce large expression libraries from fungi and bacteria to their minimal size by detection of successful protein over-expression using MS. Copyright © 2012 Elsevier B.V. All rights reserved.

  1. Fungal proteomics: from identification to function.

    PubMed

    Doyle, Sean

    2011-08-01

    Some fungi cause disease in humans and plants, while others have demonstrable potential for the control of insect pests. In addition, fungi are also a rich reservoir of therapeutic metabolites and industrially useful enzymes. Detailed analysis of fungal biochemistry is now enabled by multiple technologies including protein mass spectrometry, genome and transcriptome sequencing and advances in bioinformatics. Yet, the assignment of function to fungal proteins, encoded either by in silico annotated, or unannotated genes, remains problematic. The purpose of this review is to describe the strategies used by many researchers to reveal protein function in fungi, and more importantly, to consolidate the nomenclature of 'unknown function protein' as opposed to 'hypothetical protein' - once any protein has been identified by protein mass spectrometry. A combination of approaches including comparative proteomics, pathogen-induced protein expression and immunoproteomics are outlined, which, when used in combination with a variety of other techniques (e.g. functional genomics, microarray analysis, immunochemical and infection model systems), appear to yield comprehensive and definitive information on protein function in fungi. The relative advantages of proteomic, as opposed to transcriptomic-only, analyses are also described. In the future, combined high-throughput, quantitative proteomics, allied to transcriptomic sequencing, are set to reveal much about protein function in fungi. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  2. Quantitative proteomic analysis of microdissected oral epithelium for cancer biomarker discovery.

    PubMed

    Xiao, Hua; Langerman, Alexander; Zhang, Yan; Khalid, Omar; Hu, Shen; Cao, Cheng-Xi; Lingen, Mark W; Wong, David T W

    2015-11-01

    Specific biomarkers are urgently needed for the detection and progression of oral cancer. The objective of this study was to discover cancer biomarkers from oral epithelium through utilizing high throughput quantitative proteomics approaches. Morphologically malignant, epithelial dysplasia, and adjacent normal epithelial tissues were laser capture microdissected (LCM) from 19 patients and used for proteomics analysis. Total proteins from each group were extracted, digested and then labelled with corresponding isobaric tags for relative and absolute quantitation (iTRAQ). Labelled peptides from each sample were combined and analyzed by liquid chromatography-mass spectrometry (LC-MS/MS) for protein identification and quantification. In total, 500 proteins were identified and 425 of them were quantified. When compared with adjacent normal oral epithelium, 17 and 15 proteins were consistently up-regulated or down-regulated in malignant and epithelial dysplasia, respectively. Half of these candidate biomarkers were discovered for oral cancer for the first time. Cornulin was initially confirmed in tissue protein extracts and was further validated in tissue microarray. Its presence in the saliva of oral cancer patients was also explored. Myoglobin and S100A8 were pre-validated by tissue microarray. These data demonstrated that the proteomic biomarkers discovered through this strategy are potential targets for oral cancer detection and salivary diagnostics. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Microfluidic-Mass Spectrometry Interfaces for Translational Proteomics.

    PubMed

    Pedde, R Daniel; Li, Huiyan; Borchers, Christoph H; Akbari, Mohsen

    2017-10-01

    Interfacing mass spectrometry (MS) with microfluidic chips (μchip-MS) holds considerable potential to transform a clinician's toolbox, providing translatable methods for the early detection, diagnosis, monitoring, and treatment of noncommunicable diseases by streamlining and integrating laborious sample preparation workflows on high-throughput, user-friendly platforms. Overcoming the limitations of competitive immunoassays - currently the gold standard in clinical proteomics - μchip-MS can provide unprecedented access to complex proteomic assays having high sensitivity and specificity, but without the labor, costs, and complexities associated with conventional MS sample processing. This review surveys recent μchip-MS systems for clinical applications and examines their emerging role in streamlining the development and translation of MS-based proteomic assays by alleviating many of the challenges that currently inhibit widespread clinical adoption. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.

  4. Proteomic evaluation of genetically modified crops: current status and challenges

    PubMed Central

    Gong, Chun Yan; Wang, Tai

    2013-01-01

    Hectares of genetically modified (GM) crops have increased exponentially since 1996, when such crops began to be commercialized. GM biotechnology, together with conventional breeding, has become the main approach to improving agronomic traits of crops. However, people are concerned about the safety of GM crops, especially GM-derived food and feed. Many efforts have been made to evaluate the unintended effects caused by the introduction of exogenous genes. “Omics” techniques have advantages over targeted analysis in evaluating such crops because of their use of high-throughput screening. Proteins are key players in gene function and are directly involved in metabolism and cellular development or have roles as toxins, antinutrients, or allergens, which are essential for human health. Thus, proteomics can be expected to become one of the most useful tools in safety assessment. This review assesses the potential of proteomics in evaluating various GM crops. We further describe the challenges in ensuring homogeneity and sensitivity in detection techniques. PMID:23471542

  5. Proteomic evaluation of genetically modified crops: current status and challenges.

    PubMed

    Gong, Chun Yan; Wang, Tai

    2013-01-01

    Hectares of genetically modified (GM) crops have increased exponentially since 1996, when such crops began to be commercialized. GM biotechnology, together with conventional breeding, has become the main approach to improving agronomic traits of crops. However, people are concerned about the safety of GM crops, especially GM-derived food and feed. Many efforts have been made to evaluate the unintended effects caused by the introduction of exogenous genes. "Omics" techniques have advantages over targeted analysis in evaluating such crops because of their use of high-throughput screening. Proteins are key players in gene function and are directly involved in metabolism and cellular development or have roles as toxins, antinutrients, or allergens, which are essential for human health. Thus, proteomics can be expected to become one of the most useful tools in safety assessment. This review assesses the potential of proteomics in evaluating various GM crops. We further describe the challenges in ensuring homogeneity and sensitivity in detection techniques.

  6. Plasma proteomic changes during hypothermic and normothermic cardiopulmonary bypass in aortic surgeries

    PubMed Central

    ODA, TEIJI; YAMAGUCHI, AKANE; YOKOYAMA, MASAO; SHIMIZU, KOJI; TOYOTA, KOSAKU; NIKAI, TETSURO; MATSUMOTO, KEN-ICHI

    2014-01-01

    Deep hypothermic circulatory arrest (DHCA) is a protective method against brain ischemia in aortic surgery. However, the possible effects of DHCA on the plasma proteins remain to be determined. In the present study, we used novel high-throughput technology to compare the plasma proteomes during DHCA (22°C) with selective cerebral perfusion (SCP, n=7) to those during normothermic cardiopulmonary bypass (CPB, n=7). Three plasma samples per patient were obtained during CPB: T1, prior to cooling; T2, during hypothermia; T3, after rewarming for the DHCA group and three corresponding points for the normothermic group. A proteomic analysis was performed using isobaric tag for relative and absolute quantification (iTRAQ) labeling tandem mass spectrometry to assess quantitative protein changes. In total, the analysis identified 262 proteins. The bioinformatics analysis revealed a significant upregulation of complement activation at T2 in normothermic CPB, which was suppressed in DHCA. These findings were confirmed by the changes of the terminal complement complex (SC5b-9) levels. At T3, however, the level of SC5b-9 showed a greater increase in DHCA compared to normothermic CPB, while 48 proteins were significantly downregulated in DHCA. The results demonstrated that DHCA and rewarming potentially exert a significant effect on the plasma proteome in patients undergoing aortic surgery. PMID:25050567

  7. Cloud CPFP: a shotgun proteomics data analysis pipeline using cloud and high performance computing.

    PubMed

    Trudgian, David C; Mirzaei, Hamid

    2012-12-07

    We have extended the functionality of the Central Proteomics Facilities Pipeline (CPFP) to allow use of remote cloud and high performance computing (HPC) resources for shotgun proteomics data processing. CPFP has been modified to include modular local and remote scheduling for data processing jobs. The pipeline can now be run on a single PC or server, a local cluster, a remote HPC cluster, and/or the Amazon Web Services (AWS) cloud. We provide public images that allow easy deployment of CPFP in its entirety in the AWS cloud. This significantly reduces the effort necessary to use the software, and allows proteomics laboratories to pay for compute time ad hoc, rather than obtaining and maintaining expensive local server clusters. Alternatively the Amazon cloud can be used to increase the throughput of a local installation of CPFP as necessary. We demonstrate that cloud CPFP allows users to process data at higher speed than local installations but with similar cost and lower staff requirements. In addition to the computational improvements, the web interface to CPFP is simplified, and other functionalities are enhanced. The software is under active development at two leading institutions and continues to be released under an open-source license at http://cpfp.sourceforge.net.

  8. Recent 5-year Findings and Technological Advances in the Proteomic Study of HIV-associated Disorders.

    PubMed

    Zhang, Lijun; Jia, Xiaofang; Jin, Jun-O; Lu, Hongzhou; Tan, Zhimi

    2017-04-01

    Human immunodeficiency virus-1 (HIV-1) mainly relies on host factors to complete its life cycle. Hence, it is very important to identify HIV-regulated host proteins. Proteomics is an excellent technique for this purpose because of its high throughput and sensitivity. In this review, we summarized current technological advances in proteomics, including general isobaric tags for relative and absolute quantitation (iTRAQ) and stable isotope labeling by amino acids in cell culture (SILAC), as well as subcellular proteomics and investigation of posttranslational modifications. Furthermore, we reviewed the applications of proteomics in the discovery of HIV-related diseases and HIV infection mechanisms. Proteins identified by proteomic studies might offer new avenues for the diagnosis and treatment of HIV infection and the related diseases. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

  9. Quantitative RNA-seq analysis of the Campylobacter jejuni transcriptome

    PubMed Central

    Chaudhuri, Roy R.; Yu, Lu; Kanji, Alpa; Perkins, Timothy T.; Gardner, Paul P.; Choudhary, Jyoti; Maskell, Duncan J.

    2011-01-01

    Campylobacter jejuni is the most common bacterial cause of foodborne disease in the developed world. Its general physiology and biochemistry, as well as the mechanisms enabling it to colonize and cause disease in various hosts, are not well understood, and new approaches are required to understand its basic biology. High-throughput sequencing technologies provide unprecedented opportunities for functional genomic research. Recent studies have shown that direct Illumina sequencing of cDNA (RNA-seq) is a useful technique for the quantitative and qualitative examination of transcriptomes. In this study we report RNA-seq analyses of the transcriptomes of C. jejuni (NCTC11168) and its rpoN mutant. This has allowed the identification of hitherto unknown transcriptional units, and further defines the regulon that is dependent on rpoN for expression. The analysis of the NCTC11168 transcriptome was supplemented by additional proteomic analysis using liquid chromatography-MS. The transcriptomic and proteomic datasets represent an important resource for the Campylobacter research community. PMID:21816880

  10. OncoBinder facilitates interpretation of proteomic interaction data by capturing coactivation pairs in cancer.

    PubMed

    Van Coillie, Samya; Liang, Lunxi; Zhang, Yao; Wang, Huanbin; Fang, Jing-Yuan; Xu, Jie

    2016-04-05

    High-throughput methods such as co-immunoprecipitationmass spectrometry (coIP-MS) and yeast 2 hybridization (Y2H) have suggested a broad range of unannotated protein-protein interactions (PPIs), and interpretation of these PPIs remains a challenging task. The advancements in cancer genomic researches allow for the inference of "coactivation pairs" in cancer, which may facilitate the identification of PPIs involved in cancer. Here we present OncoBinder as a tool for the assessment of proteomic interaction data based on the functional synergy of oncoproteins in cancer. This decision tree-based method combines gene mutation, copy number and mRNA expression information to infer the functional status of protein-coding genes. We applied OncoBinder to evaluate the potential binders of EGFR and ERK2 proteins based on the gastric cancer dataset of The Cancer Genome Atlas (TCGA). As a result, OncoBinder identified high confidence interactions (annotated by Kyoto Encyclopedia of Genes and Genomes (KEGG) or validated by low-throughput assays) more efficiently than co-expression based method. Taken together, our results suggest that evaluation of gene functional synergy in cancer may facilitate the interpretation of proteomic interaction data. The OncoBinder toolbox for Matlab is freely accessible online.

  11. BioInfra.Prot: A comprehensive proteomics workflow including data standardization, protein inference, expression analysis and data publication.

    PubMed

    Turewicz, Michael; Kohl, Michael; Ahrens, Maike; Mayer, Gerhard; Uszkoreit, Julian; Naboulsi, Wael; Bracht, Thilo; Megger, Dominik A; Sitek, Barbara; Marcus, Katrin; Eisenacher, Martin

    2017-11-10

    The analysis of high-throughput mass spectrometry-based proteomics data must address the specific challenges of this technology. To this end, the comprehensive proteomics workflow offered by the de.NBI service center BioInfra.Prot provides indispensable components for the computational and statistical analysis of this kind of data. These components include tools and methods for spectrum identification and protein inference, protein quantification, expression analysis as well as data standardization and data publication. All particular methods of the workflow which address these tasks are state-of-the-art or cutting edge. As has been shown in previous publications, each of these methods is adequate to solve its specific task and gives competitive results. However, the methods included in the workflow are continuously reviewed, updated and improved to adapt to new scientific developments. All of these particular components and methods are available as stand-alone BioInfra.Prot services or as a complete workflow. Since BioInfra.Prot provides manifold fast communication channels to get access to all components of the workflow (e.g., via the BioInfra.Prot ticket system: bioinfraprot@rub.de) users can easily benefit from this service and get support by experts. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  12. A comparative proteomic strategy for subcellular proteome research: ICAT approach coupled with bioinformatics prediction to ascertain rat liver mitochondrial proteins and indication of mitochondrial localization for catalase.

    PubMed

    Jiang, Xiao-Sheng; Dai, Jie; Sheng, Quan-Hu; Zhang, Lei; Xia, Qi-Chang; Wu, Jia-Rui; Zeng, Rong

    2005-01-01

    Subcellular proteomics, as an important step to functional proteomics, has been a focus in proteomic research. However, the co-purification of "contaminating" proteins has been the major problem in all the subcellular proteomic research including all kinds of mitochondrial proteome research. It is often difficult to conclude whether these "contaminants" represent true endogenous partners or artificial associations induced by cell disruption or incomplete purification. To solve such a problem, we applied a high-throughput comparative proteome experimental strategy, ICAT approach performed with two-dimensional LC-MS/MS analysis, coupled with combinational usage of different bioinformatics tools, to study the proteome of rat liver mitochondria prepared with traditional centrifugation (CM) or further purified with a Nycodenz gradient (PM). A total of 169 proteins were identified and quantified convincingly in the ICAT analysis, in which 90 proteins have an ICAT ratio of PM:CM>1.0, while another 79 proteins have an ICAT ratio of PM:CM<1.0. Almost all the proteins annotated as mitochondrial according to Swiss-Prot annotation, bioinformatics prediction, and literature reports have a ratio of PM:CM>1.0, while proteins annotated as extracellular or secreted, cytoplasmic, endoplasmic reticulum, ribosomal, and so on have a ratio of PM:CM<1.0. Catalase and AP endonuclease 1, which have been known as peroxisomal and nuclear, respectively, have shown a ratio of PM:CM>1.0, confirming the reports about their mitochondrial location. Moreover, the 125 proteins with subcellular location annotation have been used as a testing dataset to evaluate the efficiency for ascertaining mitochondrial proteins by ICAT analysis and the bioinformatics tools such as PSORT, TargetP, SubLoc, MitoProt, and Predotar. The results indicated that ICAT analysis coupled with combinational usage of different bioinformatics tools could effectively ascertain mitochondrial proteins and distinguish contaminant proteins and even multilocation proteins. Using such a strategy, many novel proteins, known proteins without subcellular location annotation, and even known proteins that have been annotated as other locations have been strongly indicated for their mitochondrial location.

  13. Pressurized Pepsin Digestion in Proteomics

    PubMed Central

    López-Ferrer, Daniel; Petritis, Konstantinos; Robinson, Errol W.; Hixson, Kim K.; Tian, Zhixin; Lee, Jung Hwa; Lee, Sang-Won; Tolić, Nikola; Weitz, Karl K.; Belov, Mikhail E.; Smith, Richard D.; Paša-Tolić, Ljiljana

    2011-01-01

    Integrated top-down bottom-up proteomics combined with on-line digestion has great potential to improve the characterization of protein isoforms in biological systems and is amendable to high throughput proteomics experiments. Bottom-up proteomics ultimately provides the peptide sequences derived from the tandem MS analyses of peptides after the proteome has been digested. Top-down proteomics conversely entails the MS analyses of intact proteins for more effective characterization of genetic variations and/or post-translational modifications. Herein, we describe recent efforts toward efficient integration of bottom-up and top-down LC-MS-based proteomics strategies. Since most proteomics separations utilize acidic conditions, we exploited the compatibility of pepsin (where the optimal digestion conditions are at low pH) for integration into bottom-up and top-down proteomics work flows. Pressure-enhanced pepsin digestions were successfully performed and characterized with several standard proteins in either an off-line mode using a Barocycler or an on-line mode using a modified high pressure LC system referred to as a fast on-line digestion system (FOLDS). FOLDS was tested using pepsin and a whole microbial proteome, and the results were compared against traditional trypsin digestions on the same platform. Additionally, FOLDS was integrated with a RePlay configuration to demonstrate an ultrarapid integrated bottom-up top-down proteomics strategy using a standard mixture of proteins and a monkey pox virus proteome. PMID:20627868

  14. Highly Efficient Proteolysis Accelerated by Electromagnetic Waves for Peptide Mapping

    PubMed Central

    Chen, Qiwen; Liu, Ting; Chen, Gang

    2011-01-01

    Proteomics will contribute greatly to the understanding of gene functions in the post-genomic era. In proteome research, protein digestion is a key procedure prior to mass spectrometry identification. During the past decade, a variety of electromagnetic waves have been employed to accelerate proteolysis. This review focuses on the recent advances and the key strategies of these novel proteolysis approaches for digesting and identifying proteins. The subjects covered include microwave-accelerated protein digestion, infrared-assisted proteolysis, ultraviolet-enhanced protein digestion, laser-assisted proteolysis, and future prospects. It is expected that these novel proteolysis strategies accelerated by various electromagnetic waves will become powerful tools in proteome research and will find wide applications in high throughput protein digestion and identification. PMID:22379392

  15. The Urine Proteome as a Biomarker of Radiation Injury: Submitted to Proteomics- Clinical Applications Special Issue: "Renal and Urinary Proteomics (Thongboonkerd)"

    PubMed

    Sharma, Mukut; Halligan, Brian D; Wakim, Bassam T; Savin, Virginia J; Cohen, Eric P; Moulder, John E

    2008-06-18

    Terrorist attacks or nuclear accidents could expose large numbers of people to ionizing radiation, and early biomarkers of radiation injury would be critical for triage, treatment and follow-up of such individuals. However, no such biomarkers have yet been proven to exist. We tested the potential of high throughput proteomics to identify protein biomarkers of radiation injury after total body X-ray irradiation in a rat model. Subtle functional changes in the kidney are suggested by an increased glomerular permeability for macromolecules measured within 24 hours after TBI. Ultrastructural changes in glomerular podocytes include partial loss of the interdigitating organization of foot processes. Analysis of urine by LC-MS/MS and 2D-GE showed significant changes in the urine proteome within 24 hours after TBI. Tissue kallikrein 1-related peptidase, cysteine proteinase inhibitor cystatin C and oxidized histidine were found to be increased while a number of proteinase inhibitors including kallikrein-binding protein and albumin were found to be decreased post-irradiation. Thus, TBI causes immediately detectable changes in renal structure and function and in the urinary protein profile. This suggests that both systemic and renal changes are induced by radiation and it may be possible to identify a set of biomarkers unique to radiation injury.

  16. Potential for proteomic approaches in determining efficacy biomarkers following administration of fish oils rich in omega-3 fatty acids: application in pancreatic cancers.

    PubMed

    Runau, Franscois; Arshad, Ali; Isherwood, John; Norris, Leonie; Howells, Lynne; Metcalfe, Matthew; Dennison, Ashley

    2015-06-01

    Pancreatic cancer is a disease with a significantly poor prognosis. Despite modern advances in other medical, surgical, and oncologic therapy, the outcome from pancreatic cancer has improved little over the last 40 years. To improve the management of this difficult disease, trials investigating the use of dietary and parenteral fish oils rich in omega-3 (ω-3) fatty acids, exhibiting proven anti-inflammatory and anticarcinogenic properties, have revealed favorable results in pancreatic cancers. Proteomics is the large-scale study of proteins that attempts to characterize the complete set of proteins encoded by the genome of an organism and that, with the use of sensitive mass spectrometric-based techniques, has allowed high-throughput analysis of the proteome to aid identification of putative biomarkers pertinent to given disease states. These biomarkers provide useful insight into potentially discovering new markers for early detection or elucidating the efficacy of treatment on pancreatic cancers. Here, our review identifies potential proteomic-based biomarkers in pancreatic cancer relating to apoptosis, cell proliferation, angiogenesis, and metabolic regulation in clinical studies. We also reviewed proteomic biomarkers from the administration of ω-3 fatty acids that act on similar anticarcinogenic pathways as above and reflect that proteomic studies on the effect of ω-3 fatty acids in pancreatic cancer will yield favorable results. © 2015 American Society for Parenteral and Enteral Nutrition.

  17. Laser Capture Microdissection of Pancreatic Acinar Cells to Identify Proteomic Alterations in a Murine Model of Caerulein-Induced Pancreatitis

    PubMed Central

    Shapiro, John P; Komar, Hannah M; Hancioglu, Baris; Yu, Lianbo; Jin, Ming; Ogata, Yuko; Hart, Phil A; Cruz-Monserrate, Zobeida; Lesinski, Gregory B; Conwell, Darwin L

    2017-01-01

    Objectives: Chronic pancreatitis (CP) is characterized by inflammation and fibrosis of the pancreas, leading to pain, parenchymal damage, and loss of exocrine and endocrine function. There are currently no curative therapies; diagnosis remains difficult and aspects of pathogenesis remain unclear. Thus, there is a need to identify novel biomarkers to improve diagnosis and understand pathophysiology. We hypothesize that pancreatic acinar regions contain proteomic signatures relevant to disease processes, including secreted proteins that could be detected in biofluids. Methods: Acini from pancreata of mice injected with or without caerulein were collected using laser capture microdissection followed by mass spectrometry analysis. This protocol enabled high-throughput analysis that captured altered protein expression throughout the stages of CP. Results: Over 2,900 proteins were identified, whereas 331 were significantly changed ≥2-fold by mass spectrometry spectral count analysis. Consistent with pathogenesis, we observed increases in proteins related to fibrosis (e.g., collagen, P<0.001), several proteases (e.g., trypsin 1, P<0.001), and altered expression of proteins associated with diminished pancreas function (e.g., lipase, amylase, P<0.05). In comparison with proteomic data from a public data set of CP patients, a significant correlation was observed between proteomic changes in tissue from both the caerulein model and CP patients (r=0.725, P<0.001). CONCLUSIONS: This study illustrates the ability to characterize proteome changes of acinar cells isolated from pancreata of caerulein-treated mice and demonstrates a relationship between signatures from murine and human CP. PMID:28406494

  18. SwellGel: an affinity chromatography technology for high-capacity and high-throughput purification of recombinant-tagged proteins.

    PubMed

    Draveling, C; Ren, L; Haney, P; Zeisse, D; Qoronfleh, M W

    2001-07-01

    The revolution in genomics and proteomics is having a profound impact on drug discovery. Today's protein scientist demands a faster, easier, more reliable way to purify proteins. A high capacity, high-throughput new technology has been developed in Perbio Sciences for affinity protein purification. This technology utilizes selected chromatography media that are dehydrated to form uniform aggregates. The SwellGel aggregates will instantly rehydrate upon addition of the protein sample, allowing purification and direct performance of multiple assays in a variety of formats. SwellGel technology has greater stability and is easier to handle than standard wet chromatography resins. The microplate format of this technology provides high-capacity, high-throughput features, recovering milligram quantities of protein suitable for high-throughput screening or biophysical/structural studies. Data will be presented applying SwellGel technology to recombinant 6x His-tagged protein and glutathione-S-transferase (GST) fusion protein purification. Copyright 2001 Academic Press.

  19. Standard Reporting Requirements for Biological Samples in Metabolomics Experiments: Environmental Context

    EPA Science Inventory

    Metabolomic technologies are increasingly being applied to study biological questions in a range of different settings from clinical through to environmental. As with other high-throughput technologies, such as those used in transcriptomics and proteomics, metabolomics continues...

  20. Quantitative trait loci mapping of the mouse plasma proteome (pQTL).

    PubMed

    Holdt, Lesca M; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel

    2013-02-01

    A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F(2) intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins.

  1. Quantitative Trait Loci Mapping of the Mouse Plasma Proteome (pQTL)

    PubMed Central

    Holdt, Lesca M.; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel

    2013-01-01

    A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F2 intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins. PMID:23172855

  2. Enhancing Bottom-up and Top-down Proteomic Measurements with Ion Mobility Separations

    DOE PAGES

    Baker, Erin Shammel; Burnum-Johnson, Kristin E.; Ibrahim, Yehia M.; ...

    2015-07-03

    Proteomic measurements with greater throughput, sensitivity and additional structural information enhance the in-depth characterization of complex mixtures and targeted studies with additional information and higher confidence. While liquid chromatography separation coupled with mass spectrometry (LC-MS) measurements have provided information on thousands of proteins in different sample types, the additional of another rapid separation stage providing structural information has many benefits for analyses. Technical advances in ion funnels and multiplexing have enabled ion mobility separations to be easily and effectively coupled with LC-MS proteomics to enhance the information content of measurements. Finally, herein, we report on applications illustrating increased sensitivity, throughput,more » and structural information by utilizing IMS-MS and LC-IMS-MS measurements for both bottom-up and top-down proteomics measurements.« less

  3. Clinical validation of an ultra high-throughput spiral microfluidics for the detection and enrichment of viable circulating tumor cells.

    PubMed

    Khoo, Bee Luan; Warkiani, Majid Ebrahimi; Tan, Daniel Shao-Weng; Bhagat, Ali Asgar S; Irwin, Darryl; Lau, Dawn Pingxi; Lim, Alvin S T; Lim, Kiat Hon; Krisna, Sai Sakktee; Lim, Wan-Teck; Yap, Yoon Sim; Lee, Soo Chin; Soo, Ross A; Han, Jongyoon; Lim, Chwee Teck

    2014-01-01

    Circulating tumor cells (CTCs) are cancer cells that can be isolated via liquid biopsy from blood and can be phenotypically and genetically characterized to provide critical information for guiding cancer treatment. Current analysis of CTCs is hindered by the throughput, selectivity and specificity of devices or assays used in CTC detection and isolation. Here, we enriched and characterized putative CTCs from blood samples of patients with both advanced stage metastatic breast and lung cancers using a novel multiplexed spiral microfluidic chip. This system detected putative CTCs under high sensitivity (100%, n = 56) (Breast cancer samples: 12-1275 CTCs/ml; Lung cancer samples: 10-1535 CTCs/ml) rapidly from clinically relevant blood volumes (7.5 ml under 5 min). Blood samples were completely separated into plasma, CTCs and PBMCs components and each fraction were characterized with immunophenotyping (Pan-cytokeratin/CD45, CD44/CD24, EpCAM), fluorescence in-situ hybridization (FISH) (EML4-ALK) or targeted somatic mutation analysis. We used an ultra-sensitive mass spectrometry based system to highlight the presence of an EGFR-activating mutation in both isolated CTCs and plasma cell-free DNA (cf-DNA), and demonstrate concordance with the original tumor-biopsy samples. We have clinically validated our multiplexed microfluidic chip for the ultra high-throughput, low-cost and label-free enrichment of CTCs. Retrieved cells were unlabeled and viable, enabling potential propagation and real-time downstream analysis using next generation sequencing (NGS) or proteomic analysis.

  4. A Combined Proteomic and Transcriptomic Analysis on Sulfur Metabolism Pathways of Arabidopsis thaliana under Simulated Acid Rain

    PubMed Central

    Wang, Wenhua; Simon, Martin; Wu, Feihua; Hu, Wenjun; Chen, Juan B.; Zheng, Hailei

    2014-01-01

    With rapid economic development, most regions in southern China have suffered acid rain (AR) pollution. In our study, we analyzed the changes in sulfur metabolism in Arabidopsis under simulated AR stress which provide one of the first case studies, in which the systematic responses in sulfur metabolism were characterized by high-throughput methods at different levels including proteomic, genomic and physiological approaches. Generally, we found that all of the processes related to sulfur metabolism responded to AR stress, including sulfur uptake, activation and also synthesis of sulfur-containing amino acid and other secondary metabolites. Finally, we provided a catalogue of the detected sulfur metabolic changes and reconstructed the coordinating network of their mutual influences. This study can help us to understand the mechanisms of plants to adapt to AR stress. PMID:24595051

  5. Quantitative proteomics in teleost fish: insights and challenges for neuroendocrine and neurotoxicology research.

    PubMed

    Martyniuk, Christopher J; Popesku, Jason T; Chown, Brittany; Denslow, Nancy D; Trudeau, Vance L

    2012-05-01

    Neuroendocrine systems integrate both extrinsic and intrinsic signals to regulate virtually all aspects of an animal's physiology. In aquatic toxicology, studies have shown that pollutants are capable of disrupting the neuroendocrine system of teleost fish, and many chemicals found in the environment can also have a neurotoxic mode of action. Omics approaches are now used to better understand cell signaling cascades underlying fish neurophysiology and the control of pituitary hormone release, in addition to identifying adverse effects of pollutants in the teleostean central nervous system. For example, both high throughput genomics and proteomic investigations of molecular signaling cascades for both neurotransmitter and nuclear receptor agonists/antagonists have been reported. This review highlights recent studies that have utilized quantitative proteomics methods such as 2D differential in-gel electrophoresis (DIGE) and isobaric tagging for relative and absolute quantitation (iTRAQ) in neuroendocrine regions and uses these examples to demonstrate the challenges of using proteomics in neuroendocrinology and neurotoxicology research. To begin to characterize the teleost neuroproteome, we functionally annotated 623 unique proteins found in the fish hypothalamus and telencephalon. These proteins have roles in biological processes that include synaptic transmission, ATP production, receptor activity, cell structure and integrity, and stress responses. The biological processes most represented by proteins detected in the teleost neuroendocrine brain included transport (8.4%), metabolic process (5.5%), and glycolysis (4.8%). We provide an example of using sub-network enrichment analysis (SNEA) to identify protein networks in the fish hypothalamus in response to dopamine receptor signaling. Dopamine signaling altered the abundance of proteins that are binding partners of microfilaments, integrins, and intermediate filaments, consistent with data suggesting dopaminergic regulation of neuronal stability and structure. Lastly, for fish neuroendocrine studies using both high-throughput genomics and proteomics, we compare gene and protein relationships in the hypothalamus and demonstrate that correlation is often poor for single time point experiments. These studies highlight the need for additional time course analyses to better understand gene-protein relationships and adverse outcome pathways. This is important if both transcriptomics and proteomics are to be used together to investigate neuroendocrine signaling pathways or as bio-monitoring tools in ecotoxicology. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. Identifier mapping performance for integrating transcriptomics and proteomics experimental results

    PubMed Central

    2011-01-01

    Background Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. Results We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. Conclusions The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. PMID:21619611

  7. Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival.

    PubMed

    Nicolau, Monica; Levine, Arnold J; Carlsson, Gunnar

    2011-04-26

    High-throughput biological data, whether generated as sequencing, transcriptional microarrays, proteomic, or other means, continues to require analytic methods that address its high dimensional aspects. Because the computational part of data analysis ultimately identifies shape characteristics in the organization of data sets, the mathematics of shape recognition in high dimensions continues to be a crucial part of data analysis. This article introduces a method that extracts information from high-throughput microarray data and, by using topology, provides greater depth of information than current analytic techniques. The method, termed Progression Analysis of Disease (PAD), first identifies robust aspects of cluster analysis, then goes deeper to find a multitude of biologically meaningful shape characteristics in these data. Additionally, because PAD incorporates a visualization tool, it provides a simple picture or graph that can be used to further explore these data. Although PAD can be applied to a wide range of high-throughput data types, it is used here as an example to analyze breast cancer transcriptional data. This identified a unique subgroup of Estrogen Receptor-positive (ER(+)) breast cancers that express high levels of c-MYB and low levels of innate inflammatory genes. These patients exhibit 100% survival and no metastasis. No supervised step beyond distinction between tumor and healthy patients was used to identify this subtype. The group has a clear and distinct, statistically significant molecular signature, it highlights coherent biology but is invisible to cluster methods, and does not fit into the accepted classification of Luminal A/B, Normal-like subtypes of ER(+) breast cancers. We denote the group as c-MYB(+) breast cancer.

  8. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification.

    PubMed

    Liu, Ming-Qi; Zeng, Wen-Feng; Fang, Pan; Cao, Wei-Qian; Liu, Chao; Yan, Guo-Quan; Zhang, Yang; Peng, Chao; Wu, Jian-Qiang; Zhang, Xiao-Jin; Tu, Hui-Jun; Chi, Hao; Sun, Rui-Xiang; Cao, Yong; Dong, Meng-Qiu; Jiang, Bi-Yun; Huang, Jiang-Ming; Shen, Hua-Li; Wong, Catherine C L; He, Si-Min; Yang, Peng-Yuan

    2017-09-05

    The precise and large-scale identification of intact glycopeptides is a critical step in glycoproteomics. Owing to the complexity of glycosylation, the current overall throughput, data quality and accessibility of intact glycopeptide identification lack behind those in routine proteomic analyses. Here, we propose a workflow for the precise high-throughput identification of intact N-glycopeptides at the proteome scale using stepped-energy fragmentation and a dedicated search engine. pGlyco 2.0 conducts comprehensive quality control including false discovery rate evaluation at all three levels of matches to glycans, peptides and glycopeptides, improving the current level of accuracy of intact glycopeptide identification. The N-glycoproteome of samples metabolically labeled with 15 N/ 13 C were analyzed quantitatively and utilized to validate the glycopeptide identification, which could be used as a novel benchmark pipeline to compare different search engines. Finally, we report a large-scale glycoproteome dataset consisting of 10,009 distinct site-specific N-glycans on 1988 glycosylation sites from 955 glycoproteins in five mouse tissues.Protein glycosylation is a heterogeneous post-translational modification that generates greater proteomic diversity that is difficult to analyze. Here the authors describe pGlyco 2.0, a workflow for the precise one step identification of intact N-glycopeptides at the proteome scale.

  9. Biofuels as a sustainable energy source: an update of the applications of proteomics in bioenergy crops and algae.

    PubMed

    Ndimba, Bongani Kaiser; Ndimba, Roya Janeen; Johnson, T Sudhakar; Waditee-Sirisattha, Rungaroon; Baba, Masato; Sirisattha, Sophon; Shiraiwa, Yoshihiro; Agrawal, Ganesh Kumar; Rakwal, Randeep

    2013-11-20

    Sustainable energy is the need of the 21st century, not because of the numerous environmental and political reasons but because it is necessary to human civilization's energy future. Sustainable energy is loosely grouped into renewable energy, energy conservation, and sustainable transport disciplines. In this review, we deal with the renewable energy aspect focusing on the biomass from bioenergy crops to microalgae to produce biofuels to the utilization of high-throughput omics technologies, in particular proteomics in advancing our understanding and increasing biofuel production. We look at biofuel production by plant- and algal-based sources, and the role proteomics has played therein. This article is part of a Special Issue entitled: Translational Plant Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Assessing the Exoproteome of Marine Bacteria, Lesson from a RTX-Toxin Abundantly Secreted by Phaeobacter Strain DSM 17395

    PubMed Central

    Durighello, Emie; Christie-Oleza, Joseph Alexander; Armengaud, Jean

    2014-01-01

    Bacteria from the Roseobacter clade are abundant in surface marine ecosystems as over 10% of bacterial cells in the open ocean and 20% in coastal waters belong to this group. In order to document how these marine bacteria interact with their environment, we analyzed the exoproteome of Phaeobacter strain DSM 17395. We grew the strain in marine medium, collected the exoproteome and catalogued its content with high-throughput nanoLC-MS/MS shotgun proteomics. The major component represented 60% of the total protein content but was refractory to either classical proteomic identification or proteogenomics. We de novo sequenced this abundant protein with high-resolution tandem mass spectra which turned out being the 53 kDa RTX-toxin ZP_02147451. It comprised a peptidase M10 serralysin domain. We explained its recalcitrance to trypsin proteolysis and proteomic identification by its unusual low number of basic residues. We found this is a conserved trait in RTX-toxins from Roseobacter strains which probably explains their persistence in the harsh conditions around bacteria. Comprehensive analysis of exoproteomes from environmental bacteria should take into account this proteolytic recalcitrance. PMID:24586966

  11. Assessing the exoproteome of marine bacteria, lesson from a RTX-toxin abundantly secreted by Phaeobacter strain DSM 17395.

    PubMed

    Durighello, Emie; Christie-Oleza, Joseph Alexander; Armengaud, Jean

    2014-01-01

    Bacteria from the Roseobacter clade are abundant in surface marine ecosystems as over 10% of bacterial cells in the open ocean and 20% in coastal waters belong to this group. In order to document how these marine bacteria interact with their environment, we analyzed the exoproteome of Phaeobacter strain DSM 17395. We grew the strain in marine medium, collected the exoproteome and catalogued its content with high-throughput nanoLC-MS/MS shotgun proteomics. The major component represented 60% of the total protein content but was refractory to either classical proteomic identification or proteogenomics. We de novo sequenced this abundant protein with high-resolution tandem mass spectra which turned out being the 53 kDa RTX-toxin ZP_02147451. It comprised a peptidase M10 serralysin domain. We explained its recalcitrance to trypsin proteolysis and proteomic identification by its unusual low number of basic residues. We found this is a conserved trait in RTX-toxins from Roseobacter strains which probably explains their persistence in the harsh conditions around bacteria. Comprehensive analysis of exoproteomes from environmental bacteria should take into account this proteolytic recalcitrance.

  12. Automation, parallelism, and robotics for proteomics.

    PubMed

    Alterovitz, Gil; Liu, Jonathan; Chow, Jijun; Ramoni, Marco F

    2006-07-01

    The speed of the human genome project (Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C. et al., Nature 2001, 409, 860-921) was made possible, in part, by developments in automation of sequencing technologies. Before these technologies, sequencing was a laborious, expensive, and personnel-intensive task. Similarly, automation and robotics are changing the field of proteomics today. Proteomics is defined as the effort to understand and characterize proteins in the categories of structure, function and interaction (Englbrecht, C. C., Facius, A., Comb. Chem. High Throughput Screen. 2005, 8, 705-715). As such, this field nicely lends itself to automation technologies since these methods often require large economies of scale in order to achieve cost and time-saving benefits. This article describes some of the technologies and methods being applied in proteomics in order to facilitate automation within the field as well as in linking proteomics-based information with other related research areas.

  13. SwissPalm: Protein Palmitoylation database.

    PubMed

    Blanc, Mathieu; David, Fabrice; Abrami, Laurence; Migliozzi, Daniel; Armand, Florence; Bürgi, Jérôme; van der Goot, Françoise Gisou

    2015-01-01

    Protein S-palmitoylation is a reversible post-translational modification that regulates many key biological processes, although the full extent and functions of protein S-palmitoylation remain largely unexplored. Recent developments of new chemical methods have allowed the establishment of palmitoyl-proteomes of a variety of cell lines and tissues from different species.  As the amount of information generated by these high-throughput studies is increasing, the field requires centralization and comparison of this information. Here we present SwissPalm ( http://swisspalm.epfl.ch), our open, comprehensive, manually curated resource to study protein S-palmitoylation. It currently encompasses more than 5000 S-palmitoylated protein hits from seven species, and contains more than 500 specific sites of S-palmitoylation. SwissPalm also provides curated information and filters that increase the confidence in true positive hits, and integrates predictions of S-palmitoylated cysteine scores, orthologs and isoform multiple alignments. Systems analysis of the palmitoyl-proteome screens indicate that 10% or more of the human proteome is susceptible to S-palmitoylation. Moreover, ontology and pathway analyses of the human palmitoyl-proteome reveal that key biological functions involve this reversible lipid modification. Comparative analysis finally shows a strong crosstalk between S-palmitoylation and other post-translational modifications. Through the compilation of data and continuous updates, SwissPalm will provide a powerful tool to unravel the global importance of protein S-palmitoylation.

  14. SwissPalm: Protein Palmitoylation database

    PubMed Central

    Abrami, Laurence; Migliozzi, Daniel; Armand, Florence; Bürgi, Jérôme; van der Goot, Françoise Gisou

    2015-01-01

    Protein S-palmitoylation is a reversible post-translational modification that regulates many key biological processes, although the full extent and functions of protein S-palmitoylation remain largely unexplored. Recent developments of new chemical methods have allowed the establishment of palmitoyl-proteomes of a variety of cell lines and tissues from different species.  As the amount of information generated by these high-throughput studies is increasing, the field requires centralization and comparison of this information. Here we present SwissPalm ( http://swisspalm.epfl.ch), our open, comprehensive, manually curated resource to study protein S-palmitoylation. It currently encompasses more than 5000 S-palmitoylated protein hits from seven species, and contains more than 500 specific sites of S-palmitoylation. SwissPalm also provides curated information and filters that increase the confidence in true positive hits, and integrates predictions of S-palmitoylated cysteine scores, orthologs and isoform multiple alignments. Systems analysis of the palmitoyl-proteome screens indicate that 10% or more of the human proteome is susceptible to S-palmitoylation. Moreover, ontology and pathway analyses of the human palmitoyl-proteome reveal that key biological functions involve this reversible lipid modification. Comparative analysis finally shows a strong crosstalk between S-palmitoylation and other post-translational modifications. Through the compilation of data and continuous updates, SwissPalm will provide a powerful tool to unravel the global importance of protein S-palmitoylation. PMID:26339475

  15. High-throughput molecular analysis in lung cancer: insights into biology and potential clinical applications.

    PubMed

    Ocak, S; Sos, M L; Thomas, R K; Massion, P P

    2009-08-01

    During the last decade, high-throughput technologies including genomic, epigenomic, transcriptomic and proteomic have been applied to further our understanding of the molecular pathogenesis of this heterogeneous disease, and to develop strategies that aim to improve the management of patients with lung cancer. Ultimately, these approaches should lead to sensitive, specific and noninvasive methods for early diagnosis, and facilitate the prediction of response to therapy and outcome, as well as the identification of potential novel therapeutic targets. Genomic studies were the first to move this field forward by providing novel insights into the molecular biology of lung cancer and by generating candidate biomarkers of disease progression. Lung carcinogenesis is driven by genetic and epigenetic alterations that cause aberrant gene function; however, the challenge remains to pinpoint the key regulatory control mechanisms and to distinguish driver from passenger alterations that may have a small but additive effect on cancer development. Epigenetic regulation by DNA methylation and histone modifications modulate chromatin structure and, in turn, either activate or silence gene expression. Proteomic approaches critically complement these molecular studies, as the phenotype of a cancer cell is determined by proteins and cannot be predicted by genomics or transcriptomics alone. The present article focuses on the technological platforms available and some proposed clinical applications. We illustrate herein how the "-omics" have revolutionised our approach to lung cancer biology and hold promise for personalised management of lung cancer.

  16. Nucleic Acids for Ultra-Sensitive Protein Detection

    PubMed Central

    Janssen, Kris P. F.; Knez, Karel; Spasic, Dragana; Lammertyn, Jeroen

    2013-01-01

    Major advancements in molecular biology and clinical diagnostics cannot be brought about strictly through the use of genomics based methods. Improved methods for protein detection and proteomic screening are an absolute necessity to complement to wealth of information offered by novel, high-throughput sequencing technologies. Only then will it be possible to advance insights into clinical processes and to characterize the importance of specific protein biomarkers for disease detection or the realization of “personalized medicine”. Currently however, large-scale proteomic information is still not as easily obtained as its genomic counterpart, mainly because traditional antibody-based technologies struggle to meet the stringent sensitivity and throughput requirements that are required whereas mass-spectrometry based methods might be burdened by significant costs involved. However, recent years have seen the development of new biodetection strategies linking nucleic acids with existing antibody technology or replacing antibodies with oligonucleotide recognition elements altogether. These advancements have unlocked many new strategies to lower detection limits and dramatically increase throughput of protein detection assays. In this review, an overview of these new strategies will be given. PMID:23337338

  17. SpirPro: A Spirulina proteome database and web-based tools for the analysis of protein-protein interactions at the metabolic level in Spirulina (Arthrospira) platensis C1.

    PubMed

    Senachak, Jittisak; Cheevadhanarak, Supapon; Hongsthong, Apiradee

    2015-07-29

    Spirulina (Arthrospira) platensis is the only cyanobacterium that in addition to being studied at the molecular level and subjected to gene manipulation, can also be mass cultivated in outdoor ponds for commercial use as a food supplement. Thus, encountering environmental changes, including temperature stresses, is common during the mass production of Spirulina. The use of cyanobacteria as an experimental platform, especially for photosynthetic gene manipulation in plants and bacteria, is becoming increasingly important. Understanding the mechanisms and protein-protein interaction networks that underlie low- and high-temperature responses is relevant to Spirulina mass production. To accomplish this goal, high-throughput techniques such as OMICs analyses are used. Thus, large datasets must be collected, managed and subjected to information extraction. Therefore, databases including (i) proteomic analysis and protein-protein interaction (PPI) data and (ii) domain/motif visualization tools are required for potential use in temperature response models for plant chloroplasts and photosynthetic bacteria. A web-based repository was developed including an embedded database, SpirPro, and tools for network visualization. Proteome data were analyzed integrated with protein-protein interactions and/or metabolic pathways from KEGG. The repository provides various information, ranging from raw data (2D-gel images) to associated results, such as data from interaction and/or pathway analyses. This integration allows in silico analyses of protein-protein interactions affected at the metabolic level and, particularly, analyses of interactions between and within the affected metabolic pathways under temperature stresses for comparative proteomic analysis. The developed tool, which is coded in HTML with CSS/JavaScript and depicted in Scalable Vector Graphics (SVG), is designed for interactive analysis and exploration of the constructed network. SpirPro is publicly available on the web at http://spirpro.sbi.kmutt.ac.th . SpirPro is an analysis platform containing an integrated proteome and PPI database that provides the most comprehensive data on this cyanobacterium at the systematic level. As an integrated database, SpirPro can be applied in various analyses, such as temperature stress response networking analysis in cyanobacterial models and interacting domain-domain analysis between proteins of interest.

  18. A set of ligation-independent in vitro translation vectors for eukaryotic protein production.

    PubMed

    Bardóczy, Viola; Géczi, Viktória; Sawasaki, Tatsuya; Endo, Yaeta; Mészáros, Tamás

    2008-03-27

    The last decade has brought the renaissance of protein studies and accelerated the development of high-throughput methods in all aspects of proteomics. Presently, most protein synthesis systems exploit the capacity of living cells to translate proteins, but their application is limited by several factors. A more flexible alternative protein production method is the cell-free in vitro protein translation. Currently available in vitro translation systems are suitable for high-throughput robotic protein production, fulfilling the requirements of proteomics studies. Wheat germ extract based in vitro translation system is likely the most promising method, since numerous eukaryotic proteins can be cost-efficiently synthesized in their native folded form. Although currently available vectors for wheat embryo in vitro translation systems ensure high productivity, they do not meet the requirements of state-of-the-art proteomics. Target genes have to be inserted using restriction endonucleases and the plasmids do not encode cleavable affinity purification tags. We designed four ligation independent cloning (LIC) vectors for wheat germ extract based in vitro protein translation. In these constructs, the RNA transcription is driven by T7 or SP6 phage polymerase and two TEV protease cleavable affinity tags can be added to aid protein purification. To evaluate our improved vectors, a plant mitogen activated protein kinase was cloned in all four constructs. Purification of this eukaryotic protein kinase demonstrated that all constructs functioned as intended: insertion of PCR fragment by LIC worked efficiently, affinity purification of translated proteins by GST-Sepharose or MagneHis particles resulted in high purity kinase, and the affinity tags could efficiently be removed under different reaction conditions. Furthermore, high in vitro kinase activity testified of proper folding of the purified protein. Four newly designed in vitro translation vectors have been constructed which allow fast and parallel cloning and protein purification, thus representing useful molecular tools for high-throughput production of eukaryotic proteins.

  19. Integrated network analysis and effective tools in plant systems biology

    PubMed Central

    Fukushima, Atsushi; Kanaya, Shigehiko; Nishida, Kozo

    2014-01-01

    One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms. PMID:25408696

  20. Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene-induced Signaling

    PubMed Central

    Huang, Shao-shan Carol; Clarke, David C.; Gosline, Sara J. C.; Labadorf, Adam; Chouinard, Candace R.; Gordon, William; Lauffenburger, Douglas A.; Fraenkel, Ernest

    2013-01-01

    Cellular signal transduction generally involves cascades of post-translational protein modifications that rapidly catalyze changes in protein-DNA interactions and gene expression. High-throughput measurements are improving our ability to study each of these stages individually, but do not capture the connections between them. Here we present an approach for building a network of physical links among these data that can be used to prioritize targets for pharmacological intervention. Our method recovers the critical missing links between proteomic and transcriptional data by relating changes in chromatin accessibility to changes in expression and then uses these links to connect proteomic and transcriptome data. We applied our approach to integrate epigenomic, phosphoproteomic and transcriptome changes induced by the variant III mutation of the epidermal growth factor receptor (EGFRvIII) in a cell line model of glioblastoma multiforme (GBM). To test the relevance of the network, we used small molecules to target highly connected nodes implicated by the network model that were not detected by the experimental data in isolation and we found that a large fraction of these agents alter cell viability. Among these are two compounds, ICG-001, targeting CREB binding protein (CREBBP), and PKF118–310, targeting β-catenin (CTNNB1), which have not been tested previously for effectiveness against GBM. At the level of transcriptional regulation, we used chromatin immunoprecipitation sequencing (ChIP-Seq) to experimentally determine the genome-wide binding locations of p300, a transcriptional co-regulator highly connected in the network. Analysis of p300 target genes suggested its role in tumorigenesis. We propose that this general method, in which experimental measurements are used as constraints for building regulatory networks from the interactome while taking into account noise and missing data, should be applicable to a wide range of high-throughput datasets. PMID:23408876

  1. Proteomic analysis reveals strong mitochondrial involvement in cytoplasmic male sterility of pepper (Capsicum annuum L.).

    PubMed

    Guo, Jinju; Wang, Peng; Cheng, Qing; Sun, Limin; Wang, Hongyu; Wang, Yutong; Kao, Lina; Li, Yanan; Qiu, Tuoyu; Yang, Wencai; Shen, Huolin

    2017-09-25

    Although cytoplasmic male sterility (CMS) is widely used for developing pepper hybrids, its molecular mechanism remains unclear. In this study, we used a high-throughput proteomics method called label-free to compare protein abundance across a pepper CMS line (A-line) and its isogenic maintainer line (B-line). Data are available via ProteomeXchange with identifier PXD006104. Approximately 324 differentially abundant protein species were identified and quantified; among which, 47 were up-accumulated and 140 were down-accumulated in the A-line; additionally, 75 and 62 protein species were specifically accumulated in the A-line and B-line, respectively. Protein species involved in pollen exine formation, pyruvate metabolic processes, the tricarboxylic acid cycle, the mitochondrial electron transport chain, and oxidative stress response were observed to be differentially accumulated between A-line and B-line, suggesting their potential roles in the regulation of pepper pollen abortion. Based on our data, we proposed a potential regulatory network for pepper CMS that unifies these processes. Artificial emasculation is a major obstacle in pepper hybrid breeding for its high labor cost and poor seed purity. While the use of cytoplasmic male sterility (CMS) in hybrid system is seriously frustrated because a long time is needed to cultivate male sterility line and its isogenic restore line. Transgenic technology is an effective and rapid method to obtain male sterility lines and its widely application has very important significance in speeding up breeding process in pepper. Although numerous studies have been conducted to select the genes related to male sterility, the molecular mechanism of cytoplasmic male sterility in pepper remains unknown. In this study, we used the high-throughput proteomic method called "label-free", coupled with liquid chromatography-quadrupole mass spectrometry (LC-MS/MS), to perform a novel comparison of expression profiles in a CMS pepper line and its maintainer line. Based on our results, we proposed a potential regulated protein network involved in pollen development as a novel mechanism of pepper CMS. Copyright © 2017. Published by Elsevier B.V.

  2. Wheat proteomics: proteome modulation and abiotic stress acclimation

    PubMed Central

    Komatsu, Setsuko; Kamal, Abu H. M.; Hossain, Zahed

    2014-01-01

    Cellular mechanisms of stress sensing and signaling represent the initial plant responses to adverse conditions. The development of high-throughput “Omics” techniques has initiated a new era of the study of plant molecular strategies for adapting to environmental changes. However, the elucidation of stress adaptation mechanisms in plants requires the accurate isolation and characterization of stress-responsive proteins. Because the functional part of the genome, namely the proteins and their post-translational modifications, are critical for plant stress responses, proteomic studies provide comprehensive information about the fine-tuning of cellular pathways that primarily involved in stress mitigation. This review summarizes the major proteomic findings related to alterations in the wheat proteomic profile in response to abiotic stresses. Moreover, the strengths and weaknesses of different sample preparation techniques, including subcellular protein extraction protocols, are discussed in detail. The continued development of proteomic approaches in combination with rapidly evolving bioinformatics tools and interactive databases will facilitate understanding of the plant mechanisms underlying stress tolerance. PMID:25538718

  3. msBiodat analysis tool, big data analysis for high-throughput experiments.

    PubMed

    Muñoz-Torres, Pau M; Rokć, Filip; Belužic, Robert; Grbeša, Ivana; Vugrek, Oliver

    2016-01-01

    Mass spectrometry (MS) are a group of a high-throughput techniques used to increase knowledge about biomolecules. They produce a large amount of data which is presented as a list of hundreds or thousands of proteins. Filtering those data efficiently is the first step for extracting biologically relevant information. The filtering may increase interest by merging previous data with the data obtained from public databases, resulting in an accurate list of proteins which meet the predetermined conditions. In this article we present msBiodat Analysis Tool, a web-based application thought to approach proteomics to the big data analysis. With this tool, researchers can easily select the most relevant information from their MS experiments using an easy-to-use web interface. An interesting feature of msBiodat analysis tool is the possibility of selecting proteins by its annotation on Gene Ontology using its Gene Id, ensembl or UniProt codes. The msBiodat analysis tool is a web-based application that allows researchers with any programming experience to deal with efficient database querying advantages. Its versatility and user-friendly interface makes easy to perform fast and accurate data screening by using complex queries. Once the analysis is finished, the result is delivered by e-mail. msBiodat analysis tool is freely available at http://msbiodata.irb.hr.

  4. Protein mass spectra data analysis for clinical biomarker discovery: a global review.

    PubMed

    Roy, Pascal; Truntzer, Caroline; Maucort-Boulch, Delphine; Jouve, Thomas; Molinari, Nicolas

    2011-03-01

    The identification of new diagnostic or prognostic biomarkers is one of the main aims of clinical cancer research. In recent years there has been a growing interest in using high throughput technologies for the detection of such biomarkers. In particular, mass spectrometry appears as an exciting tool with great potential. However, to extract any benefit from the massive potential of clinical proteomic studies, appropriate methods, improvement and validation are required. To better understand the key statistical points involved with such studies, this review presents the main data analysis steps of protein mass spectra data analysis, from the pre-processing of the data to the identification and validation of biomarkers.

  5. Comparative and Quantitative Global Proteomics Approaches: An Overview

    PubMed Central

    Deracinois, Barbara; Flahaut, Christophe; Duban-Deweer, Sophie; Karamanos, Yannis

    2013-01-01

    Proteomics became a key tool for the study of biological systems. The comparison between two different physiological states allows unravelling the cellular and molecular mechanisms involved in a biological process. Proteomics can confirm the presence of proteins suggested by their mRNA content and provides a direct measure of the quantity present in a cell. Global and targeted proteomics strategies can be applied. Targeted proteomics strategies limit the number of features that will be monitored and then optimise the methods to obtain the highest sensitivity and throughput for a huge amount of samples. The advantage of global proteomics strategies is that no hypothesis is required, other than a measurable difference in one or more protein species between the samples. Global proteomics methods attempt to separate quantify and identify all the proteins from a given sample. This review highlights only the different techniques of separation and quantification of proteins and peptides, in view of a comparative and quantitative global proteomics analysis. The in-gel and off-gel quantification of proteins will be discussed as well as the corresponding mass spectrometry technology. The overview is focused on the widespread techniques while keeping in mind that each approach is modular and often recovers the other. PMID:28250403

  6. Proteomics-based compositional analysis of complex cellulase-hemicellulase mixtures.

    PubMed

    Chundawat, Shishir P S; Lipton, Mary S; Purvine, Samuel O; Uppugundla, Nirmal; Gao, Dahai; Balan, Venkatesh; Dale, Bruce E

    2011-10-07

    Efficient deconstruction of cellulosic biomass to fermentable sugars for fuel and chemical production is accomplished by a complex mixture of cellulases, hemicellulases, and accessory enzymes (e.g., >50 extracellular proteins). Cellulolytic enzyme mixtures, produced industrially mostly using fungi like Trichoderma reesei, are poorly characterized in terms of their protein composition and its correlation to hydrolytic activity on cellulosic biomass. The secretomes of commercial glycosyl hydrolase-producing microbes was explored using a proteomics approach with high-throughput quantification using liquid chromatography-tandem mass spectrometry (LC-MS/MS). Here, we show that proteomics-based spectral counting approach is a reasonably accurate and rapid analytical technique that can be used to determine protein composition of complex glycosyl hydrolase mixtures that also correlates with the specific activity of individual enzymes present within the mixture. For example, a strong linear correlation was seen between Avicelase activity and total cellobiohydrolase content. Reliable, quantitative and cheaper analytical methods that provide insight into the cellulosic biomass degrading fungal and bacterial secretomes would lead to further improvements toward commercialization of plant biomass-derived fuels and chemicals.

  7. Assembling proteomics data as a prerequisite for the analysis of large scale experiments

    PubMed Central

    Schmidt, Frank; Schmid, Monika; Thiede, Bernd; Pleißner, Klaus-Peter; Böhme, Martina; Jungblut, Peter R

    2009-01-01

    Background Despite the complete determination of the genome sequence of a huge number of bacteria, their proteomes remain relatively poorly defined. Beside new methods to increase the number of identified proteins new database applications are necessary to store and present results of large- scale proteomics experiments. Results In the present study, a database concept has been developed to address these issues and to offer complete information via a web interface. In our concept, the Oracle based data repository system SQL-LIMS plays the central role in the proteomics workflow and was applied to the proteomes of Mycobacterium tuberculosis, Helicobacter pylori, Salmonella typhimurium and protein complexes such as 20S proteasome. Technical operations of our proteomics labs were used as the standard for SQL-LIMS template creation. By means of a Java based data parser, post-processed data of different approaches, such as LC/ESI-MS, MALDI-MS and 2-D gel electrophoresis (2-DE), were stored in SQL-LIMS. A minimum set of the proteomics data were transferred in our public 2D-PAGE database using a Java based interface (Data Transfer Tool) with the requirements of the PEDRo standardization. Furthermore, the stored proteomics data were extractable out of SQL-LIMS via XML. Conclusion The Oracle based data repository system SQL-LIMS played the central role in the proteomics workflow concept. Technical operations of our proteomics labs were used as standards for SQL-LIMS templates. Using a Java based parser, post-processed data of different approaches such as LC/ESI-MS, MALDI-MS and 1-DE and 2-DE were stored in SQL-LIMS. Thus, unique data formats of different instruments were unified and stored in SQL-LIMS tables. Moreover, a unique submission identifier allowed fast access to all experimental data. This was the main advantage compared to multi software solutions, especially if personnel fluctuations are high. Moreover, large scale and high-throughput experiments must be managed in a comprehensive repository system such as SQL-LIMS, to query results in a systematic manner. On the other hand, these database systems are expensive and require at least one full time administrator and specialized lab manager. Moreover, the high technical dynamics in proteomics may cause problems to adjust new data formats. To summarize, SQL-LIMS met the requirements of proteomics data handling especially in skilled processes such as gel-electrophoresis or mass spectrometry and fulfilled the PSI standardization criteria. The data transfer into a public domain via DTT facilitated validation of proteomics data. Additionally, evaluation of mass spectra by post-processing using MS-Screener improved the reliability of mass analysis and prevented storage of data junk. PMID:19166578

  8. Multiplexed evaluation of capture agent binding kinetics using arrays of silicon photonic microring resonators.

    PubMed

    Byeon, Ji-Yeon; Bailey, Ryan C

    2011-09-07

    High affinity capture agents recognizing biomolecular targets are essential in the performance of many proteomic detection methods. Herein, we report the application of a label-free silicon photonic biomolecular analysis platform for simultaneously determining kinetic association and dissociation constants for two representative protein capture agents: a thrombin-binding DNA aptamer and an anti-thrombin monoclonal antibody. The scalability and inherent multiplexing capability of the technology make it an attractive platform for simultaneously evaluating the binding characteristics of multiple capture agents recognizing the same target antigen, and thus a tool complementary to emerging high-throughput capture agent generation strategies.

  9. Characterization of Native Protein Complexes and Protein Isoform Variation Using Size-fractionation-based Quantitative Proteomics*

    PubMed Central

    Kirkwood, Kathryn J.; Ahmad, Yasmeen; Larance, Mark; Lamond, Angus I.

    2013-01-01

    Proteins form a diverse array of complexes that mediate cellular function and regulation. A largely unexplored feature of such protein complexes is the selective participation of specific protein isoforms and/or post-translationally modified forms. In this study, we combined native size-exclusion chromatography (SEC) with high-throughput proteomic analysis to characterize soluble protein complexes isolated from human osteosarcoma (U2OS) cells. Using this approach, we have identified over 71,500 peptides and 1,600 phosphosites, corresponding to over 8,000 proteins, distributed across 40 SEC fractions. This represents >50% of the predicted U2OS cell proteome, identified with a mean peptide sequence coverage of 27% per protein. Three biological replicates were performed, allowing statistical evaluation of the data and demonstrating a high degree of reproducibility in the SEC fractionation procedure. Specific proteins were detected interacting with multiple independent complexes, as typified by the separation of distinct complexes for the MRFAP1-MORF4L1-MRGBP interaction network. The data also revealed protein isoforms and post-translational modifications that selectively associated with distinct subsets of protein complexes. Surprisingly, there was clear enrichment for specific Gene Ontology terms associated with differential size classes of protein complexes. This study demonstrates that combined SEC/MS analysis can be used for the system-wide annotation of protein complexes and to predict potential isoform-specific interactions. All of these SEC data on the native separation of protein complexes have been integrated within the Encyclopedia of Proteome Dynamics, an online, multidimensional data-sharing resource available to the community. PMID:24043423

  10. Characterization of native protein complexes and protein isoform variation using size-fractionation-based quantitative proteomics.

    PubMed

    Kirkwood, Kathryn J; Ahmad, Yasmeen; Larance, Mark; Lamond, Angus I

    2013-12-01

    Proteins form a diverse array of complexes that mediate cellular function and regulation. A largely unexplored feature of such protein complexes is the selective participation of specific protein isoforms and/or post-translationally modified forms. In this study, we combined native size-exclusion chromatography (SEC) with high-throughput proteomic analysis to characterize soluble protein complexes isolated from human osteosarcoma (U2OS) cells. Using this approach, we have identified over 71,500 peptides and 1,600 phosphosites, corresponding to over 8,000 proteins, distributed across 40 SEC fractions. This represents >50% of the predicted U2OS cell proteome, identified with a mean peptide sequence coverage of 27% per protein. Three biological replicates were performed, allowing statistical evaluation of the data and demonstrating a high degree of reproducibility in the SEC fractionation procedure. Specific proteins were detected interacting with multiple independent complexes, as typified by the separation of distinct complexes for the MRFAP1-MORF4L1-MRGBP interaction network. The data also revealed protein isoforms and post-translational modifications that selectively associated with distinct subsets of protein complexes. Surprisingly, there was clear enrichment for specific Gene Ontology terms associated with differential size classes of protein complexes. This study demonstrates that combined SEC/MS analysis can be used for the system-wide annotation of protein complexes and to predict potential isoform-specific interactions. All of these SEC data on the native separation of protein complexes have been integrated within the Encyclopedia of Proteome Dynamics, an online, multidimensional data-sharing resource available to the community.

  11. The Scottish Structural Proteomics Facility: targets, methods and outputs

    PubMed Central

    Oke, Muse; Carter, Lester G.; Johnson, Kenneth A.; Liu, Huanting; McMahon, Stephen A.; Yan, Xuan; Kerou, Melina; Weikart, Nadine D.; Kadi, Nadia; Sheikh, Md. Arif; Schmelz, Stefan; Dorward, Mark; Zawadzki, Michal; Cozens, Christopher; Falconer, Helen; Powers, Helen; Overton, Ian M.; van Niekerk, C. A. Johannes; Peng, Xu; Patel, Prakash; Garrett, Roger A.; Prangishvili, David; Botting, Catherine H.; Coote, Peter J.; Dryden, David T. F.; Barton, Geoffrey J.; Schwarz-Linek, Ulrich; Challis, Gregory L.; Taylor, Garry L.; White, Malcolm F.

    2010-01-01

    The Scottish Structural Proteomics Facility was funded to develop a laboratory scale approach to high throughput structure determination. The effort was successful in that over 40 structures were determined. These structures and the methods harnessed to obtain them are reported here. This report reflects on the value of automation but also on the continued requirement for a high degree of scientific and technical expertise. The efficiency of the process poses challenges to the current paradigm of structural analysis and publication. In the 5 year period we published ten peer-reviewed papers reporting structural data arising from the pipeline. Nevertheless, the number of structures solved exceeded our ability to analyse and publish each new finding. By reporting the experimental details and depositing the structures we hope to maximize the impact of the project by allowing others to follow up the relevant biology. Electronic supplementary material The online version of this article (doi:10.1007/s10969-010-9090-y) contains supplementary material, which is available to authorized users. PMID:20419351

  12. Assessment of SRM, MRM(3) , and DIA for the targeted analysis of phosphorylation dynamics in non-small cell lung cancer.

    PubMed

    Schmidlin, Thierry; Garrigues, Luc; Lane, Catherine S; Mulder, T Celine; van Doorn, Sander; Post, Harm; de Graaf, Erik L; Lemeer, Simone; Heck, Albert J R; Altelaar, A F Maarten

    2016-08-01

    Hypothesis-driven MS-based targeted proteomics has gained great popularity in a relatively short timespan. Next to the widely established selected reaction monitoring (SRM) workflow, data-independent acquisition (DIA), also referred to as sequential window acquisition of all theoretical spectra (SWATH) was introduced as a high-throughput targeted proteomics method. DIA facilitates increased proteome coverage, however, does not yet reach the sensitivity obtained with SRM. Therefore, a well-informed method selection is crucial for designing a successful targeted proteomics experiment. This is especially the case when targeting less conventional peptides such as those that contain PTMs, as these peptides do not always adhere to the optimal fragmentation considerations for targeted assays. Here, we provide insight into the performance of DIA, SRM, and MRM cubed (MRM(3) ) in the analysis of phosphorylation dynamics throughout the phosphoinositide 3-kinase mechanistic target of rapamycin (PI3K-mTOR) and mitogen-activated protein kinase (MAPK) signaling network. We observe indeed that DIA is less sensitive when compared to SRM, however demonstrates increased flexibility, by postanalysis selection of alternative phosphopeptide precursors. Additionally, we demonstrate the added benefit of MRM(3) , allowing the quantification of two poorly accessible phosphosites. In total, targeted proteomics enabled the quantification of 42 PI3K-mTOR and MAPK phosphosites, gaining a so far unachieved in-depth view mTOR signaling events linked to tyrosine kinase inhibitor resistance in non-small cell lung cancer. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Fusarium graminearum and Its Interactions with Cereal Heads: Studies in the Proteomics Era

    PubMed Central

    Yang, Fen; Jacobsen, Susanne; Jørgensen, Hans J. L.; Collinge, David B.; Svensson, Birte; Finnie, Christine

    2013-01-01

    The ascomycete fungal pathogen Fusarium graminearum (teleomorph stage: Gibberella zeae) is the causal agent of Fusarium head blight in wheat and barley. This disease leads to significant losses of crop yield, and especially quality through the contamination by diverse fungal mycotoxins, which constitute a significant threat to the health of humans and animals. In recent years, high-throughput proteomics, aiming at identifying a broad spectrum of proteins with a potential role in the pathogenicity and host resistance, has become a very useful tool in plant-fungus interaction research. In this review, we describe the progress in proteomics applications toward a better understanding of F. graminearum pathogenesis, virulence, and host defense mechanisms. The contribution of proteomics to the development of crop protection strategies against this pathogen is also discussed briefly. PMID:23450732

  14. [Applications of meta-analysis in multi-omics].

    PubMed

    Han, Mingfei; Zhu, Yunping

    2014-07-01

    As a statistical method integrating multi-features and multi-data, meta-analysis was introduced to the field of life science in the 1990s. With the rapid advances in high-throughput technologies, life omics, the core of which are genomics, transcriptomics and proteomics, is becoming the new hot spot of life science. Although the fast output of massive data has promoted the development of omics study, it results in excessive data that are difficult to integrate systematically. In this case, meta-analysis is frequently applied to analyze different types of data and is improved continuously. Here, we first summarize the representative meta-analysis methods systematically, and then study the current applications of meta-analysis in various omics fields, finally we discuss the still-existing problems and the future development of meta-analysis.

  15. Systematic Optimization of Long Gradient Chromatography Mass Spectrometry for Deep Analysis of Brain Proteome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Hong; Yang, Yanling; Li, Yuxin

    2015-02-06

    Development of high resolution liquid chromatography (LC) is essential for improving the sensitivity and throughput of mass spectrometry (MS)-based proteomics. Here we present systematic optimization of a long gradient LC-MS/MS platform to enhance protein identification from a complex mixture. The platform employed an in-house fabricated, reverse phase column (100 μm x 150 cm) coupled with Q Exactive MS. The column was capable of achieving a peak capacity of approximately 700 in a 720 min gradient of 10-45% acetonitrile. The optimal loading level was about 6 micrograms of peptides, although the column allowed loading as many as 20 micrograms. Gas phasemore » fractionation of peptide ions further increased the number of peptide identification by ~10%. Moreover, the combination of basic pH LC pre-fractionation with the long gradient LC-MS/MS platform enabled the identification of 96,127 peptides and 10,544 proteins at 1% protein false discovery rate in a postmortem brain sample of Alzheimer’s disease. As deep RNA sequencing of the same specimen suggested that ~16,000 genes were expressed, current analysis covered more than 60% of the expressed proteome. Further improvement strategies of the LC/LC-MS/MS platform were also discussed.« less

  16. Emerging proteomics biomarkers and prostate cancer burden in Africa

    PubMed Central

    Adeola, Henry A.; Blackburn, Jonathan M.; Rebbeck, Timothy R.; Zerbini, Luiz F.

    2017-01-01

    Various biomarkers have emerged via high throughput omics-based approaches for use in diagnosis, treatment, and monitoring of prostate cancer. Many of these have yet to be demonstrated as having value in routine clinical practice. Moreover, there is a dearth of information on validation of these emerging prostate biomarkers within African cohorts, despite the huge burden and aggressiveness of prostate cancer in men of African descent. This review focusses of the global landmark achievements in prostate cancer proteomics biomarker discovery and the potential for clinical implementation of these biomarkers in Africa. Biomarker validation processes at the preclinical, translational and clinical research level are discussed here, as are the challenges and prospects for the evaluation and use of novel proteomic prostate cancer biomarkers. PMID:28388542

  17. Emerging proteomics biomarkers and prostate cancer burden in Africa.

    PubMed

    Adeola, Henry A; Blackburn, Jonathan M; Rebbeck, Timothy R; Zerbini, Luiz F

    2017-06-06

    Various biomarkers have emerged via high throughput omics-based approaches for use in diagnosis, treatment, and monitoring of prostate cancer. Many of these have yet to be demonstrated as having value in routine clinical practice. Moreover, there is a dearth of information on validation of these emerging prostate biomarkers within African cohorts, despite the huge burden and aggressiveness of prostate cancer in men of African descent. This review focusses of the global landmark achievements in prostate cancer proteomics biomarker discovery and the potential for clinical implementation of these biomarkers in Africa. Biomarker validation processes at the preclinical, translational and clinical research level are discussed here, as are the challenges and prospects for the evaluation and use of novel proteomic prostate cancer biomarkers.

  18. Efficient Site-Specific Labeling of Proteins via Cysteines

    PubMed Central

    Kim, Younggyu; Ho, Sam O.; Gassman, Natalie R.; Korlann, You; Landorf, Elizabeth V.; Collart, Frank R.; Weiss, Shimon

    2011-01-01

    Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70–90%, and specificities are better than ~95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis. PMID:18275130

  19. Efficient site-specific labeling of proteins via cysteines.

    PubMed

    Kim, Younggyu; Ho, Sam O; Gassman, Natalie R; Korlann, You; Landorf, Elizabeth V; Collart, Frank R; Weiss, Shimon

    2008-03-01

    Methods for chemical modifications of proteins have been crucial for the advancement of proteomics. In particular, site-specific covalent labeling of proteins with fluorophores and other moieties has permitted the development of a multitude of assays for proteome analysis. A common approach for such a modification is solvent-accessible cysteine labeling using thiol-reactive dyes. Cysteine is very attractive for site-specific conjugation due to its relative rarity throughout the proteome and the ease of its introduction into a specific site along the protein's amino acid chain. This is achieved by site-directed mutagenesis, most often without perturbing the protein's function. Bottlenecks in this reaction, however, include the maintenance of reactive thiol groups without oxidation before the reaction, and the effective removal of unreacted molecules prior to fluorescence studies. Here, we describe an efficient, specific, and rapid procedure for cysteine labeling starting from well-reduced proteins in the solid state. The efficacy and specificity of the improved procedure are estimated using a variety of single-cysteine proteins and thiol-reactive dyes. Based on UV/vis absorbance spectra, coupling efficiencies are typically in the range 70-90%, and specificities are better than approximately 95%. The labeled proteins are evaluated using fluorescence assays, proving that the covalent modification does not alter their function. In addition to maleimide-based conjugation, this improved procedure may be used for other thiol-reactive conjugations such as haloacetyl, alkyl halide, and disulfide interchange derivatives. This facile and rapid procedure is well suited for high throughput proteome analysis.

  20. Proteomic analysis of grapevine resistance induced by Trichoderma harzianum T39 reveals specific defence pathways activated against downy mildew

    PubMed Central

    Perazzolli, Michele

    2012-01-01

    Downy mildew is caused by the oomycete Plasmopara viticola and is one of the most serious diseases of grapevine. The beneficial microorganism Trichoderma harzianum T39 (T39) has previously been shown to induce plant-mediated resistance and to reduce the severity of downy mildew in susceptible grapevines. In order to better understand the cellular processes associated with T39-induced resistance, the proteomic and histochemical changes activated by T39 in grapevine were investigated before and 1 day after P. viticola inoculation. A comprehensive proteomic analysis of T39-induced resistance in grapevine was performed using an eight-plex iTRAQ protocol, resulting in the identification and quantification of a total of 800 proteins. Most of the proteins directly affected by T39 were found to be involved in signal transduction, indicating activation of a complete microbial recognition machinery. Moreover, T39-induced resistance was associated with rapid accumulation of reactive oxygen species and callose at infection sites, as well as changes in abundance of proteins involved in response to stress and redox balance, indicating an active defence response to downy mildew. On the other hand, proteins affected by P. viticola in control plants mainly decreased in abundance, possibly reflecting the establishment of a compatible interaction. Finally, the high-throughput iTRAQ protocol allowed de novo peptide sequencing, which will be used to improve annotation of the Vitis vinifera cv. Pinot Noir proteome. PMID:23105132

  1. A New Mass Spectrometry-compatible Degradable Surfactant for Tissue Proteomics

    PubMed Central

    Chang, Ying-Hua; Gregorich, Zachery R.; Chen, Albert J.; Hwang, Leekyoung; Guner, Huseyin; Yu, Deyang; Zhang, Jianyi; Ge, Ying

    2015-01-01

    Tissue proteomics is increasingly recognized for its role in biomarker discovery and disease mechanism investigation. However, protein solubility remains a significant challenge in mass spectrometry (MS)-based tissue proteomics. Conventional surfactants such as sodium dodecyl sulfate (SDS), the preferred surfactant for protein solubilization, are not compatible with MS. Herein, we have screened a library of surfactant-like compounds and discovered an MS-compatible degradable surfactant (MaSDeS) for tissue proteomics that solubilizes all categories of proteins with performance comparable to SDS. The use of MaSDeS in the tissue extraction significantly improves the total number of protein identifications from commonly used tissues, including tissue from the heart, liver, and lung. Notably, MaSDeS significantly enriches membrane proteins, which are often under-represented in proteomics studies. The acid degradable nature of MaSDeS makes it amenable for high-throughput mass spectrometry-based proteomics. In addition, the thermostability of MaSDeS allows for its use in experiments requiring high temperature to facilitate protein extraction and solubilization. Furthermore, we have shown that MaSDeS outperforms the other MS-compatible surfactants in terms of overall protein solubility and the total number of identified proteins in tissue proteomics. Thus, the use of MaSDeS will greatly advance tissue proteomics and realize its potential in basic biomedical and clinical research. MaSDeS could be utilized in a variety of proteomics studies as well as general biochemical and biological experiments that employ surfactants for protein solubilization. PMID:25589168

  2. Mass spectrometry-assisted gel-based proteomics in cancer biomarker discovery: approaches and application

    PubMed Central

    Huang, Rongrong; Chen, Zhongsi; He, Lei; He, Nongyue; Xi, Zhijiang; Li, Zhiyang; Deng, Yan; Zeng, Xin

    2017-01-01

    There is a critical need for the discovery of novel biomarkers for early detection and targeted therapy of cancer, a major cause of deaths worldwide. In this respect, proteomic technologies, such as mass spectrometry (MS), enable the identification of pathologically significant proteins in various types of samples. MS is capable of high-throughput profiling of complex biological samples including blood, tissues, urine, milk, and cells. MS-assisted proteomics has contributed to the development of cancer biomarkers that may form the foundation for new clinical tests. It can also aid in elucidating the molecular mechanisms underlying cancer. In this review, we discuss MS principles and instrumentation as well as approaches in MS-based proteomics, which have been employed in the development of potential biomarkers. Furthermore, the challenges in validation of MS biomarkers for their use in clinical practice are also reviewed. PMID:28912895

  3. The Proteome Folding Project: Proteome-scale prediction of structure and function

    PubMed Central

    Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard

    2011-01-01

    The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995

  4. Proteomic Profiling of Mitochondrial Enzymes during Skeletal Muscle Aging.

    PubMed

    Staunton, Lisa; O'Connell, Kathleen; Ohlendieck, Kay

    2011-03-07

    Mitochondria are of central importance for energy generation in skeletal muscles. Expression changes or functional alterations in mitochondrial enzymes play a key role during myogenesis, fibre maturation, and various neuromuscular pathologies, as well as natural fibre aging. Mass spectrometry-based proteomics suggests itself as a convenient large-scale and high-throughput approach to catalogue the mitochondrial protein complement and determine global changes during health and disease. This paper gives a brief overview of the relatively new field of mitochondrial proteomics and discusses the findings from recent proteomic surveys of mitochondrial elements in aged skeletal muscles. Changes in the abundance, biochemical activity, subcellular localization, and/or posttranslational modifications in key mitochondrial enzymes might be useful as novel biomarkers of aging. In the long term, this may advance diagnostic procedures, improve the monitoring of disease progression, help in the testing of side effects due to new drug regimes, and enhance our molecular understanding of age-related muscle degeneration.

  5. Maize-Pathogen Interactions: An Ongoing Combat from a Proteomics Perspective.

    PubMed

    Pechanova, Olga; Pechan, Tibor

    2015-11-30

    Maize (Zea mays L.) is a host to numerous pathogenic species that impose serious diseases to its ear and foliage, negatively affecting the yield and the quality of the maize crop. A considerable amount of research has been carried out to elucidate mechanisms of maize-pathogen interactions with a major goal to identify defense-associated proteins. In this review, we summarize interactions of maize with its agriculturally important pathogens that were assessed at the proteome level. Employing differential analyses, such as the comparison of pathogen-resistant and susceptible maize varieties, as well as changes in maize proteomes after pathogen challenge, numerous proteins were identified as possible candidates in maize resistance. We describe findings of various research groups that used mainly mass spectrometry-based, high through-put proteomic tools to investigate maize interactions with fungal pathogens Aspergillus flavus, Fusarium spp., and Curvularia lunata, and viral agents Rice Black-streaked Dwarf Virus and Sugarcane Mosaic Virus.

  6. Maize-Pathogen Interactions: An Ongoing Combat from a Proteomics Perspective

    PubMed Central

    Pechanova, Olga; Pechan, Tibor

    2015-01-01

    Maize (Zea mays L.) is a host to numerous pathogenic species that impose serious diseases to its ear and foliage, negatively affecting the yield and the quality of the maize crop. A considerable amount of research has been carried out to elucidate mechanisms of maize-pathogen interactions with a major goal to identify defense-associated proteins. In this review, we summarize interactions of maize with its agriculturally important pathogens that were assessed at the proteome level. Employing differential analyses, such as the comparison of pathogen-resistant and susceptible maize varieties, as well as changes in maize proteomes after pathogen challenge, numerous proteins were identified as possible candidates in maize resistance. We describe findings of various research groups that used mainly mass spectrometry-based, high through-put proteomic tools to investigate maize interactions with fungal pathogens Aspergillus flavus, Fusarium spp., and Curvularia lunata, and viral agents Rice Black-streaked Dwarf Virus and Sugarcane Mosaic Virus. PMID:26633370

  7. The role of targeted chemical proteomics in pharmacology

    PubMed Central

    Sutton, Chris W

    2012-01-01

    Traditionally, proteomics is the high-throughput characterization of the global complement of proteins in a biological system using cutting-edge technologies (robotics and mass spectrometry) and bioinformatics tools (Internet-based search engines and databases). As the field of proteomics has matured, a diverse range of strategies have evolved to answer specific problems. Chemical proteomics is one such direction that provides the means to enrich and detect less abundant proteins (the ‘hidden’ proteome) from complex mixtures of wide dynamic range (the ‘deep’ proteome). In pharmacology, chemical proteomics has been utilized to determine the specificity of drugs and their analogues, for anticipated known targets, only to discover other proteins that bind and could account for side effects observed in preclinical and clinical trials. As a consequence, chemical proteomics provides a valuable accessory in refinement of second- and third-generation drug design for treatment of many diseases. However, determining definitive affinity capture of proteins by a drug immobilized on soft gel chromatography matrices has highlighted some of the challenges that remain to be addressed. Examples of the different strategies that have emerged using well-established drugs against pharmaceutically important enzymes, such as protein kinases, metalloproteases, PDEs, cytochrome P450s, etc., indicate the potential opportunity to employ chemical proteomics as an early-stage screening approach in the identification of new targets. PMID:22074351

  8. Innovative Tools and Technology for Analysis of Single Cells and Cell-Cell Interaction.

    PubMed

    Konry, Tania; Sarkar, Saheli; Sabhachandani, Pooja; Cohen, Noa

    2016-07-11

    Heterogeneity in single-cell responses and intercellular interactions results from complex regulation of cell-intrinsic and environmental factors. Single-cell analysis allows not only detection of individual cellular characteristics but also correlation of genetic content with phenotypic traits in the same cell. Technological advances in micro- and nanofabrication have benefited single-cell analysis by allowing precise control of the localized microenvironment, cell manipulation, and sensitive detection capabilities. Additionally, microscale techniques permit rapid, high-throughput, multiparametric screening that has become essential for -omics research. This review highlights innovative applications of microscale platforms in genetic, proteomic, and metabolic detection in single cells; cell sorting strategies; and heterotypic cell-cell interaction. We discuss key design aspects of single-cell localization and isolation in microfluidic systems, dynamic and endpoint analyses, and approaches that integrate highly multiplexed detection of various intracellular species.

  9. Architecture Mapping of the Inner Mitochondrial Membrane Proteome by Chemical Tools in Live Cells.

    PubMed

    Lee, Song-Yi; Kang, Myeong-Gyun; Shin, Sanghee; Kwak, Chulhwan; Kwon, Taejoon; Seo, Jeong Kon; Kim, Jong-Seo; Rhee, Hyun-Woo

    2017-03-15

    The inner mitochondrial membrane (IMM) proteome plays a central role in maintaining mitochondrial physiology and cellular metabolism. Various important biochemical reactions such as oxidative phosphorylation, metabolite production, and mitochondrial biogenesis are conducted by the IMM proteome, and mitochondria-targeted therapeutics have been developed for IMM proteins, which is deeply related for various human metabolic diseases including cancer and neurodegenerative diseases. However, the membrane topology of the IMM proteome remains largely unclear because of the lack of methods to evaluate it in live cells in a high-throughput manner. In this article, we reveal the in vivo topological direction of 135 IMM proteins, using an in situ-generated radical probe with genetically targeted peroxidase (APEX). Owing to the short lifetime of phenoxyl radicals generated in situ by submitochondrial targeted APEX and the impermeability of the IMM to small molecules, the solvent-exposed tyrosine residues of both the matrix and intermembrane space (IMS) sides of IMM proteins were exclusively labeled with the radical probe in live cells by Matrix-APEX and IMS-APEX, respectively and identified by mass spectrometry. From this analysis, we confirmed 58 IMM protein topologies and we could determine the topological direction of 77 IMM proteins whose topology at the IMM has not been fully characterized. We also found several IMM proteins (e.g., LETM1 and OXA1) whose topological information should be revised on the basis of our results. Overall, our identification of structural information on the mitochondrial inner-membrane proteome can provide valuable insights for the architecture and connectome of the IMM proteome in live cells.

  10. An LC-IMS-MS Platform Providing Increased Dynamic Range for High-Throughput Proteomic Studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baker, Erin Shammel; Livesay, Eric A.; Orton, Daniel J.

    2010-02-05

    A high-throughput approach and platform using 15 minute reversed-phase capillary liquid chromatography (RPLC) separations in conjunction with ion mobility spectrometry-mass spectrometry (IMS-MS) measurements was evaluated for the rapid analysis of complex proteomics samples. To test the separation quality of the short LC gradient, a sample was prepared by spiking twenty reference peptides at varying concentrations from 1 ng/mL to 10 µg/mL into a tryptic digest of mouse blood plasma and analyzed with both a LC-Linear Ion Trap Fourier Transform (FT) MS and LC-IMS-TOF MS. The LC-FT MS detected thirteen out of the twenty spiked peptides that had concentrations ≥100 ng/mL.more » In contrast, the drift time selected mass spectra from the LC-IMS-TOF MS analyses yielded identifications for nineteen of the twenty peptides with all spiking level present. The greater dynamic range of the LC-IMS-TOF MS system could be attributed to two factors. First, the LC-IMS-TOF MS system enabled drift time separation of the low concentration spiked peptides from the high concentration mouse peptide matrix components, reducing signal interference and background, and allowing species to be resolved that would otherwise be obscured by other components. Second, the automatic gain control (AGC) in the linear ion trap of the hybrid FT MS instrument limits the number of ions that are accumulated to reduce space charge effects, but in turn limits the achievable dynamic range compared to the TOF detector.« less

  11. Exploration of Panviral Proteome: High-Throughput Cloning and Functional Implications in Virus-host Interactions

    PubMed Central

    Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M.; LaBaer, Joshua

    2014-01-01

    Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies. PMID:24955142

  12. Exploration of panviral proteome: high-throughput cloning and functional implications in virus-host interactions.

    PubMed

    Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M; LaBaer, Joshua

    2014-01-01

    Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

  13. ATAQS: A computational software tool for high throughput transition optimization and validation for selected reaction monitoring mass spectrometry

    PubMed Central

    2011-01-01

    Background Since its inception, proteomics has essentially operated in a discovery mode with the goal of identifying and quantifying the maximal number of proteins in a sample. Increasingly, proteomic measurements are also supporting hypothesis-driven studies, in which a predetermined set of proteins is consistently detected and quantified in multiple samples. Selected reaction monitoring (SRM) is a targeted mass spectrometric technique that supports the detection and quantification of specific proteins in complex samples at high sensitivity and reproducibility. Here, we describe ATAQS, an integrated software platform that supports all stages of targeted, SRM-based proteomics experiments including target selection, transition optimization and post acquisition data analysis. This software will significantly facilitate the use of targeted proteomic techniques and contribute to the generation of highly sensitive, reproducible and complete datasets that are particularly critical for the discovery and validation of targets in hypothesis-driven studies in systems biology. Result We introduce a new open source software pipeline, ATAQS (Automated and Targeted Analysis with Quantitative SRM), which consists of a number of modules that collectively support the SRM assay development workflow for targeted proteomic experiments (project management and generation of protein, peptide and transitions and the validation of peptide detection by SRM). ATAQS provides a flexible pipeline for end-users by allowing the workflow to start or end at any point of the pipeline, and for computational biologists, by enabling the easy extension of java algorithm classes for their own algorithm plug-in or connection via an external web site. This integrated system supports all steps in a SRM-based experiment and provides a user-friendly GUI that can be run by any operating system that allows the installation of the Mozilla Firefox web browser. Conclusions Targeted proteomics via SRM is a powerful new technique that enables the reproducible and accurate identification and quantification of sets of proteins of interest. ATAQS is the first open-source software that supports all steps of the targeted proteomics workflow. ATAQS also provides software API (Application Program Interface) documentation that enables the addition of new algorithms to each of the workflow steps. The software, installation guide and sample dataset can be found in http://tools.proteomecenter.org/ATAQS/ATAQS.html PMID:21414234

  14. Accelerating the design of biomimetic materials by integrating RNA-seq with proteomics and materials science.

    PubMed

    Guerette, Paul A; Hoon, Shawn; Seow, Yiqi; Raida, Manfred; Masic, Admir; Wong, Fong T; Ho, Vincent H B; Kong, Kiat Whye; Demirel, Melik C; Pena-Francesch, Abdon; Amini, Shahrouz; Tay, Gavin Z; Ding, Dawei; Miserez, Ali

    2013-10-01

    Efforts to engineer new materials inspired by biological structures are hampered by the lack of genomic data from many model organisms studied in biomimetic research. Here we show that biomimetic engineering can be accelerated by integrating high-throughput RNA-seq with proteomics and advanced materials characterization. This approach can be applied to a broad range of systems, as we illustrate by investigating diverse high-performance biological materials involved in embryo protection, adhesion and predation. In one example, we rapidly engineer recombinant squid sucker ring teeth proteins into a range of structural and functional materials, including nanopatterned surfaces and photo-cross-linked films that exceed the mechanical properties of most natural and synthetic polymers. Integrating RNA-seq with proteomics and materials science facilitates the molecular characterization of natural materials and the effective translation of their molecular designs into a wide range of bio-inspired materials.

  15. System-Wide Quantitative Proteomics of the Metabolic Syndrome in Mice: Genotypic and Dietary Effects.

    PubMed

    Terfve, Camille; Sabidó, Eduard; Wu, Yibo; Gonçalves, Emanuel; Choi, Meena; Vaga, Stefania; Vitek, Olga; Saez-Rodriguez, Julio; Aebersold, Ruedi

    2017-02-03

    Advances in mass spectrometry have made the quantitative measurement of proteins across multiple samples a reality, allowing for the study of complex biological systems such as the metabolic syndrome. Although the deregulation of lipid metabolism and increased hepatic storage of triacylglycerides are known to play a part in the onset of the metabolic syndrome, its molecular basis and dependency on dietary and genotypic factors are poorly characterized. Here, we used an experimental design with two different mouse strains and dietary and metabolic perturbations to generate a compendium of quantitative proteome data using three mass spectrometric techniques. The data reproduce known properties of the metabolic system and indicate differential molecular adaptation of the two mouse strains to perturbations, contributing to a better understanding of the metabolic syndrome. We show that high-quality, high-throughput proteomic data sets provide an unbiased broad overview of the behavior of complex systems after perturbation.

  16. Unparalleled sample treatment throughput for proteomics workflows relying on ultrasonic energy.

    PubMed

    Jorge, Susana; Araújo, J E; Pimentel-Santos, F M; Branco, Jaime C; Santos, Hugo M; Lodeiro, Carlos; Capelo, J L

    2018-02-01

    We report on the new microplate horn ultrasonic device as a powerful tool to speed proteomics workflows with unparalleled throughput. 96 complex proteomes were digested at the same time in 4min. Variables such as ultrasonication time, ultrasonication amplitude, and protein to enzyme ratio were optimized. The "classic" method relying on overnight protein digestion (12h) and the sonoreactor-based method were also employed for comparative purposes. We found the protein digestion efficiency homogeneously distributed in the entire microplate horn surface using the following conditions: 4min sonication time and 25% amplitude. Using this approach, patients with lymphoma and myeloma were classified using principal component analysis and a 2D gel-mass spectrometry based approach. Furthermore, we demonstrate the excellent performance by using MALDI-mass spectrometry based profiling as a fast way to classify patients with rheumatoid arthritis, systemic lupus erythematosus, and ankylosing spondylitis. Finally, the speed and simplicity of this method were demonstrated by clustering 90 patients with knee osteoarthritis disease (30), with a prosthesis (30, control group) and healthy individuals (30) with no history of joint disease. Overall, the new approach allows profiling a disease in just one week while allows to match the minimalism rules as outlined by Halls. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. A practical data processing workflow for multi-OMICS projects.

    PubMed

    Kohl, Michael; Megger, Dominik A; Trippler, Martin; Meckel, Hagen; Ahrens, Maike; Bracht, Thilo; Weber, Frank; Hoffmann, Andreas-Claudius; Baba, Hideo A; Sitek, Barbara; Schlaak, Jörg F; Meyer, Helmut E; Stephan, Christian; Eisenacher, Martin

    2014-01-01

    Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Denef, Vincent; Shah, Manesh B; Verberkmoes, Nathan C

    The recent surge in microbial genomic sequencing, combined with the development of high-throughput liquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the question of the extent to which genomic information of one strain or environmental sample can be used to profile proteomes of related strains or samples. Even with decreasing sequencing costs, it remains impractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate how shotgun proteomics is affected by amino acid divergence between the sample and the genomic database using a probability-based model and a random mutation simulation model constrained by experimental data. To assess the effectsmore » of nonrandom distribution of mutations, we also evaluated identification levels using in silico peptide data from sequenced isolates with average amino acid identities (AAI) varying between 76 and 98%. We compared the predictions to experimental protein identification levels for a sample that was evaluated using a database that included genomic information for the dominant organism and for a closely related variant (95% AAI). The range of models set the boundaries at which half of the proteins in a proteomic experiment can be identified to be 77-92% AAI between orthologs in the sample and database. Consistent with this prediction, experimental data indicated loss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of the initial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI. Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most crossspecies false positives.« less

  19. Plasma protein absolute quantification by nano-LC Q-TOF UDMSE for clinical biomarker verification

    PubMed Central

    ILIES, MARIA; IUGA, CRISTINA ADELA; LOGHIN, FELICIA; DHOPLE, VISHNU MUKUND; HAMMER, ELKE

    2017-01-01

    Background and aims Proteome-based biomarker studies are targeting proteins that could serve as diagnostic, prognosis, and prediction molecules. In the clinical routine, immunoassays are currently used for the absolute quantification of such biomarkers, with the major limitation that only one molecule can be targeted per assay. The aim of our study was to test a mass spectrometry based absolute quantification method for the verification of plasma protein sets which might serve as reliable biomarker panels for the clinical practice. Methods Six EDTA plasma samples were analyzed after tryptic digestion using a high throughput data independent acquisition nano-LC Q-TOF UDMSE proteomics approach. Synthetic Escherichia coli standard peptides were spiked in each sample for the absolute quantification. Data analysis was performed using ProgenesisQI v2.0 software (Waters Corporation). Results Our method ensured absolute quantification of 242 non redundant plasma proteins in a single run analysis. The dynamic range covered was 105. 86% were represented by classical plasma proteins. The overall median coefficient of variation was 0.36, while a set of 63 proteins was found to be highly stable. Absolute protein concentrations strongly correlated with values reviewed in the literature. Conclusions Nano-LC Q-TOF UDMSE proteomic analysis can be used for a simple and rapid determination of absolute amounts of plasma proteins. A large number of plasma proteins could be analyzed, while a wide dynamic range was covered with low coefficient of variation at protein level. The method proved to be a reliable tool for the quantification of protein panel for biomarker verification in the clinical practice. PMID:29151793

  20. Advances in Quantitative Proteomics of Microbes and Microbial Communities

    NASA Astrophysics Data System (ADS)

    Waldbauer, J.; Zhang, L.; Rizzo, A. I.

    2015-12-01

    Quantitative measurements of gene expression are key to developing a mechanistic, predictive understanding of how microbial metabolism drives many biogeochemical fluxes and responds to environmental change. High-throughput RNA-sequencing can afford a wealth of information about transcript-level expression patterns, but it is becoming clear that expression dynamics are often very different at the protein level where biochemistry actually occurs. These divergent dynamics between levels of biological organization necessitate quantitative proteomic measurements to address many biogeochemical questions. The protein-level expression changes that underlie shifts in the magnitude, or even the direction, of metabolic and biogeochemical fluxes can be quite subtle and test the limits of current quantitative proteomics techniques. Here we describe methodologies for high-precision, whole-proteome quantification that are applicable to both model organisms of biogeochemical interest that may not be genetically tractable, and to complex community samples from natural environments. Employing chemical derivatization of peptides with multiple isotopically-coded tags, this strategy is rapid and inexpensive, can be implemented on a wide range of mass spectrometric instrumentation, and is relatively insensitive to chromatographic variability. We demonstrate the utility of this quantitative proteomics approach in application to both isolates and natural communities of sulfur-metabolizing and photosynthetic microbes.

  1. Proteomics technique opens new frontiers in mobilome research.

    PubMed

    Davidson, Andrew D; Matthews, David A; Maringer, Kevin

    2017-01-01

    A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti ). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.

  2. CIAN - Cell Imaging and Analysis Network at the Biology Department of McGill University

    PubMed Central

    Lacoste, J.; Lesage, G.; Bunnell, S.; Han, H.; Küster-Schöck, E.

    2010-01-01

    CF-31 The Cell Imaging and Analysis Network (CIAN) provides services and tools to researchers in the field of cell biology from within or outside Montreal's McGill University community. CIAN is composed of six scientific platforms: Cell Imaging (confocal and fluorescence microscopy), Proteomics (2-D protein gel electrophoresis and DiGE, fluorescent protein analysis), Automation and High throughput screening (Pinning robot and liquid handler), Protein Expression for Antibody Production, Genomics (real-time PCR), and Data storage and analysis (cluster, server, and workstations). Users submit project proposals, and can obtain training and consultation in any aspect of the facility, or initiate projects with the full-service platforms. CIAN is designed to facilitate training, enhance interactions, as well as share and maintain resources and expertise.

  3. Mass Spectrometry Based Lipidomics: An Overview of Technological Platforms

    PubMed Central

    Köfeler, Harald C.; Fauland, Alexander; Rechberger, Gerald N.; Trötzmüller, Martin

    2012-01-01

    One decade after the genomic and the proteomic life science revolution, new ‘omics’ fields are emerging. The metabolome encompasses the entity of small molecules—Most often end products of a catalytic process regulated by genes and proteins—with the lipidome being its fat soluble subdivision. Within recent years, lipids are more and more regarded not only as energy storage compounds but also as interactive players in various cellular regulation cycles and thus attain rising interest in the bio-medical community. The field of lipidomics is, on one hand, fuelled by analytical technology advances, particularly mass spectrometry and chromatography, but on the other hand new biological questions also drive analytical technology developments. Compared to fairly standardized genomic or proteomic high-throughput protocols, the high degree of molecular heterogeneity adds a special analytical challenge to lipidomic analysis. In this review, we will take a closer look at various mass spectrometric platforms for lipidomic analysis. We will focus on the advantages and limitations of various experimental setups like ‘shotgun lipidomics’, liquid chromatography—Mass spectrometry (LC-MS) and matrix assisted laser desorption ionization-time of flight (MALDI-TOF) based approaches. We will also examine available software packages for data analysis, which nowadays is in fact the rate limiting step for most ‘omics’ workflows. PMID:24957366

  4. Mass spectrometry based lipidomics: an overview of technological platforms.

    PubMed

    Köfeler, Harald C; Fauland, Alexander; Rechberger, Gerald N; Trötzmüller, Martin

    2012-01-05

    One decade after the genomic and the proteomic life science revolution, new 'omics' fields are emerging. The metabolome encompasses the entity of small molecules-Most often end products of a catalytic process regulated by genes and proteins-with the lipidome being its fat soluble subdivision. Within recent years, lipids are more and more regarded not only as energy storage compounds but also as interactive players in various cellular regulation cycles and thus attain rising interest in the bio-medical community. The field of lipidomics is, on one hand, fuelled by analytical technology advances, particularly mass spectrometry and chromatography, but on the other hand new biological questions also drive analytical technology developments. Compared to fairly standardized genomic or proteomic high-throughput protocols, the high degree of molecular heterogeneity adds a special analytical challenge to lipidomic analysis. In this review, we will take a closer look at various mass spectrometric platforms for lipidomic analysis. We will focus on the advantages and limitations of various experimental setups like 'shotgun lipidomics', liquid chromatography-Mass spectrometry (LC-MS) and matrix assisted laser desorption ionization-time of flight (MALDI-TOF) based approaches. We will also examine available software packages for data analysis, which nowadays is in fact the rate limiting step for most 'omics' workflows.

  5. Universal Solid-phase Reversible Sample-Prep for Concurrent Proteome and N-glycome Characterization

    PubMed Central

    Zhou, Hui; Morley, Samantha; Kostel, Stephen; Freeman, Michael R.; Joshi, Vivek; Brewster, David; Lee, Richard S.

    2017-01-01

    SUMMARY We describe a novel Solid-phase Reversible Sample-Prep (SRS) platform, which enables rapid sample preparation for concurrent proteome and N-glycome characterization by mass spectrometry. SRS utilizes a uniquely functionalized, silica-based bead that has strong affinity toward proteins with minimal-to-no affinity for peptides and other small molecules. By leveraging the inherent size difference between, SRS permits high-capacity binding of proteins, rapid removal of small molecules (detergents, metabolites, salts, etc.), extensive manipulation including enzymatic and chemical treatments on beads-bound proteins, and easy recovery of N-glycans and peptides. The efficacy of SRS was evaluated in a wide range of biological samples including single glycoprotein, whole cell lysate, murine tissues, and human urine. To further demonstrate the SRS platform, we coupled a quantitative strategy to SRS to investigate the differences between DU145 prostate cancer cells and its DIAPH3-silenced counterpart. Our previous studies suggested that DIAPH3 silencing in DU145 prostate cancer cells induced transition to an amoeboid phenotype that correlated with tumor progression and metastasis. In this analysis we identified distinct proteomic and N-glycomic alterations between the two cells. Intriguingly, a metastasis-associated tyrosine kinase receptor ephrin-type-A receptor (EPHA2) was highly upregulated in DIAPH3-silenced cells, indicating underling connection between EPHA2 and DIAPH3. Moreover, distinct alterations in the N-glycome were identified, suggesting a cross-link between DIAPH3 and glycosyltransferase networks. Overall, SRS is an enabling universal sample preparation strategy that is not size limited and has the capability to efficiently prepare and clean peptides and N-glycans concurrently from nearly all sample types. Conceptually, SRS can be utilized for the analysis of other posttranslational modifications, and the unique surface chemistry can be further transformed for high-throughput automation. The technical simplicity, robustness, and modularity of SRS make it a highly promising technology with great potential in proteomic-based research. PMID:26791391

  6. Nanobiocatalysis for protein digestion in proteomic analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Jungbae; Kim, Byoung Chan; Lopez-Ferrer, Daniel

    2010-02-01

    The process of protein digestion is a critical step for successful protein identification in the bottom-up proteomic analysis. To substitute the present practice of in-solution protein digestion, which is long, tedious, and difficult to automate, a lot of efforts have been dedicated for the development of a rapid, recyclable and automated digestion system. Recent advances of nanobiocatalytic approaches have improved the performance of protein digestion by using various nanomaterials such as nanoporous materials, magnetic nanoparticles, and polymer nanofibers. Especially, the unprecedented success of trypsin stabilization in the form of trypsin-coated nanofibers, showing no activity decrease under repeated uses for onemore » year and retaining good resistance to proteolysis, has demonstrated its great potential to be employed in the development of automated, high-throughput, and on-line digestion systems. This review discusses recent developments of nanobiocatalytic approaches for the improved performance of protein digestion in speed, detection sensitivity, recyclability, and trypsin stability. In addition, we also introduce the protein digestions under unconventional energy inputs for protein denaturation and the development of microfluidic enzyme reactors that can benefit from recent successes of these nanobiocatalytic approaches.« less

  7. The effect of electroacupuncture on proteomic changes in the motor cortex of 6-OHDA Parkinsonian rats.

    PubMed

    Li, Min; Li, Lijuan; Wang, Ke; Su, Wenting; Jia, Jun; Wang, Xiaomin

    2017-10-15

    Electroacupuncture (EA) has been reported to alleviate motor deficits in Parkinson's disease (PD) patients, and PD animal models. However, the mechanisms by which EA improves motor function have not been investigated. We have employed a 6-hydroxydopamine (6-OHDA) unilateral injection induced PD model to investigate whether EA alters protein expression in the motor cortex. We found that 4weeks of EA treatment significantly improved spontaneous floor plane locomotion and rotarod performance. High-throughput proteomic analysis in the motor cortex was employed. The expression of 54 proteins were altered in the unlesioned motor cortex, and 102 protein expressions were altered in the lesioned motor cortex of 6-OHDA rats compared to sham rats. Compared to non-treatment PD control, EA treatment reversed 6 proteins in unlesioned and 19 proteins in lesioned motor cortex. The present study demonstrated that PD induces proteomic changes in the motor cortex, some of which are rescued by EA treatment. These targeted proteins were mainly involved in increasing autophagy, mRNA processing and ATP binding and maintaining the balance of neurotransmitters. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Capillary electrophoresis interfaced with a mass spectrometer (CE-MS): technical considerations and applicability for biomarker studies in animals.

    PubMed

    Albalat, Amaya; Husi, Holger; Siwy, Justyna; Nally, Jarlath E; McLauglin, Mark; Eckersall, Peter D; Mullen, William

    2014-02-01

    Proteomics is a growing field that has the potential to be applied to many biology-related disciplines. However, the study of the proteome has proven to be very challenging due to its high level of complexity when compared to genome and transcriptome data. In order to analyse this level of complexity, high resolution separation of peptides/proteins are needed together with high resolution analysers. Currently, liquid chromatography and capillary electrophoresis (CE) are the two most widely used separation techniques that can be coupled on-line with a mass spectrometer (MS). In CE, proteins/ peptides are separated according to their size, charge and shape leading to high resolving power. Although further progress in the area of sensitivity, throughput and proteome coverage are expected, MS-based proteomics have developed to a level at which they are habitually applied to study a wide range of biological questions. The aim of this review is to present CE-MS as a proteomic analytical platform for biomarker research that could be used in farm animal and veterinary studies. This is a MS-analytical platform that has been widely used for biomarker research in the biomedical field but its application in animal proteomic studies is relatively novel. The review will focus on introducing the CE-MS platform and the primary considerations for its application to biomarker research. Furthermore, current applications but more importantly potential application in the field of farm animals and veterinary science will be presented and discussed.

  9. Quantitative proteomics reveals the central changes of wheat in response to powdery mildew.

    PubMed

    Fu, Ying; Zhang, Hong; Mandal, Siddikun Nabi; Wang, Changyou; Chen, Chunhuan; Ji, Wanquan

    2016-01-01

    Powdery mildew (Pm), caused by Blumeria graminis f. sp. tritici (Bgt), is one of the most important crop diseases, causing severe economic losses to wheat production worldwide. However, there are few reports about the proteomic response to Bgt infection in resistant wheat. Hence, quantitative proteomic analysis of N9134, a resistant wheat line, was performed to explore the molecular mechanism of wheat in defense against Bgt. Comparing the leaf proteins of Bgt-inoculated N9134 with that of mock-inoculated controls, a total of 2182 protein-species were quantified by iTRAQ at 24, 48 and 72h postinoculation (hpi) with Bgt, of which 394 showed differential accumulation. These differentially accumulated protein-species (DAPs) mainly included pathogenesis-related (PR) polypeptides, oxidative stress responsive proteins and components involved in primary metabolic pathways. KEGG enrichment analysis showed that phenylpropanoid biosynthesis, phenylalanine metabolism and photosynthesis-antenna proteins were the key pathways in response to Bgt infection. InterProScan 5 and the Gibbs Motif Sampler cluster 394 DAPs into eight conserved motifs, which shared leucine repeats and histidine sites in the sequence motifs. Moreover, eight separate protein-protein interaction (PPI) networks were predicted from STRING database. This study provides a powerful platform for further exploration of the molecular mechanism underlying resistant wheat responding to Bgt. Powdery mildew, caused by Blumeria graminis f. sp. tritici (Bgt), is a destructive pathogenic disease in wheat-producing regions worldwide, resulting in severe yield reductions. Although many resistant wheat varieties have been cultivated, there are few reports about the proteomic response to Bgt infection in resistant wheat. Therefore, an iTRAQ-based quantitative proteomic analysis of a resistant wheat line (N9134) in response to Bgt infection has been performed. This paper provides new insights into the underlying molecular mechanism of wheat in response to Bgt. The proteomic analysis can significantly narrow the field of potential defense-related protein-species, and is conducive to recognize the critical or effector protein under Bgt infection more precisely. Taken together, large amounts of high-throughput data provide a powerful platform for further exploration of the molecular mechanism on wheat-Bgt interactions. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Proteomics: from hypothesis to quantitative assay on a single platform. Guidelines for developing MRM assays using ion trap mass spectrometers.

    PubMed

    Han, Bomie; Higgs, Richard E

    2008-09-01

    High-throughput HPLC-mass spectrometry (HPLC-MS) is routinely used to profile biological samples for potential protein markers of disease, drug efficacy and toxicity. The discovery technology has advanced to the point where translating hypotheses from proteomic profiling studies into clinical use is the bottleneck to realizing the full potential of these approaches. The first step in this translation is the development and analytical validation of a higher throughput assay with improved sensitivity and selectivity relative to typical profiling assays. Multiple reaction monitoring (MRM) assays are an attractive approach for this stage of biomarker development given their improved sensitivity and specificity, the speed at which the assays can be developed and the quantitative nature of the assay. While the profiling assays are performed with ion trap mass spectrometers, MRM assays are traditionally developed in quadrupole-based mass spectrometers. Development of MRM assays from the same instrument used in the profiling analysis enables a seamless and rapid transition from hypothesis generation to validation. This report provides guidelines for rapidly developing an MRM assay using the same mass spectrometry platform used for profiling experiments (typically ion traps) and reviews methodological and analytical validation considerations. The analytical validation guidelines presented are drawn from existing practices on immunological assays and are applicable to any mass spectrometry platform technology.

  11. Stage-specific Proteomes from Onchocerca ochengi, Sister Species of the Human River Blindness Parasite, Uncover Adaptations to a Nodular Lifestyle.

    PubMed

    Armstrong, Stuart D; Xia, Dong; Bah, Germanus S; Krishna, Ritesh; Ngangyung, Henrietta F; LaCourse, E James; McSorley, Henry J; Kengne-Ouafo, Jonas A; Chounna-Ndongmo, Patrick W; Wanji, Samuel; Enyong, Peter A; Taylor, David W; Blaxter, Mark L; Wastling, Jonathan M; Tanya, Vincent N; Makepeace, Benjamin L

    2016-08-01

    Despite 40 years of control efforts, onchocerciasis (river blindness) remains one of the most important neglected tropical diseases, with 17 million people affected. The etiological agent, Onchocerca volvulus, is a filarial nematode with a complex lifecycle involving several distinct stages in the definitive host and blackfly vector. The challenges of obtaining sufficient material have prevented high-throughput studies and the development of novel strategies for disease control and diagnosis. Here, we utilize the closest relative of O. volvulus, the bovine parasite Onchocerca ochengi, to compare stage-specific proteomes and host-parasite interactions within the secretome. We identified a total of 4260 unique O. ochengi proteins from adult males and females, infective larvae, intrauterine microfilariae, and fluid from intradermal nodules. In addition, 135 proteins were detected from the obligate Wolbachia symbiont. Observed protein families that were enriched in all whole body extracts relative to the complete search database included immunoglobulin-domain proteins, whereas redox and detoxification enzymes and proteins involved in intracellular transport displayed stage-specific overrepresentation. Unexpectedly, the larval stages exhibited enrichment for several mitochondrial-related protein families, including members of peptidase family M16 and proteins which mediate mitochondrial fission and fusion. Quantification of proteins across the lifecycle using the Hi-3 approach supported these qualitative analyses. In nodule fluid, we identified 94 O. ochengi secreted proteins, including homologs of transforming growth factor-β and a second member of a novel 6-ShK toxin domain family, which was originally described from a model filarial nematode (Litomosoides sigmodontis). Strikingly, the 498 bovine proteins identified in nodule fluid were strongly dominated by antimicrobial proteins, especially cathelicidins. This first high-throughput analysis of an Onchocerca spp. proteome across the lifecycle highlights its profound complexity and emphasizes the extremely close relationship between O. ochengi and O. volvulus The insights presented here provide new candidates for vaccine development, drug targeting and diagnostic biomarkers. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  12. Stage-specific Proteomes from Onchocerca ochengi, Sister Species of the Human River Blindness Parasite, Uncover Adaptations to a Nodular Lifestyle*

    PubMed Central

    Armstrong, Stuart D.; Xia, Dong; Bah, Germanus S.; Krishna, Ritesh; Ngangyung, Henrietta F.; LaCourse, E. James; McSorley, Henry J.; Kengne-Ouafo, Jonas A.; Chounna-Ndongmo, Patrick W.; Wanji, Samuel; Enyong, Peter A.; Taylor, David W.; Blaxter, Mark L.; Wastling, Jonathan M.; Tanya, Vincent N.; Makepeace, Benjamin L.

    2016-01-01

    Despite 40 years of control efforts, onchocerciasis (river blindness) remains one of the most important neglected tropical diseases, with 17 million people affected. The etiological agent, Onchocerca volvulus, is a filarial nematode with a complex lifecycle involving several distinct stages in the definitive host and blackfly vector. The challenges of obtaining sufficient material have prevented high-throughput studies and the development of novel strategies for disease control and diagnosis. Here, we utilize the closest relative of O. volvulus, the bovine parasite Onchocerca ochengi, to compare stage-specific proteomes and host-parasite interactions within the secretome. We identified a total of 4260 unique O. ochengi proteins from adult males and females, infective larvae, intrauterine microfilariae, and fluid from intradermal nodules. In addition, 135 proteins were detected from the obligate Wolbachia symbiont. Observed protein families that were enriched in all whole body extracts relative to the complete search database included immunoglobulin-domain proteins, whereas redox and detoxification enzymes and proteins involved in intracellular transport displayed stage-specific overrepresentation. Unexpectedly, the larval stages exhibited enrichment for several mitochondrial-related protein families, including members of peptidase family M16 and proteins which mediate mitochondrial fission and fusion. Quantification of proteins across the lifecycle using the Hi-3 approach supported these qualitative analyses. In nodule fluid, we identified 94 O. ochengi secreted proteins, including homologs of transforming growth factor-β and a second member of a novel 6-ShK toxin domain family, which was originally described from a model filarial nematode (Litomosoides sigmodontis). Strikingly, the 498 bovine proteins identified in nodule fluid were strongly dominated by antimicrobial proteins, especially cathelicidins. This first high-throughput analysis of an Onchocerca spp. proteome across the lifecycle highlights its profound complexity and emphasizes the extremely close relationship between O. ochengi and O. volvulus. The insights presented here provide new candidates for vaccine development, drug targeting and diagnostic biomarkers. PMID:27226403

  13. ComplexQuant: high-throughput computational pipeline for the global quantitative analysis of endogenous soluble protein complexes using high resolution protein HPLC and precision label-free LC/MS/MS.

    PubMed

    Wan, Cuihong; Liu, Jian; Fong, Vincent; Lugowski, Andrew; Stoilova, Snejana; Bethune-Waddell, Dylan; Borgeson, Blake; Havugimana, Pierre C; Marcotte, Edward M; Emili, Andrew

    2013-04-09

    The experimental isolation and characterization of stable multi-protein complexes are essential to understanding the molecular systems biology of a cell. To this end, we have developed a high-throughput proteomic platform for the systematic identification of native protein complexes based on extensive fractionation of soluble protein extracts by multi-bed ion exchange high performance liquid chromatography (IEX-HPLC) combined with exhaustive label-free LC/MS/MS shotgun profiling. To support these studies, we have built a companion data analysis software pipeline, termed ComplexQuant. Proteins present in the hundreds of fractions typically collected per experiment are first identified by exhaustively interrogating MS/MS spectra using multiple database search engines within an integrative probabilistic framework, while accounting for possible post-translation modifications. Protein abundance is then measured across the fractions based on normalized total spectral counts and precursor ion intensities using a dedicated tool, PepQuant. This analysis allows co-complex membership to be inferred based on the similarity of extracted protein co-elution profiles. Each computational step has been optimized for processing large-scale biochemical fractionation datasets, and the reliability of the integrated pipeline has been benchmarked extensively. This article is part of a Special Issue entitled: From protein structures to clinical applications. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. Monitoring Peptidase Activities in Complex Proteomes by MALDI-TOF Mass Spectrometry

    PubMed Central

    Villanueva, Josep; Nazarian, Arpi; Lawlor, Kevin; Tempst, Paul

    2009-01-01

    Measuring enzymatic activities in biological fluids is a form of activity-based proteomics and may be utilized as a means of developing disease biomarkers. Activity-based assays allow amplification of output signals, thus potentially visualizing low-abundant enzymes on a virtually transparent whole-proteome background. The protocol presented here describes a semi-quantitative in vitro assay of proteolytic activities in complex proteomes by monitoring breakdown of designer peptide-substrates using robotic extraction and a MALDI-TOF mass spectrometric read-out. Relative quantitation of the peptide metabolites is done by comparison with spiked internal standards, followed by statistical analysis of the resulting mini-peptidome. Partial automation provides reproducibility and throughput essential for comparing large sample sets. The approach may be employed for diagnostic or predictive purposes and enables profiling of 96 samples in 30 hours. It could be tailored to many diagnostic and pharmaco-dynamic purposes, as a read-out of catalytic and metabolic activities in body fluids or tissues. PMID:19617888

  15. Single-cell-type quantitative proteomic and ionomic analysis of epidermal bladder cells from the halophyte model plant Mesembryanthemum crystallinum to identify salt-responsive proteins.

    PubMed

    Barkla, Bronwyn J; Vera-Estrella, Rosario; Raymond, Carolyn

    2016-05-10

    Epidermal bladder cells (EBC) are large single-celled, specialized, and modified trichomes found on the aerial parts of the halophyte Mesembryanthemum crystallinum. Recent development of a simple but high throughput technique to extract the contents from these cells has provided an opportunity to conduct detailed single-cell-type analyses of their molecular characteristics at high resolution to gain insight into the role of these cells in the salt tolerance of the plant. In this study, we carry out large-scale complementary quantitative proteomic studies using both a label (DIGE) and label-free (GeLC-MS) approach to identify salt-responsive proteins in the EBC extract. Additionally we perform an ionomics analysis (ICP-MS) to follow changes in the amounts of 27 different elements. Using these methods, we were able to identify 54 proteins and nine elements that showed statistically significant changes in the EBC from salt-treated plants. GO enrichment analysis identified a large number of transport proteins but also proteins involved in photosynthesis, primary metabolism and Crassulacean acid metabolism (CAM). Validation of results by western blot, confocal microscopy and enzyme analysis helped to strengthen findings and further our understanding into the role of these specialized cells. As expected EBC accumulated large quantities of sodium, however, the most abundant element was chloride suggesting the sequestration of this ion into the EBC vacuole is just as important for salt tolerance. This single-cell type omics approach shows that epidermal bladder cells of M. crystallinum are metabolically active modified trichomes, with primary metabolism supporting cell growth, ion accumulation, compatible solute synthesis and CAM. Data are available via ProteomeXchange with identifier PXD004045.

  16. Mass spectrometry-based proteomics: from cancer biology to protein biomarkers, drug targets, and clinical applications.

    PubMed

    Jimenez, Connie R; Verheul, Henk M W

    2014-01-01

    Proteomics is optimally suited to bridge the gap between genomic information on the one hand and biologic functions and disease phenotypes at the other, since it studies the expression and/or post-translational modification (especially phosphorylation) of proteins--the major cellular players bringing about cellular functions--at a global level in biologic specimens. Mass spectrometry technology and (bio)informatic tools have matured to the extent that they can provide high-throughput, comprehensive, and quantitative protein inventories of cells, tissues, and biofluids in clinical samples at low level. In this article, we focus on next-generation proteomics employing nanoliquid chromatography coupled to high-resolution tandem mass spectrometry for in-depth (phospho)protein profiling of tumor tissues and (proximal) biofluids, with a focus on studies employing clinical material. In addition, we highlight emerging proteogenomic approaches for the identification of tumor-specific protein variants, and targeted multiplex mass spectrometry strategies for large-scale biomarker validation. Below we provide a discussion of recent progress, some research highlights, and challenges that remain for clinical translation of proteomic discoveries.

  17. Emerging techniques for the discovery and validation of therapeutic targets for skeletal diseases.

    PubMed

    Cho, Christine H; Nuttall, Mark E

    2002-12-01

    Advances in genomics and proteomics have revolutionised the drug discovery process and target validation. Identification of novel therapeutic targets for chronic skeletal diseases is an extremely challenging process based on the difficulty of obtaining high-quality human diseased versus normal tissue samples. The quality of tissue and genomic information obtained from the sample is critical to identifying disease-related genes. Using a genomics-based approach, novel genes or genes with similar homology to existing genes can be identified from cDNA libraries generated from normal versus diseased tissue. High-quality cDNA libraries are prepared from uncontaminated homogeneous cell populations harvested from tissue sections of interest. Localised gene expression analysis and confirmation are obtained through in situ hybridisation or immunohistochemical studies. Cells overexpressing the recombinant protein are subsequently designed for primary cell-based high-throughput assays that are capable of screening large compound banks for potential hits. Afterwards, secondary functional assays are used to test promising compounds. The same overexpressing cells are used in the secondary assay to test protein activity and functionality as well as screen for small-molecule agonists or antagonists. Once a hit is generated, a structure-activity relationship of the compound is optimised for better oral bioavailability and pharmacokinetics allowing the compound to progress into development. Parallel efforts from proteomics, as well as genetics/transgenics, bioinformatics and combinatorial chemistry, and improvements in high-throughput automation technologies, allow the drug discovery process to meet the demands of the medicinal market. This review discusses and illustrates how different approaches are incorporated into the discovery and validation of novel targets and, consequently, the development of potentially therapeutic agents in the areas of osteoporosis and osteoarthritis. While current treatments exist in the form of hormone replacement therapy, antiresorptive and anabolic agents for osteoporosis, there are no disease-modifying therapies for the treatment of the most common human joint disease, osteoarthritis. A massive market potential for improved options with better safety and efficacy still remains. Therefore, the application of genomics and proteomics for both diseases should provide much needed novel therapeutic approaches to treating these major world health problems.

  18. Bacterial Survival under Extreme UV Radiation: A Comparative Proteomics Study of Rhodobacter sp., Isolated from High Altitude Wetlands in Chile

    PubMed Central

    Pérez, Vilma; Hengst, Martha; Kurte, Lenka; Dorador, Cristina; Jeffrey, Wade H.; Wattiez, Ruddy; Molina, Veronica; Matallana-Surget, Sabine

    2017-01-01

    Salar de Huasco, defined as a polyextreme environment, is a high altitude saline wetland in the Chilean Altiplano (3800 m.a.s.l.), permanently exposed to the highest solar radiation doses registered in the world. We present here the first comparative proteomics study of a photoheterotrophic bacterium, Rhodobacter sp., isolated from this remote and hostile habitat. We developed an innovative experimental approach using different sources of radiation (in situ sunlight and UVB lamps), cut-off filters (Mylar, Lee filters) and a high-throughput, label-free quantitative proteomics method to comprehensively analyze the effect of seven spectral bands on protein regulation. A hierarchical cluster analysis of 40 common proteins revealed that all conditions containing the most damaging UVB radiation induced similar pattern of protein regulation compared with UVA and visible light spectral bands. Moreover, it appeared that the cellular adaptation of Rhodobacter sp. to osmotic stress encountered in the hypersaline environment from which it was originally isolated, might further a higher resistance to damaging UV radiation. Indeed, proteins involved in the synthesis and transport of key osmoprotectants, such as glycine betaine and inositol, were found in very high abundance under UV radiation compared to the dark control, suggesting the function of osmolytes as efficient reactive oxygen scavengers. Our study also revealed a RecA-independent response and a tightly regulated network of protein quality control involving proteases and chaperones to selectively degrade misfolded and/or damaged proteins. PMID:28694800

  19. Integrated automation for continuous high-throughput synthetic chromosome assembly and transformation to identify improved yeast strains for industrial production of biofuels and bio-based chemicals

    USDA-ARS?s Scientific Manuscript database

    An exponential increase in our understanding of genomes, proteomes, and metabolomes provides greater impetus to address critical biotechnological issues such as sustainable production of biofuels and bio-based chemicals and, in particular, the development of improved microbial biocatalysts for use i...

  20. Computational biology for ageing

    PubMed Central

    Wieser, Daniela; Papatheodorou, Irene; Ziehm, Matthias; Thornton, Janet M.

    2011-01-01

    High-throughput genomic and proteomic technologies have generated a wealth of publicly available data on ageing. Easy access to these data, and their computational analysis, is of great importance in order to pinpoint the causes and effects of ageing. Here, we provide a description of the existing databases and computational tools on ageing that are available for researchers. We also describe the computational approaches to data interpretation in the field of ageing including gene expression, comparative and pathway analyses, and highlight the challenges for future developments. We review recent biological insights gained from applying bioinformatics methods to analyse and interpret ageing data in different organisms, tissues and conditions. PMID:21115530

  1. LXtoo: an integrated live Linux distribution for the bioinformatics community

    PubMed Central

    2012-01-01

    Background Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Findings Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. Conclusions LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo. PMID:22813356

  2. LXtoo: an integrated live Linux distribution for the bioinformatics community.

    PubMed

    Yu, Guangchuang; Wang, Li-Gen; Meng, Xiao-Hua; He, Qing-Yu

    2012-07-19

    Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo.

  3. Cloud-based solution to identify statistically significant MS peaks differentiating sample categories.

    PubMed

    Ji, Jun; Ling, Jeffrey; Jiang, Helen; Wen, Qiaojun; Whitin, John C; Tian, Lu; Cohen, Harvey J; Ling, Xuefeng B

    2013-03-23

    Mass spectrometry (MS) has evolved to become the primary high throughput tool for proteomics based biomarker discovery. Until now, multiple challenges in protein MS data analysis remain: large-scale and complex data set management; MS peak identification, indexing; and high dimensional peak differential analysis with the concurrent statistical tests based false discovery rate (FDR). "Turnkey" solutions are needed for biomarker investigations to rapidly process MS data sets to identify statistically significant peaks for subsequent validation. Here we present an efficient and effective solution, which provides experimental biologists easy access to "cloud" computing capabilities to analyze MS data. The web portal can be accessed at http://transmed.stanford.edu/ssa/. Presented web application supplies large scale MS data online uploading and analysis with a simple user interface. This bioinformatic tool will facilitate the discovery of the potential protein biomarkers using MS.

  4. Astragaloside IV Attenuates Glutamate-Induced Neurotoxicity in PC12 Cells through Raf-MEK-ERK Pathway.

    PubMed

    Yue, Rongcai; Li, Xia; Chen, Bingyang; Zhao, Jing; He, Weiwei; Yuan, Hu; Yuan, Xing; Gao, Na; Wu, Guozhen; Jin, Huizi; Shan, Lei; Zhang, Weidong

    2015-01-01

    Astragaloside IV (AGS-IV) is a main active ingredient of Astragalus membranaceus Bunge, a medicinal herb prescribed as an immunostimulant, hepatoprotective, antiperspirant, a diuretic or a tonic as documented in Chinese Materia Medica. In the present study, we employed a high-throughput comparative proteomic approach based on 2D-nano-LC-MS/MS to investigate the possible mechanism of action involved in the neuroprotective effect of AGS-IV against glutamate-induced neurotoxicity in PC12 cells. Differential proteins were identified, among which 13 proteins survived the stringent filter criteria and were further included for functional discussion. Two proteins (vimentin and Gap43) were randomly selected, and their expression levels were further confirmed by western blots analysis. The results matched well with those of proteomics. Furthermore, network analysis of protein-protein interactions (PPI) and pathways enrichment with AGS-IV associated proteins were carried out to illustrate its underlying molecular mechanism. Proteins associated with signal transduction, immune system, signaling molecules and interaction, and energy metabolism play important roles in neuroprotective effect of AGS-IV and Raf-MEK-ERK pathway was involved in the neuroprotective effect of AGS-IV against glutamate-induced neurotoxicity in PC12 cells. This study demonstrates that comparative proteomics based on shotgun approach is a valuable tool for molecular mechanism studies, since it allows the simultaneously evaluate the global proteins alterations.

  5. Proteomics Quality Control: Quality Control Software for MaxQuant Results.

    PubMed

    Bielow, Chris; Mastrobuoni, Guido; Kempa, Stefan

    2016-03-04

    Mass spectrometry-based proteomics coupled to liquid chromatography has matured into an automatized, high-throughput technology, producing data on the scale of multiple gigabytes per instrument per day. Consequently, an automated quality control (QC) and quality analysis (QA) capable of detecting measurement bias, verifying consistency, and avoiding propagation of error is paramount for instrument operators and scientists in charge of downstream analysis. We have developed an R-based QC pipeline called Proteomics Quality Control (PTXQC) for bottom-up LC-MS data generated by the MaxQuant software pipeline. PTXQC creates a QC report containing a comprehensive and powerful set of QC metrics, augmented with automated scoring functions. The automated scores are collated to create an overview heatmap at the beginning of the report, giving valuable guidance also to nonspecialists. Our software supports a wide range of experimental designs, including stable isotope labeling by amino acids in cell culture (SILAC), tandem mass tags (TMT), and label-free data. Furthermore, we introduce new metrics to score MaxQuant's Match-between-runs (MBR) functionality by which peptide identifications can be transferred across Raw files based on accurate retention time and m/z. Last but not least, PTXQC is easy to install and use and represents the first QC software capable of processing MaxQuant result tables. PTXQC is freely available at https://github.com/cbielow/PTXQC .

  6. Reduced dimensionality (3,2)D NMR experiments and their automated analysis: implications to high-throughput structural studies on proteins.

    PubMed

    Reddy, Jithender G; Kumar, Dinesh; Hosur, Ramakrishna V

    2015-02-01

    Protein NMR spectroscopy has expanded dramatically over the last decade into a powerful tool for the study of their structure, dynamics, and interactions. The primary requirement for all such investigations is sequence-specific resonance assignment. The demand now is to obtain this information as rapidly as possible and in all types of protein systems, stable/unstable, soluble/insoluble, small/big, structured/unstructured, and so on. In this context, we introduce here two reduced dimensionality experiments – (3,2)D-hNCOcanH and (3,2)D-hNcoCAnH – which enhance the previously described 2D NMR-based assignment methods quite significantly. Both the experiments can be recorded in just about 2-3 h each and hence would be of immense value for high-throughput structural proteomics and drug discovery research. The applicability of the method has been demonstrated using alpha-helical bovine apo calbindin-D9k P43M mutant (75 aa) protein. Automated assignment of this data using AUTOBA has been presented, which enhances the utility of these experiments. The backbone resonance assignments so derived are utilized to estimate secondary structures and the backbone fold using Web-based algorithms. Taken together, we believe that the method and the protocol proposed here can be used for routine high-throughput structural studies of proteins. Copyright © 2014 John Wiley & Sons, Ltd.

  7. Bioanalysis in microfluidic devices.

    PubMed

    Khandurina, Julia; Guttman, András

    2002-01-18

    Microfabricated bioanalytical devices (also referred to as laboratory-on-a-chip or micro-TAS) offer highly efficient platforms for simultaneous analysis of a large number of biologically important molecules, possessing great potential for genome, proteome and metabolome studies. Development and implementation of microfluidic-based bioanalytical tools involves both established and evolving technologies, including microlithography, micromachining, micro-electromechanical systems technology and nanotechnology. This article provides an overview of the latest developments in the key device subject areas and the basic interdisciplinary technologies. Important aspects of DNA and protein analysis, interfacing issues and system integration are all thoroughly discussed, along with applications for this novel "synergized" technology in high-throughput separations of biologically important molecules. This review also gives a better understanding of how to utilize these technologies as well as to provide appropriate technical solutions to problems perceived as being more fundamental.

  8. Five years later: the current status of the use of proteomics and transcriptomics in EMF research.

    PubMed

    Leszczynski, Dariusz; de Pomerai, David; Koczan, Dirk; Stoll, Dieter; Franke, Helmut; Albar, Juan Pablo

    2012-08-01

    The World Health Organization's and Radiation and Nuclear Safety Authority's "Workshop on Application of Proteomics and Transcriptomics in Electromagnetic Fields Research" was held in Helsinki in the October/November 2005. As a consequence of this meeting, Proteomics journal published in 2006 a special issue "Application of Proteomics and Transcriptomics in EMF Research" (Vol. 6 No. 17; Guest Editor: D. Leszczynski). This Proteomics issue presented the status of research, of the effects of electromagnetic fields (EMF) using proteomics and transcriptomics methods, present in 2005. The current overview/opinion article presents the status of research in this area by reviewing all studies that were published by the end of 2010. The review work was a part of the European Cooperation in the Field of Scientific and Technical Research (COST) Action BM0704 that created a structure in which researchers in the field of EMF and health shared knowledge and information. The review was prepared by the members of the COST Action BM0704 task group on the high-throughput screening techniques and electromagnetic fields (TG-HTST-EMF). © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Beyond the Natural Proteome: Nondegenerate Saturation Mutagenesis-Methodologies and Advantages.

    PubMed

    Ferreira Amaral, M M; Frigotto, L; Hine, A V

    2017-01-01

    Beyond the natural proteome, high-throughput mutagenesis offers the protein engineer an opportunity to "tweak" the wild-type activity of a protein to create a recombinant protein with required attributes. Of the various approaches available, saturation mutagenesis is one of the core techniques employed by protein engineers, and in recent times, nondegenerate saturation mutagenesis is emerging as the approach of choice. This review compares the current methodologies available for conducting nondegenerate saturation mutagenesis with traditional, degenerate saturation and briefly outlines the options available for screening the resulting libraries, to discover a novel protein with the required activity and/or specificity. © 2017 Elsevier Inc. All rights reserved.

  10. Analyses of Brucella Pathogenesis, Host Immunity, and Vaccine Targets using Systems Biology and Bioinformatics

    PubMed Central

    He, Yongqun

    2011-01-01

    Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning. PMID:22919594

  11. Analyses of Brucella pathogenesis, host immunity, and vaccine targets using systems biology and bioinformatics.

    PubMed

    He, Yongqun

    2012-01-01

    Brucella is a Gram-negative, facultative intracellular bacterium that causes zoonotic brucellosis in humans and various animals. Out of 10 classified Brucella species, B. melitensis, B. abortus, B. suis, and B. canis are pathogenic to humans. In the past decade, the mechanisms of Brucella pathogenesis and host immunity have been extensively investigated using the cutting edge systems biology and bioinformatics approaches. This article provides a comprehensive review of the applications of Omics (including genomics, transcriptomics, and proteomics) and bioinformatics technologies for the analysis of Brucella pathogenesis, host immune responses, and vaccine targets. Based on more than 30 sequenced Brucella genomes, comparative genomics is able to identify gene variations among Brucella strains that help to explain host specificity and virulence differences among Brucella species. Diverse transcriptomics and proteomics gene expression studies have been conducted to analyze gene expression profiles of wild type Brucella strains and mutants under different laboratory conditions. High throughput Omics analyses of host responses to infections with virulent or attenuated Brucella strains have been focused on responses by mouse and cattle macrophages, bovine trophoblastic cells, mouse and boar splenocytes, and ram buffy coat. Differential serum responses in humans and rams to Brucella infections have been analyzed using high throughput serum antibody screening technology. The Vaxign reverse vaccinology has been used to predict many Brucella vaccine targets. More than 180 Brucella virulence factors and their gene interaction networks have been identified using advanced literature mining methods. The recent development of community-based Vaccine Ontology and Brucellosis Ontology provides an efficient way for Brucella data integration, exchange, and computer-assisted automated reasoning.

  12. RaftProt: mammalian lipid raft proteome database.

    PubMed

    Shah, Anup; Chen, David; Boda, Akash R; Foster, Leonard J; Davis, Melissa J; Hill, Michelle M

    2015-01-01

    RaftProt (http://lipid-raft-database.di.uq.edu.au/) is a database of mammalian lipid raft-associated proteins as reported in high-throughput mass spectrometry studies. Lipid rafts are specialized membrane microdomains enriched in cholesterol and sphingolipids thought to act as dynamic signalling and sorting platforms. Given their fundamental roles in cellular regulation, there is a plethora of information on the size, composition and regulation of these membrane microdomains, including a large number of proteomics studies. To facilitate the mining and analysis of published lipid raft proteomics studies, we have developed a searchable database RaftProt. In addition to browsing the studies, performing basic queries by protein and gene names, searching experiments by cell, tissue and organisms; we have implemented several advanced features to facilitate data mining. To address the issue of potential bias due to biochemical preparation procedures used, we have captured the lipid raft preparation methods and implemented advanced search option for methodology and sample treatment conditions, such as cholesterol depletion. Furthermore, we have identified a list of high confidence proteins, and enabled searching only from this list of likely bona fide lipid raft proteins. Given the apparent biological importance of lipid raft and their associated proteins, this database would constitute a key resource for the scientific community. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. A Network-Based Method to Assess the Statistical Significance of Mild Co-Regulation Effects

    PubMed Central

    Horvát, Emőke-Ágnes; Zhang, Jitao David; Uhlmann, Stefan; Sahin, Özgür; Zweig, Katharina Anna

    2013-01-01

    Recent development of high-throughput, multiplexing technology has initiated projects that systematically investigate interactions between two types of components in biological networks, for instance transcription factors and promoter sequences, or microRNAs (miRNAs) and mRNAs. In terms of network biology, such screening approaches primarily attempt to elucidate relations between biological components of two distinct types, which can be represented as edges between nodes in a bipartite graph. However, it is often desirable not only to determine regulatory relationships between nodes of different types, but also to understand the connection patterns of nodes of the same type. Especially interesting is the co-occurrence of two nodes of the same type, i.e., the number of their common neighbours, which current high-throughput screening analysis fails to address. The co-occurrence gives the number of circumstances under which both of the biological components are influenced in the same way. Here we present SICORE, a novel network-based method to detect pairs of nodes with a statistically significant co-occurrence. We first show the stability of the proposed method on artificial data sets: when randomly adding and deleting observations we obtain reliable results even with noise exceeding the expected level in large-scale experiments. Subsequently, we illustrate the viability of the method based on the analysis of a proteomic screening data set to reveal regulatory patterns of human microRNAs targeting proteins in the EGFR-driven cell cycle signalling system. Since statistically significant co-occurrence may indicate functional synergy and the mechanisms underlying canalization, and thus hold promise in drug target identification and therapeutic development, we provide a platform-independent implementation of SICORE with a graphical user interface as a novel tool in the arsenal of high-throughput screening analysis. PMID:24039936

  14. Differential Denaturation of Serum Proteome Reveals a Significant Amount of Hidden Information in Complex Mixtures of Proteins

    PubMed Central

    Polci, Maria Letizia; Rossi, Stefania; Cordella, Martina; Carlucci, Giuseppe; Marchetti, Paolo; Antonini-Cappellini, Giancarlo; Facchiano, Antonio; D'Arcangelo, Daniela; Facchiano, Francesco

    2013-01-01

    Recently developed proteomic technologies allow to profile thousands of proteins within a high-throughput approach towards biomarker discovery, although results are not as satisfactory as expected. In the present study we demonstrate that serum proteome denaturation is a key underestimated feature; in fact, a new differential denaturation protocol better discriminates serum proteins according to their electrophoretic mobility as compared to single-denaturation protocols. Sixty nine different denaturation treatments were tested and the 3 most discriminating ones were selected (TRIDENT analysis) and applied to human sera, showing a significant improvement of serum protein discrimination as confirmed by MALDI-TOF/MS and LC-MS/MS identification, depending on the type of denaturation applied. Thereafter sera from mice and patients carrying cutaneous melanoma were analyzed through TRIDENT. Nine and 8 protein bands were found differentially expressed in mice and human melanoma sera, compared to healthy controls (p<0.05); three of them were found, for the first time, significantly modulated: α2macroglobulin (down-regulated in melanoma, p<0.001), Apolipoprotein-E and Apolipoprotein-A1 (both up-regulated in melanoma, p<0.04), both in mice and humans. The modulation was confirmed by immunological methods. Other less abundant proteins (e.g. gelsolin) were found significantly modulated (p<0.05). Conclusions: i) serum proteome contains a large amount of information, still neglected, related to proteins folding; ii) a careful serum denaturation may significantly improve analytical procedures involving complex protein mixtures; iii) serum differential denaturation protocol highlights interesting proteomic differences between cancer and healthy sera. PMID:23533572

  15. Analysis of essential gene dynamics under antibiotic stress in Streptococcus sanguinis

    PubMed Central

    El-Rami, Fadi; Kong, Xiangzhen; Parikh, Hardik; Zhu, Bin; Stone, Victoria; Kitten, Todd; Xu, Ping

    2018-01-01

    The paradoxical response of Streptococcus sanguinis to drugs prescribed for dental and clinical practices has complicated treatment guidelines and raised the need for further investigation. We conducted a high throughput study on concomitant transcriptome and proteome dynamics in a time course to assess S. sanguinis behaviour under a sub-inhibitory concentration of ampicillin. Temporal changes at the transcriptome and proteome level were monitored to cover essential genes and proteins over a physiological map of intricate pathways. Our findings revealed that translation was the functional category in S. sanguinis that was most enriched in essential proteins. Moreover, essential proteins in this category demonstrated the greatest conservation across 2774 bacterial proteomes, in comparison to other essential functional categories like cell wall biosynthesis and energy production. In comparison to non-essential proteins, essential proteins were less likely to contain ‘degradation-prone’ amino acids at their N-terminal position, suggesting a longer half-life. Despite the ampicillin-induced stress, the transcriptional up-regulation of amino acid-tRNA synthetases and proteomic elevation of amino acid biosynthesis enzymes favoured the enriched components of essential proteins revealing ‘proteomic signatures’ that can be used to bridge the genotype–phenotype gap of S. sanguinis under ampicillin stress. Furthermore, we identified a significant correlation between the levels of mRNA and protein for essential genes and detected essential protein-enriched pathways differentially regulated through a persistent stress response pattern at late time points. We propose that the current findings will help characterize a bacterial model to study the dynamics of essential genes and proteins under clinically relevant stress conditions. PMID:29393020

  16. PTMScout, a Web Resource for Analysis of High Throughput Post-translational Proteomics Studies*

    PubMed Central

    Naegle, Kristen M.; Gymrek, Melissa; Joughin, Brian A.; Wagner, Joel P.; Welsch, Roy E.; Yaffe, Michael B.; Lauffenburger, Douglas A.; White, Forest M.

    2010-01-01

    The rate of discovery of post-translational modification (PTM) sites is increasing rapidly and is significantly outpacing our biological understanding of the function and regulation of those modifications. To help meet this challenge, we have created PTMScout, a web-based interface for viewing, manipulating, and analyzing high throughput experimental measurements of PTMs in an effort to facilitate biological understanding of protein modifications in signaling networks. PTMScout is constructed around a custom database of PTM experiments and contains information from external protein and post-translational resources, including gene ontology annotations, Pfam domains, and Scansite predictions of kinase and phosphopeptide binding domain interactions. PTMScout functionality comprises data set comparison tools, data set summary views, and tools for protein assignments of peptides identified by mass spectrometry. Analysis tools in PTMScout focus on informed subset selection via common criteria and on automated hypothesis generation through subset labeling derived from identification of statistically significant enrichment of other annotations in the experiment. Subset selection can be applied through the PTMScout flexible query interface available for quantitative data measurements and data annotations as well as an interface for importing data set groupings by external means, such as unsupervised learning. We exemplify the various functions of PTMScout in application to data sets that contain relative quantitative measurements as well as data sets lacking quantitative measurements, producing a set of interesting biological hypotheses. PTMScout is designed to be a widely accessible tool, enabling generation of multiple types of biological hypotheses from high throughput PTM experiments and advancing functional assignment of novel PTM sites. PTMScout is available at http://ptmscout.mit.edu. PMID:20631208

  17. Less is More: Membrane Protein Digestion Beyond Urea–Trypsin Solution for Next-level Proteomics*

    PubMed Central

    Zhang, Xi

    2015-01-01

    The goal of next-level bottom-up membrane proteomics is protein function investigation, via high-coverage high-throughput peptide-centric quantitation of expression, modifications and dynamic structures at systems scale. Yet efficient digestion of mammalian membrane proteins presents a daunting barrier, and prevalent day-long urea–trypsin in-solution digestion proved insufficient to reach this goal. Many efforts contributed incremental advances over past years, but involved protein denaturation that disconnected measurement from functional states. Beyond denaturation, the recent discovery of structure/proteomics omni-compatible detergent n-dodecyl-β-d-maltopyranoside, combined with pepsin and PNGase F columns, enabled breakthroughs in membrane protein digestion: a 2010 DDM-low-TCEP (DLT) method for H/D-exchange (HDX) using human G protein-coupled receptor, and a 2015 flow/detergent-facilitated protease and de-PTM digestions (FDD) for integrative deep sequencing and quantitation using full-length human ion channel complex. Distinguishing protein solubilization from denaturation, protease digestion reliability from theoretical specificity, and reduction from alkylation, these methods shifted day(s)-long paradigms into minutes, and afforded fully automatable (HDX)-protein-peptide-(tandem mass tag)-HPLC pipelines to instantly measure functional proteins at deep coverage, high peptide reproducibility, low artifacts and minimal leakage. Promoting—not destroying—structures and activities harnessed membrane proteins for the next-level streamlined functional proteomics. This review analyzes recent advances in membrane protein digestion methods and highlights critical discoveries for future proteomics. PMID:26081834

  18. Proteogenomic insights into uranium tolerance of a Chernobyl's Microbacterium bacterial isolate.

    PubMed

    Gallois, Nicolas; Alpha-Bazin, Béatrice; Ortet, Philippe; Barakat, Mohamed; Piette, Laurie; Long, Justine; Berthomieu, Catherine; Armengaud, Jean; Chapon, Virginie

    2018-04-15

    Microbacterium oleivorans A9 is a uranium-tolerant actinobacteria isolated from the trench T22 located near the Chernobyl nuclear power plant. This site is contaminated with different radionuclides including uranium. To observe the molecular changes at the proteome level occurring in this strain upon uranyl exposure and understand molecular mechanisms explaining its uranium tolerance, we established its draft genome and used this raw information to perform an in-depth proteogenomics study. High-throughput proteomics were performed on cells exposed or not to 10μM uranyl nitrate sampled at three previously identified phases of uranyl tolerance. We experimentally detected and annotated 1532 proteins and highlighted a total of 591 proteins for which abundances were significantly differing between conditions. Notably, proteins involved in phosphate and iron metabolisms show high dynamics. A large ratio of proteins more abundant upon uranyl stress, are distant from functionally-annotated known proteins, highlighting the lack of fundamental knowledge regarding numerous key molecular players from soil bacteria. Microbacterium oleivorans A9 is an interesting environmental model to understand biological processes engaged in tolerance to radionuclides. Using an innovative proteogenomics approach, we explored its molecular mechanisms involved in uranium tolerance. We sequenced its genome, interpreted high-throughput proteomic data against a six-reading frame ORF database deduced from the draft genome, annotated the identified proteins and compared protein abundances from cells exposed or not to uranyl stress after a cascade search. These data show that a complex cellular response to uranium occurs in Microbacterium oleivorans A9, where one third of the experimental proteome is modified. In particular, the uranyl stress perturbed the phosphate and iron metabolic pathways. Furthermore, several transporters have been identified to be specifically associated to uranyl stress, paving the way to the development of biotechnological tools for uranium decontamination. Copyright © 2017. Published by Elsevier B.V.

  19. Proteomics technique opens new frontiers in mobilome research

    PubMed Central

    Davidson, Andrew D.; Matthews, David A.

    2017-01-01

    ABSTRACT A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the “mobilome,” which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the “domestication” of transposon proteins for cellular functions. Although ‘omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called “proteomics informed by transcriptomics” (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease. PMID:28932623

  20. Dr. Janie Merkel is interviewed by Ryan Blum and Janice Friend.

    PubMed

    Merkel, Janie

    2007-12-01

    Dr. Janie Merkel is the director of Yale's Chemical Genomics Screening Facility, a high-throughput screening laboratory that is part of the Yale University Center for Genomics and Proteomics. The Screening Facility connects Yale researchers with industry-quality robotic machinery and a diverse group of compound libraries, which have been used successfully to link therapeutic targets with potential therapies.

  1. Proteomics of blood-based therapeutics: a promising tool for quality assurance in transfusion medicine.

    PubMed

    Thiele, Thomas; Steil, Leif; Völker, Uwe; Greinacher, Andreas

    2007-01-01

    Blood-based therapeutics are cellular or plasma components derived from human blood. Their production requires appropriate selection and treatment of the donor and processing of cells or plasma proteins. In contrast to clearly defined, chemically synthesized drugs, blood-derived therapeutics are highly complex mixtures of plasma proteins or even more complex cells. Pathogen transmission by the product as well as changes in the integrity of blood constituents resulting in loss of function or immune modulation are currently important issues in transfusion medicine. Protein modifications can occur during various steps of the production process, such as acquisition, enrichment of separate components (e.g. coagulation factors, cell populations), virus inactivation, conservation, and storage. Contemporary proteomic strategies allow a comprehensive assessment of protein modifications with high coverage, offer capabilities for qualitative and even quantitative analysis, and for high-throughput protein identification. Traditionally, proteomics approaches predominantly relied on two-dimensional gel electrophoresis (2-DE). Even if 2-DE is still state of the art, it has inherent limitations that are mainly based on the physicochemical properties of the proteins analyzed; for example, proteins with extremes in molecular mass and hydrophobicity (most membrane proteins) are difficult to assess by 2-DE. These limitations have fostered the development of mass spectrometry centered on non-gel-based separation approaches, which have proven to be highly successful and are thus complementing and even partially replacing 2-DE-based approaches. Although blood constituents have been extensively analyzed by proteomics, this technology has not been widely applied to assess or even improve blood-derived therapeutics, or to monitor the production processes. As proteomic technologies have the capacity to provide comprehensive information about changes occurring during processing and storage of blood products, proteomics can potentially guide improvement of pathogen inactivation procedures and engineering of stem cells, and may also allow a better understanding of factors influencing the immunogenicity of blood-derived therapeutics. An important development in proteomics is the reduction of inter-assay variability. This now allows the screening of samples taken from the same product over time or before and after processing. Optimized preparation procedures and storage conditions will reduce the risk of protein alterations, which in turn may contribute to better recovery, reduced exposure to allogeneic proteins, and increased transfusion safety.

  2. A robust mass spectrometry method for rapid profiling of erythrocyte ghost membrane proteomes.

    PubMed

    Fye, Haddy K S; Mrosso, Paul; Bruce, Lesley; Thézénas, Marie-Laëtitia; Davis, Simon; Fischer, Roman; Rwegasira, Gration L; Makani, Julie; Kessler, Benedikt M

    2018-01-01

    Red blood cell (RBC) physiology is directly linked to many human disorders associated with low tissue oxygen levels or anemia including chronic obstructive pulmonary disease, congenital heart disease, sleep apnea and sickle cell anemia. Parasites such as Plasmodium spp. and phylum Apicomplexa directly target RBCs, and surface molecules within the RBC membrane are critical for pathogen interactions. Proteomics of RBC membrane 'ghost' fractions has therefore been of considerable interest, but protocols described to date are either suboptimal or too extensive to be applicable to a larger set of clinical cohorts. Here, we describe an optimised erythrocyte isolation protocol from blood, tested for various storage conditions and explored using different fractionation conditions for isolating ghost RBC membranes. Liquid chromatography mass spectrometry (LC-MS) analysis on a Q-Exactive Orbitrap instrument was used to profile proteins isolated from the comparative conditions. Data analysis was run on the MASCOT and MaxQuant platforms to assess their scope and diversity. The results obtained demonstrate a robust method for membrane enrichment enabling consistent MS based characterisation of > 900 RBC membrane proteins in single LC-MS/MS analyses. Non-detergent based membrane solubilisation methods using the tissue and supernatant fractions of isolated ghost membranes are shown to offer effective haemoglobin removal as well as diverse recovery including erythrocyte membrane proteins of high and low abundance. The methods described in this manuscript propose a medium to high throughput framework for membrane proteome profiling by LC-MS of potential applicability to larger clinical cohorts in a variety of disease contexts.

  3. toxoMine: an integrated omics data warehouse for Toxoplasma gondii systems biology research

    PubMed Central

    Rhee, David B.; Croken, Matthew McKnight; Shieh, Kevin R.; Sullivan, Julie; Micklem, Gos; Kim, Kami; Golden, Aaron

    2015-01-01

    Toxoplasma gondii (T. gondii) is an obligate intracellular parasite that must monitor for changes in the host environment and respond accordingly; however, it is still not fully known which genetic or epigenetic factors are involved in regulating virulence traits of T. gondii. There are on-going efforts to elucidate the mechanisms regulating the stage transition process via the application of high-throughput epigenomics, genomics and proteomics techniques. Given the range of experimental conditions and the typical yield from such high-throughput techniques, a new challenge arises: how to effectively collect, organize and disseminate the generated data for subsequent data analysis. Here, we describe toxoMine, which provides a powerful interface to support sophisticated integrative exploration of high-throughput experimental data and metadata, providing researchers with a more tractable means toward understanding how genetic and/or epigenetic factors play a coordinated role in determining pathogenicity of T. gondii. As a data warehouse, toxoMine allows integration of high-throughput data sets with public T. gondii data. toxoMine is also able to execute complex queries involving multiple data sets with straightforward user interaction. Furthermore, toxoMine allows users to define their own parameters during the search process that gives users near-limitless search and query capabilities. The interoperability feature also allows users to query and examine data available in other InterMine systems, which would effectively augment the search scope beyond what is available to toxoMine. toxoMine complements the major community database ToxoDB by providing a data warehouse that enables more extensive integrative studies for T. gondii. Given all these factors, we believe it will become an indispensable resource to the greater infectious disease research community. Database URL: http://toxomine.org PMID:26130662

  4. Serum Proteome Analysis for Profiling Predictive Protein Markers Associated with the Severity of Skin Lesions Induced by Ionizing Radiation.

    PubMed

    Chaze, Thibault; Hornez, Louis; Chambon, Christophe; Haddad, Iman; Vinh, Joelle; Peyrat, Jean-Philippe; Benderitter, Marc; Guipaud, Olivier

    2013-07-10

    The finding of new diagnostic and prognostic markers of local radiation injury, and particularly of the cutaneous radiation syndrome, is crucial for its medical management, in the case of both accidental exposure and radiotherapy side effects. Especially, a fast high-throughput method is still needed for triage of people accidentally exposed to ionizing radiation. In this study, we investigated the impact of localized irradiation of the skin on the early alteration of the serum proteome of mice in an effort to discover markers associated with the exposure and severity of impending damage. Using two different large-scale quantitative proteomic approaches, 2D-DIGE-MS and SELDI-TOF-MS, we performed global analyses of serum proteins collected in the clinical latency phase (days 3 and 7) from non-irradiated and locally irradiated mice exposed to high doses of 20, 40 and 80 Gy which will develop respectively erythema, moist desquamation and necrosis. Unsupervised and supervised multivariate statistical analyses (principal component analysis, partial-least square discriminant analysis and Random Forest analysis) using 2D-DIGE quantitative protein data allowed us to discriminate early between non-irradiated and irradiated animals, and between uninjured/slightly injured animals and animals that will develop severe lesions. On the other hand, despite a high number of animal replicates, PLS-DA and Random Forest analyses of SELDI-TOF-MS data failed to reveal sets of MS peaks able to discriminate between the different groups of animals. Our results show that, unlike SELDI-TOF-MS, the 2D-DIGE approach remains a powerful and promising method for the discovery of sets of proteins that could be used for the development of clinical tests for triage and the prognosis of the severity of radiation-induced skin lesions. We propose a list of 15 proteins which constitutes a set of candidate proteins for triage and prognosis of skin lesion outcomes.

  5. Serum Proteome Analysis for Profiling Predictive Protein Markers Associated with the Severity of Skin Lesions Induced by Ionizing Radiation

    PubMed Central

    Chaze, Thibault; Hornez, Louis; Chambon, Christophe; Haddad, Iman; Vinh, Joelle; Peyrat, Jean-Philippe; Benderitter, Marc; Guipaud, Olivier

    2013-01-01

    The finding of new diagnostic and prognostic markers of local radiation injury, and particularly of the cutaneous radiation syndrome, is crucial for its medical management, in the case of both accidental exposure and radiotherapy side effects. Especially, a fast high-throughput method is still needed for triage of people accidentally exposed to ionizing radiation. In this study, we investigated the impact of localized irradiation of the skin on the early alteration of the serum proteome of mice in an effort to discover markers associated with the exposure and severity of impending damage. Using two different large-scale quantitative proteomic approaches, 2D-DIGE-MS and SELDI-TOF-MS, we performed global analyses of serum proteins collected in the clinical latency phase (days 3 and 7) from non-irradiated and locally irradiated mice exposed to high doses of 20, 40 and 80 Gy which will develop respectively erythema, moist desquamation and necrosis. Unsupervised and supervised multivariate statistical analyses (principal component analysis, partial-least square discriminant analysis and Random Forest analysis) using 2D-DIGE quantitative protein data allowed us to discriminate early between non-irradiated and irradiated animals, and between uninjured/slightly injured animals and animals that will develop severe lesions. On the other hand, despite a high number of animal replicates, PLS-DA and Random Forest analyses of SELDI-TOF-MS data failed to reveal sets of MS peaks able to discriminate between the different groups of animals. Our results show that, unlike SELDI-TOF-MS, the 2D-DIGE approach remains a powerful and promising method for the discovery of sets of proteins that could be used for the development of clinical tests for triage and the prognosis of the severity of radiation-induced skin lesions. We propose a list of 15 proteins which constitutes a set of candidate proteins for triage and prognosis of skin lesion outcomes. PMID:28250398

  6. High throughput techniques to reveal the molecular physiology and evolution of digestion in spiders.

    PubMed

    Fuzita, Felipe J; Pinkse, Martijn W H; Patane, José S L; Verhaert, Peter D E M; Lopes, Adriana R

    2016-09-07

    Spiders are known for their predatory efficiency and for their high capacity of digesting relatively large prey. They do this by combining both extracorporeal and intracellular digestion. Whereas many high throughput ("-omics") techniques focus on biomolecules in spider venom, so far this approach has not yet been applied to investigate the protein composition of spider midgut diverticula (MD) and digestive fluid (DF). We here report on our investigations of both MD and DF of the spider Nephilingis (Nephilengys) cruentata through the use of next generation sequencing and shotgun proteomics. This shows that the DF is composed of a variety of hydrolases including peptidases, carbohydrases, lipases and nuclease, as well as of toxins and regulatory proteins. We detect 25 astacins in the DF. Phylogenetic analysis of the corresponding transcript(s) in Arachnida suggests that astacins have acquired an unprecedented role for extracorporeal digestion in Araneae, with different orthologs used by each family. The results of a comparative study of spiders in distinct physiological conditions allow us to propose some digestion mechanisms in this interesting animal taxon. All the high throughput data allowed the demonstration that DF is a secretion originating from the MD. We identified enzymes involved in the extracellular and intracellular phases of digestion. Besides that, data analyses show a large gene duplication event in Araneae digestive process evolution, mainly of astacin genes. We were also able to identify proteins expressed and translated in the digestive system, which until now had been exclusively associated to venom glands.

  7. Characterization of Macrophage Endogenous S-Nitrosoproteome Using a Cysteine-Specific Phosphonate Adaptable Tag in Combination with TiO2 Chromatography.

    PubMed

    Ibáñez-Vea, María; Huang, Honggang; Martínez de Morentin, Xabier; Pérez, Estela; Gato, Maria; Zuazo, Miren; Arasanz, Hugo; Fernández-Irigoyen, Joaquin; Santamaría, Enrique; Fernandez-Hinojal, Gonzalo; Larsen, Martin R; Escors, David; Kochan, Grazyna

    2018-03-02

    Protein S-nitrosylation is a cysteine post-translational modification mediated by nitric oxide. An increasing number of studies highlight S-nitrosylation as an important regulator of signaling involved in numerous cellular processes. Despite the significant progress in the development of redox proteomic methods, identification and quantification of endogeneous S-nitrosylation using high-throughput mass-spectrometry-based methods is a technical challenge because this modification is highly labile. To overcome this drawback, most methods induce S-nitrosylation chemically in proteins using nitrosylating compounds before analysis, with the risk of introducing nonphysiological S-nitrosylation. Here we present a novel method to efficiently identify endogenous S-nitrosopeptides in the macrophage total proteome. Our approach is based on the labeling of S-nitrosopeptides reduced by ascorbate with a cysteine specific phosphonate adaptable tag (CysPAT), followed by titanium dioxide (TiO 2 ) chromatography enrichment prior to nLC-MS/MS analysis. To test our procedure, we performed a large-scale analysis of this low-abundant modification in a murine macrophage cell line. We identified 569 endogeneous S-nitrosylated proteins compared with 795 following exogenous chemically induced S-nitrosylation. Importantly, we discovered 579 novel S-nitrosylation sites. The large number of identified endogenous S-nitrosylated peptides allowed the definition of two S-nitrosylation consensus sites, highlighting protein translation and redox processes as key S-nitrosylation targets in macrophages.

  8. Characterisation of the Manduca sexta sperm proteome: Genetic novelty underlying sperm composition in Lepidoptera.

    PubMed

    Whittington, Emma; Zhao, Qian; Borziak, Kirill; Walters, James R; Dorus, Steve

    2015-07-01

    The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole-cell proteomic characterisation will facilitate future evolutionary genetic and developmental studies of heteromorphic sperm production and parasperm function. Furthermore, the analyses presented here provide useful annotation information regarding sex-biased gene expression, novel Lepidopteran genes and gene function in the male gamete to complement the newly sequenced and annotated Manduca genome. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. FDRAnalysis: a tool for the integrated analysis of tandem mass spectrometry identification results from multiple search engines.

    PubMed

    Wedge, David C; Krishna, Ritesh; Blackhurst, Paul; Siepen, Jennifer A; Jones, Andrew R; Hubbard, Simon J

    2011-04-01

    Confident identification of peptides via tandem mass spectrometry underpins modern high-throughput proteomics. This has motivated considerable recent interest in the postprocessing of search engine results to increase confidence and calculate robust statistical measures, for example through the use of decoy databases to calculate false discovery rates (FDR). FDR-based analyses allow for multiple testing and can assign a single confidence value for both sets and individual peptide spectrum matches (PSMs). We recently developed an algorithm for combining the results from multiple search engines, integrating FDRs for sets of PSMs made by different search engine combinations. Here we describe a web-server and a downloadable application that makes this routinely available to the proteomics community. The web server offers a range of outputs including informative graphics to assess the confidence of the PSMs and any potential biases. The underlying pipeline also provides a basic protein inference step, integrating PSMs into protein ambiguity groups where peptides can be matched to more than one protein. Importantly, we have also implemented full support for the mzIdentML data standard, recently released by the Proteomics Standards Initiative, providing users with the ability to convert native formats to mzIdentML files, which are available to download.

  10. FDRAnalysis: A tool for the integrated analysis of tandem mass spectrometry identification results from multiple search engines

    PubMed Central

    Wedge, David C; Krishna, Ritesh; Blackhurst, Paul; Siepen, Jennifer A; Jones, Andrew R.; Hubbard, Simon J.

    2013-01-01

    Confident identification of peptides via tandem mass spectrometry underpins modern high-throughput proteomics. This has motivated considerable recent interest in the post-processing of search engine results to increase confidence and calculate robust statistical measures, for example through the use of decoy databases to calculate false discovery rates (FDR). FDR-based analyses allow for multiple testing and can assign a single confidence value for both sets and individual peptide spectrum matches (PSMs). We recently developed an algorithm for combining the results from multiple search engines, integrating FDRs for sets of PSMs made by different search engine combinations. Here we describe a web-server, and a downloadable application, which makes this routinely available to the proteomics community. The web server offers a range of outputs including informative graphics to assess the confidence of the PSMs and any potential biases. The underlying pipeline provides a basic protein inference step, integrating PSMs into protein ambiguity groups where peptides can be matched to more than one protein. Importantly, we have also implemented full support for the mzIdentML data standard, recently released by the Proteomics Standards Initiative, providing users with the ability to convert native formats to mzIdentML files, which are available to download. PMID:21222473

  11. The Urine Proteome as a Biomarker of Radiation Injury

    PubMed Central

    Sharma, Mukut; Halligan, Brian D.; Wakim, Bassam T.; Savin, Virginia J.; Cohen, Eric P.; Moulder, John E.

    2009-01-01

    Terrorist attacks or nuclear accidents could expose large numbers of people to ionizing radiation, and early biomarkers of radiation injury would be critical for triage, treatment and follow-up of such individuals. However, no such biomarkers have yet been proven to exist. We tested the potential of high throughput proteomics to identify protein biomarkers of radiation injury after total body X-ray irradiation in a rat model. Subtle functional changes in the kidney are suggested by an increased glomerular permeability for macromolecules measured within 24 hours after TBI. Ultrastructural changes in glomerular podocytes include partial loss of the interdigitating organization of foot processes. Analysis of urine by LC-MS/MS and 2D-GE showed significant changes in the urine proteome within 24 hours after TBI. Tissue kallikrein 1-related peptidase, cysteine proteinase inhibitor cystatin C and oxidized histidine were found to be increased while a number of proteinase inhibitors including kallikrein-binding protein and albumin were found to be decreased post-irradiation. Thus, TBI causes immediately detectable changes in renal structure and function and in the urinary protein profile. This suggests that both systemic and renal changes are induced by radiation and it may be possible to identify a set of biomarkers unique to radiation injury. PMID:19746194

  12. Optimization strategies for a fluorescent dye with bimodal excitation spectra: application to semiautomated proteomics

    NASA Astrophysics Data System (ADS)

    Patton, Wayne F.; Berggren, Kiera N.; Lopez, Mary F.

    2001-04-01

    Facilities engaged in proteome analysis differ significantly in the degree that they implement automated systems for high-throughput protein characterization. Though automated workstation environments are becoming more routine in the biotechnology and pharmaceutical sectors of industry, university-based laboratories often perform these tasks manually, submitting protein spots excised from polyacrylamide gels to institutional core facilities for identification. For broad compatibility with imaging platforms, an optimized fluorescent dye developed for proteomics applications should be designed taking into account that laser scanners use visible light excitation and that charge-coupled device camera systems and gas discharge transilluminators rely upon UV excitation. The luminescent ruthenium metal complex, SYPRO Ruby protein gel stain, is compatible with a variety of excitation sources since it displays intense UV (280 nm) and visible (470 nm) absorption maxima. Localization is achieved by noncovalent, electrostatic and hydrophobic binding of dye to proteins, with signal being detected at 610 nm. Since proteins are not covalently modified by the dye, compatibility with downstream microchemical characterization techniques such as matrix-assisted laser desorption/ionization-mass spectrometry is assured. Protocols have been devised for optimizing fluorophore intensity. SYPRO Ruby dye outperforms alternatives such as silver staining in terms of quantitative capabilities, compatibility with mass spectrometry and ease of integration into automated work environments.

  13. Quantitative Proteomics of Sleep-Deprived Mouse Brains Reveals Global Changes in Mitochondrial Proteins

    PubMed Central

    Li, Tie-Mei; Zhang, Ju-en; Lin, Rui; Chen, She; Luo, Minmin; Dong, Meng-Qiu

    2016-01-01

    Sleep is a ubiquitous, tightly regulated, and evolutionarily conserved behavior observed in almost all animals. Prolonged sleep deprivation can be fatal, indicating that sleep is a physiological necessity. However, little is known about its core function. To gain insight into this mystery, we used advanced quantitative proteomics technology to survey the global changes in brain protein abundance. Aiming to gain a comprehensive profile, our proteomics workflow included filter-aided sample preparation (FASP), which increased the coverage of membrane proteins; tandem mass tag (TMT) labeling, for relative quantitation; and high resolution, high mass accuracy, high throughput mass spectrometry (MS). In total, we obtained the relative abundance ratios of 9888 proteins encoded by 6070 genes. Interestingly, we observed significant enrichment for mitochondrial proteins among the differentially expressed proteins. This finding suggests that sleep deprivation strongly affects signaling pathways that govern either energy metabolism or responses to mitochondrial stress. Additionally, the differentially-expressed proteins are enriched in pathways implicated in age-dependent neurodegenerative diseases, including Parkinson’s, Huntington’s, and Alzheimer’s, hinting at possible connections between sleep loss, mitochondrial stress, and neurodegeneration. PMID:27684481

  14. Identification of lactoferricin B intracellular targets using an Escherichia coli proteome chip.

    PubMed

    Tu, Yu-Hsuan; Ho, Yu-Hsuan; Chuang, Ying-Chih; Chen, Po-Chung; Chen, Chien-Sheng

    2011-01-01

    Lactoferricin B (LfcinB) is a well-known antimicrobial peptide. Several studies have indicated that it can inhibit bacteria by affecting intracellular activities, but the intracellular targets of this antimicrobial peptide have not been identified. Therefore, we used E. coli proteome chips to identify the intracellular target proteins of LfcinB in a high-throughput manner. We probed LfcinB with E. coli proteome chips and further conducted normalization and Gene Ontology (GO) analyses. The results of the GO analyses showed that the identified proteins were associated with metabolic processes. Moreover, we validated the interactions between LfcinB and chip assay-identified proteins with fluorescence polarization (FP) assays. Sixteen proteins were identified, and an E. coli interaction database (EcID) analysis revealed that the majority of the proteins that interact with these 16 proteins affected the tricarboxylic acid (TCA) cycle. Knockout assays were conducted to further validate the FP assay results. These results showed that phosphoenolpyruvate carboxylase was a target of LfcinB, indicating that one of its mechanisms of action may be associated with pyruvate metabolism. Thus, we used pyruvate assays to conduct an in vivo validation of the relationship between LfcinB and pyruvate level in E. coli. These results showed that E. coli exposed to LfcinB had abnormal pyruvate amounts, indicating that LfcinB caused an accumulation of pyruvate. In conclusion, this study successfully revealed the intracellular targets of LfcinB using an E. coli proteome chip approach.

  15. Identification of Lactoferricin B Intracellular Targets Using an Escherichia coli Proteome Chip

    PubMed Central

    Chen, Po-Chung; Chen, Chien-Sheng

    2011-01-01

    Lactoferricin B (LfcinB) is a well-known antimicrobial peptide. Several studies have indicated that it can inhibit bacteria by affecting intracellular activities, but the intracellular targets of this antimicrobial peptide have not been identified. Therefore, we used E. coli proteome chips to identify the intracellular target proteins of LfcinB in a high-throughput manner. We probed LfcinB with E. coli proteome chips and further conducted normalization and Gene Ontology (GO) analyses. The results of the GO analyses showed that the identified proteins were associated with metabolic processes. Moreover, we validated the interactions between LfcinB and chip assay-identified proteins with fluorescence polarization (FP) assays. Sixteen proteins were identified, and an E. coli interaction database (EcID) analysis revealed that the majority of the proteins that interact with these 16 proteins affected the tricarboxylic acid (TCA) cycle. Knockout assays were conducted to further validate the FP assay results. These results showed that phosphoenolpyruvate carboxylase was a target of LfcinB, indicating that one of its mechanisms of action may be associated with pyruvate metabolism. Thus, we used pyruvate assays to conduct an in vivo validation of the relationship between LfcinB and pyruvate level in E. coli. These results showed that E. coli exposed to LfcinB had abnormal pyruvate amounts, indicating that LfcinB caused an accumulation of pyruvate. In conclusion, this study successfully revealed the intracellular targets of LfcinB using an E. coli proteome chip approach. PMID:22164243

  16. Fibre optical spectroscopy and sensing innovation at innoFSPEC Potsdam

    NASA Astrophysics Data System (ADS)

    Haynes, Roger; Reich, Oliver; Rambold, William; Hass, Roland; Janssen, Katja

    2010-07-01

    In October 2009, an interdisciplinary centre for fibre spectroscopy and sensing, innoFSPEC Potsdam, has been established as joint initiative of the Astrophysikalisches Institut Potsdam (AIP) and the Physical Chemistry group of Potsdam University (UPPC), Germany. The centre focuses on fundamental research in the two fields of fibre-coupled multi-channel spectroscopy and optical fibre-based sensing. Thanks to its interdisciplinary approach, the complementary methodologies of astrophysics on the one hand, and physical chemistry on the other hand, are expected to spawn synergies that otherwise would not normally become available in more standard research programmes. innoFSPEC Potsdam targets future innovations for next generation astrophysical instrumentation, environmental analysis, manufacturing control and process analysis, medical diagnostics, non-invasive imaging spectroscopy, biopsy, genomics/proteomics, high throughput screening, and related applications.

  17. The peripheral blood proteome signature of idiopathic pulmonary fibrosis is distinct from normal and is associated with novel immunological processes.

    PubMed

    O'Dwyer, David N; Norman, Katy C; Xia, Meng; Huang, Yong; Gurczynski, Stephen J; Ashley, Shanna L; White, Eric S; Flaherty, Kevin R; Martinez, Fernando J; Murray, Susan; Noth, Imre; Arnold, Kelly B; Moore, Bethany B

    2017-04-25

    Idiopathic pulmonary fibrosis (IPF) is a progressive and fatal interstitial pneumonia. The disease pathophysiology is poorly understood and the etiology remains unclear. Recent advances have generated new therapies and improved knowledge of the natural history of IPF. These gains have been brokered by advances in technology and improved insight into the role of various genes in mediating disease, but gene expression and protein levels do not always correlate. Thus, in this paper we apply a novel large scale high throughput aptamer approach to identify more than 1100 proteins in the peripheral blood of well-characterized IPF patients and normal volunteers. We use systems biology approaches to identify a unique IPF proteome signature and give insight into biological processes driving IPF. We found IPF plasma to be altered and enriched for proteins involved in defense response, wound healing and protein phosphorylation when compared to normal human plasma. Analysis also revealed a minimal protein signature that differentiated IPF patients from normal controls, which may allow for accurate diagnosis of IPF based on easily-accessible peripheral blood. This report introduces large scale unbiased protein discovery analysis to IPF and describes distinct biological processes that further inform disease biology.

  18. Sperm Proteome: What Is on the Horizon?

    PubMed

    Mohanty, Gayatri; Swain, Nirlipta; Samanta, Luna

    2015-06-01

    As the mammalian spermatozoa transcends from the testis to the end of the epididymal tubule, the functionally incompetent spermatozoa acquires its fertilizing capability. Molecular changes in the spermatozoa at the posttesticular level concern qualitative and quantitative modifications of proteins along with their sugar moieties and membranous lipids mostly associated with motility, egg binding, and penetration processes. Proteomic studies have identified numerous sperm-specific proteins, and recent reports have provided a further understanding of their function with respect to male fertility. High-throughput techniques such as mass spectrometry have shown drastic potential for the identification and study of sperm proteins. In fact, compelling evidence has provided that proteins are critically important in cellular remodeling event and that aberrant expression is associated with pronounced defects in sperm function. This review highlights the posttesticular functional transformation in the epididymis and female reproductive tract with due emphasis on proteomics. © The Author(s) 2014.

  19. Approaches for Defining the Hsp90-dependent Proteome

    PubMed Central

    Hartson, Steven D.; Matts, Robert L.

    2011-01-01

    Hsp90 is the target of ongoing drug discovery studies seeking new compounds to treat cancer, neurodegenerative diseases, and protein folding disorders. To better understand Hsp90’s roles in cellular pathologies and in normal cells, numerous studies have utilized proteomics assays and related high-throughput tools to characterize its physical and functional protein partnerships. This review surveys these studies, and summarizes the strengths and limitations of the individual attacks. We also include downloadable spreadsheets compiling all of the Hsp90-interacting proteins identified in more than 23 studies. These tools include cross-references among gene aliases, human homologues of yeast Hsp90-interacting proteins, hyperlinks to database entries, summaries of canonical pathways that are enriched in the Hsp90 interactome, and additional bioinformatic annotations. In addition to summarizing Hsp90 proteomics studies performed to date and the insights they have provided, we identify gaps in our current understanding of Hsp90-mediated proteostasis. PMID:21906632

  20. High-Resolution Enabled 12-Plex DiLeu Isobaric Tags for Quantitative Proteomics

    PubMed Central

    2015-01-01

    Multiplex isobaric tags (e.g., tandem mass tags (TMT) and isobaric tags for relative and absolute quantification (iTRAQ)) are a valuable tool for high-throughput mass spectrometry based quantitative proteomics. We have developed our own multiplex isobaric tags, DiLeu, that feature quantitative performance on par with commercial offerings but can be readily synthesized in-house as a cost-effective alternative. In this work, we achieve a 3-fold increase in the multiplexing capacity of the DiLeu reagent without increasing structural complexity by exploiting mass defects that arise from selective incorporation of 13C, 15N, and 2H stable isotopes in the reporter group. The inclusion of eight new reporter isotopologues that differ in mass from the existing four reporters by intervals of 6 mDa yields a 12-plex isobaric set that preserves the synthetic simplicity and quantitative performance of the original implementation. We show that the new reporter variants can be baseline-resolved in high-resolution higher-energy C-trap dissociation (HCD) spectra, and we demonstrate accurate 12-plex quantitation of a DiLeu-labeled Saccharomyces cerevisiae lysate digest via high-resolution nano liquid chromatography–tandem mass spectrometry (nanoLC–MS2) analysis on an Orbitrap Elite mass spectrometer. PMID:25405479

  1. Curated protein information in the Saccharomyces genome database.

    PubMed

    Hellerstedt, Sage T; Nash, Robert S; Weng, Shuai; Paskov, Kelley M; Wong, Edith D; Karra, Kalpana; Engel, Stacia R; Cherry, J Michael

    2017-01-01

    Due to recent advancements in the production of experimental proteomic data, the Saccharomyces genome database (SGD; www.yeastgenome.org ) has been expanding our protein curation activities to make new data types available to our users. Because of broad interest in post-translational modifications (PTM) and their importance to protein function and regulation, we have recently started incorporating expertly curated PTM information on individual protein pages. Here we also present the inclusion of new abundance and protein half-life data obtained from high-throughput proteome studies. These new data types have been included with the aim to facilitate cellular biology research. : www.yeastgenome.org. © The Author(s) 2017. Published by Oxford University Press.

  2. Effects of Perfluorooctanoic Acid on Metabolic Profiles in Brain and Liver of Mouse Revealed by a High-throughput Targeted Metabolomics Approach

    NASA Astrophysics Data System (ADS)

    Yu, Nanyang; Wei, Si; Li, Meiying; Yang, Jingping; Li, Kan; Jin, Ling; Xie, Yuwei; Giesy, John P.; Zhang, Xiaowei; Yu, Hongxia

    2016-04-01

    Perfluorooctanoic acid (PFOA), a perfluoroalkyl acid, can result in hepatotoxicity and neurobehavioral effects in animals. The metabolome, which serves as a connection among transcriptome, proteome and toxic effects, provides pathway-based insights into effects of PFOA. Since understanding of changes in the metabolic profile during hepatotoxicity and neurotoxicity were still incomplete, a high-throughput targeted metabolomics approach (278 metabolites) was used to investigate effects of exposure to PFOA for 28 d on brain and liver of male Balb/c mice. Results of multivariate statistical analysis indicated that PFOA caused alterations in metabolic pathways in exposed individuals. Pathway analysis suggested that PFOA affected metabolism of amino acids, lipids, carbohydrates and energetics. Ten and 18 metabolites were identified as potential unique biomarkers of exposure to PFOA in brain and liver, respectively. In brain, PFOA affected concentrations of neurotransmitters, including serotonin, dopamine, norepinephrine, and glutamate in brain, which provides novel insights into mechanisms of PFOA-induced neurobehavioral effects. In liver, profiles of lipids revealed involvement of β-oxidation and biosynthesis of saturated and unsaturated fatty acids in PFOA-induced hepatotoxicity, while alterations in metabolism of arachidonic acid suggesting potential of PFOA to cause inflammation response in liver. These results provide insight into the mechanism and biomarkers for PFOA-induced effects.

  3. Low-molecular-weight color pI markers to monitor on-line the peptide focusing process in OFFGEL fractionation.

    PubMed

    Michelland, Sylvie; Bourgoin-Voillard, Sandrine; Cunin, Valérie; Tollance, Axel; Bertolino, Pascal; Slais, Karel; Seve, Michel

    2017-08-01

    High-throughput mass spectrometry-based proteomic analysis requires peptide fractionation to simplify complex biological samples and increase proteome coverage. OFFGEL fractionation technology became a common method to separate peptides or proteins using isoelectric focusing in an immobilized pH gradient. However, the OFFGEL focusing process may be further optimized and controlled in terms of separation time and pI resolution. Here we evaluated OFFGEL technology to separate peptides from different samples in the presence of low-molecular-weight (LMW) color pI markers to visualize the focusing process. LMW color pI markers covering a large pH range were added to the peptide mixture before OFFGEL fractionation using a 24-wells device encompassing the pH range 3-10. We also explored the impact of LMW color pI markers on peptide fractionation labeled previously for iTRAQ. Then, fractionated peptides were separated by RP_HPLC prior to MS analysis using MALDI-TOF/TOF mass spectrometry in MS and MS/MS modes. Here we report the performance of the peptide focusing process in the presence of LMW color pI markers as on-line trackers during the OFFGEL process and the possibility to use them as pI controls for peptide focusing. This method improves the workflow for peptide fractionation in a bottom-up proteomic approach with or without iTRAQ labeling. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Microfluidic liquid chromatography system for proteomic applications and biomarker screening.

    PubMed

    Lazar, Iulia M; Trisiripisal, Phichet; Sarvaiya, Hetal A

    2006-08-01

    A microfluidic liquid chromatography (LC) system for proteomic investigations that integrates all the necessary components for stand-alone operation, i.e., pump, valve, separation column, and electrospray interface, is described in this paper. The overall size of the LC device is small enough to enable the integration of two fully functional separation systems on a 3 in. x 1 in. glass microchip. A multichannel architecture that uses electroosmotic pumping principles provides the necessary functionality for eluent propulsion and sample valving. The flow rates generated within these chips are fully consistent with the requirements of nano-LC platforms that are routinely used in proteomic applications. The microfluidic device was evaluated for the analysis of a protein digest obtained from the MCF7 breast cancer cell line. The cytosolic protein extract was processed according to a shotgun protocol, and after tryptic digestion and prefractionation using strong cation exchange chromatography (SCX), selected sample subfractions were analyzed with conventional and microfluidic LC platforms. Using similar experimental conditions, the performance of the microchip LC was comparable to that obtained with benchtop instrumentation, providing an overlap of 75% in proteins that were identified by more than two unique peptides. The microfluidic LC analysis of a protein-rich SCX fraction enabled the confident identification of 77 proteins by using conventional data filtering parameters, of 39 proteins with p < 0.001, and of 5 proteins that are known to be cancer-specific biomarkers, demonstrating thus the potential applicability of these chips for future high-throughput biomarker screening applications.

  5. High throughput gene expression profiling: a molecular approach to integrative physiology

    PubMed Central

    Liang, Mingyu; Cowley, Allen W; Greene, Andrew S

    2004-01-01

    Integrative physiology emphasizes the importance of understanding multiple pathways with overlapping, complementary, or opposing effects and their interactions in the context of intact organisms. The DNA microarray technology, the most commonly used method for high-throughput gene expression profiling, has been touted as an integrative tool that provides insights into regulatory pathways. However, the physiology community has been slow in acceptance of these techniques because of early failure in generating useful data and the lack of a cohesive theoretical framework in which experiments can be analysed. With recent advances in both technology and analysis, we propose a concept of multidimensional integration of physiology that incorporates data generated by DNA microarray and other functional, genomic, and proteomic approaches to achieve a truly integrative understanding of physiology. Analysis of several studies performed in simpler organisms or in mammalian model animals supports the feasibility of such multidimensional integration and demonstrates the power of DNA microarray as an indispensable molecular tool for such integration. Evaluation of DNA microarray techniques indicates that these techniques, despite limitations, have advanced to a point where the question-driven profiling research has become a feasible complement to the conventional, hypothesis-driven research. With a keen sense of homeostasis, global regulation, and quantitative analysis, integrative physiologists are uniquely positioned to apply these techniques to enhance the understanding of complex physiological functions. PMID:14678487

  6. Less is More: Membrane Protein Digestion Beyond Urea-Trypsin Solution for Next-level Proteomics.

    PubMed

    Zhang, Xi

    2015-09-01

    The goal of next-level bottom-up membrane proteomics is protein function investigation, via high-coverage high-throughput peptide-centric quantitation of expression, modifications and dynamic structures at systems scale. Yet efficient digestion of mammalian membrane proteins presents a daunting barrier, and prevalent day-long urea-trypsin in-solution digestion proved insufficient to reach this goal. Many efforts contributed incremental advances over past years, but involved protein denaturation that disconnected measurement from functional states. Beyond denaturation, the recent discovery of structure/proteomics omni-compatible detergent n-dodecyl-β-d-maltopyranoside, combined with pepsin and PNGase F columns, enabled breakthroughs in membrane protein digestion: a 2010 DDM-low-TCEP (DLT) method for H/D-exchange (HDX) using human G protein-coupled receptor, and a 2015 flow/detergent-facilitated protease and de-PTM digestions (FDD) for integrative deep sequencing and quantitation using full-length human ion channel complex. Distinguishing protein solubilization from denaturation, protease digestion reliability from theoretical specificity, and reduction from alkylation, these methods shifted day(s)-long paradigms into minutes, and afforded fully automatable (HDX)-protein-peptide-(tandem mass tag)-HPLC pipelines to instantly measure functional proteins at deep coverage, high peptide reproducibility, low artifacts and minimal leakage. Promoting-not destroying-structures and activities harnessed membrane proteins for the next-level streamlined functional proteomics. This review analyzes recent advances in membrane protein digestion methods and highlights critical discoveries for future proteomics. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Lessons we learned from high-throughput and top-down systems biology analyses about glioma stem cells.

    PubMed

    Mock, Andreas; Chiblak, Sara; Herold-Mende, Christel

    2014-01-01

    A growing body of evidence suggests that glioma stem cells (GSCs) account for tumor initiation, therapy resistance, and the subsequent regrowth of gliomas. Thus, continuous efforts have been undertaken to further characterize this subpopulation of less differentiated tumor cells. Although we are able to enrich GSCs, we still lack a comprehensive understanding of GSC phenotypes and behavior. The advent of high-throughput technologies raised hope that incorporation of these newly developed platforms would help to tackle such questions. Since then a couple of comparative genome-, transcriptome- and proteome-wide studies on GSCs have been conducted giving new insights in GSC biology. However, lessons had to be learned in designing high-throughput experiments and some of the resulting conclusions fell short of expectations because they were performed on only a few GSC lines or at one molecular level instead of an integrative poly-omics approach. Despite these shortcomings, our knowledge of GSC biology has markedly expanded due to a number of survival-associated biomarkers as well as glioma-relevant signaling pathways and therapeutic targets being identified. In this article we review recent findings obtained by comparative high-throughput analyses of GSCs. We further summarize fundamental concepts of systems biology as well as its applications for glioma stem cell research.

  8. Computer-based fluorescence quantification: a novel approach to study nucleolar biology

    PubMed Central

    2011-01-01

    Background Nucleoli are composed of possibly several thousand different proteins and represent the most conspicuous compartments in the nucleus; they play a crucial role in the proper execution of many cellular processes. As such, nucleoli carry out ribosome biogenesis and sequester or associate with key molecules that regulate cell cycle progression, tumorigenesis, apoptosis and the stress response. Nucleoli are dynamic compartments that are characterized by a constant flux of macromolecules. Given the complex and dynamic composition of the nucleolar proteome, it is challenging to link modifications in nucleolar composition to downstream effects. Results In this contribution, we present quantitative immunofluorescence methods that rely on computer-based image analysis. We demonstrate the effectiveness of these techniques by monitoring the dynamic association of proteins and RNA with nucleoli under different physiological conditions. Thus, the protocols described by us were employed to study stress-dependent changes in the nucleolar concentration of endogenous and GFP-tagged proteins. Furthermore, our methods were applied to measure de novo RNA synthesis that is associated with nucleoli. We show that the techniques described here can be easily combined with automated high throughput screening (HTS) platforms, making it possible to obtain large data sets and analyze many of the biological processes that are located in nucleoli. Conclusions Our protocols set the stage to analyze in a quantitative fashion the kinetics of shuttling nucleolar proteins, both at the single cell level as well as for a large number of cells. Moreover, the procedures described here are compatible with high throughput image acquisition and analysis using HTS automated platforms, thereby providing the basis to quantify nucleolar components and activities for numerous samples and experimental conditions. Together with the growing amount of information obtained for the nucleolar proteome, improvements in quantitative microscopy as they are described here can be expected to produce new insights into the complex biological functions that are orchestrated by the nucleolus. PMID:21639891

  9. Proteomics of gliomas: Initial biomarker discovery and evolution of technology

    PubMed Central

    Kalinina, Juliya; Peng, Junmin; Ritchie, James C.; Van Meir, Erwin G.

    2011-01-01

    Gliomas are a group of aggressive brain tumors that diffusely infiltrate adjacent brain tissues, rendering them largely incurable, even with multiple treatment modalities and agents. Mostly asymptomatic at early stages, they present in several subtypes with astrocytic or oligodendrocytic features and invariably progress to malignant forms. Gliomas are difficult to classify precisely because of interobserver variability during histopathologic grading. Identifying biological signatures of each glioma subtype through protein biomarker profiling of tumor or tumor-proximal fluids is therefore of high priority. Such profiling not only may provide clues regarding tumor classification but may identify clinical biomarkers and pathologic targets for the development of personalized treatments. In the past decade, differential proteomic profiling techniques have utilized tumor, cerebrospinal fluid, and plasma from glioma patients to identify the first candidate diagnostic, prognostic, predictive, and therapeutic response markers, highlighting the potential for glioma biomarker discovery. The number of markers identified, however, has been limited, their reproducibility between studies is unclear, and none have been validated for clinical use. Recent technological advancements in methodologies for high-throughput profiling, which provide easy access, rapid screening, low sample consumption, and accurate protein identification, are anticipated to accelerate brain tumor biomarker discovery. Reliable tools for biomarker verification forecast translation of the biomarkers into clinical diagnostics in the foreseeable future. Herein we update the reader on the recent trends and directions in glioma proteomics, including key findings and established and emerging technologies for analysis, together with challenges we are still facing in identifying and verifying potential glioma biomarkers. PMID:21852429

  10. Differential proteomics reveals the hallmarks of seed development in common bean (Phaseolus vulgaris L.).

    PubMed

    Parreira, J R; Bouraada, J; Fitzpatrick, M A; Silvestre, S; Bernardes da Silva, A; Marques da Silva, J; Almeida, A M; Fevereiro, P; Altelaar, A F M; Araújo, S S

    2016-06-30

    Common bean (Phaseolus vulgaris L.) is one of the most consumed staple foods worldwide. Little is known about the molecular mechanisms controlling seed development. This study aims to comprehensively describe proteome dynamics during seed development of common bean. A high-throughput gel-free proteomics approach (LC-MS/MS) was conducted on seeds at 10, 20, 30 and 40days after anthesis, spanning from late embryogenesis until desiccation. Of the 418 differentially accumulated proteins identified, 255 were characterized, most belonging to protein metabolism. An accumulation of proteins belonging to the MapMan functional categories of "protein", "glycolysis", "TCA", "DNA", "RNA", "cell" and "stress" were found at early seed development stages, reflecting an extensive metabolic activity. In the mid stages, accumulation of storage, signaling, starch synthesis and cell wall-related proteins stood out. In the later stages, an increase in proteins related to redox, protein degradation/modification/folding and nucleic acid metabolisms reflect that seed desiccation-resistance mechanisms were activated. Our study unveils new clues to understand the regulation of seed development mediated by post-translational modifications and maintenance of genome integrity. This knowledge enhances the understanding on seed development molecular mechanisms that may be used in the design and selection of common bean seeds with desired quality traits. Common bean (P. vulgaris) is an important source of proteins and carbohydrates worldwide. Despite the agronomic and economic importance of this pulse, knowledge on common bean seed development is limited. Herein, a gel-free high throughput methodology was used to describe the proteome changes during P. vulgaris seed development. Data obtained will enhance the knowledge on the molecular mechanisms controlling this grain legume seed development and may be used in the design and selection of common bean seeds with desired quality traits. Results may be extrapolated to other pulses. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. A Library of Phosphoproteomic and Chromatin Signatures for Characterizing Cellular Responses to Drug Perturbations.

    PubMed

    Litichevskiy, Lev; Peckner, Ryan; Abelin, Jennifer G; Asiedu, Jacob K; Creech, Amanda L; Davis, John F; Davison, Desiree; Dunning, Caitlin M; Egertson, Jarrett D; Egri, Shawn; Gould, Joshua; Ko, Tak; Johnson, Sarah A; Lahr, David L; Lam, Daniel; Liu, Zihan; Lyons, Nicholas J; Lu, Xiaodong; MacLean, Brendan X; Mungenast, Alison E; Officer, Adam; Natoli, Ted E; Papanastasiou, Malvina; Patel, Jinal; Sharma, Vagisha; Toder, Courtney; Tubelli, Andrew A; Young, Jennie Z; Carr, Steven A; Golub, Todd R; Subramanian, Aravind; MacCoss, Michael J; Tsai, Li-Huei; Jaffe, Jacob D

    2018-04-25

    Although the value of proteomics has been demonstrated, cost and scale are typically prohibitive, and gene expression profiling remains dominant for characterizing cellular responses to perturbations. However, high-throughput sentinel assays provide an opportunity for proteomics to contribute at a meaningful scale. We present a systematic library resource (90 drugs × 6 cell lines) of proteomic signatures that measure changes in the reduced-representation phosphoproteome (P100) and changes in epigenetic marks on histones (GCP). A majority of these drugs elicited reproducible signatures, but notable cell line- and assay-specific differences were observed. Using the "connectivity" framework, we compared signatures across cell types and integrated data across assays, including a transcriptional assay (L1000). Consistent connectivity among cell types revealed cellular responses that transcended lineage, and consistent connectivity among assays revealed unexpected associations between drugs. We further leveraged the resource against public data to formulate hypotheses for treatment of multiple myeloma and acute lymphocytic leukemia. This resource is publicly available at https://clue.io/proteomics. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  12. Intact mass detection, interpretation, and visualization to automate Top-Down proteomics on a large scale

    PubMed Central

    Durbin, Kenneth R.; Tran, John C.; Zamdborg, Leonid; Sweet, Steve M. M.; Catherman, Adam D.; Lee, Ji Eun; Li, Mingxi; Kellie, John F.; Kelleher, Neil L.

    2011-01-01

    Applying high-throughput Top-Down MS to an entire proteome requires a yet-to-be-established model for data processing. Since Top-Down is becoming possible on a large scale, we report our latest software pipeline dedicated to capturing the full value of intact protein data in automated fashion. For intact mass detection, we combine algorithms for processing MS1 data from both isotopically resolved (FT) and charge-state resolved (ion trap) LC-MS data, which are then linked to their fragment ions for database searching using ProSight. Automated determination of human keratin and tubulin isoforms is one result. Optimized for the intricacies of whole proteins, new software modules visualize proteome-scale data based on the LC retention time and intensity of intact masses and enable selective detection of PTMs to automatically screen for acetylation, phosphorylation, and methylation. Software functionality was demonstrated using comparative LC-MS data from yeast strains in addition to human cells undergoing chemical stress. We further these advances as a key aspect of realizing Top-Down MS on a proteomic scale. PMID:20848673

  13. Quantitative proteomic analysis in breast cancer.

    PubMed

    Tabchy, A; Hennessy, B T; Gonzalez-Angulo, A M; Bernstam, F M; Lu, Y; Mills, G B

    2011-02-01

    Much progress has recently been made in the genomic and transcriptional characterization of tumors. However, historically the characterization of cells at the protein level has suffered limitations in reproducibility, scalability and robustness. Recent technological advances have made it possible to accurately and reproducibly portray the global levels and active states of cellular proteins. Protein microarrays examine the native post-translational conformations of proteins including activated phosphorylated states, in a comprehensive high-throughput mode, and can map activated pathways and networks of proteins inside the cells. The reverse-phase protein microarray (RPPA) offers a unique opportunity to study signal transduction networks in small biological samples such as human biopsy material and can provide critical information for therapeutic decision-making and the monitoring of patients for targeted molecular medicine. By providing the key missing link to the story generated from genomic and gene expression characterization efforts, functional proteomics offer the promise of a comprehensive understanding of cancer. Several initial successes in breast cancer are showing that such information is clinically relevant. Copyright 2011 Prous Science, S.A.U. or its licensors. All rights reserved.

  14. A High-Resolution Proteomic Landscaping of Primary Human Dental Stem Cells: Identification of SHED- and PDLSC-Specific Biomarkers.

    PubMed

    Taraslia, Vasiliki; Lymperi, Stefania; Pantazopoulou, Vasiliki; Anagnostopoulos, Athanasios K; Papassideri, Issidora S; Basdra, Efthimia K; Bei, Marianna; Kontakiotis, Evangelos G; Tsangaris, George Th; Stravopodis, Dimitrios J; Anastasiadou, Ema

    2018-01-05

    Dental stem cells (DSCs) have emerged as a promising tool for basic research and clinical practice. A variety of adult stem cell (ASC) populations can be isolated from different areas within the dental tissue, which, due to their cellular and molecular characteristics, could give rise to different outcomes when used in potential applications. In this study, we performed a high-throughput molecular comparison of two primary human adult dental stem cell (hADSC) sub-populations: Stem Cells from Human Exfoliated Deciduous Teeth (SHEDs) and Periodontal Ligament Stem Cells (PDLSCs). A detailed proteomic mapping of SHEDs and PDLSCs, via employment of nano-LC tandem-mass spectrometry (MS/MS) revealed 2032 identified proteins in SHEDs and 3235 in PDLSCs. In total, 1516 proteins were expressed in both populations, while 517 were unique for SHEDs and 1721 were exclusively expressed in PDLSCs. Further analysis of the recorded proteins suggested that SHEDs predominantly expressed molecules that are involved in organizing the cytoskeletal network, cellular migration and adhesion, whereas PDLSCs are highly energy-producing cells, vastly expressing proteins that are implicated in various aspects of cell metabolism and proliferation. Applying the Rho-GDI signaling pathway as a paradigm, we propose potential biomarkers for SHEDs and for PDLSCs, reflecting their unique features, properties and engaged molecular pathways.

  15. A High-Resolution Proteomic Landscaping of Primary Human Dental Stem Cells: Identification of SHED- and PDLSC-Specific Biomarkers

    PubMed Central

    Taraslia, Vasiliki; Lymperi, Stefania; Pantazopoulou, Vasiliki; Anagnostopoulos, Athanasios K.; Basdra, Efthimia K.; Bei, Marianna; Kontakiotis, Evangelos G.; Tsangaris, George Th.; Stravopodis, Dimitrios J.; Anastasiadou, Ema

    2018-01-01

    Dental stem cells (DSCs) have emerged as a promising tool for basic research and clinical practice. A variety of adult stem cell (ASC) populations can be isolated from different areas within the dental tissue, which, due to their cellular and molecular characteristics, could give rise to different outcomes when used in potential applications. In this study, we performed a high-throughput molecular comparison of two primary human adult dental stem cell (hADSC) sub-populations: Stem Cells from Human Exfoliated Deciduous Teeth (SHEDs) and Periodontal Ligament Stem Cells (PDLSCs). A detailed proteomic mapping of SHEDs and PDLSCs, via employment of nano-LC tandem-mass spectrometry (MS/MS) revealed 2032 identified proteins in SHEDs and 3235 in PDLSCs. In total, 1516 proteins were expressed in both populations, while 517 were unique for SHEDs and 1721 were exclusively expressed in PDLSCs. Further analysis of the recorded proteins suggested that SHEDs predominantly expressed molecules that are involved in organizing the cytoskeletal network, cellular migration and adhesion, whereas PDLSCs are highly energy-producing cells, vastly expressing proteins that are implicated in various aspects of cell metabolism and proliferation. Applying the Rho-GDI signaling pathway as a paradigm, we propose potential biomarkers for SHEDs and for PDLSCs, reflecting their unique features, properties and engaged molecular pathways. PMID:29304003

  16. Advanced Mass Spectrometric Methods for the Rapid and Quantitative Characterization of Proteomes

    DOE PAGES

    Smith, Richard D.

    2002-01-01

    Progress is reviewedmore » towards the development of a global strategy that aims to extend the sensitivity, dynamic range, comprehensiveness and throughput of proteomic measurements based upon the use of high performance separations and mass spectrometry. The approach uses high accuracy mass measurements from Fourier transform ion cyclotron resonance mass spectrometry (FTICR) to validate peptide ‘accurate mass tags’ (AMTs) produced by global protein enzymatic digestions for a specific organism, tissue or cell type from ‘potential mass tags’ tentatively identified using conventional tandem mass spectrometry (MS/MS). This provides the basis for subsequent measurements without the need for MS/ MS. High resolution capillary liquid chromatography separations combined with high sensitivity, and high resolution accurate FTICR measurements are shown to be capable of characterizing peptide mixtures of more than 10 5 components. The strategy has been initially demonstrated using the microorganisms Saccharomyces cerevisiae and Deinococcus radiodurans. Advantages of the approach include the high confidence of protein identification, its broad proteome coverage, high sensitivity, and the capability for stableisotope labeling methods for precise relative protein abundance measurements. Abbreviations : LC, liquid chromatography; FTICR, Fourier transform ion cyclotron resonance; AMT, accurate mass tag; PMT, potential mass tag; MMA, mass measurement accuracy; MS, mass spectrometry; MS/MS, tandem mass spectrometry; ppm, parts per million.« less

  17. A high-throughput semi-automated preparation for filtered synaptoneurosomes.

    PubMed

    Murphy, Kathryn M; Balsor, Justin; Beshara, Simon; Siu, Caitlin; Pinto, Joshua G A

    2014-09-30

    Synaptoneurosomes have become an important tool for studying synaptic proteins. The filtered synaptoneurosomes preparation originally developed by Hollingsworth et al. (1985) is widely used and is an easy method to prepare synaptoneurosomes. The hand processing steps in that preparation, however, are labor intensive and have become a bottleneck for current proteomic studies using synaptoneurosomes. For this reason, we developed new steps for tissue homogenization and filtration that transform the preparation of synaptoneurosomes to a high-throughput, semi-automated process. We implemented a standardized protocol with easy to follow steps for homogenizing multiple samples simultaneously using a FastPrep tissue homogenizer (MP Biomedicals, LLC) and then filtering all of the samples in centrifugal filter units (EMD Millipore, Corp). The new steps dramatically reduce the time to prepare synaptoneurosomes from hours to minutes, increase sample recovery, and nearly double enrichment for synaptic proteins. These steps are also compatible with biosafety requirements for working with pathogen infected brain tissue. The new high-throughput semi-automated steps to prepare synaptoneurosomes are timely technical advances for studies of low abundance synaptic proteins in valuable tissue samples. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Heat-Responsive Photosynthetic and Signaling Pathways in Plants: Insight from Proteomics.

    PubMed

    Wang, Xiaoli; Xu, Chenxi; Cai, Xiaofeng; Wang, Quanhua; Dai, Shaojun

    2017-10-20

    Heat stress is a major abiotic stress posing a serious threat to plants. Heat-responsive mechanisms in plants are complicated and fine-tuned. Heat signaling transduction and photosynthesis are highly sensitive. Therefore, a thorough understanding of the molecular mechanism in heat stressed-signaling transduction and photosynthesis is necessary to protect crop yield. Current high-throughput proteomics investigations provide more useful information for underlying heat-responsive signaling pathways and photosynthesis modulation in plants. Several signaling components, such as guanosine triphosphate (GTP)-binding protein, nucleoside diphosphate kinase, annexin, and brassinosteroid-insensitive I-kinase domain interacting protein 114, were proposed to be important in heat signaling transduction. Moreover, diverse protein patterns of photosynthetic proteins imply that the modulations of stomatal CO₂ exchange, photosystem II, Calvin cycle, ATP synthesis, and chlorophyll biosynthesis are crucial for plant heat tolerance.

  19. A knowledge-based T2-statistic to perform pathway analysis for quantitative proteomic data

    PubMed Central

    Chen, Yi-Hau

    2017-01-01

    Approaches to identify significant pathways from high-throughput quantitative data have been developed in recent years. Still, the analysis of proteomic data stays difficult because of limited sample size. This limitation also leads to the practice of using a competitive null as common approach; which fundamentally implies genes or proteins as independent units. The independent assumption ignores the associations among biomolecules with similar functions or cellular localization, as well as the interactions among them manifested as changes in expression ratios. Consequently, these methods often underestimate the associations among biomolecules and cause false positives in practice. Some studies incorporate the sample covariance matrix into the calculation to address this issue. However, sample covariance may not be a precise estimation if the sample size is very limited, which is usually the case for the data produced by mass spectrometry. In this study, we introduce a multivariate test under a self-contained null to perform pathway analysis for quantitative proteomic data. The covariance matrix used in the test statistic is constructed by the confidence scores retrieved from the STRING database or the HitPredict database. We also design an integrating procedure to retain pathways of sufficient evidence as a pathway group. The performance of the proposed T2-statistic is demonstrated using five published experimental datasets: the T-cell activation, the cAMP/PKA signaling, the myoblast differentiation, and the effect of dasatinib on the BCR-ABL pathway are proteomic datasets produced by mass spectrometry; and the protective effect of myocilin via the MAPK signaling pathway is a gene expression dataset of limited sample size. Compared with other popular statistics, the proposed T2-statistic yields more accurate descriptions in agreement with the discussion of the original publication. We implemented the T2-statistic into an R package T2GA, which is available at https://github.com/roqe/T2GA. PMID:28622336

  20. A knowledge-based T2-statistic to perform pathway analysis for quantitative proteomic data.

    PubMed

    Lai, En-Yu; Chen, Yi-Hau; Wu, Kun-Pin

    2017-06-01

    Approaches to identify significant pathways from high-throughput quantitative data have been developed in recent years. Still, the analysis of proteomic data stays difficult because of limited sample size. This limitation also leads to the practice of using a competitive null as common approach; which fundamentally implies genes or proteins as independent units. The independent assumption ignores the associations among biomolecules with similar functions or cellular localization, as well as the interactions among them manifested as changes in expression ratios. Consequently, these methods often underestimate the associations among biomolecules and cause false positives in practice. Some studies incorporate the sample covariance matrix into the calculation to address this issue. However, sample covariance may not be a precise estimation if the sample size is very limited, which is usually the case for the data produced by mass spectrometry. In this study, we introduce a multivariate test under a self-contained null to perform pathway analysis for quantitative proteomic data. The covariance matrix used in the test statistic is constructed by the confidence scores retrieved from the STRING database or the HitPredict database. We also design an integrating procedure to retain pathways of sufficient evidence as a pathway group. The performance of the proposed T2-statistic is demonstrated using five published experimental datasets: the T-cell activation, the cAMP/PKA signaling, the myoblast differentiation, and the effect of dasatinib on the BCR-ABL pathway are proteomic datasets produced by mass spectrometry; and the protective effect of myocilin via the MAPK signaling pathway is a gene expression dataset of limited sample size. Compared with other popular statistics, the proposed T2-statistic yields more accurate descriptions in agreement with the discussion of the original publication. We implemented the T2-statistic into an R package T2GA, which is available at https://github.com/roqe/T2GA.

  1. Towards High-throughput Immunomics for Infectious Diseases: Use of Next-generation Peptide Microarrays for Rapid Discovery and Mapping of Antigenic Determinants*

    PubMed Central

    Carmona, Santiago J.; Nielsen, Morten; Schafer-Nielsen, Claus; Mucci, Juan; Altcheh, Jaime; Balouz, Virginia; Tekiel, Valeria; Frasch, Alberto C.; Campetella, Oscar; Buscaglia, Carlos A.; Agüero, Fernán

    2015-01-01

    Complete characterization of antibody specificities associated to natural infections is expected to provide a rich source of serologic biomarkers with potential applications in molecular diagnosis, follow-up of chemotherapeutic treatments, and prioritization of targets for vaccine development. Here, we developed a highly-multiplexed platform based on next-generation high-density peptide microarrays to map these specificities in Chagas Disease, an exemplar of a human infectious disease caused by the protozoan Trypanosoma cruzi. We designed a high-density peptide microarray containing more than 175,000 overlapping 15mer peptides derived from T. cruzi proteins. Peptides were synthesized in situ on microarray slides, spanning the complete length of 457 parasite proteins with fully overlapped 15mers (1 residue shift). Screening of these slides with antibodies purified from infected patients and healthy donors demonstrated both a high technical reproducibility as well as epitope mapping consistency when compared with earlier low-throughput technologies. Using a conservative signal threshold to classify positive (reactive) peptides we identified 2,031 disease-specific peptides and 97 novel parasite antigens, effectively doubling the number of known antigens and providing a 10-fold increase in the number of fine mapped antigenic determinants for this disease. Finally, further analysis of the chip data showed that optimizing the amount of sequence overlap of displayed peptides can increase the protein space covered in a single chip by at least ∼threefold without sacrificing sensitivity. In conclusion, we show the power of high-density peptide chips for the discovery of pathogen-specific linear B-cell epitopes from clinical samples, thus setting the stage for high-throughput biomarker discovery screenings and proteome-wide studies of immune responses against pathogens. PMID:25922409

  2. Towards High-throughput Immunomics for Infectious Diseases: Use of Next-generation Peptide Microarrays for Rapid Discovery and Mapping of Antigenic Determinants.

    PubMed

    Carmona, Santiago J; Nielsen, Morten; Schafer-Nielsen, Claus; Mucci, Juan; Altcheh, Jaime; Balouz, Virginia; Tekiel, Valeria; Frasch, Alberto C; Campetella, Oscar; Buscaglia, Carlos A; Agüero, Fernán

    2015-07-01

    Complete characterization of antibody specificities associated to natural infections is expected to provide a rich source of serologic biomarkers with potential applications in molecular diagnosis, follow-up of chemotherapeutic treatments, and prioritization of targets for vaccine development. Here, we developed a highly-multiplexed platform based on next-generation high-density peptide microarrays to map these specificities in Chagas Disease, an exemplar of a human infectious disease caused by the protozoan Trypanosoma cruzi. We designed a high-density peptide microarray containing more than 175,000 overlapping 15 mer peptides derived from T. cruzi proteins. Peptides were synthesized in situ on microarray slides, spanning the complete length of 457 parasite proteins with fully overlapped 15 mers (1 residue shift). Screening of these slides with antibodies purified from infected patients and healthy donors demonstrated both a high technical reproducibility as well as epitope mapping consistency when compared with earlier low-throughput technologies. Using a conservative signal threshold to classify positive (reactive) peptides we identified 2,031 disease-specific peptides and 97 novel parasite antigens, effectively doubling the number of known antigens and providing a 10-fold increase in the number of fine mapped antigenic determinants for this disease. Finally, further analysis of the chip data showed that optimizing the amount of sequence overlap of displayed peptides can increase the protein space covered in a single chip by at least ∼ threefold without sacrificing sensitivity. In conclusion, we show the power of high-density peptide chips for the discovery of pathogen-specific linear B-cell epitopes from clinical samples, thus setting the stage for high-throughput biomarker discovery screenings and proteome-wide studies of immune responses against pathogens. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  3. Study design in high-dimensional classification analysis.

    PubMed

    Sánchez, Brisa N; Wu, Meihua; Song, Peter X K; Wang, Wen

    2016-10-01

    Advances in high throughput technology have accelerated the use of hundreds to millions of biomarkers to construct classifiers that partition patients into different clinical conditions. Prior to classifier development in actual studies, a critical need is to determine the sample size required to reach a specified classification precision. We develop a systematic approach for sample size determination in high-dimensional (large [Formula: see text] small [Formula: see text]) classification analysis. Our method utilizes the probability of correct classification (PCC) as the optimization objective function and incorporates the higher criticism thresholding procedure for classifier development. Further, we derive the theoretical bound of maximal PCC gain from feature augmentation (e.g. when molecular and clinical predictors are combined in classifier development). Our methods are motivated and illustrated by a study using proteomics markers to classify post-kidney transplantation patients into stable and rejecting classes. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Systems-wide analysis of manganese deficiency-induced changes in gene activity of Arabidopsis roots

    PubMed Central

    Rodríguez-Celma, Jorge; Tsai, Yi-Hsiu; Wen, Tuan-Nan; Wu, Yu-Ching; Curie, Catherine; Schmidt, Wolfgang

    2016-01-01

    Manganese (Mn) is pivotal for plant growth and development, but little information is available regarding the strategies that evolved to improve Mn acquisition and cellular homeostasis of Mn. Using an integrated RNA-based transcriptomic and high-throughput shotgun proteomics approach, we generated a comprehensive inventory of transcripts and proteins that showed altered abundance in response to Mn deficiency in roots of the model plant Arabidopsis. A suite of 22,385 transcripts was consistently detected in three RNA-seq runs; LC-MS/MS-based iTRAQ proteomics allowed the unambiguous determination of 11,606 proteins. While high concordance between mRNA and protein expression (R = 0.87) was observed for transcript/protein pairs in which both gene products accumulated differentially upon Mn deficiency, only approximately 10% of the total alterations in the abundance of proteins could be attributed to transcription, indicating a large impact of protein-level regulation. Differentially expressed genes spanned a wide range of biological functions, including the maturation, translation, and transport of mRNAs, as well as primary and secondary metabolic processes. Metabolic analysis by UPLC-qTOF-MS revealed that the steady-state levels of several major glucosinolates were significantly altered upon Mn deficiency in both roots and leaves, possibly as a compensation for increased pathogen susceptibility under conditions of Mn deficiency. PMID:27804982

  5. Efficient Analysis of Mass Spectrometry Data Using the Isotope Wavelet

    NASA Astrophysics Data System (ADS)

    Hussong, Rene; Tholey, Andreas; Hildebrandt, Andreas

    2007-09-01

    Mass spectrometry (MS) has become today's de-facto standard for high-throughput analysis in proteomics research. Its applications range from toxicity analysis to MS-based diagnostics. Often, the time spent on the MS experiment itself is significantly less than the time necessary to interpret the measured signals, since the amount of data can easily exceed several gigabytes. In addition, automated analysis is hampered by baseline artifacts, chemical as well as electrical noise, and an irregular spacing of data points. Thus, filtering techniques originating from signal and image analysis are commonly employed to address these problems. Unfortunately, smoothing, base-line reduction, and in particular a resampling of data points can affect important characteristics of the experimental signal. To overcome these problems, we propose a new family of wavelet functions based on the isotope wavelet, which is hand-tailored for the analysis of mass spectrometry data. The resulting technique is theoretically well-founded and compares very well with standard peak picking tools, since it is highly robust against noise spoiling the data, but at the same time sufficiently sensitive to detect even low-abundant peptides.

  6. A Robust Two-Dimensional Separation of Intact Proteins for Bottom-Up Tandem Mass Spectrometry of the Human CSF Proteome

    PubMed Central

    Bora, Adriana; Anderson, Carol; Bachani, Muznabanu; Nath, Avindra; Cotter, Robert J.

    2012-01-01

    The cerebrospinal fluid (CSF) is produced in the brain by cells in the choroid plexus at a rate of 500mL/day. It is the only body fluid in direct contact with the brain. Thus, any changes in the CSF composition will reflect pathological processes and make CSF a potential source of biomarkers for different disease states. Proteomics offers a comprehensive view of the proteins found in CSF. In this study, we use a recently developed non-gel based method of sample preparation of CSF followed by liquid chromatography high accuracy mass spectrometry (LC-MS) for MS and MS/MS analyses, allowing unambiguous identification of peptides/proteins. Gel-eluted liquid fraction entrapment electrophoresis (Gelfree) is used to separate a CSF complex protein mixture in 12 user-selectable liquid-phase molecular weight fractions. Using this high throughput workflow we have been able to separate CSF intact proteins over a broad mass range 3.5 kDa-100 kDa with high resolution between 15 kDa and 100 kDa in 2 hours and 40 min. We have completely eliminated albumin and were able to interrogate the low abundance CSF proteins in a highly reproducible manner from different CSF samples in the same time. Using LC-MS as a downstream analysis, we identified 368 proteins using MidiTrap G-10 desalting columns and 166 proteins (including 57 unique proteins) using Zeba spin columns with 5% false discovery rate (FDR). Prostaglandin D2 synthase, Chromogranin A, Apolipoprotein E, Chromogranin B, Secretogranin III, Cystatin C, VGF nerve growth factor, Cadherin 2 are a few of the proteins that were characterized. The Gelfree-LC-MS is a robust method for the analysis of the human proteome that we will use to develop biomarkers for several neurodegenerative diseases and to quantitate these markers using multiple reaction monitoring. PMID:22537003

  7. innoFSPEC: fiber optical spectroscopy and sensing

    NASA Astrophysics Data System (ADS)

    Roth, Martin M.; Löhmannsröben, Hans-Gerd; Kelz, Andreas; Kumke, Michael

    2008-07-01

    innoFSPEC Potsdam is presently being established as in interdisciplinary innovation center for fiber-optical spectroscopy and sensing, hosted by Astrophysikalisches Institut Potsdam and the Physical Chemistry group of Potsdam University, Germany. The center focuses on fundamental research in the two fields of fiber-coupled multi-channel spectroscopy and optical fiber-based sensing. Thanks to its interdisciplinary approach, the complementary methodologies of astrophysics on the one hand, and physical chemistry on the other hand, are expected to spawn synergies that otherwise would not normally become available in more standard research programmes. innoFSPEC targets future innovations for next generation astrophysical instrumentation, environmental analysis, manufacturing control and process monitoring, medical diagnostics, non-invasive imaging spectroscopy, biopsy, genomics/proteomics, high-throughput screening, and related applications.

  8. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.

    PubMed

    Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi

    2007-10-04

    In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  9. Integrated Proteomic and Transcriptomic-Based Approaches to Identifying Signature Biomarkers and Pathways for Elucidation of Daoy and UW228 Subtypes.

    PubMed

    Higdon, Roger; Kala, Jessie; Wilkins, Devan; Yan, Julia Fangfei; Sethi, Manveen K; Lin, Liang; Liu, Siqi; Montague, Elizabeth; Janko, Imre; Choiniere, John; Kolker, Natali; Hancock, William S; Kolker, Eugene; Fanayan, Susan

    2017-02-03

    Medulloblastoma (MB) is the most common malignant pediatric brain tumor. Patient survival has remained largely the same for the past 20 years, with therapies causing significant health, cognitive, behavioral and developmental complications for those who survive the tumor. In this study, we profiled the total transcriptome and proteome of two established MB cell lines, Daoy and UW228, using high-throughput RNA sequencing (RNA-Seq) and label-free nano-LC-MS/MS-based quantitative proteomics, coupled with advanced pathway analysis. While Daoy has been suggested to belong to the sonic hedgehog (SHH) subtype, the exact UW228 subtype is not yet clearly established. Thus, a goal of this study was to identify protein markers and pathways that would help elucidate their subtype classification. A number of differentially expressed genes and proteins, including a number of adhesion, cytoskeletal and signaling molecules, were observed between the two cell lines. While several cancer-associated genes/proteins exhibited similar expression across the two cell lines, upregulation of a number of signature proteins and enrichment of key components of SHH and WNT signaling pathways were uniquely observed in Daoy and UW228, respectively. The novel information on differentially expressed genes/proteins and enriched pathways provide insights into the biology of MB, which could help elucidate their subtype classification.

  10. Proteomic data from human cell cultures refine mechanisms of chaperone-mediated protein homeostasis.

    PubMed

    Finka, Andrija; Goloubinoff, Pierre

    2013-09-01

    In the crowded environment of human cells, folding of nascent polypeptides and refolding of stress-unfolded proteins is error prone. Accumulation of cytotoxic misfolded and aggregated species may cause cell death, tissue loss, degenerative conformational diseases, and aging. Nevertheless, young cells effectively express a network of molecular chaperones and folding enzymes, termed here "the chaperome," which can prevent formation of potentially harmful misfolded protein conformers and use the energy of adenosine triphosphate (ATP) to rehabilitate already formed toxic aggregates into native functional proteins. In an attempt to extend knowledge of chaperome mechanisms in cellular proteostasis, we performed a meta-analysis of human chaperome using high-throughput proteomic data from 11 immortalized human cell lines. Chaperome polypeptides were about 10% of total protein mass of human cells, half of which were Hsp90s and Hsp70s. Knowledge of cellular concentrations and ratios among chaperome polypeptides provided a novel basis to understand mechanisms by which the Hsp60, Hsp70, Hsp90, and small heat shock proteins (HSPs), in collaboration with cochaperones and folding enzymes, assist de novo protein folding, import polypeptides into organelles, unfold stress-destabilized toxic conformers, and control the conformal activity of native proteins in the crowded environment of the cell. Proteomic data also provided means to distinguish between stable components of chaperone core machineries and dynamic regulatory cochaperones.

  11. Evaluation of analytical performance and reliability of direct nanoLC-nanoESI-high resolution mass spectrometry for profiling the (xeno)metabolome.

    PubMed

    Chetwynd, Andrew J; David, Arthur; Hill, Elizabeth M; Abdul-Sada, Alaa

    2014-10-01

    Mass spectrometry (MS) profiling techniques are used for analysing metabolites and xenobiotics in biofluids; however, detection of low abundance compounds using conventional MS techniques is poor. To counter this, nanoflow ultra-high-pressure liquid chromatography-nanoelectrospray ionization-time-of-flight MS (nUHPLC-nESI-TOFMS), which has been used primarily for proteomics, offers an innovative prospect for profiling small molecules. Compared to conventional UHPLC-ESI-TOFMS, nUHPLC-nESI-TOFMS enhanced detection limits of a variety of (xeno)metabolites by between 2 and 2000-fold. In addition, this study demonstrates for the first time excellent repeatability and reproducibility for analysis of urine and plasma samples using nUHPLC-nESI-TOFMS, supporting implementation of this platform as a novel approach for high-throughput (xeno)metabolomics. Copyright © 2014 John Wiley & Sons, Ltd.

  12. High-resolution Antibody Array Analysis of Childhood Acute Leukemia Cells*

    PubMed Central

    Kanderova, Veronika; Kuzilkova, Daniela; Stuchly, Jan; Vaskova, Martina; Brdicka, Tomas; Fiser, Karel; Hrusak, Ondrej; Lund-Johansen, Fridtjof

    2016-01-01

    Acute leukemia is a disease pathologically manifested at both genomic and proteomic levels. Molecular genetic technologies are currently widely used in clinical research. In contrast, sensitive and high-throughput proteomic techniques for performing protein analyses in patient samples are still lacking. Here, we used a technology based on size exclusion chromatography followed by immunoprecipitation of target proteins with an antibody bead array (Size Exclusion Chromatography-Microsphere-based Affinity Proteomics, SEC-MAP) to detect hundreds of proteins from a single sample. In addition, we developed semi-automatic bioinformatics tools to adapt this technology for high-content proteomic screening of pediatric acute leukemia patients. To confirm the utility of SEC-MAP in leukemia immunophenotyping, we tested 31 leukemia diagnostic markers in parallel by SEC-MAP and flow cytometry. We identified 28 antibodies suitable for both techniques. Eighteen of them provided excellent quantitative correlation between SEC-MAP and flow cytometry (p < 0.05). Next, SEC-MAP was applied to examine 57 diagnostic samples from patients with acute leukemia. In this assay, we used 632 different antibodies and detected 501 targets. Of those, 47 targets were differentially expressed between at least two of the three acute leukemia subgroups. The CD markers correlated with immunophenotypic categories as expected. From non-CD markers, we found DBN1, PAX5, or PTK2 overexpressed in B-cell precursor acute lymphoblastic leukemias, LAT, SH2D1A, or STAT5A overexpressed in T-cell acute lymphoblastic leukemias, and HCK, GLUD1, or SYK overexpressed in acute myeloid leukemias. In addition, OPAL1 overexpression corresponded to ETV6-RUNX1 chromosomal translocation. In summary, we demonstrated that SEC-MAP technology is a powerful tool for detecting hundreds of proteins in clinical samples obtained from pediatric acute leukemia patients. It provides information about protein size and reveals differences in protein expression between particular leukemia subgroups. Forty-seven of SEC-MAP identified targets were validated by other conventional method in this study. PMID:26785729

  13. Common bean proteomics: Present status and future strategies.

    PubMed

    Zargar, Sajad Majeed; Mahajan, Reetika; Nazir, Muslima; Nagar, Preeti; Kim, Sun Tae; Rai, Vandna; Masi, Antonio; Ahmad, Syed Mudasir; Shah, Riaz Ahmad; Ganai, Nazir Ahmad; Agrawal, Ganesh K; Rakwal, Randeep

    2017-10-03

    Common bean (Phaseolus vulgaris L.) is a legume of appreciable importance and usefulness worldwide to the human population providing food and feed. It is rich in high-quality protein, energy, fiber and micronutrients especially iron, zinc, and pro-vitamin A; and possesses potentially disease-preventing and health-promoting compounds. The recently published genome sequence of common bean is an important landmark in common bean research, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and abiotic stresses severely limiting its productivity. Looking at the trend of increasing world population and the need for food crops best suited to the health of humankind, the legumes will be in great demand, including the common bean mostly for its nutritive values. Hence the need for new research in understanding the biology of this crop brings us to utilize and apply high-throughput omics approaches. In this mini-review our focus will be on the need for proteomics studies in common bean, potential of proteomics for understanding genetic regulation under abiotic and biotic stresses and how proteogenomics will lead to nutritional improvement. We will also discuss future proteomics-based strategies that must be adopted to mine new genomic resources by identifying molecular switches regulating various biological processes. Common bean is regarded as "grain of hope" for the poor, being rich in high-quality protein, energy, fiber and micronutrients (iron, zinc, pro-vitamin A); and possesses potentially disease-preventing and health-promoting compounds. Increasing world population and the need for food crops best suited to the health of humankind, puts legumes into great demand, which includes the common bean mostly. An important landmark in common bean research was the recent publication of its genome sequence, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and abiotic stresses severely limiting its productivity. Therefore, the need for new research in understanding the biology of this crop brings us to utilize and apply high-throughput omics approaches. Proteomics can be used to track all the candidate proteins/genes responsible for a biological process under specific conditions in a particular tissue. The potential of proteomics will not only help in determining the functions of a large number of genes in a single experiment but will also be a useful tool to mine new genes that can provide solution to various problems (abiotic stress, biotic stress, nutritional improvement, etc). We believe that a combined approach including breeding along with omics tools will lead towards attaining sustainability in legumes, including common bean. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Domain selection combined with improved cloning strategy for high throughput expression of higher eukaryotic proteins

    PubMed Central

    Chen, Yunjia; Qiu, Shihong; Luan, Chi-Hao; Luo, Ming

    2007-01-01

    Background Expression of higher eukaryotic genes as soluble, stable recombinant proteins is still a bottleneck step in biochemical and structural studies of novel proteins today. Correct identification of stable domains/fragments within the open reading frame (ORF), combined with proper cloning strategies, can greatly enhance the success rate when higher eukaryotic proteins are expressed as these domains/fragments. Furthermore, a HTP cloning pipeline incorporated with bioinformatics domain/fragment selection methods will be beneficial to studies of structure and function genomics/proteomics. Results With bioinformatics tools, we developed a domain/domain boundary prediction (DDBP) method, which was trained by available experimental data. Combined with an improved cloning strategy, DDBP had been applied to 57 proteins from C. elegans. Expression and purification results showed there was a 10-fold increase in terms of obtaining purified proteins. Based on the DDBP method, the improved GATEWAY cloning strategy and a robotic platform, we constructed a high throughput (HTP) cloning pipeline, including PCR primer design, PCR, BP reaction, transformation, plating, colony picking and entry clones extraction, which have been successfully applied to 90 C. elegans genes, 88 Brucella genes, and 188 human genes. More than 97% of the targeted genes were obtained as entry clones. This pipeline has a modular design and can adopt different operations for a variety of cloning/expression strategies. Conclusion The DDBP method and improved cloning strategy were satisfactory. The cloning pipeline, combined with our recombinant protein HTP expression pipeline and the crystal screening robots, constitutes a complete platform for structure genomics/proteomics. This platform will increase the success rate of purification and crystallization dramatically and promote the further advancement of structure genomics/proteomics. PMID:17663785

  15. Proteomic Analyses of the Unexplored Sea Anemone Bunodactis verrucosa

    PubMed Central

    Campos, Alexandre; Turkina, Maria V.; Ribeiro, Tiago; Osorio, Hugo; Vasconcelos, Vítor; Antunes, Agostinho

    2018-01-01

    Cnidarian toxic products, particularly peptide toxins, constitute a promising target for biomedicine research. Indeed, cnidarians are considered as the largest phylum of generally toxic animals. However, research on peptides and toxins of sea anemones is still limited. Moreover, most of the toxins from sea anemones have been discovered by classical purification approaches. Recently, high-throughput methodologies have been used for this purpose but in other Phyla. Hence, the present work was focused on the proteomic analyses of whole-body extract from the unexplored sea anemone Bunodactis verrucosa. The proteomic analyses applied were based on two methods: two-dimensional gel electrophoresis combined with MALDI-TOF/TOF and shotgun proteomic approach. In total, 413 proteins were identified, but only eight proteins were identified from gel-based analyses. Such proteins are mainly involved in basal metabolism and biosynthesis of antibiotics as the most relevant pathways. In addition, some putative toxins including metalloproteinases and neurotoxins were also identified. These findings reinforce the significance of the production of antimicrobial compounds and toxins by sea anemones, which play a significant role in defense and feeding. In general, the present study provides the first proteome map of the sea anemone B. verrucosa stablishing a reference for future studies in the discovery of new compounds. PMID:29364843

  16. Proteomic Analyses of the Unexplored Sea Anemone Bunodactis verrucosa.

    PubMed

    Domínguez-Pérez, Dany; Campos, Alexandre; Alexei Rodríguez, Armando; Turkina, Maria V; Ribeiro, Tiago; Osorio, Hugo; Vasconcelos, Vítor; Antunes, Agostinho

    2018-01-24

    Cnidarian toxic products, particularly peptide toxins, constitute a promising target for biomedicine research. Indeed, cnidarians are considered as the largest phylum of generally toxic animals. However, research on peptides and toxins of sea anemones is still limited. Moreover, most of the toxins from sea anemones have been discovered by classical purification approaches. Recently, high-throughput methodologies have been used for this purpose but in other Phyla. Hence, the present work was focused on the proteomic analyses of whole-body extract from the unexplored sea anemone Bunodactis verrucosa . The proteomic analyses applied were based on two methods: two-dimensional gel electrophoresis combined with MALDI-TOF/TOF and shotgun proteomic approach. In total, 413 proteins were identified, but only eight proteins were identified from gel-based analyses. Such proteins are mainly involved in basal metabolism and biosynthesis of antibiotics as the most relevant pathways. In addition, some putative toxins including metalloproteinases and neurotoxins were also identified. These findings reinforce the significance of the production of antimicrobial compounds and toxins by sea anemones, which play a significant role in defense and feeding. In general, the present study provides the first proteome map of the sea anemone B. verrucosa stablishing a reference for future studies in the discovery of new compounds.

  17. Reprint of "pFind-Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data".

    PubMed

    Chi, Hao; He, Kun; Yang, Bing; Chen, Zhen; Sun, Rui-Xiang; Fan, Sheng-Bo; Zhang, Kun; Liu, Chao; Yuan, Zuo-Fei; Wang, Quan-Hui; Liu, Si-Qi; Dong, Meng-Qiu; He, Si-Min

    2015-11-03

    Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested.This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. An Efficient Semi-supervised Learning Approach to Predict SH2 Domain Mediated Interactions.

    PubMed

    Kundu, Kousik; Backofen, Rolf

    2017-01-01

    Src homology 2 (SH2) domain is an important subclass of modular protein domains that plays an indispensable role in several biological processes in eukaryotes. SH2 domains specifically bind to the phosphotyrosine residue of their binding peptides to facilitate various molecular functions. For determining the subtle binding specificities of SH2 domains, it is very important to understand the intriguing mechanisms by which these domains recognize their target peptides in a complex cellular environment. There are several attempts have been made to predict SH2-peptide interactions using high-throughput data. However, these high-throughput data are often affected by a low signal to noise ratio. Furthermore, the prediction methods have several additional shortcomings, such as linearity problem, high computational complexity, etc. Thus, computational identification of SH2-peptide interactions using high-throughput data remains challenging. Here, we propose a machine learning approach based on an efficient semi-supervised learning technique for the prediction of 51 SH2 domain mediated interactions in the human proteome. In our study, we have successfully employed several strategies to tackle the major problems in computational identification of SH2-peptide interactions.

  19. Detection of Biomarkers of Pathogenic Naegleria fowleri Through Mass Spectrometry and Proteomics

    PubMed Central

    Moura, Hercules; Izquierdo, Fernando; Woolfitt, Adrian R.; Wagner, Glauber; Pinto, Tatiana; del Aguila, Carmen; Barr, John R.

    2017-01-01

    Emerging methods based on mass spectrometry (MS) can be used in the rapid identification of microorganisms. Thus far, these practical and rapidly evolving methods have mainly been applied to characterize prokaryotes. We applied matrix-assisted laser-desorption-ionization-time-of-flight mass spectrometry MALDI-TOF MS in the analysis of whole cells of 18 N. fowleri isolates belonging to three genotypes. Fourteen originated from the cerebrospinal fluid or brain tissue of primary amoebic meningoencephalitis patients and four originated from water samples of hot springs, rivers, lakes or municipal water supplies. Whole Naegleria trophozoites grown in axenic cultures were washed and mixed with MALDI matrix. Mass spectra were acquired with a 4700 TOF-TOF instrument. MALDI-TOF MS yielded consistent patterns for all isolates examined. Using a combination of novel data processing methods for visual peak comparison, statistical analysis and proteomics database searching we were able to detect several biomarkers that can differentiate all species and isolates studied, along with common biomarkers for all N. fowleri isolates. Naegleria fowleri could be easily separated from other species within the genus Naegleria. A number of peaks detected were tentatively identified. MALDI-TOF MS fingerprinting is a rapid, reproducible, high-throughput alternative method for identifying Naegleria isolates. This method has potential for studying eukaryotic agents. PMID:25231600

  20. Comparative systems analysis of the secretome of the opportunistic pathogen Aspergillus fumigatus and other Aspergillus species.

    PubMed

    Vivek-Ananth, R P; Mohanraj, Karthikeyan; Vandanashree, Muralidharan; Jhingran, Anupam; Craig, James P; Samal, Areejit

    2018-04-26

    Aspergillus fumigatus and multiple other Aspergillus species cause a wide range of lung infections, collectively termed aspergillosis. Aspergilli are ubiquitous in environment with healthy immune systems routinely eliminating inhaled conidia, however, Aspergilli can become an opportunistic pathogen in immune-compromised patients. The aspergillosis mortality rate and emergence of drug-resistance reveals an urgent need to identify novel targets. Secreted and cell membrane proteins play a critical role in fungal-host interactions and pathogenesis. Using a computational pipeline integrating data from high-throughput experiments and bioinformatic predictions, we have identified secreted and cell membrane proteins in ten Aspergillus species known to cause aspergillosis. Small secreted and effector-like proteins similar to agents of fungal-plant pathogenesis were also identified within each secretome. A comparison with humans revealed that at least 70% of Aspergillus secretomes have no sequence similarity with the human proteome. An analysis of antigenic qualities of Aspergillus proteins revealed that the secretome is significantly more antigenic than cell membrane proteins or the complete proteome. Finally, overlaying an expression dataset, four A. fumigatus proteins upregulated during infection and with available structures, were found to be structurally similar to known drug target proteins in other organisms, and were able to dock in silico with the respective drug.

  1. MALDI-TOF mass spectrometry analysis of small molecular weight compounds (under 10 KDa) as biomarkers of rat hearts undergoing arecoline challenge.

    PubMed

    Chen, Tung-Sheng; Chang, Mu-Hsin; Kuo, Wei-Wen; Lin, Yueh-Min; Yeh, Yu-Lan; Day, Cecilia Hsuan; Lin, Chien-Chung; Tsai, Fuu-Jen; Tsai, Chang-Hai; Huang, Chih-Yang

    2013-04-01

    Statistical and clinical reports indicate that betel nut chewing is strongly associated with progression of oral cancer because some ingredients in betel nuts are potential cancer promoters, especially arecoline. Early diagnosis for cancer biomarkers is the best strategy for prevention of cancer progression. Several methods are suggested for investigating cancer biomarkers. Among these methods, gel-based proteomics approach is the most powerful and recommended tool for investigating biomarkers due to its high-throughput. However, this proteomics approach is not suitable for screening biomarkers with molecular weight under 10 KDa because of the characteristics of gel electrophoresis. This study investigated biomarkers with molecular weight under 10 KDa in rats with arecoline challenge. The centrifuging vials with membrane (10 KDa molecular weight cut-off) played a crucial role in this study. After centrifuging, the filtrate (containing compounds with molecular weight under 10 KDa) was collected and spotted on a sample plate for MALDI-TOF mass spectrometry analysis. Compared to control, three extra peaks (m/z values were 1553.1611, 1668.2097 and 1740.1832, respectively) were found in sera and two extra peaks were found in heart tissue samples (408.9719 and 524.9961, respectively). These small compounds should play important roles and may be potential biomarker candidates in rats with arecoline. This study successfully reports a mass-based method for investigating biomarker candidates with small molecular weight in different types of sample (including serum and tissue). In addition, this reported method is more time-efficient (1 working day) than gel-based proteomics approach (5~7 working days).

  2. MStern Blotting-High Throughput Polyvinylidene Fluoride (PVDF) Membrane-Based Proteomic Sample Preparation for 96-Well Plates.

    PubMed

    Berger, Sebastian T; Ahmed, Saima; Muntel, Jan; Cuevas Polo, Nerea; Bachur, Richard; Kentsis, Alex; Steen, Judith; Steen, Hanno

    2015-10-01

    We describe a 96-well plate compatible membrane-based proteomic sample processing method, which enables the complete processing of 96 samples (or multiples thereof) within a single workday. This method uses a large-pore hydrophobic PVDF membrane that efficiently adsorbs proteins, resulting in fast liquid transfer through the membrane and significantly reduced sample processing times. Low liquid transfer speeds have prevented the useful 96-well plate implementation of FASP as a widely used membrane-based proteomic sample processing method. We validated our approach on whole-cell lysate and urine and cerebrospinal fluid as clinically relevant body fluids. Without compromising peptide and protein identification, our method uses a vacuum manifold and circumvents the need for digest desalting, making our processing method compatible with standard liquid handling robots. In summary, our new method maintains the strengths of FASP and simultaneously overcomes one of the major limitations of FASP without compromising protein identification and quantification. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  3. Characterization of Medium Conditioned by Irradiated Cells Using Proteome-Wide, High-Throughput Mass Spectrometry

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Springer, David L.; Ahram, Mamoun; Adkins, Joshua N.

    Shedding, the release of cell surface proteins by regulated proteolysis, is a general cellular response to injury and is responsible for generating numerous bioactive molecules including growth factors and cytokines. The purpose of our work is to determine whether low doses of low-linear energy transfer (LET) radiation induce shedding of bioactive molecules. Using a mass spectrometry-based global proteomics method, we tested this hypothesis by analyzing for shed proteins in medium from irradiated human mammary epithelial cells (HMEC). Several hundred proteins were identified, including transforming growth factor beta (TGFB); however, no changes in protein abundances attributable to radiation exposure, based onmore » immunoblotting methods, were observed. These results demonstrate that our proteomic-based approach has the sensitivity to identify the kinds of proteins believed to be released after low-dose radiation exposure but that improvements in mass spectrometry-based protein quantification will be required to detect the small changes in abundance associated with this type of insult.« less

  4. MStern Blotting–High Throughput Polyvinylidene Fluoride (PVDF) Membrane-Based Proteomic Sample Preparation for 96-Well Plates*

    PubMed Central

    Berger, Sebastian T.; Ahmed, Saima; Muntel, Jan; Cuevas Polo, Nerea; Bachur, Richard; Kentsis, Alex; Steen, Judith; Steen, Hanno

    2015-01-01

    We describe a 96-well plate compatible membrane-based proteomic sample processing method, which enables the complete processing of 96 samples (or multiples thereof) within a single workday. This method uses a large-pore hydrophobic PVDF membrane that efficiently adsorbs proteins, resulting in fast liquid transfer through the membrane and significantly reduced sample processing times. Low liquid transfer speeds have prevented the useful 96-well plate implementation of FASP as a widely used membrane-based proteomic sample processing method. We validated our approach on whole-cell lysate and urine and cerebrospinal fluid as clinically relevant body fluids. Without compromising peptide and protein identification, our method uses a vacuum manifold and circumvents the need for digest desalting, making our processing method compatible with standard liquid handling robots. In summary, our new method maintains the strengths of FASP and simultaneously overcomes one of the major limitations of FASP without compromising protein identification and quantification. PMID:26223766

  5. Analysis of Proteins, Protein Complexes, and Organellar Proteomes Using Sheathless Capillary Zone Electrophoresis - Native Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Belov, Arseniy M.; Viner, Rosa; Santos, Marcia R.; Horn, David M.; Bern, Marshall; Karger, Barry L.; Ivanov, Alexander R.

    2017-12-01

    Native mass spectrometry (MS) is a rapidly advancing field in the analysis of proteins, protein complexes, and macromolecular species of various types. The majority of native MS experiments reported to-date has been conducted using direct infusion of purified analytes into a mass spectrometer. In this study, capillary zone electrophoresis (CZE) was coupled online to Orbitrap mass spectrometers using a commercial sheathless interface to enable high-performance separation, identification, and structural characterization of limited amounts of purified proteins and protein complexes, the latter with preserved non-covalent associations under native conditions. The performance of both bare-fused silica and polyacrylamide-coated capillaries was assessed using mixtures of protein standards known to form non-covalent protein-protein and protein-ligand complexes. High-efficiency separation of native complexes is demonstrated using both capillary types, while the polyacrylamide neutral-coated capillary showed better reproducibility and higher efficiency for more complex samples. The platform was then evaluated for the determination of monoclonal antibody aggregation and for analysis of proteomes of limited complexity using a ribosomal isolate from E. coli. Native CZE-MS, using accurate single stage and tandem-MS measurements, enabled identification of proteoforms and non-covalent complexes at femtomole levels. This study demonstrates that native CZE-MS can serve as an orthogonal and complementary technique to conventional native MS methodologies with the advantages of low sample consumption, minimal sample processing and losses, and high throughput and sensitivity. This study presents a novel platform for analysis of ribosomes and other macromolecular complexes and organelles, with the potential for discovery of novel structural features defining cellular phenotypes (e.g., specialized ribosomes). [Figure not available: see fulltext.

  6. Nucleic acid programmable protein array a just-in-time multiplexed protein expression and purification platform.

    PubMed

    Qiu, Ji; LaBaer, Joshua

    2011-01-01

    Systematic study of proteins requires the availability of thousands of proteins in functional format. However, traditional recombinant protein expression and purification methods have many drawbacks for such study at the proteome level. We have developed an innovative in situ protein expression and capture system, namely NAPPA (nucleic acid programmable protein array), where C-terminal tagged proteins are expressed using an in vitro expression system and efficiently captured/purified by antitag antibodies coprinted at each spot. The NAPPA technology presented in this chapter enable researchers to produce and display fresh proteins just in time in a multiplexed high-throughput fashion and utilize them for various downstream biochemical researches of interest. This platform could revolutionize the field of functional proteomics with it ability to produce thousands of spatially separated proteins in high density with narrow dynamic rand of protein concentrations, reproducibly and functionally. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. pyQms enables universal and accurate quantification of mass spectrometry data.

    PubMed

    Leufken, Johannes; Niehues, Anna; Sarin, L Peter; Wessel, Florian; Hippler, Michael; Leidel, Sebastian A; Fufezan, Christian

    2017-10-01

    Quantitative mass spectrometry (MS) is a key technique in many research areas (1), including proteomics, metabolomics, glycomics, and lipidomics. Because all of the corresponding molecules can be described by chemical formulas, universal quantification tools are highly desirable. Here, we present pyQms, an open-source software for accurate quantification of all types of molecules measurable by MS. pyQms uses isotope pattern matching that offers an accurate quality assessment of all quantifications and the ability to directly incorporate mass spectrometer accuracy. pyQms is, due to its universal design, applicable to every research field, labeling strategy, and acquisition technique. This opens ultimate flexibility for researchers to design experiments employing innovative and hitherto unexplored labeling strategies. Importantly, pyQms performs very well to accurately quantify partially labeled proteomes in large scale and high throughput, the most challenging task for a quantification algorithm. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  8. Large-Scale and Deep Quantitative Proteome Profiling Using Isobaric Labeling Coupled with Two-Dimensional LC-MS/MS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gritsenko, Marina A.; Xu, Zhe; Liu, Tao

    Comprehensive, quantitative information on abundances of proteins and their post-translational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labelling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification andmore » quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples, and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.« less

  9. Large-Scale and Deep Quantitative Proteome Profiling Using Isobaric Labeling Coupled with Two-Dimensional LC-MS/MS.

    PubMed

    Gritsenko, Marina A; Xu, Zhe; Liu, Tao; Smith, Richard D

    2016-01-01

    Comprehensive, quantitative information on abundances of proteins and their posttranslational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labeling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification and quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.

  10. Global Liver Proteome Analysis Using iTRAQ Reveals AMPK-mTOR-Autophagy Signaling Is Altered by Intrauterine Growth Restriction in Newborn Piglets.

    PubMed

    Long, Baisheng; Yin, Cong; Fan, Qiwen; Yan, Guokai; Wang, Zhichang; Li, Xiuzhi; Chen, Changqing; Yang, Xingya; Liu, Lu; Zheng, Zilong; Shi, Min; Yan, Xianghua

    2016-04-01

    Intrauterine growth restriction (IUGR) impairs fetal growth and development, perturbs nutrient metabolism, and increases the risk of developing diseases in postnatal life. However, the underlying mechanisms by which IUGR affects fetal liver development and metabolism remain incompletely understood. Here, we applied a high-throughput proteomics approach and biochemical analysis to investigate the impact of IUGR on the liver of newborn piglets. As a result, we identified 78 differentially expressed proteins in the three biological replicates, including 31 significantly up-regulated proteins and 47 significantly down-regulated proteins. Among them, a majority of differentially expressed proteins were related to nutrient metabolism and mitochondrial function. Additionally, many significantly down-regulated proteins participated in the mTOR signaling pathway and the phagosome maturation signaling pathway. Further analysis suggested that glucose concentration and hepatic glycogen storage were both reduced in IUGR newborn piglets, which may contribute to AMPK activation and mTORC1 inhibition. However, AMPK activation and mTORC1 inhibition failed to induce autophagy in the liver of IUGR neonatal pigs. A possible reason is that PP2Ac, a potential candidate in autophagy regulation, is significantly down-regulated in the liver of IUGR newborn piglets. These findings may provide implications for preventing and treating IUGR in human beings and domestic animals.

  11. Vemurafenib resistance signature by proteome analysis offers new strategies and rational therapeutic concepts.

    PubMed

    Paulitschke, Verena; Berger, Walter; Paulitschke, Philipp; Hofstätter, Elisabeth; Knapp, Bernhard; Dingelmaier-Hovorka, Ruth; Födinger, Dagmar; Jäger, Walter; Szekeres, Thomas; Meshcheryakova, Anastasia; Bileck, Andrea; Pirker, Christine; Pehamberger, Hubert; Gerner, Christopher; Kunstfeld, Rainer

    2015-03-01

    The FDA-approved BRAF inhibitor vemurafenib achieves outstanding clinical response rates in patients with melanoma, but early resistance is common. Understanding the pathologic mechanisms of drug resistance and identification of effective therapeutic alternatives are key scientific challenges in the melanoma setting. Using proteomic techniques, including shotgun analysis and 2D-gel electrophoresis, we identified a comprehensive signature of the vemurafenib-resistant M24met in comparison with the vemurafenib-sensitive A375 melanoma cell line. The resistant cells were characterized by loss of differentiation, induction of transformation, enhanced expression of the lysosomal compartment, increased potential for metastasis, migration, adherence and Ca2(+) ion binding, enhanced expression of the MAPK pathway and extracellular matrix proteins, and epithelial-mesenchymal transformation. The main features were verified by shotgun analysis with QEXACTIVE orbitrap MS, electron microscopy, lysosomal staining, Western blotting, and adherence assay in a VM-1 melanoma cell line with acquired vemurafenib resistance. On the basis of the resistance profile, we were able to successfully predict that a novel resveratrol-derived COX-2 inhibitor, M8, would be active against the vemurafenib-resistant but not the vemurafenib-sensitive melanoma cells. Using high-throughput methods for cell line and drug characterization may thus offer a new way to identify key features of vemurafenib resistance, facilitating the design of effective rational therapeutic alternatives. ©2015 American Association for Cancer Research.

  12. High Dynamic Range Characterization of the Trauma Patient Plasma Proteome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liu, Tao; Qian, Weijun; Gritsenko, Marina A.

    2006-06-08

    While human plasma represents an attractive sample for disease biomarker discovery, the extreme complexity and large dynamic range in protein concentrations present significant challenges for characterization, candidate biomarker discovery, and validation. Herein, we describe a strategy that combines immunoaffinity subtraction and chemical fractionation based on cysteinyl peptide and N-glycopeptide captures with 2D-LC-MS/MS to increase the dynamic range of analysis for plasma. Application of this ''divide-and-conquer'' strategy to trauma patient plasma significantly improved the overall dynamic range of detection and resulted in confident identification of 22,267 unique peptides from four different peptide populations (cysteinyl peptides, non-cysteinyl peptides, N-glycopeptides, and non-glycopeptides) thatmore » covered 3654 nonredundant proteins. Numerous low-abundance proteins were identified, exemplified by 78 ''classic'' cytokines and cytokine receptors and by 136 human cell differentiation molecules. Additionally, a total of 2910 different N-glycopeptides that correspond to 662 N-glycoproteins and 1553 N-glycosylation sites were identified. A panel of the proteins identified in this study is known to be involved in inflammation and immune responses. This study established an extensive reference protein database for trauma patients, which provides a foundation for future high-throughput quantitative plasma proteomic studies designed to elucidate the mechanisms that underlie systemic inflammatory responses.« less

  13. Microgravity-driven remodeling of the proteome reveals insights into molecular mechanisms and signal networks involved in response to the space flight environment.

    PubMed

    Rea, Giuseppina; Cristofaro, Francesco; Pani, Giuseppe; Pascucci, Barbara; Ghuge, Sandip A; Corsetto, Paola Antonia; Imbriani, Marcello; Visai, Livia; Rizzo, Angela M

    2016-03-30

    Space is a hostile environment characterized by high vacuum, extreme temperatures, meteoroids, space debris, ionospheric plasma, microgravity and space radiation, which all represent risks for human health. A deep understanding of the biological consequences of exposure to the space environment is required to design efficient countermeasures to minimize their negative impact on human health. Recently, proteomic approaches have received a significant amount of attention in the effort to further study microgravity-induced physiological changes. In this review, we summarize the current knowledge about the effects of microgravity on microorganisms (in particular Cupriavidus metallidurans CH34, Bacillus cereus and Rhodospirillum rubrum S1H), plants (whole plants, organs, and cell cultures), mammalian cells (endothelial cells, bone cells, chondrocytes, muscle cells, thyroid cancer cells, immune system cells) and animals (invertebrates, vertebrates and mammals). Herein, we describe their proteome's response to microgravity, focusing on proteomic discoveries and their future potential applications in space research. Space experiments and operational flight experience have identified detrimental effects on human health and performance because of exposure to weightlessness, even when currently available countermeasures are implemented. Many experimental tools and methods have been developed to study microgravity induced physiological changes. Recently, genomic and proteomic approaches have received a significant amount of attention. This review summarizes the recent research studies of the proteome response to microgravity inmicroorganisms, plants, mammalians cells and animals. Current proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of all proteomes. Understanding gene and/or protein expression is the key to unlocking the mechanisms behind microgravity-induced problems and to finding effective countermeasures to spaceflight-induced alterations but also for the study of diseases on earth. Future perspectives are also highlighted. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Statistical Design for Biospecimen Cohort Size in Proteomics-based Biomarker Discovery and Verification Studies

    PubMed Central

    Skates, Steven J.; Gillette, Michael A.; LaBaer, Joshua; Carr, Steven A.; Anderson, N. Leigh; Liebler, Daniel C.; Ransohoff, David; Rifai, Nader; Kondratovich, Marina; Težak, Živana; Mansfield, Elizabeth; Oberg, Ann L.; Wright, Ian; Barnes, Grady; Gail, Mitchell; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Boja, Emily S.

    2014-01-01

    Protein biomarkers are needed to deepen our understanding of cancer biology and to improve our ability to diagnose, monitor and treat cancers. Important analytical and clinical hurdles must be overcome to allow the most promising protein biomarker candidates to advance into clinical validation studies. Although contemporary proteomics technologies support the measurement of large numbers of proteins in individual clinical specimens, sample throughput remains comparatively low. This problem is amplified in typical clinical proteomics research studies, which routinely suffer from a lack of proper experimental design, resulting in analysis of too few biospecimens to achieve adequate statistical power at each stage of a biomarker pipeline. To address this critical shortcoming, a joint workshop was held by the National Cancer Institute (NCI), National Heart, Lung and Blood Institute (NHLBI), and American Association for Clinical Chemistry (AACC), with participation from the U.S. Food and Drug Administration (FDA). An important output from the workshop was a statistical framework for the design of biomarker discovery and verification studies. Herein, we describe the use of quantitative clinical judgments to set statistical criteria for clinical relevance, and the development of an approach to calculate biospecimen sample size for proteomic studies in discovery and verification stages prior to clinical validation stage. This represents a first step towards building a consensus on quantitative criteria for statistical design of proteomics biomarker discovery and verification research. PMID:24063748

  15. Statistical design for biospecimen cohort size in proteomics-based biomarker discovery and verification studies.

    PubMed

    Skates, Steven J; Gillette, Michael A; LaBaer, Joshua; Carr, Steven A; Anderson, Leigh; Liebler, Daniel C; Ransohoff, David; Rifai, Nader; Kondratovich, Marina; Težak, Živana; Mansfield, Elizabeth; Oberg, Ann L; Wright, Ian; Barnes, Grady; Gail, Mitchell; Mesri, Mehdi; Kinsinger, Christopher R; Rodriguez, Henry; Boja, Emily S

    2013-12-06

    Protein biomarkers are needed to deepen our understanding of cancer biology and to improve our ability to diagnose, monitor, and treat cancers. Important analytical and clinical hurdles must be overcome to allow the most promising protein biomarker candidates to advance into clinical validation studies. Although contemporary proteomics technologies support the measurement of large numbers of proteins in individual clinical specimens, sample throughput remains comparatively low. This problem is amplified in typical clinical proteomics research studies, which routinely suffer from a lack of proper experimental design, resulting in analysis of too few biospecimens to achieve adequate statistical power at each stage of a biomarker pipeline. To address this critical shortcoming, a joint workshop was held by the National Cancer Institute (NCI), National Heart, Lung, and Blood Institute (NHLBI), and American Association for Clinical Chemistry (AACC) with participation from the U.S. Food and Drug Administration (FDA). An important output from the workshop was a statistical framework for the design of biomarker discovery and verification studies. Herein, we describe the use of quantitative clinical judgments to set statistical criteria for clinical relevance and the development of an approach to calculate biospecimen sample size for proteomic studies in discovery and verification stages prior to clinical validation stage. This represents a first step toward building a consensus on quantitative criteria for statistical design of proteomics biomarker discovery and verification research.

  16. Contribution of proteomics to the study of plant pathogenic fungi.

    PubMed

    Gonzalez-Fernandez, Raquel; Jorrin-Novo, Jesus V

    2012-01-01

    Phytopathogenic fungi are one of the most damaging plant parasitic organisms, and can cause serious diseases and important yield losses in crops. The study of the biology of these microorganisms and the interaction with their hosts has experienced great advances in recent years due to the development of moderm, holistic and high-throughput -omic techniques, together with the increasing number of genome sequencing projects and the development of mutants and reverse genetics tools. We highlight among these -omic techniques the importance of proteomics, which has become a relevant tool in plant-fungus pathosystem research. Proteomics intends to identify gene products with a key role in pathogenicity and virulence. These studies would help in the search of key protein targets and in the development of agrochemicals, which may open new ways for crop disease diagnosis and protection. In this review, we made an overview on the contribution of proteomics to the knowledge of life cycle, infection mechanisms, and virulence of the plant pathogenic fungi. Data from current, innovative literature, according to both methodological and experimental systems, were summarized and discussed. Specific sections were devoted to the most studied fungal phytopathogens: Botrytis cinerea, Sclerotinia sclerotiorum, and Fusarium graminearum.

  17. The Present and Future of Biomarkers in Prostate Cancer: Proteomics, Genomics, and Immunology Advancements

    PubMed Central

    Gaudreau, Pierre-Olivier; Stagg, John; Soulières, Denis; Saad, Fred

    2016-01-01

    Prostate cancer (PC) is the second most common form of cancer in men worldwide. Biomarkers have emerged as essential tools for treatment and assessment since the variability of disease behavior, the cost and diversity of treatments, and the related impairment of quality of life have given rise to a need for a personalized approach. High-throughput technology platforms in proteomics and genomics have accelerated the development of biomarkers. Furthermore, recent successes of several new agents in PC, including immunotherapy, have stimulated the search for predictors of response and resistance and have improved the understanding of the biological mechanisms at work. This review provides an overview of currently established biomarkers in PC, as well as a selection of the most promising biomarkers within these particular fields of development. PMID:27168728

  18. The diverse and expanding role of mass spectrometry in structural and molecular biology.

    PubMed

    Lössl, Philip; van de Waterbeemd, Michiel; Heck, Albert Jr

    2016-12-15

    The emergence of proteomics has led to major technological advances in mass spectrometry (MS). These advancements not only benefitted MS-based high-throughput proteomics but also increased the impact of mass spectrometry on the field of structural and molecular biology. Here, we review how state-of-the-art MS methods, including native MS, top-down protein sequencing, cross-linking-MS, and hydrogen-deuterium exchange-MS, nowadays enable the characterization of biomolecular structures, functions, and interactions. In particular, we focus on the role of mass spectrometry in integrated structural and molecular biology investigations of biological macromolecular complexes and cellular machineries, highlighting work on CRISPR-Cas systems and eukaryotic transcription complexes. © 2016 The Authors. Published under the terms of the CC BY NC ND 4.0 license.

  19. The clinical impact of recent advances in LC-MS for cancer biomarker discovery and verification.

    PubMed

    Wang, Hui; Shi, Tujin; Qian, Wei-Jun; Liu, Tao; Kagan, Jacob; Srivastava, Sudhir; Smith, Richard D; Rodland, Karin D; Camp, David G

    2016-01-01

    Mass spectrometry (MS) -based proteomics has become an indispensable tool with broad applications in systems biology and biomedical research. With recent advances in liquid chromatography (LC) and MS instrumentation, LC-MS is making increasingly significant contributions to clinical applications, especially in the area of cancer biomarker discovery and verification. To overcome challenges associated with analyses of clinical samples (for example, a wide dynamic range of protein concentrations in bodily fluids and the need to perform high throughput and accurate quantification of candidate biomarker proteins), significant efforts have been devoted to improve the overall performance of LC-MS-based clinical proteomics platforms. Reviewed here are the recent advances in LC-MS and its applications in cancer biomarker discovery and quantification, along with the potentials, limitations and future perspectives.

  20. Multi-component immunoaffinity subtraction chromatography: an innovative step towards a comprehensive survey of the human plasma proteome.

    PubMed

    Pieper, Rembert; Su, Qin; Gatlin, Christine L; Huang, Shih-Ting; Anderson, N Leigh; Steiner, Sandra

    2003-04-01

    In order to discover novel protein markers indicative of disease processes or drug effects, the proteomics technology platform most commonly used consists of high resolution protein separation by two-dimensional electrophoresis (2-DE), mass spectrometric identification of proteins from stained gel spots and a bioinformatic data analysis process supported by statistics. This approach has been more successful in profiling proteins and their disease- or treatment-related quantitative changes in tissue homogenates than in plasma samples. Plasma protein display and quantitation suffer from several disadvantages: very high abundance of a few proteins; high heterogeneity of many proteins resulting in long charge trains; crowding of 2-DE separated protein spots in the molecular mass range between 45-80 kD and in the isoelectric point range between 4.5 and 6. Therefore, proteomic technologies are needed that address these problems and particularly allow accurate quantitation of a larger number of less abundant proteins in plasma and other body fluids. The immunoaffinity-based protein subtraction chromatography (IASC) described here removes multiple proteins present in plasma and serum in high concentrations effectively and reproducibly. Applying IASC as an upfront plasma sample preparation process for 2-DE, the protein spot pattern observed in gels changes dramatically and at least 350 additional lower abundance proteins are visualized. Affinity-purified polyclonal antibodies (pAbs) are the immunoaffinity reagents used to specifically remove the abundant proteins such as albumin, immunoglobulin G, immunoglobulin A, transferrin, haptoglobin, alpha-1-antitrypsin, hemopexin, transthyretin, alpha-2-HS glycoprotein, alpha-1-acid glycoprotein, alpha-2-macroglobulin and fibrinogen from human plasma samples. To render the immunoaffinity subtraction procedure recyclable, the pAbs are immobilized and cross-linked on chromatographic matrices. Antibody-coupled matrices specific for one protein each can be pooled to form mixed-bed IASC columns. We show that up to ten affinity-bound plasma proteins with similar solubility characteristics are eluted from a mixed-bed column in one step. This facilitates automated chromatographic processing of plasma samples in high throughput, which is desirable in proteomic disease marker discovery projects.

  1. Biomarker Discovery by Novel Sensors Based on Nanoproteomics Approaches

    PubMed Central

    Dasilva, Noelia; Díez, Paula; Matarraz, Sergio; González-González, María; Paradinas, Sara; Orfao, Alberto; Fuentes, Manuel

    2012-01-01

    During the last years, proteomics has facilitated biomarker discovery by coupling high-throughput techniques with novel nanosensors. In the present review, we focus on the study of label-based and label-free detection systems, as well as nanotechnology approaches, indicating their advantages and applications in biomarker discovery. In addition, several disease biomarkers are shown in order to display the clinical importance of the improvement of sensitivity and selectivity by using nanoproteomics approaches as novel sensors. PMID:22438764

  2. SubCellProt: predicting protein subcellular localization using machine learning approaches.

    PubMed

    Garg, Prabha; Sharma, Virag; Chaudhari, Pradeep; Roy, Nilanjan

    2009-01-01

    High-throughput genome sequencing projects continue to churn out enormous amounts of raw sequence data. However, most of this raw sequence data is unannotated and, hence, not very useful. Among the various approaches to decipher the function of a protein, one is to determine its localization. Experimental approaches for proteome annotation including determination of a protein's subcellular localizations are very costly and labor intensive. Besides the available experimental methods, in silico methods present alternative approaches to accomplish this task. Here, we present two machine learning approaches for prediction of the subcellular localization of a protein from the primary sequence information. Two machine learning algorithms, k Nearest Neighbor (k-NN) and Probabilistic Neural Network (PNN) were used to classify an unknown protein into one of the 11 subcellular localizations. The final prediction is made on the basis of a consensus of the predictions made by two algorithms and a probability is assigned to it. The results indicate that the primary sequence derived features like amino acid composition, sequence order and physicochemical properties can be used to assign subcellular localization with a fair degree of accuracy. Moreover, with the enhanced accuracy of our approach and the definition of a prediction domain, this method can be used for proteome annotation in a high throughput manner. SubCellProt is available at www.databases.niper.ac.in/SubCellProt.

  3. Dimension reduction techniques for the integrative analysis of multi-omics data

    PubMed Central

    Zeleznik, Oana A.; Thallinger, Gerhard G.; Kuster, Bernhard; Gholami, Amin M.

    2016-01-01

    State-of-the-art next-generation sequencing, transcriptomics, proteomics and other high-throughput ‘omics' technologies enable the efficient generation of large experimental data sets. These data may yield unprecedented knowledge about molecular pathways in cells and their role in disease. Dimension reduction approaches have been widely used in exploratory analysis of single omics data sets. This review will focus on dimension reduction approaches for simultaneous exploratory analyses of multiple data sets. These methods extract the linear relationships that best explain the correlated structure across data sets, the variability both within and between variables (or observations) and may highlight data issues such as batch effects or outliers. We explore dimension reduction techniques as one of the emerging approaches for data integration, and how these can be applied to increase our understanding of biological systems in normal physiological function and disease. PMID:26969681

  4. Proteome- and transcriptome-driven reconstruction of the human myocyte metabolic network and its use for identification of markers for diabetes.

    PubMed

    Väremo, Leif; Scheele, Camilla; Broholm, Christa; Mardinoglu, Adil; Kampf, Caroline; Asplund, Anna; Nookaew, Intawat; Uhlén, Mathias; Pedersen, Bente Klarlund; Nielsen, Jens

    2015-05-12

    Skeletal myocytes are metabolically active and susceptible to insulin resistance and are thus implicated in type 2 diabetes (T2D). This complex disease involves systemic metabolic changes, and their elucidation at the systems level requires genome-wide data and biological networks. Genome-scale metabolic models (GEMs) provide a network context for the integration of high-throughput data. We generated myocyte-specific RNA-sequencing data and investigated their correlation with proteome data. These data were then used to reconstruct a comprehensive myocyte GEM. Next, we performed a meta-analysis of six studies comparing muscle transcription in T2D versus healthy subjects. Transcriptional changes were mapped on the myocyte GEM, revealing extensive transcriptional regulation in T2D, particularly around pyruvate oxidation, branched-chain amino acid catabolism, and tetrahydrofolate metabolism, connected through the downregulated dihydrolipoamide dehydrogenase. Strikingly, the gene signature underlying this metabolic regulation successfully classifies the disease state of individual samples, suggesting that regulation of these pathways is a ubiquitous feature of myocytes in response to T2D. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  5. Genome-derived vaccines.

    PubMed

    De Groot, Anne S; Rappuoli, Rino

    2004-02-01

    Vaccine research entered a new era when the complete genome of a pathogenic bacterium was published in 1995. Since then, more than 97 bacterial pathogens have been sequenced and at least 110 additional projects are now in progress. Genome sequencing has also dramatically accelerated: high-throughput facilities can draft the sequence of an entire microbe (two to four megabases) in 1 to 2 days. Vaccine developers are using microarrays, immunoinformatics, proteomics and high-throughput immunology assays to reduce the truly unmanageable volume of information available in genome databases to a manageable size. Vaccines composed by novel antigens discovered from genome mining are already in clinical trials. Within 5 years we can expect to see a novel class of vaccines composed by genome-predicted, assembled and engineered T- and Bcell epitopes. This article addresses the convergence of three forces--microbial genome sequencing, computational immunology and new vaccine technologies--that are shifting genome mining for vaccines onto the forefront of immunology research.

  6. Exploring the potential of laser capture microdissection technology in integrated oral biosciences.

    PubMed

    Thennavan, A; Sharma, M; Chandrashekar, C; Hunter, K; Radhakrishnan, R

    2017-09-01

    Laser capture microdissection (LCM) is a high-end research and diagnostic technology that helps in obtaining pure cell populations for the purpose of cell- or lesion-specific genomic and proteomic analysis. Literature search on the application of LCM in oral tissues was made through PubMed. There is ample evidence to substantiate the utility of LCM in understanding the underlying molecular mechanism involving an array of oral physiological and pathological processes, including odontogenesis, taste perception, eruptive tooth movement, oral microbes, and cancers of the mouth and jaw tumors. This review is aimed at exploring the potential application of LCM in oral tissues as a high-throughput tool for integrated oral sciences. The indispensable application of LCM in the construction of lesion-specific genomic libraries with emphasis on some of the novel molecular markers thus discovered is also highlighted. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Redox proteomics screening cellular factors associated with oxidative stress in hepatocarcinogenesis.

    PubMed

    Zhou, Li; Wen, Ji; Huang, Zhao; Nice, Edouard C; Huang, Canhua; Zhang, Haiyuan; Li, Qifu

    2017-03-01

    Liver cancer is a major global health problem being the sixth most common cancer and the third cause of cancer-related death, with hepatocellular carcinoma (HCC) representing more than 90% of primary liver cancers. Mounting evidence suggests that, compared with their normal counterparts, many types of cancer cell have increased levels of ROS. Therefore, cancer cells need to combat high levels of ROS, especially at early stages of tumor development. Recent studies have revealed that ROS-mediated regulation of redox-sensitive proteins (redox sensors) is involved in the pathogenesis and/or progression of many human diseases, including cancer. Unraveling the altered functions of redox sensors and the underlying mechanisms in hepatocarcinogenesis is critical for the development of novel cancer therapeutics. For this reason, redox proteomics has been developed for the high-throughput screening of redox sensors, which will benefit the development of novel therapeutic strategies for the treatment of HCC. In this review, we will briefly introduce several novel redox proteomics techniques that are currently available to study various oxidative modifications in hepatocarcinogenesis and summarize the most important discoveries in the study of redox processes related to the development and progression of HCC. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Accurate, Sensitive, and Precise Multiplexed Proteomics Using the Complement Reporter Ion Cluster

    DOE PAGES

    Sonnett, Matthew; Yeung, Eyan; Wuhr, Martin

    2018-03-09

    We present that quantitative analysis of proteomes across multiple time points, organelles, and perturbations is essential for understanding both fundamental biology and disease states. The development of isobaric tags (e.g. TMT) have enabled the simultaneous measurement of peptide abundances across several different conditions. These multiplexed approaches are promising in principle because of advantages in throughput and measurement quality. However, in practice existing multiplexing approaches suffer from key limitations. In its simple implementation (TMT-MS2), measurements are distorted by chemical noise leading to poor measurement accuracy. The current state-of-the-art (TMT-MS3) addresses this, but requires specialized quadrupole-iontrap-Orbitrap instrumentation. The complement reporter ion approachmore » (TMTc) produces high accuracy measurements and is compatible with many more instruments, like quadrupole-Orbitraps. However, the required deconvolution of the TMTc cluster leads to poor measurement precision. Here, we introduce TMTc+, which adds the modeling of the MS2-isolation step into the deconvolution algorithm. The resulting measurements are comparable in precision to TMT-MS3/MS2. The improved duty cycle, and lower filtering requirements make TMTc+ more sensitive than TMT-MS3 and comparable with TMT-MS2. At the same time, unlike TMT-MS2, TMTc+ is exquisitely able to distinguish signal from chemical noise even outperforming TMT-MS3. Lastly, we compare TMTc+ to quantitative label-free proteomics of total HeLa lysate and find that TMTc+ quantifies 7.8k versus 3.9k proteins in a 5-plex sample. At the same time the median coefficient of variation improves from 13% to 4%. Furthermore, TMTc+ advances quantitative proteomics by enabling accurate, sensitive, and precise multiplexed experiments on more commonly used instruments.« less

  9. Accurate, Sensitive, and Precise Multiplexed Proteomics Using the Complement Reporter Ion Cluster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sonnett, Matthew; Yeung, Eyan; Wuhr, Martin

    We present that quantitative analysis of proteomes across multiple time points, organelles, and perturbations is essential for understanding both fundamental biology and disease states. The development of isobaric tags (e.g. TMT) have enabled the simultaneous measurement of peptide abundances across several different conditions. These multiplexed approaches are promising in principle because of advantages in throughput and measurement quality. However, in practice existing multiplexing approaches suffer from key limitations. In its simple implementation (TMT-MS2), measurements are distorted by chemical noise leading to poor measurement accuracy. The current state-of-the-art (TMT-MS3) addresses this, but requires specialized quadrupole-iontrap-Orbitrap instrumentation. The complement reporter ion approachmore » (TMTc) produces high accuracy measurements and is compatible with many more instruments, like quadrupole-Orbitraps. However, the required deconvolution of the TMTc cluster leads to poor measurement precision. Here, we introduce TMTc+, which adds the modeling of the MS2-isolation step into the deconvolution algorithm. The resulting measurements are comparable in precision to TMT-MS3/MS2. The improved duty cycle, and lower filtering requirements make TMTc+ more sensitive than TMT-MS3 and comparable with TMT-MS2. At the same time, unlike TMT-MS2, TMTc+ is exquisitely able to distinguish signal from chemical noise even outperforming TMT-MS3. Lastly, we compare TMTc+ to quantitative label-free proteomics of total HeLa lysate and find that TMTc+ quantifies 7.8k versus 3.9k proteins in a 5-plex sample. At the same time the median coefficient of variation improves from 13% to 4%. Furthermore, TMTc+ advances quantitative proteomics by enabling accurate, sensitive, and precise multiplexed experiments on more commonly used instruments.« less

  10. Interclonal proteomic responses to predator exposure in Daphnia magna may depend on predator composition of habitats.

    PubMed

    Otte, Kathrin A; Schrank, Isabella; Fröhlich, Thomas; Arnold, Georg J; Laforsch, Christian

    2015-08-01

    Phenotypic plasticity, the ability of one genotype to express different phenotypes in response to changing environmental conditions, is one of the most common phenomena characterizing the living world and is not only relevant for the ecology but also for the evolution of species. Daphnia, the water flea, is a textbook example for predator-induced phenotypic plastic defences; however, the analysis of molecular mechanisms underlying these inducible defences is still in its early stages. We exposed Daphnia magna to chemical cues of the predator Triops cancriformis to identify key processes underlying plastic defensive trait formation. To get a more comprehensive idea of this phenomenon, we studied four genotypes with five biological replicates each, originating from habitats characterized by different predator composition, ranging from predator-free habitats to habitats containing T. cancriformis. We analysed the morphologies as well as proteomes of predator-exposed and control animals. Three genotypes showed morphological changes when the predator was present. Using a high-throughput proteomics approach, we found 294 proteins which were significantly altered in their abundance after predator exposure in a general or genotype-dependent manner. Proteins connected to genotype-dependent responses were related to the cuticle, protein synthesis and calcium binding, whereas the yolk protein vitellogenin increased in abundance in all genotypes, indicating their involvement in a more general response. Furthermore, genotype-dependent responses at the proteome level were most distinct for the only genotype that shares its habitat with Triops. Altogether, our study provides new insights concerning genotype-dependent and general molecular processes involved in predator-induced phenotypic plasticity in D. magna. © 2015 John Wiley & Sons Ltd.

  11. Integration of Transcriptome, Proteome and Metabolism Data Reveals the Alkaloids Biosynthesis in Macleaya cordata and Macleaya microcarpa

    PubMed Central

    Liu, Fuqing; Huang, Peng; Zhu, Pengcheng; Chen, Jinjun; Shi, Mingming; Guo, Fang; Cheng, Pi; Zeng, Jing; Liao, Yifang; Gong, Jing; Zhang, Hong-Mei; Wang, Depeng; Guo, An-Yuan; Xiong, Xingyao

    2013-01-01

    Background The Macleaya spp., including Macleaya cordata and Macleaya microcarpa, are traditional anti-virus, inflammation eliminating, and insecticide herb medicines for their isoquinoline alkaloids. They are also known as the basis of the popular natural animal food addictive in Europe. However, few studies especially at genomics level were conducted on them. Hence, we performed the Macleaya spp. transcriptome and integrated it with iTRAQ proteome analysis in order to identify potential genes involved in alkaloids biosynthesis. Methodology and Principal Findings We elaborately designed the transcriptome, proteome and metabolism profiling for 10 samples of both species to explore their alkaloids biosynthesis. From the transcriptome data, we obtained 69367 and 78255 unigenes for M. cordata and M. microcarpa, in which about two thirds of them were similar to sequences in public databases. By metabolism profiling, reverse patterns for alkaloids sanguinarine, chelerythrine, protopine, and allocryptopine were observed in different organs of two species. We characterized the expressions of enzymes in alkaloid biosynthesis pathways. We also identified more than 1000 proteins from iTRAQ proteome data. Our results strongly suggest that the root maybe the organ for major alkaloids biosynthesis of Macleaya spp. Except for biosynthesis, the alkaloids storage and transport were also important for their accumulation. The ultrastructure of laticifers by SEM helps us to prove the alkaloids maybe accumulated in the mature roots. Conclusions/Significance To our knowledge this is the first study to elucidate the genetic makeup of Macleaya spp. This work provides clues to the identification of the potential modulate genes involved in alkaloids biosynthesis in Macleaya spp., and sheds light on researches for non-model medicinal plants by integrating different high-throughput technologies. PMID:23326424

  12. ATP-Sensitive K+ Channel Knockout Induces Cardiac Proteome Remodeling Predictive of Heart Disease Susceptibility

    PubMed Central

    Arrell, D. Kent; Zlatkovic, Jelena; Kane, Garvan C.; Yamada, Satsuki; Terzic, Andre

    2010-01-01

    Forecasting disease susceptibility requires detection of maladaptive signatures prior to onset of overt symptoms. A case-in-point are cardiac ATP-sensitive K+ (KATP) channelopathies, for which the substrate underlying disease vulnerability remains to be identified. Resolving molecular pathobiology, even for single genetic defects, mandates a systems platform to reliably diagnose disease predisposition. High-throughput proteomic analysis was here integrated with network biology to decode consequences of Kir6.2 KATP channel pore deletion. Differential two-dimensional gel electrophoresis reproducibly resolved > 800 protein species from hearts of asymptomatic wild-type and Kir6.2-knockout counterparts. KATP channel ablation remodeled the cardiac proteome, significantly altering 71 protein spots, from which 102 unique identities were assigned following hybrid linear ion trap quadrupole-Orbitrap tandem mass spectrometry. Ontological annotation stratified the KATP channel-dependent protein cohort into a predominant bioenergetic module (63 resolved identities), with additional focused sets representing signaling molecules (6), oxidoreductases (8), chaperones (6), and proteins involved in catabolism (6), cytostructure (8), and transcription and translation (5). Protein interaction mapping, in conjunction with expression level changes, localized a KATP channel-associated subproteome within a nonstochastic scale-free network. Global assessment of the KATP channel deficient environment verified the primary impact on metabolic pathways and revealed overrepresentation of markers associated with cardiovascular disease. Experimental imposition of graded stress precipitated exaggerated structural and functional myocardial defects in the Kir6.2-knockout, decreasing survivorship and validating the forecast of disease susceptibility. Proteomic cartography thus provides an integral view of molecular remodeling in the heart induced by KATP channel deletion, establishing a systems approach that predicts outcome at a presymptomatic stage. PMID:19673485

  13. An Unsupervised kNN Method to Systematically Detect Changes in Protein Localization in High-Throughput Microscopy Images.

    PubMed

    Lu, Alex Xijie; Moses, Alan M

    2016-01-01

    Despite the importance of characterizing genes that exhibit subcellular localization changes between conditions in proteome-wide imaging experiments, many recent studies still rely upon manual evaluation to assess the results of high-throughput imaging experiments. We describe and demonstrate an unsupervised k-nearest neighbours method for the detection of localization changes. Compared to previous classification-based supervised change detection methods, our method is much simpler and faster, and operates directly on the feature space to overcome limitations in needing to manually curate training sets that may not generalize well between screens. In addition, the output of our method is flexible in its utility, generating both a quantitatively ranked list of localization changes that permit user-defined cut-offs, and a vector for each gene describing feature-wise direction and magnitude of localization changes. We demonstrate that our method is effective at the detection of localization changes using the Δrpd3 perturbation in Saccharomyces cerevisiae, where we capture 71.4% of previously known changes within the top 10% of ranked genes, and find at least four new localization changes within the top 1% of ranked genes. The results of our analysis indicate that simple unsupervised methods may be able to identify localization changes in images without laborious manual image labelling steps.

  14. At the Tipping Point

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wiley, H. S.

    There comes a time in every field of science when things suddenly change. While it might not be immediately apparent that things are different, a tipping point has occurred. Biology is now at such a point. The reason is the introduction of high-throughput genomics-based technologies. I am not talking about the consequences of the sequencing of the human genome (and every other genome within reach). The change is due to new technologies that generate an enormous amount of data about the molecular composition of cells. These include proteomics, transcriptional profiling by sequencing, and the ability to globally measure microRNAs andmore » post-translational modifications of proteins. These mountains of digital data can be mapped to a common frame of reference: the organism’s genome. With the new high-throughput technologies, we can generate tens of thousands of data points from each sample. Data are now measured in terabytes and the time necessary to analyze data can now require years. Obviously, we can’t wait to interpret the data fully before the next experiment. In fact, we might never be able to even look at all of it, much less understand it. This volume of data requires sophisticated computational and statistical methods for its analysis and is forcing biologists to approach data interpretation as a collaborative venture.« less

  15. Mass spectrometry-based targeted quantitative proteomics: achieving sensitive and reproducible detection of proteins.

    PubMed

    Boja, Emily S; Rodriguez, Henry

    2012-04-01

    Traditional shotgun proteomics used to detect a mixture of hundreds to thousands of proteins through mass spectrometric analysis, has been the standard approach in research to profile protein content in a biological sample which could lead to the discovery of new (and all) protein candidates with diagnostic, prognostic, and therapeutic values. In practice, this approach requires significant resources and time, and does not necessarily represent the goal of the researcher who would rather study a subset of such discovered proteins (including their variations or posttranslational modifications) under different biological conditions. In this context, targeted proteomics is playing an increasingly important role in the accurate measurement of protein targets in biological samples in the hope of elucidating the molecular mechanism of cellular function via the understanding of intricate protein networks and pathways. One such (targeted) approach, selected reaction monitoring (or multiple reaction monitoring) mass spectrometry (MRM-MS), offers the capability of measuring multiple proteins with higher sensitivity and throughput than shotgun proteomics. Developing and validating MRM-MS-based assays, however, is an extensive and iterative process, requiring a coordinated and collaborative effort by the scientific community through the sharing of publicly accessible data and datasets, bioinformatic tools, standard operating procedures, and well characterized reagents. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Design and Initial Characterization of the SC-200 Proteomics Standard Mixture

    PubMed Central

    Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald

    2011-01-01

    Abstract High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels. PMID:21250827

  17. Design and initial characterization of the SC-200 proteomics standard mixture.

    PubMed

    Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald; Kolker, Eugene

    2011-01-01

    High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels.

  18. GProX, a user-friendly platform for bioinformatics analysis and visualization of quantitative proteomics data.

    PubMed

    Rigbolt, Kristoffer T G; Vanselow, Jens T; Blagoev, Blagoy

    2011-08-01

    Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)(1). The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net.

  19. GProX, a User-Friendly Platform for Bioinformatics Analysis and Visualization of Quantitative Proteomics Data*

    PubMed Central

    Rigbolt, Kristoffer T. G.; Vanselow, Jens T.; Blagoev, Blagoy

    2011-01-01

    Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)1. The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net. PMID:21602510

  20. High throughput profile-profile based fold recognition for the entire human proteome.

    PubMed

    McGuffin, Liam J; Smith, Richard T; Bryson, Kevin; Sørensen, Søren-Aksel; Jones, David T

    2006-06-07

    In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power. In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.

  1. High-Throughput Analysis of Age-Dependent Protein Changes in Layer II/III of the Human Orbitofrontal Cortex

    NASA Astrophysics Data System (ADS)

    Kapadia, Fenika

    Studies on the orbitofrontal cortex (OFC) during normal aging have shown a decline in cognitive functions, a loss of spines/synapses in layer III and gene expression changes related to neural communication. Biological changes during the course of normal aging are summarized into 9 hallmarks based on aging in peripheral tissue. Whether these hallmarks apply to non-dividing brain tissue is not known. Therefore, we opted to perform large-scale proteomic profiling of the OFC layer II/III during normal aging from 15 young and 18 old male subjects. MaxQuant was utilized for label-free quantification and statistical analysis by the Random Intercept Model (RIM) identified 118 differentially expressed (DE) age-related proteins. Altered neural communication was the most represented hallmark of aging (54% of DE proteins), highlighting the importance of communication in the brain. Functional analysis showed enrichment in GABA/glutamate signaling and pro-inflammatory responses. The former may contribute to alterations in excitation/inhibition, leading to cognitive decline during aging.

  2. Structural Analysis of PTM Hotspots (SAPH-ire) – A Quantitative Informatics Method Enabling the Discovery of Novel Regulatory Elements in Protein Families*

    PubMed Central

    Dewhurst, Henry M.; Choudhury, Shilpa; Torres, Matthew P.

    2015-01-01

    Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)—a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits—conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit–N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. PMID:26070665

  3. BIG: a large-scale data integration tool for renal physiology.

    PubMed

    Zhao, Yue; Yang, Chin-Rang; Raghuram, Viswanathan; Parulekar, Jaya; Knepper, Mark A

    2016-10-01

    Due to recent advances in high-throughput techniques, we and others have generated multiple proteomic and transcriptomic databases to describe and quantify gene expression, protein abundance, or cellular signaling on the scale of the whole genome/proteome in kidney cells. The existence of so much data from diverse sources raises the following question: "How can researchers find information efficiently for a given gene product over all of these data sets without searching each data set individually?" This is the type of problem that has motivated the "Big-Data" revolution in Data Science, which has driven progress in fields such as marketing. Here we present an online Big-Data tool called BIG (Biological Information Gatherer) that allows users to submit a single online query to obtain all relevant information from all indexed databases. BIG is accessible at http://big.nhlbi.nih.gov/.

  4. CPTAC Releases Largest-Ever Ovarian Cancer Proteome Dataset from Previously Genome Characterized Tumors | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have just released a comprehensive dataset of the proteomic analysis of high grade serous ovarian tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA).  This is one of the largest public datasets covering the proteome, phosphoproteome and glycoproteome with complementary deep genomic sequencing data on the same tumor.

  5. The Representation of Heart Development in the Gene Ontology

    PubMed Central

    Khodiyar, Varsha K.; Hill, David P.; Howe, Doug; Berardini, Tanya Z.; Tweedie, Susan; Talmud, Philippa J.; Breckenridge, Ross; Bhattarcharya, Shoumo; Riley, Paul; Scambler, Peter; Lovering, Ruth C.

    2012-01-01

    An understanding of heart development is critical in any systems biology approach to cardiovascular disease. The interpretation of data generated from high-throughput technologies (such as microarray and proteomics) is also essential to this approach. However, characterizing the role of genes in the processes underlying heart development and cardiovascular disease involves the non-trivial task of data analysis and integration of previous knowledge. The Gene Ontology (GO) Consortium provides structured controlled biological vocabularies that are used to summarize previous functional knowledge for gene products across all species. One aspect of GO describes biological processes, such as development and signaling. In order to support high-throughput cardiovascular research, we have initiated an effort to fully describe heart development in GO; expanding the number of GO terms describing heart development from 12 to over 280. This new ontology describes heart morphogenesis, the differentiation of specific cardiac cell types, and the involvement of signaling pathways in heart development and aligns GO with the current views of the heart development research community and its representation in the literature. This extension of GO allows gene product annotators to comprehensively capture the genetic program leading to the developmental progression of the heart. This will enable users to integrate heart development data across species, resulting in the comprehensive retrieval of information about this subject. The revised GO structure, combined with gene product annotations, should improve the interpretation of data from high-throughput methods in a variety of cardiovascular research areas, including heart development, congenital cardiac disease, and cardiac stem cell research. Additionally, we invite the heart development community to contribute to the expansion of this important dataset for the benefit of future research in this area. PMID:21419760

  6. High throughput, cell type-specific analysis of key proteins in human endometrial biopsies of women from fertile and infertile couples

    PubMed Central

    Leach, Richard E.; Jessmon, Philip; Coutifaris, Christos; Kruger, Michael; Myers, Evan R.; Ali-Fehmi, Rouba; Carson, Sandra A.; Legro, Richard S.; Schlaff, William D.; Carr, Bruce R.; Steinkampf, Michael P.; Silva, Susan; Leppert, Phyllis C.; Giudice, Linda; Diamond, Michael P.; Armant, D. Randall

    2012-01-01

    BACKGROUND Although histological dating of endometrial biopsies provides little help for prediction or diagnosis of infertility, analysis of individual endometrial proteins, proteomic profiling and transcriptome analysis have suggested several biomarkers with altered expression arising from intrinsic abnormalities, inadequate stimulation by or in response to gonadal steroids or altered function due to systemic disorders. The objective of this study was to delineate the developmental dynamics of potentially important proteins in the secretory phase of the menstrual cycle, utilizing a collection of endometrial biopsies from women of fertile (n = 89) and infertile (n = 89) couples. METHODS AND RESULTS Progesterone receptor-B (PGR-B), leukemia inhibitory factor, glycodelin/progestagen-associated endometrial protein (PAEP), homeobox A10, heparin-binding EGF-like growth factor, calcitonin and chemokine ligand 14 (CXCL14) were measured using a high-throughput, quantitative immunohistochemical method. Significant cyclic and tissue-specific regulation was documented for each protein, as well as their dysregulation in women of infertile couples. Infertile patients demonstrated a delay early in the secretory phase in the decline of PGR-B (P < 0.05) and premature mid-secretory increases in PAEP (P < 0.05) and CXCL14 (P < 0.05), suggesting that the implantation interval could be closing early. Correlation analysis identified potential interactions among certain proteins that were disrupted by infertility. CONCLUSIONS This approach overcomes the limitations of a small sample number. Protein expression and localization provided important insights into the potential roles of these proteins in normal and pathological development of the endometrium that is not attainable from transcriptome analysis, establishing a basis for biomarker, diagnostic and targeted drug development for women with infertility. PMID:22215622

  7. MINER: exploratory analysis of gene interaction networks by machine learning from expression data.

    PubMed

    Kadupitige, Sidath Randeni; Leung, Kin Chun; Sellmeier, Julia; Sivieng, Jane; Catchpoole, Daniel R; Bain, Michael E; Gaëta, Bruno A

    2009-12-03

    The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. We have developed MINER (Microarray Interactive Network Exploration and Representation), an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.

  8. Elucidating structural and molecular mechanisms of β-arrestin-biased agonism at GPCRs via MS-based proteomics.

    PubMed

    Xiao, Kunhong; Sun, Jinpeng

    2018-01-01

    The discovery of β-arrestin-dependent GPCR signaling has led to an exciting new field in GPCR pharmacology: to develop "biased agonists" that can selectively target a specific downstream signaling pathway that elicits beneficial therapeutic effects without activating other pathways that elicit negative side effects. This new trend in GPCR drug discovery requires us to understand the structural and molecular mechanisms of β-arrestin-biased agonism, which largely remain unclear. We have used cutting-edge mass spectrometry (MS)-based proteomics, combined with systems, chemical and structural biology to study protein function, macromolecular interaction, protein expression and posttranslational modifications in the β-arrestin-dependent GPCR signaling. These high-throughput proteomic studies have provided a systems view of β-arrestin-biased agonism from several perspectives: distinct receptor phosphorylation barcode, multiple receptor conformations, distinct β-arrestin conformations, and ligand-specific signaling. The information obtained from these studies offers new insights into the molecular basis of GPCR regulation by β-arrestin and provides a potential platform for developing novel therapeutic interventions through GPCRs. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. A Researcher's Guide to Mass Spectrometry-Based Proteomics

    PubMed Central

    Savaryn, John P.; Toby, Timothy K.; Kelleher, Neil L.

    2016-01-01

    Mass spectrometry (MS) is widely recognized as a powerful analytical tool for molecular research. MS is used by researchers around the globe to identify, quantify, and characterize biomolecules like proteins from any number of biological conditions or sample types. As instrumentation has advanced, and with the coupling of liquid chromatography (LC) for high-throughput LC-MS/MS, a proteomics experiment measuring hundreds to thousands of proteins/protein groups is now commonplace. While expert practitioners who best understand the operation of LC-MS systems tend to have strong backgrounds in physics and engineering, consumers of proteomics data and technology are not exposed to the physio-chemical principles underlying the information they seek. Since articles and reviews tend not to focus on bridging this divide, our goal here is to span this gap and translate MS ion physics into language intuitive to the general reader active in basic or applied biomedical research. Here, we visually describe what happens to ions as they enter and move around inside a mass spectrometer. We describe basic MS principles, including electric current, ion optics, ion traps, quadrupole mass filters, and Orbitrap FT-analyzers. PMID:27553853

  10. Detection of biomarkers of pathogenic Naegleria fowleri through mass spectrometry and proteomics.

    PubMed

    Moura, Hercules; Izquierdo, Fernando; Woolfitt, Adrian R; Wagner, Glauber; Pinto, Tatiana; del Aguila, Carmen; Barr, John R

    2015-01-01

    Emerging methods based on mass spectrometry (MS) can be used in the rapid identification of microorganisms. Thus far, these practical and rapidly evolving methods have mainly been applied to characterize prokaryotes. We applied matrix-assisted laser-desorption-ionization-time-of-flight mass spectrometry MALDI-TOF MS in the analysis of whole cells of 18 N. fowleri isolates belonging to three genotypes. Fourteen originated from the cerebrospinal fluid or brain tissue of primary amoebic meningoencephalitis patients and four originated from water samples of hot springs, rivers, lakes or municipal water supplies. Whole Naegleria trophozoites grown in axenic cultures were washed and mixed with MALDI matrix. Mass spectra were acquired with a 4700 TOF-TOF instrument. MALDI-TOF MS yielded consistent patterns for all isolates examined. Using a combination of novel data processing methods for visual peak comparison, statistical analysis and proteomics database searching we were able to detect several biomarkers that can differentiate all species and isolates studied, along with common biomarkers for all N. fowleri isolates. Naegleria fowleri could be easily separated from other species within the genus Naegleria. A number of peaks detected were tentatively identified. MALDI-TOF MS fingerprinting is a rapid, reproducible, high-throughput alternative method for identifying Naegleria isolates. This method has potential for studying eukaryotic agents. © 2014 The Author(s) Journal of Eukaryotic Microbiology © 2014 International Society of Protistologists.

  11. Quantitative proteomic analysis for high-throughput screening of differential glycoproteins in hepatocellular carcinoma serum

    PubMed Central

    Gao, Hua-Jun; Chen, Ya-Jing; Zuo, Duo; Xiao, Ming-Ming; Li, Ying; Guo, Hua; Zhang, Ning; Chen, Rui-Bing

    2015-01-01

    Objective Hepatocellular carcinoma (HCC) is a leading cause of cancer-related deaths. Novel serum biomarkers are required to increase the sensitivity and specificity of serum screening for early HCC diagnosis. This study employed a quantitative proteomic strategy to analyze the differential expression of serum glycoproteins between HCC and normal control serum samples. Methods Lectin affinity chromatography (LAC) was used to enrich glycoproteins from the serum samples. Quantitative mass spectrometric analysis combined with stable isotope dimethyl labeling and 2D liquid chromatography (LC) separations were performed to examine the differential levels of the detected proteins between HCC and control serum samples. Western blot was used to analyze the differential expression levels of the three serum proteins. Results A total of 2,280 protein groups were identified in the serum samples from HCC patients by using the 2D LC-MS/MS method. Up to 36 proteins were up-regulated in the HCC serum, whereas 19 proteins were down-regulated. Three differential glycoproteins, namely, fibrinogen gamma chain (FGG), FOS-like antigen 2 (FOSL2), and α-1,6-mannosylglycoprotein 6-β-N-acetylglucosaminyltransferase B (MGAT5B) were validated by Western blot. All these three proteins were up-regulated in the HCC serum samples. Conclusion A quantitative glycoproteomic method was established and proven useful to determine potential novel biomarkers for HCC. PMID:26487969

  12. Reproducible Tissue Homogenization and Protein Extraction for Quantitative Proteomics Using MicroPestle-Assisted Pressure-Cycling Technology.

    PubMed

    Shao, Shiying; Guo, Tiannan; Gross, Vera; Lazarev, Alexander; Koh, Ching Chiek; Gillessen, Silke; Joerger, Markus; Jochum, Wolfram; Aebersold, Ruedi

    2016-06-03

    The reproducible and efficient extraction of proteins from biopsy samples for quantitative analysis is a critical step in biomarker and translational research. Recently, we described a method consisting of pressure-cycling technology (PCT) and sequential windowed acquisition of all theoretical fragment ions-mass spectrometry (SWATH-MS) for the rapid quantification of thousands of proteins from biopsy-size tissue samples. As an improvement of the method, we have incorporated the PCT-MicroPestle into the PCT-SWATH workflow. The PCT-MicroPestle is a novel, miniaturized, disposable mechanical tissue homogenizer that fits directly into the microTube sample container. We optimized the pressure-cycling conditions for tissue lysis with the PCT-MicroPestle and benchmarked the performance of the system against the conventional PCT-MicroCap method using mouse liver, heart, brain, and human kidney tissues as test samples. The data indicate that the digestion of the PCT-MicroPestle-extracted proteins yielded 20-40% more MS-ready peptide mass from all tissues tested with a comparable reproducibility when compared to the conventional PCT method. Subsequent SWATH-MS analysis identified a higher number of biologically informative proteins from a given sample. In conclusion, we have developed a new device that can be seamlessly integrated into the PCT-SWATH workflow, leading to increased sample throughput and improved reproducibility at both the protein extraction and proteomic analysis levels when applied to the quantitative proteomic analysis of biopsy-level samples.

  13. Nonlinear mixed effects dose response modeling in high throughput drug screens: application to melanoma cell line analysis.

    PubMed

    Ding, Kuan-Fu; Petricoin, Emanuel F; Finlay, Darren; Yin, Hongwei; Hendricks, William P D; Sereduk, Chris; Kiefer, Jeffrey; Sekulic, Aleksandar; LoRusso, Patricia M; Vuori, Kristiina; Trent, Jeffrey M; Schork, Nicholas J

    2018-01-12

    Cancer cell lines are often used in high throughput drug screens (HTS) to explore the relationship between cell line characteristics and responsiveness to different therapies. Many current analysis methods infer relationships by focusing on one aspect of cell line drug-specific dose-response curves (DRCs), the concentration causing 50% inhibition of a phenotypic endpoint (IC 50 ). Such methods may overlook DRC features and do not simultaneously leverage information about drug response patterns across cell lines, potentially increasing false positive and negative rates in drug response associations. We consider the application of two methods, each rooted in nonlinear mixed effects (NLME) models, that test the relationship relationships between estimated cell line DRCs and factors that might mitigate response. Both methods leverage estimation and testing techniques that consider the simultaneous analysis of different cell lines to draw inferences about any one cell line. One of the methods is designed to provide an omnibus test of the differences between cell line DRCs that is not focused on any one aspect of the DRC (such as the IC 50 value). We simulated different settings and compared the different methods on the simulated data. We also compared the proposed methods against traditional IC 50 -based methods using 40 melanoma cell lines whose transcriptomes, proteomes, and, importantly, BRAF and related mutation profiles were available. Ultimately, we find that the NLME-based methods are more robust, powerful and, for the omnibus test, more flexible, than traditional methods. Their application to the melanoma cell lines reveals insights into factors that may be clinically useful.

  14. Transcriptome and Proteome Exploration to Provide a Resource for the Study of Agrocybe aegerita

    PubMed Central

    Jiang, Shuai; Chen, Yijie; Yin, Yalin; Pan, Yongfu; Yu, Guojun; Li, Yamu; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui

    2013-01-01

    Background Agrocybe aegerita, the black poplar mushroom, has been highly valued as a functional food for its medicinal and nutritional benefits. Several bioactive extracts from A. aegerita have been found to exhibit antitumor and antioxidant activities. However, limited genetic resources for A. aegerita have hindered exploration of this species. Methodology/Principal Findings To facilitate the research on A. aegerita, we established a deep survey of the transcriptome and proteome of this mushroom. We applied high-throughput sequencing technology (Illumina) to sequence A. aegerita transcriptomes from mycelium and fruiting body. The raw clean reads were de novo assembled into a total of 36,134 expressed sequences tags (ESTs) with an average length of 663 bp. These ESTs were annotated and classified according to Gene Ontology (GO), Clusters of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways. Gene expression profile analysis showed that 18,474 ESTs were differentially expressed, with 10,131 up-regulated in mycelium and 8,343 up-regulated in fruiting body. Putative genes involved in polysaccharide and steroid biosynthesis were identified from A. aegerita transcriptome, and these genes were differentially expressed at the two stages of A. aegerita. Based on one-dimensional gel electrophoresis (1-DGE) coupled with electrospray ionization liquid chromatography tandem MS (LC-ESI-MS/MS), we identified a total of 309 non-redundant proteins. And many metabolic enzymes involved in glycolysis were identified in the protein database. Conclusions/Significance This is the first study on transcriptome and proteome analyses of A. aegerita. The data in this study serve as a resource of A. aegerita transcripts and proteins, and offer clues to the applications of this mushroom in nutrition, pharmacy and industry. PMID:23418592

  15. Transcriptome and proteome exploration to provide a resource for the study of Agrocybe aegerita.

    PubMed

    Wang, Man; Gu, Bianli; Huang, Jie; Jiang, Shuai; Chen, Yijie; Yin, Yalin; Pan, Yongfu; Yu, Guojun; Li, Yamu; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui

    2013-01-01

    Agrocybe aegerita, the black poplar mushroom, has been highly valued as a functional food for its medicinal and nutritional benefits. Several bioactive extracts from A. aegerita have been found to exhibit antitumor and antioxidant activities. However, limited genetic resources for A. aegerita have hindered exploration of this species. To facilitate the research on A. aegerita, we established a deep survey of the transcriptome and proteome of this mushroom. We applied high-throughput sequencing technology (Illumina) to sequence A. aegerita transcriptomes from mycelium and fruiting body. The raw clean reads were de novo assembled into a total of 36,134 expressed sequences tags (ESTs) with an average length of 663 bp. These ESTs were annotated and classified according to Gene Ontology (GO), Clusters of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways. Gene expression profile analysis showed that 18,474 ESTs were differentially expressed, with 10,131 up-regulated in mycelium and 8,343 up-regulated in fruiting body. Putative genes involved in polysaccharide and steroid biosynthesis were identified from A. aegerita transcriptome, and these genes were differentially expressed at the two stages of A. aegerita. Based on one-dimensional gel electrophoresis (1-DGE) coupled with electrospray ionization liquid chromatography tandem MS (LC-ESI-MS/MS), we identified a total of 309 non-redundant proteins. And many metabolic enzymes involved in glycolysis were identified in the protein database. This is the first study on transcriptome and proteome analyses of A. aegerita. The data in this study serve as a resource of A. aegerita transcripts and proteins, and offer clues to the applications of this mushroom in nutrition, pharmacy and industry.

  16. Combined comparative and chemical proteomics on the mechanisms of levo-tetrahydropalmatine-induced antinociception in the formalin test.

    PubMed

    Wang, Chen; Zhou, Jiangrui; Wang, Shuowen; Ye, Mingliang; Jiang, Chunlei; Fan, Guorong; Zou, Hanfa

    2010-06-04

    This study investigated the mechanisms involved in the antinociceptive action induced by levo-tetrahydropalmatine (l-THP) in the formalin test by combined comparative and chemical proteomics. Rats were pretreated with l-THP by the oral route (40 mg/kg) 1 h before formalin injection. The antinociceptive effect of l-THP was shown in the first and second phases of the formalin test. To address the mechanisms by which l-THP inhibits formalin-induced nociception in rats, the combined comparative and chemical proteomics were applied. A novel high-throughput comparative proteomic approach based on 2D-nano-LC-MS/MS was applied to simultaneously evaluate the deregulated proteins involved in the response of l-THP treatment in formalin-induced pain rats. Thousands of proteins were identified, among which 17 proteins survived the stringent filter criteria and were further included for functional discussion. Two proteins (Neurabin-1 and Calcium-dependent secretion activator 1) were randomly selected, and their expression levels were further confirmed by Western Blots. The results matched well with those of proteomics. In the present study, we also described the development and application of l-THP immobilized beads to bind the targets. Following incubation with cellular lysates, the proteome interacting with the fixed l-THP was identified. The results of comparative and chemical proteomics were quite complementary. Although the precise roles of these identified moleculars in l-THP-induced antinociception need further study, the combined results indicated that proteins associated with signal transduction, vesicular trafficking and neurotransmitter release, energy metabolism, and ion transport play important roles in l-THP-induced antinociception in the formalin test.

  17. Unraveling sterol-dependent membrane phenotypes by analysis of protein abundance-ratio distributions in different membrane fractions under biochemical and endogenous sterol depletion.

    PubMed

    Zauber, Henrik; Szymanski, Witold; Schulze, Waltraud X

    2013-12-01

    During the last decade, research on plasma membrane focused increasingly on the analysis of so-called microdomains. It has been shown that function of many membrane-associated proteins involved in signaling and transport depends on their conditional segregation within sterol-enriched membrane domains. High throughput proteomic analysis of sterol-protein interactions are often based on analyzing detergent resistant membrane fraction enriched in sterols and associated proteins, which also contain proteins from these microdomain structures. Most studies so far focused exclusively on the characterization of detergent resistant membrane protein composition and abundances. This approach has received some criticism because of its unspecificity and many co-purifying proteins. In this study, by a label-free quantitation approach, we extended the characterization of membrane microdomains by particularly studying distributions of each protein between detergent resistant membrane and detergent-soluble fractions (DSF). This approach allows a more stringent definition of dynamic processes between different membrane phases and provides a means of identification of co-purifying proteins. We developed a random sampling algorithm, called Unicorn, allowing for robust statistical testing of alterations in the protein distribution ratios of the two different fractions. Unicorn was validated on proteomic data from methyl-β-cyclodextrin treated plasma membranes and the sterol biosynthesis mutant smt1. Both, chemical treatment and sterol-biosynthesis mutation affected similar protein classes in their membrane phase distribution and particularly proteins with signaling and transport functions.

  18. Unraveling Sterol-dependent Membrane Phenotypes by Analysis of Protein Abundance-ratio Distributions in Different Membrane Fractions Under Biochemical and Endogenous Sterol Depletion*

    PubMed Central

    Zauber, Henrik; Szymanski, Witold; Schulze, Waltraud X.

    2013-01-01

    During the last decade, research on plasma membrane focused increasingly on the analysis of so-called microdomains. It has been shown that function of many membrane-associated proteins involved in signaling and transport depends on their conditional segregation within sterol-enriched membrane domains. High throughput proteomic analysis of sterol-protein interactions are often based on analyzing detergent resistant membrane fraction enriched in sterols and associated proteins, which also contain proteins from these microdomain structures. Most studies so far focused exclusively on the characterization of detergent resistant membrane protein composition and abundances. This approach has received some criticism because of its unspecificity and many co-purifying proteins. In this study, by a label-free quantitation approach, we extended the characterization of membrane microdomains by particularly studying distributions of each protein between detergent resistant membrane and detergent-soluble fractions (DSF). This approach allows a more stringent definition of dynamic processes between different membrane phases and provides a means of identification of co-purifying proteins. We developed a random sampling algorithm, called Unicorn, allowing for robust statistical testing of alterations in the protein distribution ratios of the two different fractions. Unicorn was validated on proteomic data from methyl-β-cyclodextrin treated plasma membranes and the sterol biosynthesis mutant smt1. Both, chemical treatment and sterol-biosynthesis mutation affected similar protein classes in their membrane phase distribution and particularly proteins with signaling and transport functions. PMID:24030099

  19. Proteomic Screening of Antigenic Proteins from the Hard Tick, Haemaphysalis longicornis (Acari: Ixodidae)

    PubMed Central

    Kim, Young-Ha; slam, Mohammad Saiful; You, Myung-Jo

    2015-01-01

    Proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. For detection of antigens from Haemaphysalis longicornis, 1-dimensional electrophoresis (1-DE) quantitative immunoblotting technique combined with 2-dimensional electrophoresis (2-DE) immunoblotting was used for whole body proteins from unfed and partially fed female ticks. Reactivity bands and 2-DE immunoblotting were performed following 2-DE electrophoresis to identify protein spots. The proteome of the partially fed female had a larger number of lower molecular weight proteins than that of the unfed female tick. The total number of detected spots was 818 for unfed and 670 for partially fed female ticks. The 2-DE immunoblotting identified 10 antigenic spots from unfed females and 8 antigenic spots from partially fed females. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry (MALDI-TOF) of relevant spots identified calreticulin, putative secreted WC salivary protein, and a conserved hypothetical protein from the National Center for Biotechnology Information and Swiss Prot protein sequence databases. These findings indicate that most of the whole body components of these ticks are non-immunogenic. The data reported here will provide guidance in the identification of antigenic proteins to prevent infestation and diseases transmitted by H. longicornis. PMID:25748713

  20. Unlocking the proteomic information encoded in MALDI-TOF-MS data used for microbial identification and characterization.

    PubMed

    Fagerquist, Clifton K

    2017-01-01

    Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) is increasingly utilized as a rapid technique to identify microorganisms including pathogenic bacteria. However, little attention has been paid to the significant proteomic information encoded in the MS peaks that collectively constitute the MS 'fingerprint'. This review/perspective is intended to explore this topic in greater detail in the hopes that it may spur interest and further research in this area. Areas covered: This paper examines the recent literature on utilizing MALDI-TOF for bacterial identification. Critical works highlighting protein biomarker identification of bacteria, arguments for and against protein biomarker identification, proteomic approaches to biomarker identification, emergence of MALDI-TOF-TOF platforms and their use for top-down proteomic identification of bacterial proteins, protein denaturation and its effect on protein ion fragmentation, collision cross-sections and energy deposition during desorption/ionization are also explored. Expert commentary: MALDI-TOF and TOF-TOF mass spectrometry platforms will continue to provide chemical analyses that are rapid, cost-effective and high throughput. These instruments have proven their utility in the taxonomic identification of pathogenic bacteria at the genus and species level and are poised to more fully characterize these microorganisms to the benefit of clinical microbiology, food safety and other fields.

  1. The cerebrospinal fluid proteome in HIV infection: change associated with disease severity.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Angel, Thomas E.; Jacobs, Jon M.; Spudich, Serena S.

    2012-03-20

    Central nervous system (CNS) infection is a constant feature of systemic HIV infection with a clinical spectrum that ranges from chronic asymptomatic infection to severe cognitive and motor dysfunction. Analysis of cerebrospinal fluid (CSF) has played an important part in defining the character of this evolving infection and response to treatment. To further characterize CNS HIV infection and its effects, we applied advanced high-throughput proteomic methods to CSF to identify novel proteins and their changes with disease progression and treatment. After establishing an accurate mass and time (AMT) tag database containing 23,141 AMT tags for CSF peptides, we analyzed 91more » CSF samples by LC-MS from 12 HIV-uninfected and 14 HIV-infected subjects studied in the context of initiation of antiretroviral and correlated abundances of identified proteins (a) within and between subjects, (b) with all other proteins across the entire sample set, and (c) with 'external' CSF biomarkers of infection (HIV RNA), immune activation (neopterin) and neural injury (neurofilament light chain protein, NFL). We identified a mean of 2,333 +/- 328 (SD) peptides covering 307 +/-16 proteins in the 91 CSF sample set. Protein abundances differed both between and within subjects sampled at different time points and readily separated those with and without HIV infection. Proteins also showed inter-correlations across the sample set that were associated with biologically relevant dynamic processes. One-hundred and fifty proteins showed correlations with the external biomarkers. For example, using a threshold of cross correlation coefficient (Pearson's) {le}0.3 and {ge}0.3 for potentially meaningful relationships, a total of 99 proteins correlated with CSF neopterin (43 negative and 56 positive correlations) and related principally to neuronal plasticity and survival and to innate immunity. Pathway analysis defined several networks connecting the identified proteins, including one with amyloid precursor protein as a central node. Advanced CSF proteomic analysis enabled the identification of an array of novel protein changes across the spectrum of CNS HIV infection and disease. This initial analysis clearly demonstrated the value of contemporary state-of-the-art proteomic CSF analysis as a discovery tool in HIV infection with likely similar application to other neurological inflammatory and degenerative diseases.« less

  2. The cerebrospinal fluid proteome in HIV infection: change associated with disease severity

    PubMed Central

    2012-01-01

    Background Central nervous system (CNS) infection is a nearly universal feature of untreated systemic HIV infection with a clinical spectrum that ranges from chronic asymptomatic infection to severe cognitive and motor dysfunction. Analysis of cerebrospinal fluid (CSF) has played an important part in defining the character of this evolving infection and response to treatment. To further characterize CNS HIV infection and its effects, we applied advanced high-throughput proteomic methods to CSF to identify novel proteins and their changes with disease progression and treatment. Results After establishing an accurate mass and time (AMT) tag database containing 23,141 AMT tags for CSF peptides, we analyzed 91 CSF samples by LC-MS from 12 HIV-uninfected and 14 HIV-infected subjects studied in the context of initiation of antiretroviral therapy and correlated abundances of identified proteins a) within and between subjects, b) with all other proteins across the entire sample set, and c) with "external" CSF biomarkers of infection (HIV RNA), immune activation (neopterin) and neural injury (neurofilament light chain protein, NFL). We identified a mean of 2,333 +/- 328 (SD) peptides covering 307 +/-16 proteins in the 91 CSF sample set. Protein abundances differed both between and within subjects sampled at different time points and readily separated those with and without HIV infection. Proteins also showed inter-correlations across the sample set that were associated with biologically relevant dynamic processes. One-hundred and fifty proteins showed correlations with the external biomarkers. For example, using a threshold of cross correlation coefficient (Pearson's) ≤ -0.3 and ≥0.3 for potentially meaningful relationships, a total of 99 proteins correlated with CSF neopterin (43 negative and 56 positive correlations) and related principally to neuronal plasticity and survival and to innate immunity. Pathway analysis defined several networks connecting the identified proteins, including one with amyloid precursor protein as a central node. Conclusions Advanced CSF proteomic analysis enabled the identification of an array of novel protein changes across the spectrum of CNS HIV infection and disease. This initial analysis clearly demonstrated the value of contemporary state-of-the-art proteomic CSF analysis as a discovery tool in HIV infection with likely similar application to other neurological inflammatory and degenerative diseases. PMID:22433316

  3. Analytical challenges translating mass spectrometry-based phosphoproteomics from discovery to clinical applications

    PubMed Central

    Iliuk, Anton B.; Arrington, Justine V.; Tao, Weiguo Andy

    2014-01-01

    Phosphoproteomics is the systematic study of one of the most common protein modifications in high throughput with the aim of providing detailed information of the control, response, and communication of biological systems in health and disease. Advances in analytical technologies and strategies, in particular the contributions of high-resolution mass spectrometers, efficient enrichments of phosphopeptides, and fast data acquisition and annotation, have catalyzed dramatic expansion of signaling landscapes in multiple systems during the past decade. While phosphoproteomics is an essential inquiry to map high-resolution signaling networks and to find relevant events among the apparently ubiquitous and widespread modifications of proteome, it presents tremendous challenges in separation sciences to translate it from discovery to clinical practice. In this mini-review, we summarize the analytical tools currently utilized for phosphoproteomic analysis (with focus on MS), progresses made on deciphering clinically relevant kinase-substrate networks, MS uses for biomarker discovery and validation, and the potential of phosphoproteomics for disease diagnostics and personalized medicine. PMID:24890697

  4. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    DOE PAGES

    Brooks, Brandon; Mueller, R. S.; Young, Jacque C.; ...

    2015-07-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13 21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains ofmore » Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential.« less

  5. LipidHome: a database of theoretical lipids optimized for high throughput mass spectrometry lipidomics.

    PubMed

    Foster, Joseph M; Moreno, Pablo; Fabregat, Antonio; Hermjakob, Henning; Steinbeck, Christoph; Apweiler, Rolf; Wakelam, Michael J O; Vizcaíno, Juan Antonio

    2013-01-01

    Protein sequence databases are the pillar upon which modern proteomics is supported, representing a stable reference space of predicted and validated proteins. One example of such resources is UniProt, enriched with both expertly curated and automatic annotations. Taken largely for granted, similar mature resources such as UniProt are not available yet in some other "omics" fields, lipidomics being one of them. While having a seasoned community of wet lab scientists, lipidomics lies significantly behind proteomics in the adoption of data standards and other core bioinformatics concepts. This work aims to reduce the gap by developing an equivalent resource to UniProt called 'LipidHome', providing theoretically generated lipid molecules and useful metadata. Using the 'FASTLipid' Java library, a database was populated with theoretical lipids, generated from a set of community agreed upon chemical bounds. In parallel, a web application was developed to present the information and provide computational access via a web service. Designed specifically to accommodate high throughput mass spectrometry based approaches, lipids are organised into a hierarchy that reflects the variety in the structural resolution of lipid identifications. Additionally, cross-references to other lipid related resources and papers that cite specific lipids were used to annotate lipid records. The web application encompasses a browser for viewing lipid records and a 'tools' section where an MS1 search engine is currently implemented. LipidHome can be accessed at http://www.ebi.ac.uk/apweiler-srv/lipidhome.

  6. Identification of widespread adenosine nucleotide binding in Mycobacterium tuberculosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ansong, Charles; Ortega, Corrie; Payne, Samuel H.

    The annotation of protein function is almost completely performed by in silico approaches. However, computational prediction of protein function is frequently incomplete and error prone. In Mycobacterium tuberculosis (Mtb), ~25% of all genes have no predicted function and are annotated as hypothetical proteins. This lack of functional information severely limits our understanding of Mtb pathogenicity. Current tools for experimental functional annotation are limited and often do not scale to entire protein families. Here, we report a generally applicable chemical biology platform to functionally annotate bacterial proteins by combining activity-based protein profiling (ABPP) and quantitative LC-MS-based proteomics. As an example ofmore » this approach for high-throughput protein functional validation and discovery, we experimentally annotate the families of ATP-binding proteins in Mtb. Our data experimentally validate prior in silico predictions of >250 ATPases and adenosine nucleotide-binding proteins, and reveal 73 hypothetical proteins as novel ATP-binding proteins. We identify adenosine cofactor interactions with many hypothetical proteins containing a diversity of unrelated sequences, providing a new and expanded view of adenosine nucleotide binding in Mtb. Furthermore, many of these hypothetical proteins are both unique to Mycobacteria and essential for infection, suggesting specialized functions in mycobacterial physiology and pathogenicity. Thus, we provide a generally applicable approach for high throughput protein function discovery and validation, and highlight several ways in which application of activity-based proteomics data can improve the quality of functional annotations to facilitate novel biological insights.« less

  7. A systems approach to bone pathophysiology.

    PubMed

    Weiss, Aaron J; Lipshtat, Azi; Mechanick, Jeffrey I

    2010-11-01

    With evolving interest in multiscalar biological systems one could assume that reductionist approaches may not fully describe biological complexity. Instead, tools such as mathematical modeling, network analysis, and other multiplexed clinical- and research-oriented tests enable rapid analyses of high-throughput data parsed at the genomic, proteomic, metabolomic, and physiomic levels. A physiomic-level approach allows for recursive horizontal and vertical integration of subsystem coupling across and within spatiotemporal scales. Additionally, this methodology recognizes previously ignored subsystems and the strong, nonintuitively obvious and indirect connections among physiological events that potentially account for the uncertainties in medicine. In this review, we flip the reductionist research paradigm and review the concept of systems biology and its applications to bone pathophysiology. Specifically, a bone-centric physiome model is presented that incorporates systemic-level processes with their respective therapeutic implications. © 2010 New York Academy of Sciences.

  8. Deciphering the functions of O-GlcNAc glycosylation in the brain: The role of site-specific quantitative O-GlcNAcomics.

    PubMed

    Thompson, John W; Sorum, Alexander W; Hsieh-Wilson, Linda C

    2018-06-23

    The dynamic posttranslational modification O-linked β-N-acetylglucosamine glycosylation (O-GlcNAcylation) is present on thousands of intracellular proteins in the brain. Like phosphorylation, O-GlcNAcylation is inducible and plays important functional roles in both physiology and disease. Recent advances in mass spectrometry (MS) and bioconjugation methods are now enabling the mapping of O-GlcNAcylation events to individual sites in proteins. However, our understanding of which glycosylation events are necessary for regulating protein function and controlling specific processes, phenotypes, or diseases remains in its infancy. Given the sheer number of O-GlcNAc sites, methods are greatly needed to identify promising sites and prioritize them for time- and resource-intensive functional studies. Revealing sites that are dynamically altered by different stimuli or disease states will likely to go a long way in this regard. Here, we describe advanced methods for identifying O-GlcNAc sites on individual proteins and across the proteome, and for determining their stoichiometry in vivo. We also highlight emerging technologies for quantitative, site-specific MS-based O-GlcNAc proteomics (O-GlcNAcomics), which allow proteome-wide tracking of O-GlcNAcylation dynamics at individual sites. These cutting-edge technologies are beginning to bridge the gap between the high-throughput cataloging of O-GlcNAcylated proteins and the relatively low-throughput study of individual proteins. By uncovering the O-GlcNAcylation events that change in specific physiological and disease contexts, these new approaches are providing key insights into the regulatory functions of O-GlcNAc in the brain, including their roles in neuroprotection, neuronal signaling, learning and memory, and neurodegenerative diseases.

  9. Derivative component analysis for mass spectral serum proteomic profiles.

    PubMed

    Han, Henry

    2014-01-01

    As a promising way to transform medicine, mass spectrometry based proteomics technologies have seen a great progress in identifying disease biomarkers for clinical diagnosis and prognosis. However, there is a lack of effective feature selection methods that are able to capture essential data behaviors to achieve clinical level disease diagnosis. Moreover, it faces a challenge from data reproducibility, which means that no two independent studies have been found to produce same proteomic patterns. Such reproducibility issue causes the identified biomarker patterns to lose repeatability and prevents it from real clinical usage. In this work, we propose a novel machine-learning algorithm: derivative component analysis (DCA) for high-dimensional mass spectral proteomic profiles. As an implicit feature selection algorithm, derivative component analysis examines input proteomics data in a multi-resolution approach by seeking its derivatives to capture latent data characteristics and conduct de-noising. We further demonstrate DCA's advantages in disease diagnosis by viewing input proteomics data as a profile biomarker via integrating it with support vector machines to tackle the reproducibility issue, besides comparing it with state-of-the-art peers. Our results show that high-dimensional proteomics data are actually linearly separable under proposed derivative component analysis (DCA). As a novel multi-resolution feature selection algorithm, DCA not only overcomes the weakness of the traditional methods in subtle data behavior discovery, but also suggests an effective resolution to overcoming proteomics data's reproducibility problem and provides new techniques and insights in translational bioinformatics and machine learning. The DCA-based profile biomarker diagnosis makes clinical level diagnostic performances reproducible across different proteomic data, which is more robust and systematic than the existing biomarker discovery based diagnosis. Our findings demonstrate the feasibility and power of the proposed DCA-based profile biomarker diagnosis in achieving high sensitivity and conquering the data reproducibility issue in serum proteomics. Furthermore, our proposed derivative component analysis suggests the subtle data characteristics gleaning and de-noising are essential in separating true signals from red herrings for high-dimensional proteomic profiles, which can be more important than the conventional feature selection or dimension reduction. In particular, our profile biomarker diagnosis can be generalized to other omics data for derivative component analysis (DCA)'s nature of generic data analysis.

  10. Real-time Full-spectral Imaging and Affinity Measurements from 50 Microfluidic Channels using Nanohole Surface Plasmon Resonance†

    PubMed Central

    Lee, Si Hoon; Lindquist, Nathan C.; Wittenberg, Nathan J.; Jordan, Luke R.; Oh, Sang-Hyun

    2012-01-01

    With recent advances in high-throughput proteomics and systems biology, there is a growing demand for new instruments that can precisely quantify a wide range of receptor-ligand binding kinetics in a high-throughput fashion. Here we demonstrate a surface plasmon resonance (SPR) imaging spectroscopy instrument capable of extracting binding kinetics and affinities from 50 parallel microfluidic channels simultaneously. The instrument utilizes large-area (~cm2) metallic nanohole arrays as SPR sensing substrates and combines a broadband light source, a high-resolution imaging spectrometer and a low-noise CCD camera to extract spectral information from every channel in real time with a refractive index resolution of 7.7 × 10−6. To demonstrate the utility of our instrument for quantifying a wide range of biomolecular interactions, each parallel microfluidic channel is coated with a biomimetic supported lipid membrane containing ganglioside (GM1) receptors. The binding kinetics of cholera toxin b (CTX-b) to GM1 are then measured in a single experiment from 50 channels. By combining the highly parallel microfluidic device with large-area periodic nanohole array chips, our SPR imaging spectrometer system enables high-throughput, label-free, real-time SPR biosensing, and its full-spectral imaging capability combined with nanohole arrays could enable integration of SPR imaging with concurrent surface-enhanced Raman spectroscopy. PMID:22895607

  11. A draft map of the human ovarian proteome for tissue engineering and clinical applications.

    PubMed

    Ouni, Emna; Vertommen, Didier; Chiti, Maria Costanza; Dolmans, Marie-Madeleine; Amorim, Christiani Andrade

    2018-02-23

    Fertility preservation research in women today is increasingly taking advantage of bioengineering techniques to develop new biomimetic materials and solutions to safeguard ovarian cell function and microenvironment in vitro and in vivo. However, available data on the human ovary are limited and fundamental differences between animal models and humans are hampering researchers in their quest for more extensive knowledge of human ovarian physiology and key reproductive proteins that need to be preserved. We therefore turned to multi-dimensional label-free mass spectrometry to analyze human ovarian cortex, as it is a high-throughput and conclusive technique providing information on the proteomic composition of complex tissues like the ovary. In-depth proteomic profiling through two-dimensional liquid chromatography-mass spectrometry, western blot, histological and immunohistochemical analyses, and data mining helped us to confidently identify 1,508 proteins. Moreover, our method allowed us to chart the most complete representation so far of the ovarian matrisome, defined as the ensemble of extracellular matrix proteins and associated factors, including more than 80 proteins. In conclusion, this study will provide a better understanding of ovarian proteomics, with a detailed characterization of the ovarian follicle microenvironment, in order to enable bioengineers to create biomimetic scaffolds for transplantation and three-dimensional in vitro culture. By publishing our proteomic data, we also hope to contribute to accelerating biomedical research into ovarian health and disease in general. Published under license by The American Society for Biochemistry and Molecular Biology, Inc.

  12. A flexible statistical model for alignment of label-free proteomics data – incorporating ion mobility and product ion information

    PubMed Central

    2013-01-01

    Background The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing - the matching of peptide measurements across samples. Results We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Conclusions Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods. PMID:24341404

  13. A flexible statistical model for alignment of label-free proteomics data--incorporating ion mobility and product ion information.

    PubMed

    Benjamin, Ashlee M; Thompson, J Will; Soderblom, Erik J; Geromanos, Scott J; Henao, Ricardo; Kraus, Virginia B; Moseley, M Arthur; Lucas, Joseph E

    2013-12-16

    The goal of many proteomics experiments is to determine the abundance of proteins in biological samples, and the variation thereof in various physiological conditions. High-throughput quantitative proteomics, specifically label-free LC-MS/MS, allows rapid measurement of thousands of proteins, enabling large-scale studies of various biological systems. Prior to analyzing these information-rich datasets, raw data must undergo several computational processing steps. We present a method to address one of the essential steps in proteomics data processing--the matching of peptide measurements across samples. We describe a novel method for label-free proteomics data alignment with the ability to incorporate previously unused aspects of the data, particularly ion mobility drift times and product ion information. We compare the results of our alignment method to PEPPeR and OpenMS, and compare alignment accuracy achieved by different versions of our method utilizing various data characteristics. Our method results in increased match recall rates and similar or improved mismatch rates compared to PEPPeR and OpenMS feature-based alignment. We also show that the inclusion of drift time and product ion information results in higher recall rates and more confident matches, without increases in error rates. Based on the results presented here, we argue that the incorporation of ion mobility drift time and product ion information are worthy pursuits. Alignment methods should be flexible enough to utilize all available data, particularly with recent advancements in experimental separation methods.

  14. Automated Interpretation of Subcellular Patterns in Fluorescence Microscope Images for Location Proteomics

    PubMed Central

    Chen, Xiang; Velliste, Meel; Murphy, Robert F.

    2010-01-01

    Proteomics, the large scale identification and characterization of many or all proteins expressed in a given cell type, has become a major area of biological research. In addition to information on protein sequence, structure and expression levels, knowledge of a protein’s subcellular location is essential to a complete understanding of its functions. Currently subcellular location patterns are routinely determined by visual inspection of fluorescence microscope images. We review here research aimed at creating systems for automated, systematic determination of location. These employ numerical feature extraction from images, feature reduction to identify the most useful features, and various supervised learning (classification) and unsupervised learning (clustering) methods. These methods have been shown to perform significantly better than human interpretation of the same images. When coupled with technologies for tagging large numbers of proteins and high-throughput microscope systems, the computational methods reviewed here enable the new subfield of location proteomics. This subfield will make critical contributions in two related areas. First, it will provide structured, high-resolution information on location to enable Systems Biology efforts to simulate cell behavior from the gene level on up. Second, it will provide tools for Cytomics projects aimed at characterizing the behaviors of all cell types before, during and after the onset of various diseases. PMID:16752421

  15. Transcriptomic and Proteomic Responses of Sweetpotato Whitefly, Bemisia tabaci, to Thiamethoxam

    PubMed Central

    Yang, Nina; Xie, Wen; Yang, Xin; Wang, Shaoli; Wu, Qingjun; Li, Rumei; Pan, Huipeng; Liu, Baiming; Shi, Xiaobin; Fang, Yong; Xu, Baoyun; Zhou, Xuguo; Zhang, Youjun

    2013-01-01

    Background The sweetpotato whitefly, Bemisia tabaci (Hemiptera: Aleyrodidae), is one of the most widely distributed agricultural pests. Although it has developed resistance to many registered insecticides including the neonicotinoid insecticide thiamethoxam, the mechanisms that regulate the resistance are poorly understood. To understand the molecular basis of thiamethoxam resistance, “omics” analyses were carried out to examine differences between resistant and susceptible B. tabaci at both transcriptional and translational levels. Results A total of 1,338 mRNAs and 52 proteins were differentially expressed between resistant and susceptible B. tabaci. Among them, 11 transcripts had concurrent transcription and translation profiles. KEGG analysis mapped 318 and 35 differentially expressed genes and proteins, respectively, to 160 and 59 pathways (p<0.05). Thiamethoxam treatment activated metabolic pathways (e.g., drug metabolism), in which 118 transcripts were putatively linked to insecticide resistance, including up-regulated glutathione-S-transferase, UDP glucuronosyltransferase, glucosyl/glucuronosyl transferase, and cytochrome P450. Gene Ontology analysis placed these genes and proteins into protein complex, metabolic process, cellular process, signaling, and response to stimulus categories. Quantitative real-time PCR analysis validated “omics” response, and suggested a highly overexpressed P450, CYP6CX1, as a candidate molecular basis for the mechanistic study of thiamethoxam resistance in whiteflies. Finally, enzymatic activity assays showed elevated detoxification activities in the resistant B. tabaci. Conclusions This study demonstrates the applicability of high-throughput omics tools for identifying molecular candidates related to thiamethoxam resistance in an agricultural important insect pest. In addition, transcriptomic and proteomic analyses provide a solid foundation for future functional investigations into the complex molecular mechanisms governing the neonicotinoid resistance in whiteflies. PMID:23671574

  16. Combining high-throughput MALDI-TOF mass spectrometry and isoelectric focusing gel electrophoresis for virtual 2D gel-based proteomics.

    PubMed

    Lohnes, Karen; Quebbemann, Neil R; Liu, Kate; Kobzeff, Fred; Loo, Joseph A; Ogorzalek Loo, Rachel R

    2016-07-15

    The virtual two-dimensional gel electrophoresis/mass spectrometry (virtual 2D gel/MS) technology combines the premier, high-resolution capabilities of 2D gel electrophoresis with the sensitivity and high mass accuracy of mass spectrometry (MS). Intact proteins separated by isoelectric focusing (IEF) gel electrophoresis are imaged from immobilized pH gradient (IPG) polyacrylamide gels (the first dimension of classic 2D-PAGE) by matrix-assisted laser desorption/ionization (MALDI) MS. Obtaining accurate intact masses from sub-picomole-level proteins embedded in 2D-PAGE gels or in IPG strips is desirable to elucidate how the protein of one spot identified as protein 'A' on a 2D gel differs from the protein of another spot identified as the same protein, whenever tryptic peptide maps fail to resolve the issue. This task, however, has been extremely challenging. Virtual 2D gel/MS provides access to these intact masses. Modifications to our matrix deposition procedure improve the reliability with which IPG gels can be prepared; the new procedure is described. Development of this MALDI MS imaging (MSI) method for high-throughput MS with integrated 'top-down' MS to elucidate protein isoforms from complex biological samples is described and it is demonstrated that a 4-cm IPG gel segment can now be imaged in approximately 5min. Gel-wide chemical and enzymatic methods with further interrogation by MALDI MS/MS provide identifications, sequence-related information, and post-translational/transcriptional modification information. The MSI-based virtual 2D gel/MS platform may potentially link the benefits of 'top-down' and 'bottom-up' proteomics. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. BIG: a large-scale data integration tool for renal physiology

    PubMed Central

    Zhao, Yue; Yang, Chin-Rang; Raghuram, Viswanathan; Parulekar, Jaya

    2016-01-01

    Due to recent advances in high-throughput techniques, we and others have generated multiple proteomic and transcriptomic databases to describe and quantify gene expression, protein abundance, or cellular signaling on the scale of the whole genome/proteome in kidney cells. The existence of so much data from diverse sources raises the following question: “How can researchers find information efficiently for a given gene product over all of these data sets without searching each data set individually?” This is the type of problem that has motivated the “Big-Data” revolution in Data Science, which has driven progress in fields such as marketing. Here we present an online Big-Data tool called BIG (Biological Information Gatherer) that allows users to submit a single online query to obtain all relevant information from all indexed databases. BIG is accessible at http://big.nhlbi.nih.gov/. PMID:27279488

  18. Systematic cloning of an ORFeome using the Gateway system.

    PubMed

    Matsuyama, Akihisa; Yoshida, Minoru

    2009-01-01

    With the completion of the genome projects, there are increasing demands on the experimental systems that enable to exploit the entire set of protein-coding open reading frames (ORFs), viz. ORFeome, en masse. Systematic proteomic studies based on cloned ORFeomes are called "reverse proteomics," and have been launched in many organisms in recent years. Cloning of an ORFeome is such an attractive way for comprehensive understanding of biological phenomena, but is a challenging and daunting task. However, recent advances in techniques for DNA cloning using site-specific recombination and for high-throughput experimental techniques have made it feasible to clone an ORFeome with the minimum of exertion. The Gateway system is one of such the approaches, employing the recombination reaction of the bacteriophage lambda. Combining traditional DNA manipulation methods with modern technique of the recombination-based cloning system, it is possible to clone an ORFeome of an organism on an individual level.

  19. Lignases and aldo-keto reductases for conversion of lignin-containing materials to fermentable products

    DOEpatents

    Scharf, Michael; Sethi, Amit

    2016-09-13

    Termites have specialized digestive systems that overcome the lignin barrier in wood to release fermentable simple sugars. Using the termite Reticulitermes flavipes and its gut symbionts, high-throughput titanium pyrosequencing and proteomics approaches experimentally compared the effects of lignin-containing diets on host-symbiont digestome composition. Proteomic investigations and functional digestive studies with recombinant lignocellulases conducted in parallel provided strong evidence of congruence at the transcription and translational levels and provide enzymatic strategies for overcoming recalcitrant lignin barriers in biofuel feedstocks. Briefly described, therefore, the disclosure provides a system for generating a fermentable product from a lignified plant material, the system comprising a cooperating series of at least two catalytically active polypeptides, where said catalytically active polypeptides are selected from the group consisting of: cellulase Cell-1, .beta.-glu cellulase, an aldo-keto-reductase, a catalase, a laccase, and an endo-xylanase.

  20. Development of Proteomics-Based Fungicides: New Strategies for Environmentally Friendly Control of Fungal Plant Diseases

    PubMed Central

    Acero, Francisco Javier Fernández; Carbú, María; El-Akhal, Mohamed Rabie; Garrido, Carlos; González-Rodríguez, Victoria E.; Cantoral, Jesús M.

    2011-01-01

    Proteomics has become one of the most relevant high-throughput technologies. Several approaches have been used for studying, for example, tumor development, biomarker discovery, or microbiology. In this “post-genomic” era, the relevance of these studies has been highlighted as the phenotypes determined by the proteins and not by the genotypes encoding them that is responsible for the final phenotypes. One of the most interesting outcomes of these technologies is the design of new drugs, due to the discovery of new disease factors that may be candidates for new therapeutic targets. To our knowledge, no commercial fungicides have been developed from targeted molecular research, this review will shed some light on future prospects. We will summarize previous research efforts and discuss future innovations, focused on the fight against one of the main agents causing a devastating crops disease, fungal phytopathogens. PMID:21340014

  1. MCAM: multiple clustering analysis methodology for deriving hypotheses and insights from high-throughput proteomic datasets.

    PubMed

    Naegle, Kristen M; Welsch, Roy E; Yaffe, Michael B; White, Forest M; Lauffenburger, Douglas A

    2011-07-01

    Advances in proteomic technologies continue to substantially accelerate capability for generating experimental data on protein levels, states, and activities in biological samples. For example, studies on receptor tyrosine kinase signaling networks can now capture the phosphorylation state of hundreds to thousands of proteins across multiple conditions. However, little is known about the function of many of these protein modifications, or the enzymes responsible for modifying them. To address this challenge, we have developed an approach that enhances the power of clustering techniques to infer functional and regulatory meaning of protein states in cell signaling networks. We have created a new computational framework for applying clustering to biological data in order to overcome the typical dependence on specific a priori assumptions and expert knowledge concerning the technical aspects of clustering. Multiple clustering analysis methodology ('MCAM') employs an array of diverse data transformations, distance metrics, set sizes, and clustering algorithms, in a combinatorial fashion, to create a suite of clustering sets. These sets are then evaluated based on their ability to produce biological insights through statistical enrichment of metadata relating to knowledge concerning protein functions, kinase substrates, and sequence motifs. We applied MCAM to a set of dynamic phosphorylation measurements of the ERRB network to explore the relationships between algorithmic parameters and the biological meaning that could be inferred and report on interesting biological predictions. Further, we applied MCAM to multiple phosphoproteomic datasets for the ERBB network, which allowed us to compare independent and incomplete overlapping measurements of phosphorylation sites in the network. We report specific and global differences of the ERBB network stimulated with different ligands and with changes in HER2 expression. Overall, we offer MCAM as a broadly-applicable approach for analysis of proteomic data which may help increase the current understanding of molecular networks in a variety of biological problems. © 2011 Naegle et al.

  2. Multiplexed Post-Experimental Monoisotopic Mass Refinement ( m PE-MMR) to Increase Sensitivity and Accuracy in Peptide Identifications from Tandem Mass Spectra of Cofragmentation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Madar, Inamul Hasan; Ko, Seung-Ik; Kim, Hokeun

    Mass spectrometry (MS)-based proteomics, which uses high-resolution hybrid mass spectrometers such as the quadrupole-orbitrap mass spectrometer, can yield tens of thousands of tandem mass (MS/MS) spectra of high resolution during a routine bottom-up experiment. Despite being a fundamental and key step in MS-based proteomics, the accurate determination and assignment of precursor monoisotopic masses to the MS/MS spectra remains difficult. The difficulties stem from imperfect isotopic envelopes of precursor ions, inaccurate charge states for precursor ions, and cofragmentation. We describe a composite method of utilizing MS data to assign accurate monoisotopic masses to MS/MS spectra, including those subject to cofragmentation. Themore » method, “multiplexed post-experiment monoisotopic mass refinement” (mPE-MMR), consists of the following: multiplexing of precursor masses to assign multiple monoisotopic masses of cofragmented peptides to the corresponding multiplexed MS/MS spectra, multiplexing of charge states to assign correct charges to the precursor ions of MS/ MS spectra with no charge information, and mass correction for inaccurate monoisotopic peak picking. When combined with MS-GF+, a database search algorithm based on fragment mass difference, mPE-MMR effectively increases both sensitivity and accuracy in peptide identification from complex high-throughput proteomics data compared to conventional methods.« less

  3. Individual Biomarkers Using Molecular Personalized Medicine Approaches.

    PubMed

    Zenner, Hans P

    2017-01-01

    Molecular personalized medicine tries to generate individual predictive biomarkers to assist doctors in their decision making. These are thought to improve the efficacy and lower the toxicity of a treatment. The molecular basis of the desired high-precision prediction is modern "omex" technologies providing high-throughput bioanalytical methods. These include genomics and epigenomics, transcriptomics, proteomics, metabolomics, microbiomics, imaging, and functional analyses. In most cases, producing big data also requires a complex biomathematical analysis. Using molecular personalized medicine, the conventional physician's check of biomarker results may no longer be sufficient. By contrast, the physician may need to cooperate with the biomathematician to achieve the desired prediction on the basis of the analysis of individual big data typically produced by omex technologies. Identification of individual biomarkers using molecular personalized medicine approaches is thought to allow a decision-making for the precise use of a targeted therapy, selecting the successful therapeutic tool from a panel of preexisting drugs or medical products. This should avoid the treatment of nonresponders and responders that produces intolerable unwanted effects. © 2017 S. Karger AG, Basel.

  4. Size-Sorting Combined with Improved Nanocapillary-LC-MS for Identification of Intact Proteins up to 80 kDa

    PubMed Central

    Vellaichamy, Adaikkalam; Tran, John C.; Catherman, Adam D.; Lee, Ji Eun; Kellie, John F.; Sweet, Steve M.M.; Zamdborg, Leonid; Thomas, Paul M.; Ahlf, Dorothy R.; Durbin, Kenneth R.; Valaskovic, Gary A.; Kelleher, Neil L.

    2010-01-01

    Despite the availability of ultra-high resolution mass spectrometers, methods for separation and detection of intact proteins for proteome-scale analyses are still in a developmental phase. Here we report robust protocols for on-line LC-MS to drive high-throughput top-down proteomics in a fashion similar to bottom-up. Comparative work on protein standards showed that a polymeric stationary phase led to superior sensitivity over a silica-based medium in reversed-phase nanocapillary-LC, with detection of proteins >50 kDa routinely accomplished in the linear ion trap of a hybrid Fourier-Transform mass spectrometer. Protein identification was enabled by nozzle-skimmer dissociation (NSD) and detection of fragment ions with <5 ppm mass accuracy for highly-specific database searching using custom software. This overall approach led to identification of proteins up to 80 kDa, with 10-60 proteins identified in single LC-MS runs of samples from yeast and human cell lines pre-fractionated by their molecular weight using a gel-based sieving system. PMID:20073486

  5. Protein 3-Nitrotyrosine in Complex Biological Samples: Quantification by High-Pressure Liquid Chromatography/Electrochemical Detection and Emergence of Proteomic Approaches for Unbiased Identification of Modification Sites

    PubMed Central

    Nuriel, Tal; Deeb, Ruba S.; Hajjar, David P.; Gross, Steven S.

    2008-01-01

    Nitration of tyrosine residues by nitric oxide (NO)-derived species results in the accumulation of 3-nitrotyrosine in proteins, a hallmark of nitrosative stress in cells and tissues. Tyrosine nitration is recognized as one of the multiple signaling modalities used by NO-derived species for the regulation of protein structure and function in health and disease. Various methods have been described for the quantification of protein 3-nitrotyrosine residues, and several strategies have been presented toward the goal of proteome-wide identification of protein tyrosine modification sites. This chapter details a useful protocol for the quantification of 3-nitrotyrosine in cells and tissues using high-pressure liquid chromatography with electrochemical detection. Additionally, this chapter describes a novel biotin-tagging strategy for specific enrichment of 3-nitrotyrosine-containing peptides. Application of this strategy, in conjunction with high-throughput MS/MS-based peptide sequencing, is anticipated to fuel efforts in developing comprehensive inventories of nitrosative stress-induced protein-tyrosine modification sites in cells and tissues. PMID:18554526

  6. Analysis of mass spectrometry data from the secretome of an explant model of articular cartilage exposed to pro-inflammatory and anti-inflammatory stimuli using machine learning

    PubMed Central

    2013-01-01

    Background Osteoarthritis (OA) is an inflammatory disease of synovial joints involving the loss and degeneration of articular cartilage. The gold standard for evaluating cartilage loss in OA is the measurement of joint space width on standard radiographs. However, in most cases the diagnosis is made well after the onset of the disease, when the symptoms are well established. Identification of early biomarkers of OA can facilitate earlier diagnosis, improve disease monitoring and predict responses to therapeutic interventions. Methods This study describes the bioinformatic analysis of data generated from high throughput proteomics for identification of potential biomarkers of OA. The mass spectrometry data was generated using a canine explant model of articular cartilage treated with the pro-inflammatory cytokine interleukin 1 β (IL-1β). The bioinformatics analysis involved the application of machine learning and network analysis to the proteomic mass spectrometry data. A rule based machine learning technique, BioHEL, was used to create a model that classified the samples into their relevant treatment groups by identifying those proteins that separated samples into their respective groups. The proteins identified were considered to be potential biomarkers. Protein networks were also generated; from these networks, proteins pivotal to the classification were identified. Results BioHEL correctly classified eighteen out of twenty-three samples, giving a classification accuracy of 78.3% for the dataset. The dataset included the four classes of control, IL-1β, carprofen, and IL-1β and carprofen together. This exceeded the other machine learners that were used for a comparison, on the same dataset, with the exception of another rule-based method, JRip, which performed equally well. The proteins that were most frequently used in rules generated by BioHEL were found to include a number of relevant proteins including matrix metalloproteinase 3, interleukin 8 and matrix gla protein. Conclusions Using this protocol, combining an in vitro model of OA with bioinformatics analysis, a number of relevant extracellular matrix proteins were identified, thereby supporting the application of these bioinformatics tools for analysis of proteomic data from in vitro models of cartilage degradation. PMID:24330474

  7. Novel potential serological prostate cancer biomarkers using CT100+ cancer antigen microarray platform in a multi-cultural South African cohort

    PubMed Central

    Adeola, Henry A.; Smith, Muneerah; Kaestner, Lisa; Blackburn, Jonathan M.; Zerbini, Luiz F.

    2016-01-01

    There is a growing need for high throughput diagnostic tools for early diagnosis and treatment monitoring of prostate cancer (PCa) in Africa. The role of cancer-testis antigens (CTAs) in PCa in men of African descent is poorly researched. Hence, we aimed to elucidate the role of 123 Tumour Associated Antigens (TAAs) using antigen microarray platform in blood samples (N = 67) from a South African PCa, Benign prostatic hyperplasia (BPH) and disease control (DC) cohort. Linear (fold-over-cutoff) and differential expression quantitation of autoantibody signal intensities were performed. Molecular signatures of candidate PCa antigen biomarkers were identified and analyzed for ethnic group variation. Potential cancer diagnostic and immunotherapeutic inferences were drawn. We identified a total of 41 potential diagnostic/therapeutic antigen biomarkers for PCa. By linear quantitation, four antigens, GAGE1, ROPN1, SPANXA1 and PRKCZ were found to have higher autoantibody titres in PCa serum as compared with BPH where MAGEB1 and PRKCZ were highly expressed. Also, p53 S15A and p53 S46A were found highly expressed in the disease control group. Statistical analysis by differential expression revealed twenty-four antigens as upregulated in PCa samples, while 11 were downregulated in comparison to BPH and DC (FDR = 0.01). FGFR2, COL6A1and CALM1 were verifiable biomarkers of PCa analysis using urinary shotgun proteomics. Functional pathway annotation of identified biomarkers revealed similar enrichment both at genomic and proteomic level and ethnic variations were observed. Cancer antigen arrays are emerging useful in potential diagnostic and immunotherapeutic antigen biomarker discovery. PMID:26885621

  8. Omics and multi-omics approaches to study the biosynthesis of secondary metabolites in microorganisms.

    PubMed

    Palazzotto, Emilia; Weber, Tilmann

    2018-04-12

    Natural products produced by microorganisms represent the main source of bioactive molecules. The development of high-throughput (omics) techniques have importantly contributed to the renaissance of new antibiotic discovery increasing our understanding of complex mechanisms controlling the expression of biosynthetic gene clusters (BGCs) encoding secondary metabolites. In this context this review highlights recent progress in the use and integration of 'omics' approaches with focuses on genomics, transcriptomics, proteomics metabolomics meta-omics and combined omics as powerful strategy to discover new antibiotics. Copyright © 2018 Elsevier Ltd. All rights reserved.

  9. GeneLab: NASA's Open Access, Collaborative Platform for Systems Biology and Space Medicine

    NASA Technical Reports Server (NTRS)

    Berrios, Daniel C.; Thompson, Terri G.; Fogle, Homer W.; Rask, Jon C.; Coughlan, Joseph C.

    2015-01-01

    NASA is investing in GeneLab1 (http:genelab.nasa.gov), a multi-year effort to maximize utilization of the limited resources to conduct biological and medical research in space, principally aboard the International Space Station (ISS). High-throughput genomic, transcriptomic, proteomic or other omics analyses from experiments conducted on the ISS will be stored in the GeneLab Data Systems (GLDS), an open-science information system that will also include a biocomputation platform with collaborative science capabilities, to enable the discovery and validation of molecular networks.

  10. Metaproteomics of Colonic Microbiota Unveils Discrete Protein Functions among Colitic Mice and Control Groups.

    PubMed

    Moon, Clara; Stupp, Gregory S; Su, Andrew I; Wolan, Dennis W

    2018-02-01

    Metaproteomics can greatly assist established high-throughput sequencing methodologies to provide systems biological insights into the alterations of microbial protein functionalities correlated with disease-associated dysbiosis of the intestinal microbiota. Here, the authors utilize the well-characterized murine T cell transfer model of colitis to find specific changes within the intestinal luminal proteome associated with inflammation. MS proteomic analysis of colonic samples permitted the identification of ≈10 000-12 000 unique peptides that corresponded to 5610 protein clusters identified across three groups, including the colitic Rag1 -/- T cell recipients, isogenic Rag1 -/- controls, and wild-type mice. The authors demonstrate that the colitic mice exhibited a significant increase in Proteobacteria and Verrucomicrobia and show that such alterations in the microbial communities contributed to the enrichment of specific proteins with transcription and translation gene ontology terms. In combination with 16S sequencing, the authors' metaproteomics-based microbiome studies provide a foundation for assessing alterations in intestinal luminal protein functionalities in a robust and well-characterized mouse model of colitis, and set the stage for future studies to further explore the functional mechanisms of altered protein functionalities associated with dysbiosis and inflammation. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. A scalable strategy for high-throughput GFP tagging of endogenous human proteins.

    PubMed

    Leonetti, Manuel D; Sekine, Sayaka; Kamiyama, Daichi; Weissman, Jonathan S; Huang, Bo

    2016-06-21

    A central challenge of the postgenomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9 nuclease/single-guide RNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless, and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Taken together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.

  12. Quantitative Proteomic Analysis of Staphylococcus aureus Treated With Punicalagin, a Natural Antibiotic From Pomegranate That Disrupts Iron Homeostasis and Induces SOS.

    PubMed

    Cooper, Bret; Islam, Nazrul; Xu, Yunfeng; Beard, Hunter S; Garrett, Wesley M; Gu, Ganyu; Nou, Xiangwu

    2018-05-01

    Staphylococcus aureus, a bacterial, food-borne pathogen of humans, can contaminate raw fruits and vegetables. While physical and chemical methods are available to control S. aureus, scientists are searching for inhibitory phytochemicals from plants. One promising compound from pomegranate is punicalagin, a natural antibiotic. To get a broader understanding of the inhibitory effect of punicalagin on S. aureus growth, high-throughput mass spectrometry and quantitative isobaric labeling was used to investigate the proteome of S. aureus after exposure to a sublethal dose of punicalagin. Nearly half of the proteins encoded by the small genome were interrogated, and nearly half of those exhibited significant changes in accumulation. Punicalagin treatment altered the accumulation of proteins and enzymes needed for iron acquisition, and it altered amounts of enzymes for glycolysis, citric acid cycling, protein biosynthesis, and purine and pyrimidine biosynthesis. Punicalagin treatment also induced an SOS cellular response to damaged DNA. Transcriptional comparison of marker genes shows that the punicalagin-induced iron starvation and SOS responses resembles those produced by EDTA and ciprofloxacin. These results show that punicalagin adversely alters bacterial growth by disrupting iron homeostasis and that it induces SOS, possibly through DNA biosynthesis inhibition. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Characterising the enzymatic profile of crude tentacle extracts from the South Atlantic jellyfish Olindias sambaquiensis (Cnidaria: Hydrozoa).

    PubMed

    Knittel, Paloma S; Long, Paul F; Brammall, Lucas; Marques, Antonio C; Almeida, Michelle T; Padilla, Gabriel; Moura-da-Silva, Ana M

    2016-09-01

    Jellyfish venoms are of medical and biotechnological importance, with toxins displaying antimicrobial, analgesic and anti-tumor activities. Although proteolytic enzymes have also been described, detailed characterisation of these proteins is scant in Olindias spp. High throughput mass spectrometry profiling of cnidarian venoms has become increasingly popular since the first description of the proteomic profile of putative toxins isolated from nematocysts of the hydrozoan jellyfish Olindias sambaquiensis describing the presence of orthologous enzymes as presented in venoms of advanced species as snakes. Rigorous bioinformatics analyses can aid functional annotation, but biochemical assays are prerequisite to unambiguously assign toxic function to a peptide or protein. Here we present results that experimentally confirm previously predicted proteomic analysis that crude venom extracts from tentacles of O. sambaquiensis are composed of polypeptides with metalloproteinase, serine proteinase and phospholipases A2 activities. Surprisingly, levels of serine proteinase and phospholipase A2 activities were comparable to those observed in venoms of Bothrops snakes which were used as positive controls in this study. Hence, these data offer new opportunities to explore serine proteinase and phospholipase A2 activities in the clinical sequelae following O. sambaquiensis envenomation, with future possible biopharmaceutical applications. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. RepExplore: addressing technical replicate variance in proteomics and metabolomics data analysis.

    PubMed

    Glaab, Enrico; Schneider, Reinhard

    2015-07-01

    High-throughput omics datasets often contain technical replicates included to account for technical sources of noise in the measurement process. Although summarizing these replicate measurements by using robust averages may help to reduce the influence of noise on downstream data analysis, the information on the variance across the replicate measurements is lost in the averaging process and therefore typically disregarded in subsequent statistical analyses.We introduce RepExplore, a web-service dedicated to exploit the information captured in the technical replicate variance to provide more reliable and informative differential expression and abundance statistics for omics datasets. The software builds on previously published statistical methods, which have been applied successfully to biomedical omics data but are difficult to use without prior experience in programming or scripting. RepExplore facilitates the analysis by providing a fully automated data processing and interactive ranking tables, whisker plot, heat map and principal component analysis visualizations to interpret omics data and derived statistics. Freely available at http://www.repexplore.tk enrico.glaab@uni.lu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  15. In Vitro Identification of Histatin 5 Salivary Complexes

    PubMed Central

    Moffa, Eduardo B.; Machado, Maria A. A. M.; Mussi, Maria C. M.; Xiao, Yizhi; Garrido, Saulo S.; Giampaolo, Eunice T.; Siqueira, Walter L.

    2015-01-01

    With recent progress in the analysis of the salivary proteome, the number of salivary proteins identified has increased dramatically. However, the physiological functions of many of the newly discovered proteins remain unclear. Closely related to the study of a protein’s function is the identification of its interaction partners. Although in saliva some proteins may act primarily as single monomeric units, a significant percentage of all salivary proteins, if not the majority, appear to act in complexes with partners to execute their diverse functions. Coimmunoprecipitation (Co-IP) and pull-down assays were used to identify the heterotypic complexes between histatin 5, a potent natural antifungal protein, and other salivary proteins in saliva. Classical protein–protein interaction methods in combination with high-throughput mass spectrometric techniques were carried out. Co-IP using protein G magnetic Sepharose TM beads suspension was able to capture salivary complexes formed between histatin 5 and its salivary protein partners. Pull-down assay was used to confirm histatin 5 protein partners. A total of 52 different proteins were identified to interact with histatin 5. The present study used proteomic approaches in conjunction with classical biochemical methods to investigate protein–protein interaction in human saliva. Our study demonstrated that when histatin 5 is complexed with salivary amylase, one of the 52 proteins identified as a histatin 5 partner, the antifungal activity of histatin 5 is reduced. We expected that our proteomic approach could serve as a basis for future studies on the mechanism and structural-characterization of those salivary protein interactions to understand their clinical significance. PMID:26544073

  16. Analysis of high accuracy, quantitative proteomics data in the MaxQB database.

    PubMed

    Schaab, Christoph; Geiger, Tamar; Stoehr, Gabriele; Cox, Juergen; Mann, Matthias

    2012-03-01

    MS-based proteomics generates rapidly increasing amounts of precise and quantitative information. Analysis of individual proteomic experiments has made great strides, but the crucial ability to compare and store information across different proteome measurements still presents many challenges. For example, it has been difficult to avoid contamination of databases with low quality peptide identifications, to control for the inflation in false positive identifications when combining data sets, and to integrate quantitative data. Although, for example, the contamination with low quality identifications has been addressed by joint analysis of deposited raw data in some public repositories, we reasoned that there should be a role for a database specifically designed for high resolution and quantitative data. Here we describe a novel database termed MaxQB that stores and displays collections of large proteomics projects and allows joint analysis and comparison. We demonstrate the analysis tools of MaxQB using proteome data of 11 different human cell lines and 28 mouse tissues. The database-wide false discovery rate is controlled by adjusting the project specific cutoff scores for the combined data sets. The 11 cell line proteomes together identify proteins expressed from more than half of all human genes. For each protein of interest, expression levels estimated by label-free quantification can be visualized across the cell lines. Similarly, the expression rank order and estimated amount of each protein within each proteome are plotted. We used MaxQB to calculate the signal reproducibility of the detected peptides for the same proteins across different proteomes. Spearman rank correlation between peptide intensity and detection probability of identified proteins was greater than 0.8 for 64% of the proteome, whereas a minority of proteins have negative correlation. This information can be used to pinpoint false protein identifications, independently of peptide database scores. The information contained in MaxQB, including high resolution fragment spectra, is accessible to the community via a user-friendly web interface at http://www.biochem.mpg.de/maxqb.

  17. AT_CHLORO, a comprehensive chloroplast proteome database with subplastidial localization and curated information on envelope proteins.

    PubMed

    Ferro, Myriam; Brugière, Sabine; Salvi, Daniel; Seigneurin-Berny, Daphné; Court, Magali; Moyet, Lucas; Ramus, Claire; Miras, Stéphane; Mellal, Mourad; Le Gall, Sophie; Kieffer-Jaquinod, Sylvie; Bruley, Christophe; Garin, Jérôme; Joyard, Jacques; Masselon, Christophe; Rolland, Norbert

    2010-06-01

    Recent advances in the proteomics field have allowed a series of high throughput experiments to be conducted on chloroplast samples, and the data are available in several public databases. However, the accurate localization of many chloroplast proteins often remains hypothetical. This is especially true for envelope proteins. We went a step further into the knowledge of the chloroplast proteome by focusing, in the same set of experiments, on the localization of proteins in the stroma, the thylakoids, and envelope membranes. LC-MS/MS-based analyses first allowed building the AT_CHLORO database (http://www.grenoble.prabi.fr/protehome/grenoble-plant-proteomics/), a comprehensive repertoire of the 1323 proteins, identified by 10,654 unique peptide sequences, present in highly purified chloroplasts and their subfractions prepared from Arabidopsis thaliana leaves. This database also provides extensive proteomics information (peptide sequences and molecular weight, chromatographic retention times, MS/MS spectra, and spectral count) for a unique chloroplast protein accurate mass and time tag database gathering identified peptides with their respective and precise analytical coordinates, molecular weight, and retention time. We assessed the partitioning of each protein in the three chloroplast compartments by using a semiquantitative proteomics approach (spectral count). These data together with an in-depth investigation of the literature were compiled to provide accurate subplastidial localization of previously known and newly identified proteins. A unique knowledge base containing extensive information on the proteins identified in envelope fractions was thus obtained, allowing new insights into this membrane system to be revealed. Altogether, the data we obtained provide unexpected information about plastidial or subplastidial localization of some proteins that were not suspected to be associated to this membrane system. The spectral counting-based strategy was further validated as the compartmentation of well known pathways (for instance, photosynthesis and amino acid, fatty acid, or glycerolipid biosynthesis) within chloroplasts could be dissected. It also allowed revisiting the compartmentation of the chloroplast metabolism and functions.

  18. High-throughput Crystallography for Structural Genomics

    PubMed Central

    Joachimiak, Andrzej

    2009-01-01

    Protein X-ray crystallography recently celebrated its 50th anniversary. The structures of myoglobin and hemoglobin determined by Kendrew and Perutz provided the first glimpses into the complex protein architecture and chemistry. Since then, the field of structural molecular biology has experienced extraordinary progress and now over 53,000 proteins structures have been deposited into the Protein Data Bank. In the past decade many advances in macromolecular crystallography have been driven by world-wide structural genomics efforts. This was made possible because of third-generation synchrotron sources, structure phasing approaches using anomalous signal and cryo-crystallography. Complementary progress in molecular biology, proteomics, hardware and software for crystallographic data collection, structure determination and refinement, computer science, databases, robotics and automation improved and accelerated many processes. These advancements provide the robust foundation for structural molecular biology and assure strong contribution to science in the future. In this report we focus mainly on reviewing structural genomics high-throughput X-ray crystallography technologies and their impact. PMID:19765976

  19. High-throughput microscopy must re-invent the microscope rather than speed up its functions

    PubMed Central

    Oheim, M

    2007-01-01

    Knowledge gained from the revolutions in genomics and proteomics has helped to identify many of the key molecules involved in cellular signalling. Researchers, both in academia and in the pharmaceutical industry, now screen, at a sub-cellular level, where and when these proteins interact. Fluorescence imaging and molecular labelling combine to provide a powerful tool for real-time functional biochemistry with molecular resolution. However, they traditionally have been work-intensive, required trained personnel, and suffered from low through-put due to sample preparation, loading and handling. The need for speeding up microscopy is apparent from the tremendous complexity of cellular signalling pathways, the inherent biological variability, as well as the possibility that the same molecule plays different roles in different sub-cellular compartments. Research institutes and companies have teamed up to develop imaging cytometers of ever-increasing complexity. However, to truly go high-speed, sub-cellular imaging must free itself from the rigid framework of current microscopes. PMID:17603553

  20. Automated solid-phase subcloning based on beads brought into proximity by magnetic force.

    PubMed

    Hudson, Elton P; Nikoshkov, Andrej; Uhlen, Mathias; Rockberg, Johan

    2012-01-01

    In the fields of proteomics, metabolic engineering and synthetic biology there is a need for high-throughput and reliable cloning methods to facilitate construction of expression vectors and genetic pathways. Here, we describe a new approach for solid-phase cloning in which both the vector and the gene are immobilized to separate paramagnetic beads and brought into proximity by magnetic force. Ligation events were directly evaluated using fluorescent-based microscopy and flow cytometry. The highest ligation efficiencies were obtained when gene- and vector-coated beads were brought into close contact by application of a magnet during the ligation step. An automated procedure was developed using a laboratory workstation to transfer genes into various expression vectors and more than 95% correct clones were obtained in a number of various applications. The method presented here is suitable for efficient subcloning in an automated manner to rapidly generate a large number of gene constructs in various vectors intended for high throughput applications.

  1. Automated Solid-Phase Subcloning Based on Beads Brought into Proximity by Magnetic Force

    PubMed Central

    Hudson, Elton P.; Nikoshkov, Andrej; Uhlen, Mathias; Rockberg, Johan

    2012-01-01

    In the fields of proteomics, metabolic engineering and synthetic biology there is a need for high-throughput and reliable cloning methods to facilitate construction of expression vectors and genetic pathways. Here, we describe a new approach for solid-phase cloning in which both the vector and the gene are immobilized to separate paramagnetic beads and brought into proximity by magnetic force. Ligation events were directly evaluated using fluorescent-based microscopy and flow cytometry. The highest ligation efficiencies were obtained when gene- and vector-coated beads were brought into close contact by application of a magnet during the ligation step. An automated procedure was developed using a laboratory workstation to transfer genes into various expression vectors and more than 95% correct clones were obtained in a number of various applications. The method presented here is suitable for efficient subcloning in an automated manner to rapidly generate a large number of gene constructs in various vectors intended for high throughput applications. PMID:22624028

  2. Network-Based Analysis of Nutraceuticals in Human Hepatocellular Carcinomas Reveals Mechanisms of Chemopreventive Action

    PubMed Central

    Michailidou, M; Melas, IN; Messinis, DE; Klamt, S; Alexopoulos, LG; Kolisis, FN; Loutrari, H

    2015-01-01

    Chronic inflammation is associated with the development of human hepatocellular carcinoma (HCC), an essentially incurable cancer. Anti-inflammatory nutraceuticals have emerged as promising candidates against HCC, yet the mechanisms through which they influence the cell signaling machinery to impose phenotypic changes remain unresolved. Herein we implemented a systems biology approach in HCC cells, based on the integration of cytokine release and phospoproteomic data from high-throughput xMAP Luminex assays to elucidate the action mode of prominent nutraceuticals in terms of topology alterations of HCC-specific signaling networks. An optimization algorithm based on SigNetTrainer, an Integer Linear Programming formulation, was applied to construct networks linking signal transduction to cytokine secretion by combining prior knowledge of protein connectivity with proteomic data. Our analysis identified the most probable target phosphoproteins of interrogated compounds and predicted translational control as a new mechanism underlying their anticytokine action. Induced alterations corroborated with inhibition of HCC-driven angiogenesis and metastasis. PMID:26225263

  3. Electrochemistry-Assisted Top-Down Characterization of Disulfide-Containing Proteins

    PubMed Central

    Zhang, Yun; Cui, Weidong; Zhang, Hao; Dewald, Howard D.; Chen, Hao

    2013-01-01

    Covalent disulfide bond linkage in a protein represents an important challenge for mass spectrometry (MS)-based top-down protein structure analysis as it reduces the backbone cleavage efficiency for MS/MS dissociation. This study presents a strategy for solving this critical issue via integrating electrochemistry (EC) online with top-down MS approach. In this approach, proteins undergo electrolytic reduction in an electrochemical cell to break disulfide bonds and then online ionized into gaseous ions for analysis by electron-capture dissociation (ECD) and collision-induced dissociation (CID). The electrochemical reduction of proteins allows to remove disulfide bond constraints and also leads to increased charge numbers of the resulting protein ions. As a result, sequence coverage was significantly enhanced, as exemplified by β-lactoglobulin A (24 vs. 73 backbone cleavages before and after electrolytic reduction, respectively) and lysozyme (5 vs. 66 backbone cleavages before and after electrolytic reduction, respectively). This methodology is fast and does not need chemical reductants, which would have an important impact in high-throughput proteomics research. PMID:22448817

  4. Electrochemistry-assisted top-down characterization of disulfide-containing proteins.

    PubMed

    Zhang, Yun; Cui, Weidong; Zhang, Hao; Dewald, Howard D; Chen, Hao

    2012-04-17

    Covalent disulfide bond linkage in a protein represents an important challenge for mass spectrometry (MS)-based top-down protein structure analysis as it reduces the backbone cleavage efficiency for MS/MS dissociation. This study presents a strategy for solving this critical issue via integrating electrochemistry (EC) online with a top-down MS approach. In this approach, proteins undergo electrolytic reduction in an electrochemical cell to break disulfide bonds and then undergo online ionization into gaseous ions for analysis by electron-capture dissociation (ECD) and collision-induced dissociation (CID). The electrochemical reduction of proteins allows one to remove disulfide bond constraints and also leads to increased charge numbers of the resulting protein ions. As a result, sequence coverage was significantly enhanced, as exemplified by β-lactoglobulin A (24 vs 75 backbone cleavages before and after electrolytic reduction, respectively) and lysozyme (5 vs 66 backbone cleavages before and after electrolytic reduction, respectively). This methodology is fast and does not need chemical reductants, which would have an important impact in high-throughput proteomics research.

  5. New technology and resources for cryptococcal research

    PubMed Central

    Zhang, Nannan; Park, Yoon-Dong; Williamson, Peter R.

    2014-01-01

    Rapid advances in molecular biology and genome sequencing have enabled the generation of new technology and resources for cryptococcal research. RNAi-mediated specific gene knock down has become routine and more efficient by utilizing modified shRNA plasmids and convergent promoter RNAi constructs. This system was recently applied in a high-throughput screen to identify genes involved in host-pathogen interactions. Gene deletion efficiencies have also been improved by increasing rates of homologous recombination through a number of approaches, including a combination of double-joint PCR with split-marker transformation, the use of dominant selectable markers and the introduction of Cre-Loxp systems into Cryptococcus. Moreover, visualization of cryptococcal proteins has become more facile using fusions with codon-optimized fluorescent tags, such as green or red fluorescent proteins or, mCherry. Using recent genome-wide analytical tools, new transcriptional factors and regulatory proteins have been identified in novel virulence-related signaling pathways by employing microarray analysis, RNA-sequencing and proteomic analysis. PMID:25460849

  6. Quantifying protein-protein interactions in high throughput using protein domain microarrays.

    PubMed

    Kaushansky, Alexis; Allen, John E; Gordus, Andrew; Stiffler, Michael A; Karp, Ethan S; Chang, Bryan H; MacBeath, Gavin

    2010-04-01

    Protein microarrays provide an efficient way to identify and quantify protein-protein interactions in high throughput. One drawback of this technique is that proteins show a broad range of physicochemical properties and are often difficult to produce recombinantly. To circumvent these problems, we have focused on families of protein interaction domains. Here we provide protocols for constructing microarrays of protein interaction domains in individual wells of 96-well microtiter plates, and for quantifying domain-peptide interactions in high throughput using fluorescently labeled synthetic peptides. As specific examples, we will describe the construction of microarrays of virtually every human Src homology 2 (SH2) and phosphotyrosine binding (PTB) domain, as well as microarrays of mouse PDZ domains, all produced recombinantly in Escherichia coli. For domains that mediate high-affinity interactions, such as SH2 and PTB domains, equilibrium dissociation constants (K(D)s) for their peptide ligands can be measured directly on arrays by obtaining saturation binding curves. For weaker binding domains, such as PDZ domains, arrays are best used to identify candidate interactions, which are then retested and quantified by fluorescence polarization. Overall, protein domain microarrays provide the ability to rapidly identify and quantify protein-ligand interactions with minimal sample consumption. Because entire domain families can be interrogated simultaneously, they provide a powerful way to assess binding selectivity on a proteome-wide scale and provide an unbiased perspective on the connectivity of protein-protein interaction networks.

  7. Comparative Proteomics Reveals a Significant Bias Toward Alternative Protein Isoforms with Conserved Structure and Function

    PubMed Central

    Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.

    2012-01-01

    Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687

  8. Plant Abiotic Stress Proteomics: The Major Factors Determining Alterations in Cellular Proteome

    PubMed Central

    Kosová, Klára; Vítámvás, Pavel; Urban, Milan O.; Prášil, Ilja T.; Renaut, Jenny

    2018-01-01

    HIGHLIGHTS: Major environmental and genetic factors determining stress-related protein abundance are discussed.Major aspects of protein biological function including protein isoforms and PTMs, cellular localization and protein interactions are discussed.Functional diversity of protein isoforms and PTMs is discussed. Abiotic stresses reveal profound impacts on plant proteomes including alterations in protein relative abundance, cellular localization, post-transcriptional and post-translational modifications (PTMs), protein interactions with other protein partners, and, finally, protein biological functions. The main aim of the present review is to discuss the major factors determining stress-related protein accumulation and their final biological functions. A dynamics of stress response including stress acclimation to altered ambient conditions and recovery after the stress treatment is discussed. The results of proteomic studies aimed at a comparison of stress response in plant genotypes differing in stress adaptability reveal constitutively enhanced levels of several stress-related proteins (protective proteins, chaperones, ROS scavenging- and detoxification-related enzymes) in the tolerant genotypes with respect to the susceptible ones. Tolerant genotypes can efficiently adjust energy metabolism to enhanced needs during stress acclimation. Stress tolerance vs. stress susceptibility are relative terms which can reflect different stress-coping strategies depending on the given stress treatment. The role of differential protein isoforms and PTMs with respect to their biological functions in different physiological constraints (cellular compartments and interacting partners) is discussed. The importance of protein functional studies following high-throughput proteome analyses is presented in a broader context of plant biology. In summary, the manuscript tries to provide an overview of the major factors which have to be considered when interpreting data from proteomic studies on stress-treated plants. PMID:29472941

  9. Systems Approaches to Biology and Disease Enable Translational Systems Medicine

    PubMed Central

    Hood, Leroy; Tian, Qiang

    2012-01-01

    The development and application of systems strategies to biology and disease are transforming medical research and clinical practice in an unprecedented rate. In the foreseeable future, clinicians, medical researchers, and ultimately the consumers and patients will be increasingly equipped with a deluge of personal health information, e.g., whole genome sequences, molecular profiling of diseased tissues, and periodic multi-analyte blood testing of biomarker panels for disease and wellness. The convergence of these practices will enable accurate prediction of disease susceptibility and early diagnosis for actionable preventive schema and personalized treatment regimes tailored to each individual. It will also entail proactive participation from all major stakeholders in the health care system. We are at the dawn of predictive, preventive, personalized, and participatory (P4) medicine, the fully implementation of which requires marrying basic and clinical researches through advanced systems thinking and the employment of high-throughput technologies in genomics, proteomics, nanofluidics, single-cell analysis, and computation strategies in a highly-orchestrated discipline we termed translational systems medicine. PMID:23084773

  10. Organic matrix-related mineralization of sea urchin spicules, spines, test and teeth.

    PubMed

    Veis, Arthur

    2011-06-01

    The camarodont echinoderms have five distinct mineralized skeletal elements: embryonic spicules, mature test, spines, lantern stereom and teeth. The spicules are transient structural elements whereas the spines, and test plates are permanent. The teeth grow continuously. The mineral is a high magnesium calcite, but the magnesium content is different in each type of skeletal element, varying from 5 to 40 mole% Mg. The organic matrix creates the spaces and environments for crystal initiation and growth. The detailed mechanisms of crystal regulation are not known, but acidic and phosphorylated matrix proteins may be of special importance. Biochemical studies, sequencing of the complete genome, and high-throughput proteomic analysis have not yet provided insight into the mechanisms of crystallization, calcite composition, and orientation applicable to all skeletal elements. The embryonic spicules are not representative of the mature skeletal elements. The next phase of research will have to focus on the specific localization of the proteins and individual biochemistries of each system with regard to mineral content and placement.

  11. Identification of IGFBP2 and IGFBP3 As Compensatory Biomarkers for CA19-9 in Early-Stage Pancreatic Cancer Using a Combination of Antibody-Based and LC-MS/MS-Based Proteomics

    PubMed Central

    Yoneyama, Toshihiro; Ohtsuki, Sumio; Honda, Kazufumi; Kobayashi, Makoto; Iwasaki, Motoki; Uchida, Yasuo; Okusaka, Takuji; Nakamori, Shoji; Shimahara, Masashi; Ueno, Takaaki; Tsuchida, Akihiko; Sata, Naohiro; Ioka, Tatsuya; Yasunami, Yohichi; Kosuge, Tomoo; Kaneda, Takashi; Kato, Takao; Yagihara, Kazuhiro; Fujita, Shigeyuki; Huang, Wilber; Yamada, Tesshi; Tachikawa, Masanori; Terasaki, Tetsuya

    2016-01-01

    Pancreatic cancer is one of the most lethal tumors, and reliable detection of early-stage pancreatic cancer and risk diseases for pancreatic cancer is essential to improve the prognosis. As 260 genes were previously reported to be upregulated in invasive ductal adenocarcinoma of pancreas (IDACP) cells, quantification of the corresponding proteins in plasma might be useful for IDACP diagnosis. Therefore, the purpose of the present study was to identify plasma biomarkers for early detection of IDACP by using two proteomics strategies: antibody-based proteomics and liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based proteomics. Among the 260 genes, we focused on 130 encoded proteins with known function for which antibodies were available. Twenty-three proteins showed values of the area under the curve (AUC) of more than 0.8 in receiver operating characteristic (ROC) analysis of reverse-phase protein array (RPPA) data of IDACP patients compared with healthy controls, and these proteins were selected as biomarker candidates. We then used our high-throughput selected reaction monitoring or multiple reaction monitoring (SRM/MRM) methodology, together with an automated sample preparation system, micro LC and auto analysis system, to quantify these candidate proteins in plasma from healthy controls and IDACP patients on a large scale. The results revealed that insulin-like growth factor-binding protein (IGFBP)2 and IGFBP3 have the ability to discriminate IDACP patients at an early stage from healthy controls, and IGFBP2 appeared to be increased in risk diseases of pancreatic malignancy, such as intraductal papillary mucinous neoplasms (IPMNs). Furthermore, diagnosis of IDACP using the combination of carbohydrate antigen 19–9 (CA19-9), IGFBP2 and IGFBP3 is significantly more effective than CA19-9 alone. This suggests that IGFBP2 and IGFBP3 may serve as compensatory biomarkers for CA19-9. Early diagnosis with this marker combination may improve the prognosis of IDACP patients. PMID:27579675

  12. Unraveling the proteomic profile of mice testis during the initiation of meiosis.

    PubMed

    Shao, Binbin; Guo, Yueshuai; Wang, Lei; Zhou, Quan; Gao, Tingting; Zheng, Bo; Zheng, Haoyu; Zhou, Tao; Zhou, Zuomin; Guo, Xuejiang; Huang, Xiaoyan; Sha, Jiahao

    2015-04-29

    In mice, once primordial germ cells (PGCs) are generated, they continue to proliferate and migrate to eventually reach the future gonads. They initiate sexual differentiation after their colonization of the gonads. During this process, retinoic acid (RA) induces meiosis in the female germ cells, which proceeds to the diplotene stage of meiotic prophase I, whereas the male germ cells initiate growth arrest. After birth, meiosis is initiated in mice spermatogonia by their conversion to preleptotene spermatocytes. There are evidences showing the roles of RA in the regulation of spermatogonial differentiation and meiosis initiation. However, it is still not well known on what responds to RA and how RA signaling engages meiosis. Thus, we constructed a proteomic profile of proteins associated with meiosis onset during testis development in mouse and identified 104 differentially expressed proteins (≥1.5 folds). Bioinformatic analysis showed proteins functioning in specific cell processes. The expression patterns of five selected proteins were verified via Western blot, of which we found that Tfrc gene was RA responsive, with a RA responsive element, and could be up regulated by RA in spermatogonial stem cell (SSC) line. Taken together, the results provide an important reference profile for further functional study of meiosis initiation. Spermatogenesis involves mitosis of spermatogonia, meiosis of spermatocytes and spermiogenesis, in which meiosis is a unique event to germ cells, and not in the somatic cells. Till now, the detailed molecular mechanisms of the transition from mitosis to meiosis are still not elucidated. With high-throughput proteomic technology, it is now possible to systemically identify proteins possibly involved. With TMT-6plex based quantification, we identified 104 proteins differentially between testes without meiosis (day 8.5) and those that were meiosis initiated (day 10.5). And a well-known protein essential for meiosis initiation, stra8, was identified to be differentially expressed in the study. And bioinformatic analysis and functional studies revealed several proteins regulated by retinoic acid, a chemical known to regulate the meiosis initiation. Thus, this quantitative proteomic approach can identify meiosis initiation regulating proteins, and further functional studies of these proteins will help elucidate the mechanisms of meiosis initiation. Copyright © 2015. Published by Elsevier B.V.

  13. Jellyfish venomics and venom gland transcriptomics analysis of Stomolophus meleagris to reveal the toxins associated with sting.

    PubMed

    Li, Rongfeng; Yu, Huahua; Xue, Wei; Yue, Yang; Liu, Song; Xing, Ronge; Li, Pengcheng

    2014-06-25

    Jellyfish Stomolophus meleagris is a very dangerous animal because of its strong toxicity. However, the composition of the venom is still unclear. Both proteomics and transcriptomics approaches were applied in present study to investigate the major components and their possible relationships to the sting. The proteomics of the venom from S. meleagris was conducted by tryptic digestion of the crude venom followed by RP-HPLC separation and MS/MS analysis of the tryptic peptides. The venom gland transcriptome was analyzed using a high-throughput Illumina sequencing platform HiSeq 2000 with de novo assembly. A total of 218 toxins were identified including C-type lectin, phospholipase A₂ (PLA₂), potassium channel inhibitor, protease inhibitor, metalloprotease, hemolysin and other toxins, most of which should be responsible for the sting. Among them, serine protease inhibitor, PLA₂, potassium channel inhibitor and metalloprotease are predominant, representing 28.44%, 21.56%, 16.06% and 15.14% of the identified venom proteins, respectively. Overall, our combined proteomics and transcriptomics approach provides a systematic overview of the toxins in the venom of jellyfish S. meleagris and it will be significant to understand the mechanism of the sting. Jellyfish Stomolophus meleagris is a very dangerous animal because of its strong toxicity. It often bloomed in the coast of China in recent years and caused thousands of people stung and even deaths every year. However, the components which caused sting are still unknown yet. In addition, no study about the venomics of jellyfish S. meleagris has been reported. In the present study, both proteomics and transcriptomics approaches were applied to investigate the major components related to the sting. The result showed that major component included C-type lectin, phospholipase A₂, potassium channel inhibitor, protease inhibitor, metalloprotease, hemolysin and other toxins, which should be responsible for the effect of sting. This is the first research about the venomics of jellyfish S. meleagris. It will be significant to understand the mechanism of the biological effects and helpful to develop ways to deal with the sting. Copyright © 2014 Elsevier B.V. All rights reserved.

  14. PROTEOMICS IN ECOTOXICOLOGY: PROTEIN EXPRESSION PROFILING TO SCREEN CHEMICALS FOR ENDOCRINE ACTIVITY

    EPA Science Inventory

    Abstract for poster.

    Current endocrine testing methods are animal intensive and lack the throughput necessary to screen large numbers of environmental chemicals for adverse effects. In this study, Matrix Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry...

  15. Proteomic analysis of adducted butyrylcholinesterase for biomonitoring organophosphorus exposures

    PubMed Central

    Marsillach, Judit; Hsieh, Edward J.; Richter, Rebecca J.; MacCoss, Michael J.; Furlong, Clement E.

    2014-01-01

    Organophosphorus (OP) compounds include a broad group of toxic chemicals such as insecticides, chemical warfare agents and antiwear agents. The liver cytochromes P450 bioactivate many OPs to potent inhibitors of serine hydrolases. Cholinesterases were the first OP targets discovered and are the most studied. They are used to monitor human exposures to OP compounds. However, the assay that is currently used has limitations. The mechanism of action of OP compounds is the inhibition of serine hydrolases by covalently modifying their active-site serine. After structural rearrangement, the complex OP inhibitor-enzyme is irreversible and will remain in circulation until the modified enzyme is degraded. Mass spectrometry is a sensitive technology for analyzing protein modifications, such as OP-adducted enzymes. These analyses also provide some information about the nature of the OP adduct. Our aim is to develop high-throughput protocols for monitoring OP exposures using mass spectrometry. PMID:23123252

  16. A multichannel nanosensor for instantaneous readout of cancer drug mechanisms

    NASA Astrophysics Data System (ADS)

    Rana, Subinoy; Le, Ngoc D. B.; Mout, Rubul; Saha, Krishnendu; Tonga, Gulen Yesilbag; Bain, Robert E. S.; Miranda, Oscar R.; Rotello, Caren M.; Rotello, Vincent M.

    2015-01-01

    Screening methods that use traditional genomic, transcriptional, proteomic and metabonomic signatures to characterize drug mechanisms are known. However, they are time consuming and require specialized equipment. Here, we present a high-throughput multichannel sensor platform that can profile the mechanisms of various chemotherapeutic drugs in minutes. The sensor consists of a gold nanoparticle complexed with three different fluorescent proteins that can sense drug-induced physicochemical changes on cell surfaces. In the presence of cells, fluorescent proteins are rapidly displaced from the gold nanoparticle surface and fluorescence is restored. Fluorescence ‘turn on’ of the fluorescent proteins depends on the drug-induced cell surface changes, generating patterns that identify specific mechanisms of cell death induced by drugs. The nanosensor is generalizable to different cell types and does not require processing steps before analysis, offering an effective way to expedite research in drug discovery, toxicology and cell-based sensing.

  17. Microfluidic array platform for simultaneous lipid bilayer membrane formation.

    PubMed

    Zagnoni, M; Sandison, M E; Morgan, H

    2009-01-01

    In recent years, protein array technologies have found widespread applications in proteomics. However, new methods for high-throughput analysis of protein-protein and protein-compound interactions are still required. In this paper, an array of lipid bilayer membranes formed within a microfluidic system with integrated electrodes is presented. The system is comprised of three layers that are clamped together, thus rendering the device cleanable and reusable. The device microfluidics enable the simultaneous formation of an array of lipid bilayers using a previously developed air-exposure technique, thereby avoiding the need to manually form individual bilayers. The Ag/AgCl electrodes allow for ion channel measurements, each of the sites being independently addressable. Typically, a 50% yield in simultaneous lipid bilayer formation over 12 sites was obtained and ion channel recordings have been acquired over multiple sites. This system has great potential for the development of an automatable platform of suspended lipid bilayer arrays.

  18. Growing trend of CE at the omics level: the frontier of systems biology--an update.

    PubMed

    Ban, Eunmi; Park, Soo Hyun; Kang, Min-Jung; Lee, Hyun-Jung; Song, Eun Joo; Yoo, Young Sook

    2012-01-01

    Omics is the study of proteins, peptides, genes, and metabolites in living organisms. Systems biology aims to understand the system through the study of the relationship between elements such as genes and proteins in biological system. Recently, systems biology emerged as the result of the advanced development of high-throughput analysis technologies such as DNA sequencers, DNA arrays, and mass spectrometry for omics studies. Among a number of analytical tools and technologies, CE and CE coupled to MS are promising and relatively rapidly developing tools with the potential to provide qualitative and quantitative analyses of biological molecules. With an emphasis on CE for systems biology, this review summarizes the method developments and applications of CE for the genomic, transcriptomic, proteomic, and metabolomic studies focusing on the drug discovery and disease diagnosis and therapies since 2009. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. neXtProt: organizing protein knowledge in the context of human proteome projects.

    PubMed

    Gaudet, Pascale; Argoud-Puy, Ghislaine; Cusin, Isabelle; Duek, Paula; Evalet, Olivier; Gateau, Alain; Gleizes, Anne; Pereira, Mario; Zahn-Zabal, Monique; Zwahlen, Catherine; Bairoch, Amos; Lane, Lydie

    2013-01-04

    About 5000 (25%) of the ~20400 human protein-coding genes currently lack any experimental evidence at the protein level. For many others, there is only little information relative to their abundance, distribution, subcellular localization, interactions, or cellular functions. The aim of the HUPO Human Proteome Project (HPP, www.thehpp.org ) is to collect this information for every human protein. HPP is based on three major pillars: mass spectrometry (MS), antibody/affinity capture reagents (Ab), and bioinformatics-driven knowledge base (KB). To meet this objective, the Chromosome-Centric Human Proteome Project (C-HPP) proposes to build this catalog chromosome-by-chromosome ( www.c-hpp.org ) by focusing primarily on proteins that currently lack MS evidence or Ab detection. These are termed "missing proteins" by the HPP consortium. The lack of observation of a protein can be due to various factors including incorrect and incomplete gene annotation, low or restricted expression, or instability. neXtProt ( www.nextprot.org ) is a new web-based knowledge platform specific for human proteins that aims to complement UniProtKB/Swiss-Prot ( www.uniprot.org ) with detailed information obtained from carefully selected high-throughput experiments on genomic variation, post-translational modifications, as well as protein expression in tissues and cells. This article describes how neXtProt contributes to prioritize C-HPP efforts and integrates C-HPP results with other research efforts to create a complete human proteome catalog.

  20. Display technologies: application for the discovery of drug and gene delivery agents

    PubMed Central

    Sergeeva, Anna; Kolonin, Mikhail G.; Molldrem, Jeffrey J.; Pasqualini, Renata; Arap, Wadih

    2007-01-01

    Recognition of molecular diversity of cell surface proteomes in disease is essential for the development of targeted therapies. Progress in targeted therapeutics requires establishing effective approaches for high-throughput identification of agents specific for clinically relevant cell surface markers. Over the past decade, a number of platform strategies have been developed to screen polypeptide libraries for ligands targeting receptors selectively expressed in the context of various cell surface proteomes. Streamlined procedures for identification of ligand-receptor pairs that could serve as targets in disease diagnosis, profiling, imaging and therapy have relied on the display technologies, in which polypeptides with desired binding profiles can be serially selected, in a process called biopanning, based on their physical linkage with the encoding nucleic acid. These technologies include virus/phage display, cell display, ribosomal display, mRNA display and covalent DNA display (CDT), with phage display being by far the most utilized. The scope of this review is the recent advancements in the display technologies with a particular emphasis on molecular mapping of cell surface proteomes with peptide phage display. Prospective applications of targeted compounds derived from display libraries in the discovery of targeted drugs and gene therapy vectors are discussed. PMID:17123658

  1. Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size

    PubMed Central

    Zhang, Jingshan; Maslov, Sergei; Shakhnovich, Eugene I

    2008-01-01

    Crowded intracellular environments present a challenge for proteins to form functional specific complexes while reducing non-functional interactions with promiscuous non-functional partners. Here we show how the need to minimize the waste of resources to non-functional interactions limits the proteome diversity and the average concentration of co-expressed and co-localized proteins. Using the results of high-throughput Yeast 2-Hybrid experiments, we estimate the characteristic strength of non-functional protein–protein interactions. By combining these data with the strengths of specific interactions, we assess the fraction of time proteins spend tied up in non-functional interactions as a function of their overall concentration. This allows us to sketch the phase diagram for baker's yeast cells using the experimentally measured concentrations and subcellular localization of their proteins. The positions of yeast compartments on the phase diagram are consistent with our hypothesis that the yeast proteome has evolved to operate closely to the upper limit of its size, whereas keeping individual protein concentrations sufficiently low to reduce non-functional interactions. These findings have implication for conceptual understanding of intracellular compartmentalization, multicellularity and differentiation. PMID:18682700

  2. The Role of Proteomics in the Diagnosis and Treatment of Women's Cancers: Current Trends in Technology and Future Opportunities

    PubMed Central

    Breuer, Eun-Kyoung Yim; Murph, Mandi M.

    2011-01-01

    Technological and scientific innovations over the last decade have greatly contributed to improved diagnostics, predictive models, and prognosis among cancers affecting women. In fact, an explosion of information in these areas has almost assured future generations that outcomes in cancer will continue to improve. Herein we discuss the current status of breast, cervical, and ovarian cancers as it relates to screening, disease diagnosis, and treatment options. Among the differences in these cancers, it is striking that breast cancer has multiple predictive tests based upon tumor biomarkers and sophisticated, individualized options for prescription therapeutics while ovarian cancer lacks these tools. In addition, cervical cancer leads the way in innovative, cancer-preventative vaccines and multiple screening options to prevent disease progression. For each of these malignancies, emerging proteomic technologies based upon mass spectrometry, stable isotope labeling with amino acids, high-throughput ELISA, tissue or protein microarray techniques, and click chemistry in the pursuit of activity-based profiling can pioneer the next generation of discovery. We will discuss six of the latest techniques to understand proteomics in cancer and highlight research utilizing these techniques with the goal of improvement in the management of women's cancers. PMID:21886869

  3. Adolescent social isolation affects schizophrenia-like behavior and astrocyte biomarkers in the PFC of adult rats.

    PubMed

    Sun, Lan; Min, Li; Zhou, Hao; Li, Man; Shao, Feng; Wang, Weiwen

    2017-08-30

    Social isolation is regarded as a cause of schizophrenia spectrum disorders. Animal models of schizophrenia are constructed by repeated early environment deprivation as an important paradigm to reveal its pathological mechanism. Male Sprague Dawley rats were assigned to either social-rearing (SR) or isolated-rearing (IR) groups during postnatal days (PNDs) 21-34. On PND 56, all rats underwent behavioral testing including locomotor activity, anxiety-related behaviors in an open field and prepulse inhibition (PPI). Then, the rats were sacrificed and prefrontal cortex (PFC) tissues were separated for high-throughput proteomics analysis and Western blot validation. Rats of the IR group showed increased spontaneous locomotion, increased anxiety-like behavior and disrupted PPI compared with rats of the SR group. Based on proteomics analysis, a total of 124 PFC proteins were found to be significantly differentially expressed between the SR group and the IR group, the most remarkable of which were glial fibrillary acidic protein (GFAP), Annexin A2 (ANXA2) and vimentin (VIM), three astrocyte biomarkers. Further Western blot measurement confirmed that the levels of GFAP, ANXA2 and VIM were increased significantly in IR rats. Adolescent social isolation induced schizophrenia-like behaviors and significantly different expression of 124 PFC proteins in adult rats, especially GFAP, ANXA2 and VIM, which suggests that astrocyte development might be involved in the neural mechanism of schizophrenia. Copyright © 2017. Published by Elsevier B.V.

  4. Quantitative label-free proteomic analysis of human urine to identify novel candidate protein biomarkers for schistosomiasis.

    PubMed

    Onile, Olugbenga Samson; Calder, Bridget; Soares, Nelson C; Anumudu, Chiaka I; Blackburn, Jonathan M

    2017-11-01

    Schistosomiasis is a chronic neglected tropical disease that is characterized by continued inflammatory challenges to the exposed population and it has been established as a possible risk factor in the aetiology of bladder cancer. Improved diagnosis of schistosomiasis and its associated pathology is possible through mass spectrometry to identify biomarkers among the infected population, which will influence early detection of the disease and its subtle morbidity. A high-throughput proteomic approach was used to analyse human urine samples for 49 volunteers from Eggua, a schistosomiasis endemic community in South-West, Nigeria. The individuals were previously screened for Schistosoma haematobium and structural bladder pathologies via microscopy and ultrasonography respectively. Samples were categorised into schistosomiasis, schistosomiasis with bladder pathology, bladder pathology, and a normal healthy control group. These samples were analysed to identify potential protein biomarkers. A total of 1306 proteins and 9701 unique peptides were observed in this study (FDR = 0.01). Fifty-four human proteins were found to be potential biomarkers for schistosomiasis and bladder pathologies due to schistosomiasis by label-free quantitative comparison between groups. Thirty-six (36) parasite-derived potential biomarkers were also identified, which include some existing putative schistosomiasis biomarkers that have been previously reported. Some of these proteins include Elongation factor 1 alpha, phosphopyruvate hydratase, histone H4 and heat shock proteins (HSP 60, HSP 70). These findings provide an in-depth analysis of potential schistosoma and human host protein biomarkers for diagnosis of chronic schistosomiasis caused by Schistosoma haematobium and its pathogenesis.

  5. Reconstructing the regulatory circuit of cell fate determination in yeast mating response.

    PubMed

    Shao, Bin; Yuan, Haiyu; Zhang, Rongfei; Wang, Xuan; Zhang, Shuwen; Ouyang, Qi; Hao, Nan; Luo, Chunxiong

    2017-07-01

    Massive technological advances enabled high-throughput measurements of proteomic changes in biological processes. However, retrieving biological insights from large-scale protein dynamics data remains a challenging task. Here we used the mating differentiation in yeast Saccharomyces cerevisiae as a model and developed integrated experimental and computational approaches to analyze the proteomic dynamics during the process of cell fate determination. When exposed to a high dose of mating pheromone, the yeast cell undergoes growth arrest and forms a shmoo-like morphology; however, at intermediate doses, chemotropic elongated growth is initialized. To understand the gene regulatory networks that control this differentiation switch, we employed a high-throughput microfluidic imaging system that allows real-time and simultaneous measurements of cell growth and protein expression. Using kinetic modeling of protein dynamics, we classified the stimulus-dependent changes in protein abundance into two sources: global changes due to physiological alterations and gene-specific changes. A quantitative framework was proposed to decouple gene-specific regulatory modes from the growth-dependent global modulation of protein abundance. Based on the temporal patterns of gene-specific regulation, we established the network architectures underlying distinct cell fates using a reverse engineering method and uncovered the dose-dependent rewiring of gene regulatory network during mating differentiation. Furthermore, our results suggested a potential crosstalk between the pheromone response pathway and the target of rapamycin (TOR)-regulated ribosomal biogenesis pathway, which might underlie a cell differentiation switch in yeast mating response. In summary, our modeling approach addresses the distinct impacts of the global and gene-specific regulation on the control of protein dynamics and provides new insights into the mechanisms of cell fate determination. We anticipate that our integrated experimental and modeling strategies could be widely applicable to other biological systems.

  6. The mammary gland in domestic ruminants: a systems biology perspective.

    PubMed

    Ferreira, Ana M; Bislev, Stine L; Bendixen, Emøke; Almeida, André M

    2013-12-06

    Milk and dairy products are central elements in the human diet. It is estimated that 108kg of milk per year are consumed per person worldwide. Therefore, dairy production represents a relevant fraction of the economies of many countries, being cattle, sheep, goat, water buffalo, and other ruminants the main species used worldwide. An adequate management of dairy farming cannot be achieved without the knowledge on the biological mechanisms behind lactation in ruminants. Thus, understanding the morphology, development and regulation of the mammary gland in health, disease and production is crucial. Presently, innovative and high-throughput technologies such as genomics, transcriptomics, proteomics and metabolomics allow a much broader and detailed knowledge on such issues. Additionally, the application of a systems biology approach to animal science is vastly growing, as new advances in one field of specialization or animal species lead to new lines of research in other areas or/and are expanded to other species. This article addresses how modern research approaches may help us understand long-known issues in mammary development, lactation biology and dairy production. Dairy production depends upon the knowledge of the morphology and regulation of the mammary gland and lactation. High-throughput technologies allow a much broader and detailed knowledge on the biology of the mammary gland. This paper reviews the major contributions that genomics, transcriptomics, metabolomics and proteomics approaches have provided to understand the regulation of the mammary gland in health, disease and production. In the context of mammary gland "omics"-based research, the integration of results using a Systems Biology Approach is of key importance. © 2013.

  7. Shotgun proteomics of plant plasma membrane and microdomain proteins using nano-LC-MS/MS.

    PubMed

    Takahashi, Daisuke; Li, Bin; Nakayama, Takato; Kawamura, Yukio; Uemura, Matsuo

    2014-01-01

    Shotgun proteomics allows the comprehensive analysis of proteins extracted from plant cells, subcellular organelles, and membranes. Previously, two-dimensional gel electrophoresis-based proteomics was used for mass spectrometric analysis of plasma membrane proteins. In order to get comprehensive proteome profiles of the plasma membrane including highly hydrophobic proteins with a number of transmembrane domains, a mass spectrometry-based shotgun proteomics method using nano-LC-MS/MS for proteins from the plasma membrane proteins and plasma membrane microdomain fraction is described. The results obtained are easily applicable to label-free protein semiquantification.

  8. Finding Biomass Degrading Enzymes Through an Activity-Correlated Quantitative Proteomics Platform (ACPP).

    PubMed

    Ma, Hongyan; Delafield, Daniel G; Wang, Zhe; You, Jianlan; Wu, Si

    2017-04-01

    The microbial secretome, known as a pool of biomass (i.e., plant-based materials) degrading enzymes, can be utilized to discover industrial enzyme candidates for biofuel production. Proteomics approaches have been applied to discover novel enzyme candidates through comparing protein expression profiles with enzyme activity of the whole secretome under different growth conditions. However, the activity measurement of each enzyme candidate is needed for confident "active" enzyme assignments, which remains to be elucidated. To address this challenge, we have developed an Activity-Correlated Quantitative Proteomics Platform (ACPP) that systematically correlates protein-level enzymatic activity patterns and protein elution profiles using a label-free quantitative proteomics approach. The ACPP optimized a high performance anion exchange separation for efficiently fractionating complex protein samples while preserving enzymatic activities. The detected enzymatic activity patterns in sequential fractions using microplate-based assays were cross-correlated with protein elution profiles using a customized pattern-matching algorithm with a correlation R-score. The ACPP has been successfully applied to the identification of two types of "active" biomass-degrading enzymes (i.e., starch hydrolysis enzymes and cellulose hydrolysis enzymes) from Aspergillus niger secretome in a multiplexed fashion. By determining protein elution profiles of 156 proteins in A. niger secretome, we confidently identified the 1,4-α-glucosidase as the major "active" starch hydrolysis enzyme (R = 0.96) and the endoglucanase as the major "active" cellulose hydrolysis enzyme (R = 0.97). The results demonstrated that the ACPP facilitated the discovery of bioactive enzymes from complex protein samples in a high-throughput, multiplexing, and untargeted fashion. Graphical Abstract ᅟ.

  9. Finding Biomass Degrading Enzymes Through an Activity-Correlated Quantitative Proteomics Platform (ACPP)

    NASA Astrophysics Data System (ADS)

    Ma, Hongyan; Delafield, Daniel G.; Wang, Zhe; You, Jianlan; Wu, Si

    2017-04-01

    The microbial secretome, known as a pool of biomass (i.e., plant-based materials) degrading enzymes, can be utilized to discover industrial enzyme candidates for biofuel production. Proteomics approaches have been applied to discover novel enzyme candidates through comparing protein expression profiles with enzyme activity of the whole secretome under different growth conditions. However, the activity measurement of each enzyme candidate is needed for confident "active" enzyme assignments, which remains to be elucidated. To address this challenge, we have developed an Activity-Correlated Quantitative Proteomics Platform (ACPP) that systematically correlates protein-level enzymatic activity patterns and protein elution profiles using a label-free quantitative proteomics approach. The ACPP optimized a high performance anion exchange separation for efficiently fractionating complex protein samples while preserving enzymatic activities. The detected enzymatic activity patterns in sequential fractions using microplate-based assays were cross-correlated with protein elution profiles using a customized pattern-matching algorithm with a correlation R-score. The ACPP has been successfully applied to the identification of two types of "active" biomass-degrading enzymes (i.e., starch hydrolysis enzymes and cellulose hydrolysis enzymes) from Aspergillus niger secretome in a multiplexed fashion. By determining protein elution profiles of 156 proteins in A. niger secretome, we confidently identified the 1,4-α-glucosidase as the major "active" starch hydrolysis enzyme (R = 0.96) and the endoglucanase as the major "active" cellulose hydrolysis enzyme (R = 0.97). The results demonstrated that the ACPP facilitated the discovery of bioactive enzymes from complex protein samples in a high-throughput, multiplexing, and untargeted fashion.

  10. A Comprehensive, Open-source Platform for Mass Spectrometry-based Glycoproteomics Data Analysis.

    PubMed

    Liu, Gang; Cheng, Kai; Lo, Chi Y; Li, Jun; Qu, Jun; Neelamegham, Sriram

    2017-11-01

    Glycosylation is among the most abundant and diverse protein post-translational modifications (PTMs) identified to date. The structural analysis of this PTM is challenging because of the diverse monosaccharides which are not conserved among organisms, the branched nature of glycans, their isomeric structures, and heterogeneity in the glycan distribution at a given site. Glycoproteomics experiments have adopted the traditional high-throughput LC-MS n proteomics workflow to analyze site-specific glycosylation. However, comprehensive computational platforms for data analyses are scarce. To address this limitation, we present a comprehensive, open-source, modular software for glycoproteomics data analysis called GlycoPAT (GlycoProteomics Analysis Toolbox; freely available from www.VirtualGlycome.org/glycopat). The program includes three major advances: (1) "SmallGlyPep," a minimal linear representation of glycopeptides for MS n data analysis. This format allows facile serial fragmentation of both the peptide backbone and PTM at one or more locations. (2) A novel scoring scheme based on calculation of the "Ensemble Score (ES)," a measure that scores and rank-orders MS/MS spectrum for N- and O-linked glycopeptides using cross-correlation and probability based analyses. (3) A false discovery rate (FDR) calculation scheme where decoy glycopeptides are created by simultaneously scrambling the amino acid sequence and by introducing artificial monosaccharides by perturbing the original sugar mass. Parallel computing facilities and user-friendly GUIs (Graphical User Interfaces) are also provided. GlycoPAT is used to catalogue site-specific glycosylation on simple glycoproteins, standard protein mixtures and human plasma cryoprecipitate samples in three common MS/MS fragmentation modes: CID, HCD and ETD. It is also used to identify 960 unique glycopeptides in cell lysates from prostate cancer cells. The results show that the simultaneous consideration of peptide and glycan fragmentation is necessary for high quality MS n spectrum annotation in CID and HCD fragmentation modes. Additionally, they confirm the suitability of GlycoPAT to analyze shotgun glycoproteomics data. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  11. Proteomics and Metabolomics: Two Emerging Areas for Legume Improvement

    PubMed Central

    Ramalingam, Abirami; Kudapa, Himabindu; Pazhamala, Lekha T.; Weckwerth, Wolfram; Varshney, Rajeev K.

    2015-01-01

    The crop legumes such as chickpea, common bean, cowpea, peanut, pigeonpea, soybean, etc. are important sources of nutrition and contribute to a significant amount of biological nitrogen fixation (>20 million tons of fixed nitrogen) in agriculture. However, the production of legumes is constrained due to abiotic and biotic stresses. It is therefore imperative to understand the molecular mechanisms of plant response to different stresses and identify key candidate genes regulating tolerance which can be deployed in breeding programs. The information obtained from transcriptomics has facilitated the identification of candidate genes for the given trait of interest and utilizing them in crop breeding programs to improve stress tolerance. However, the mechanisms of stress tolerance are complex due to the influence of multi-genes and post-transcriptional regulations. Furthermore, stress conditions greatly affect gene expression which in turn causes modifications in the composition of plant proteomes and metabolomes. Therefore, functional genomics involving various proteomics and metabolomics approaches have been obligatory for understanding plant stress tolerance. These approaches have also been found useful to unravel different pathways related to plant and seed development as well as symbiosis. Proteome and metabolome profiling using high-throughput based systems have been extensively applied in the model legume species, Medicago truncatula and Lotus japonicus, as well as in the model crop legume, soybean, to examine stress signaling pathways, cellular and developmental processes and nodule symbiosis. Moreover, the availability of protein reference maps as well as proteomics and metabolomics databases greatly support research and understanding of various biological processes in legumes. Protein-protein interaction techniques, particularly the yeast two-hybrid system have been advantageous for studying symbiosis and stress signaling in legumes. In this review, several studies on proteomics and metabolomics in model and crop legumes have been discussed. Additionally, applications of advanced proteomics and metabolomics approaches have also been included in this review for future applications in legume research. The integration of these “omics” approaches will greatly support the identification of accurate biomarkers in legume smart breeding programs. PMID:26734026

  12. Identifying and quantifying proteolytic events and the natural N terminome by terminal amine isotopic labeling of substrates.

    PubMed

    Kleifeld, Oded; Doucet, Alain; Prudova, Anna; auf dem Keller, Ulrich; Gioia, Magda; Kizhakkedathu, Jayachandran N; Overall, Christopher M

    2011-09-22

    Analysis of the sequence and nature of protein N termini has many applications. Defining the termini of proteins for proteome annotation in the Human Proteome Project is of increasing importance. Terminomics analysis of protease cleavage sites in degradomics for substrate discovery is a key new application. Here we describe the step-by-step procedures for performing terminal amine isotopic labeling of substrates (TAILS), a 2- to 3-d (depending on method of labeling) high-throughput method to identify and distinguish protease-generated neo-N termini from mature protein N termini with all natural modifications with high confidence. TAILS uses negative selection to enrich for all N-terminal peptides and uses primary amine labeling-based quantification as the discriminating factor. Labeling is versatile and suited to many applications, including biochemical and cell culture analyses in vitro; in vivo analyses using tissue samples from animal and human sources can also be readily performed. At the protein level, N-terminal and lysine amines are blocked by dimethylation (formaldehyde/sodium cyanoborohydride) and isotopically labeled by incorporating heavy and light dimethylation reagents or stable isotope labeling with amino acids in cell culture labels. Alternatively, easy multiplex sample analysis can be achieved using amine blocking and labeling with isobaric tags for relative and absolute quantification, also known as iTRAQ. After tryptic digestion, N-terminal peptide separation is achieved using a high-molecular-weight dendritic polyglycerol aldehyde polymer that binds internal tryptic and C-terminal peptides that now have N-terminal alpha amines. The unbound naturally blocked (acetylation, cyclization, methylation and so on) or labeled mature N-terminal and neo-N-terminal peptides are recovered by ultrafiltration and analyzed by tandem mass spectrometry (MS/MS). Hierarchical substrate winnowing discriminates substrates from the background proteolysis products and non-cleaved proteins by peptide isotope quantification and bioinformatics search criteria.

  13. Proteomic Analysis of Virus-Host Interactions in an Infectious Context Using Recombinant Viruses*

    PubMed Central

    Komarova, Anastassia V.; Combredet, Chantal; Meyniel-Schicklin, Laurène; Chapelle, Manuel; Caignard, Grégory; Camadro, Jean-Michel; Lotteau, Vincent; Vidalain, Pierre-Olivier; Tangy, Frédéric

    2011-01-01

    RNA viruses exhibit small-sized genomes encoding few proteins, but still establish complex networks of interactions with host cell components to achieve replication and spreading. Ideally, these virus-host protein interactions should be mapped directly in infected cell culture, but such a high standard is often difficult to reach when using conventional approaches. We thus developed a new strategy based on recombinant viruses expressing tagged viral proteins to capture both direct and indirect physical binding partners during infection. As a proof of concept, we engineered a recombinant measles virus (MV) expressing one of its virulence factors, the MV-V protein, with a One-STrEP amino-terminal tag. This allowed virus-host protein complex analysis directly from infected cells by combining modified tandem affinity chromatography and mass spectrometry analysis. Using this approach, we established a prosperous list of 245 cellular proteins interacting either directly or indirectly with MV-V, and including four of the nine already known partners of this viral factor. These interactions were highly specific of MV-V because they were not recovered when the nucleoprotein MV-N, instead of MV-V, was tagged. Besides key components of the antiviral response, cellular proteins from mitochondria, ribosomes, endoplasmic reticulum, protein phosphatase 2A, and histone deacetylase complex were identified for the first time as prominent targets of MV-V and the critical role of the later protein family in MV replication was addressed. Most interestingly, MV-V showed some preferential attachment to essential proteins in the human interactome network, as assessed by centrality and interconnectivity measures. Furthermore, the list of MV-V interactors also showed a massive enrichment for well-known targets of other viruses. Altogether, this clearly supports our approach based on reverse genetics of viruses combined with high-throughput proteomics to probe the interaction network that viruses establish in infected cells. PMID:21911578

  14. Preprocessing and Analysis of LC-MS-Based Proteomic Data

    PubMed Central

    Tsai, Tsung-Heng; Wang, Minkun; Ressom, Habtom W.

    2016-01-01

    Liquid chromatography coupled with mass spectrometry (LC-MS) has been widely used for profiling protein expression levels. This chapter is focused on LC-MS data preprocessing, which is a crucial step in the analysis of LC-MS based proteomics. We provide a high-level overview, highlight associated challenges, and present a step-by-step example for analysis of data from LC-MS based untargeted proteomic study. Furthermore, key procedures and relevant issues with the subsequent analysis by multiple reaction monitoring (MRM) are discussed. PMID:26519169

  15. Comprehensive Analysis of Protein Modifications by Top-down Mass Spectrometry

    PubMed Central

    Zhang, Han; Ge, Ying

    2012-01-01

    Mass spectrometry (MS)-based proteomics is playing an increasingly important role in cardiovascular research. Proteomics includes not only identification and quantification of proteins, but also the characterization of protein modifications such as post-translational modifications and sequence variants. The conventional bottom-up approach, involving proteolytic digestion of proteins into small peptides prior to MS analysis, is routinely used for protein identification and quantification with high throughput and automation. Nevertheless, it has limitations in the analysis of protein modifications mainly due to the partial sequence coverage and loss of connections among modifications on disparate portions of a protein. An alternative approach, top-down MS, has emerged as a powerful tool for the analysis of protein modifications. The top-down approach analyzes whole proteins directly, providing a “bird’s eye” view of all existing modifications. Subsequently, each modified protein form can be isolated and fragmented in the mass spectrometer to locate the modification site. The incorporation of the non-ergodic dissociation methods such as electron capture dissociation (ECD) greatly enhances the top-down capabilities. ECD is especially useful for mapping labile post-translational modifications which are well-preserved during the ECD fragmentation process. Top-down MS with ECD has been successfully applied to cardiovascular research with the unique advantages in unraveling the molecular complexity, quantifying modified protein forms, complete mapping of modifications with full sequence coverage, discovering unexpected modifications, and identifying and quantifying positional isomers and determining the order of multiple modifications. Nevertheless, top-down MS still needs to overcome some technical challenges to realize its full potential. Herein, we reviewed the advantages and challenges of top-down methodology with a focus on its application in cardiovascular research. PMID:22187450

  16. Identification of Antibody Targets for Tuberculosis Serology using High-Density Nucleic Acid Programmable Protein Arrays*

    PubMed Central

    Song, Lusheng; Wallstrom, Garrick; Yu, Xiaobo; Hopper, Marika; Van Duine, Jennifer; Steel, Jason; Park, Jin; Wiktor, Peter; Kahn, Peter; Brunner, Al; Wilson, Douglas; Jenny-Avital, Elizabeth R.; Qiu, Ji; Labaer, Joshua; Magee, D. Mitchell; Achkar, Jacqueline M.

    2017-01-01

    Better and more diverse biomarkers for the development of simple point-of-care tests for active tuberculosis (TB), a clinically heterogeneous disease, are urgently needed. We generated a proteomic Mycobacterium tuberculosis (Mtb) High-Density Nucleic Acid Programmable Protein Array (HD-NAPPA) that used a novel multiplexed strategy for expedited high-throughput screening for antibody responses to the Mtb proteome. We screened sera from HIV uninfected and coinfected TB patients and controls (n = 120) from the US and South Africa (SA) using the multiplex HD-NAPPA for discovery, followed by deconvolution and validation through single protein HD-NAPPA with biologically independent samples (n = 124). We verified the top proteins with enzyme-linked immunosorbent assays (ELISA) using the original screening and validation samples (n = 244) and heretofore untested samples (n = 41). We identified 8 proteins with TB biomarker value; four (Rv0054, Rv0831c, Rv2031c and Rv0222) of these were previously identified in serology studies, and four (Rv0948c, Rv2853, Rv3405c, Rv3544c) were not known to elicit antibody responses. Using ELISA data, we created classifiers that could discriminate patients' TB status according to geography (US or SA) and HIV (HIV- or HIV+) status. With ROC curve analysis under cross validation, the classifiers performed with an AUC for US/HIV- at 0.807; US/HIV+ at 0.782; SA/HIV- at 0.868; and SA/HIV+ at 0.723. With this study we demonstrate a new platform for biomarker/antibody screening and delineate its utility to identify previously unknown immunoreactive proteins. PMID:28223349

  17. SASD: the Synthetic Alternative Splicing Database for identifying novel isoform from proteomics

    PubMed Central

    2013-01-01

    Background Alternative splicing is an important and widespread mechanism for generating protein diversity and regulating protein expression. High-throughput identification and analysis of alternative splicing in the protein level has more advantages than in the mRNA level. The combination of alternative splicing database and tandem mass spectrometry provides a powerful technique for identification, analysis and characterization of potential novel alternative splicing protein isoforms from proteomics. Therefore, based on the peptidomic database of human protein isoforms for proteomics experiments, our objective is to design a new alternative splicing database to 1) provide more coverage of genes, transcripts and alternative splicing, 2) exclusively focus on the alternative splicing, and 3) perform context-specific alternative splicing analysis. Results We used a three-step pipeline to create a synthetic alternative splicing database (SASD) to identify novel alternative splicing isoforms and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. First, we extracted information on gene structures of all genes in the Ensembl Genes 71 database and incorporated the Integrated Pathway Analysis Database. Then, we compiled artificial splicing transcripts. Lastly, we translated the artificial transcripts into alternative splicing peptides. The SASD is a comprehensive database containing 56,630 genes (Ensembl gene IDs), 95,260 transcripts (Ensembl transcript IDs), and 11,919,779 Alternative Splicing peptides, and also covering about 1,956 pathways, 6,704 diseases, 5,615 drugs, and 52 organs. The database has a web-based user interface that allows users to search, display and download a single gene/transcript/protein, custom gene set, pathway, disease, drug, organ related alternative splicing. Moreover, the quality of the database was validated with comparison to other known databases and two case studies: 1) in liver cancer and 2) in breast cancer. Conclusions The SASD provides the scientific community with an efficient means to identify, analyze, and characterize novel Exon Skipping and Intron Retention protein isoforms from mass spectrometry and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. PMID:24267658

  18. MASH Suite Pro: A Comprehensive Software Tool for Top-Down Proteomics*

    PubMed Central

    Cai, Wenxuan; Guner, Huseyin; Gregorich, Zachery R.; Chen, Albert J.; Ayaz-Guner, Serife; Peng, Ying; Valeja, Santosh G.; Liu, Xiaowen; Ge, Ying

    2016-01-01

    Top-down mass spectrometry (MS)-based proteomics is arguably a disruptive technology for the comprehensive analysis of all proteoforms arising from genetic variation, alternative splicing, and posttranslational modifications (PTMs). However, the complexity of top-down high-resolution mass spectra presents a significant challenge for data analysis. In contrast to the well-developed software packages available for data analysis in bottom-up proteomics, the data analysis tools in top-down proteomics remain underdeveloped. Moreover, despite recent efforts to develop algorithms and tools for the deconvolution of top-down high-resolution mass spectra and the identification of proteins from complex mixtures, a multifunctional software platform, which allows for the identification, quantitation, and characterization of proteoforms with visual validation, is still lacking. Herein, we have developed MASH Suite Pro, a comprehensive software tool for top-down proteomics with multifaceted functionality. MASH Suite Pro is capable of processing high-resolution MS and tandem MS (MS/MS) data using two deconvolution algorithms to optimize protein identification results. In addition, MASH Suite Pro allows for the characterization of PTMs and sequence variations, as well as the relative quantitation of multiple proteoforms in different experimental conditions. The program also provides visualization components for validation and correction of the computational outputs. Furthermore, MASH Suite Pro facilitates data reporting and presentation via direct output of the graphics. Thus, MASH Suite Pro significantly simplifies and speeds up the interpretation of high-resolution top-down proteomics data by integrating tools for protein identification, quantitation, characterization, and visual validation into a customizable and user-friendly interface. We envision that MASH Suite Pro will play an integral role in advancing the burgeoning field of top-down proteomics. PMID:26598644

  19. Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated "Knowledge-Based" Platform.

    PubMed

    Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana

    2017-01-01

    Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).

  20. Organic Matrix-related mineralization of sea urchin spicules, spines, test and teeth

    PubMed Central

    Veis, Arthur

    2012-01-01

    The camarodont echinoderms have five distinct mineralized skeletal elements: the embryonic spicules and mature test; spines, lantern stereom and teeth. The embryonic spicules are transient structural elements of the larval skeleton whereas the spines and test plates are permanent structural elements. The teeth are continuously growing structures, matching wear at the incisal adoral end to the rate of new production at the aboral plumula. The mineral in all cases is a high magnesium calcite, but the magnesium content, crystal shape and growth pattern is different in each type of skeletal element. The crystal shape and organization into macro structures depends on the presence of an organic matrix which creates the spaces and controls the environments for crystal initiation and growth. The detailed mechanisms of crystal regulation are not known, but much work has been done on defining the proteins which appear to be involved. Phosphorylated matrix proteins may be of special importance. Biochemical isolation of proteins, construction and analysis of cDNA libraries, and most recently high-throughput proteomic analysis in conjunction with the sequencing of the complete genome have yielded a detailed list of protein components likely to be involved in the mineralization processes. However, the proteome-genome analyses have not yet provided insight into the mechanisms of crystallization, calcite composition, and orientation applicable to all skeletal elements. Although the embryonic pluteus and their spicules are the best studied system, it appears that spicule is not representative of the mature skeletal elements. Now armed with the compositions of most of the proteins involved, the next phase of research will have to focus on the specific localization of the proteins and individual biochemistries of each system with regard to mineral content and placement. PMID:21622194

  1. Rapid analysis of protein backbone resonance assignments using cryogenic probes, a distributed Linux-based computing architecture, and an integrated set of spectral analysis tools.

    PubMed

    Monleón, Daniel; Colson, Kimberly; Moseley, Hunter N B; Anklin, Clemens; Oswald, Robert; Szyperski, Thomas; Montelione, Gaetano T

    2002-01-01

    Rapid data collection, spectral referencing, processing by time domain deconvolution, peak picking and editing, and assignment of NMR spectra are necessary components of any efficient integrated system for protein NMR structure analysis. We have developed a set of software tools designated AutoProc, AutoPeak, and AutoAssign, which function together with the data processing and peak-picking programs NMRPipe and Sparky, to provide an integrated software system for rapid analysis of protein backbone resonance assignments. In this paper we demonstrate that these tools, together with high-sensitivity triple resonance NMR cryoprobes for data collection and a Linux-based computer cluster architecture, can be combined to provide nearly complete backbone resonance assignments and secondary structures (based on chemical shift data) for a 59-residue protein in less than 30 hours of data collection and processing time. In this optimum case of a small protein providing excellent spectra, extensive backbone resonance assignments could also be obtained using less than 6 hours of data collection and processing time. These results demonstrate the feasibility of high throughput triple resonance NMR for determining resonance assignments and secondary structures of small proteins, and the potential for applying NMR in large scale structural proteomics projects.

  2. The future of targeted peptidomics.

    PubMed

    Findeisen, Peter

    2013-12-01

    Targeted MS is becoming increasingly important for sensitive and specific quantitative detection of proteins and respective PTMs. In this article, Ceglarek et al. [Proteomics Clin. Appl. 2013, 7, 794-801] present an LC-MS-based method for simultaneous quantitation of seven apolipoproteins in serum specimens. The assay fulfills many necessities of routine diagnostic applications, namely, low cost, high throughput, and good reproducibility. We anticipate that validation of new biomarkers will speed up with this technology and the palette of laboratory-based diagnostic tools will hopefully be augmented significantly in the near future. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. Alternative polyadenylation: New insights from global analyses

    PubMed Central

    Shi, Yongsheng

    2012-01-01

    Recent studies have revealed widespread mRNA alternative polyadenylation (APA) in eukaryotes and its dynamic spatial and temporal regulation. APA not only generates proteomic and functional diversity, but also plays important roles in regulating gene expression. Global deregulation of APA has been demonstrated in a variety of human diseases. Recent exciting advances in the field have been made possible in a large part by high throughput analyses using newly developed experimental tools. Here I review the recent progress in global studies of APA and the insights that have emerged from these and other studies that use more conventional methods. PMID:23097429

  4. [Development and Application of Metabonomics in Forensic Toxicology].

    PubMed

    Yan, Hui; Shen, Min

    2015-06-01

    Metabonomics is an important branch of system biology following the development of genomics, transcriptomics and proteomics. It can perform high-throughput detection and data processing with multiple parameters, potentially enabling the identification and quantification of all small metabolites in a biological system. It can be used to provide comprehensive information on the toxicity effects, toxicological mechanisms and biomarkers, sensitively finding the unusual metabolic changes caused by poison. This article mainly reviews application of metabonomics in toxicological studies of abused drugs, pesticides, poisonous plants and poisonous animals, and also illustrates the new direction of forensic toxicology research.

  5. MannDB: A microbial annotation database for protein characterization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, C; Lam, M; Smith, J

    2006-05-19

    MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-sourcemore » tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports.« less

  6. Structural Analysis of PTM Hotspots (SAPH-ire)--A Quantitative Informatics Method Enabling the Discovery of Novel Regulatory Elements in Protein Families.

    PubMed

    Dewhurst, Henry M; Choudhury, Shilpa; Torres, Matthew P

    2015-08-01

    Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)--a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits--conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit-N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Identification and Characterization of Human Proteoforms by Top-Down LC-21 Tesla FT-ICR Mass Spectrometry.

    PubMed

    Anderson, Lissa C; DeHart, Caroline J; Kaiser, Nathan K; Fellers, Ryan T; Smith, Donald F; Greer, Joseph B; LeDuc, Richard D; Blakney, Greg T; Thomas, Paul M; Kelleher, Neil L; Hendrickson, Christopher L

    2017-02-03

    Successful high-throughput characterization of intact proteins from complex biological samples by mass spectrometry requires instrumentation capable of high mass resolving power, mass accuracy, sensitivity, and spectral acquisition rate. These limitations often necessitate the performance of hundreds of LC-MS/MS experiments to obtain reasonable coverage of the targeted proteome, which is still typically limited to molecular weights below 30 kDa. The National High Magnetic Field Laboratory (NHMFL) recently installed a 21 T FT-ICR mass spectrometer, which is part of the NHMFL FT-ICR User Facility and available to all qualified users. Here we demonstrate top-down LC-21 T FT-ICR MS/MS of intact proteins derived from human colorectal cancer cell lysate. We identified a combined total of 684 unique protein entries observed as 3238 unique proteoforms at a 1% false discovery rate, based on rapid, data-dependent acquisition of collision-induced and electron-transfer dissociation tandem mass spectra from just 40 LC-MS/MS experiments. Our identifications included 372 proteoforms with molecular weights over 30 kDa detected at isotopic resolution, which substantially extends the accessible mass range for high-throughput top-down LC-MS/MS.

  8. ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells.

    PubMed

    Xu, Huilei; Baroukh, Caroline; Dannenfelser, Ruth; Chen, Edward Y; Tan, Christopher M; Kou, Yan; Kim, Yujin E; Lemischka, Ihor R; Ma'ayan, Avi

    2013-01-01

    High content studies that profile mouse and human embryonic stem cells (m/hESCs) using various genome-wide technologies such as transcriptomics and proteomics are constantly being published. However, efforts to integrate such data to obtain a global view of the molecular circuitry in m/hESCs are lagging behind. Here, we present an m/hESC-centered database called Embryonic Stem Cell Atlas from Pluripotency Evidence integrating data from many recent diverse high-throughput studies including chromatin immunoprecipitation followed by deep sequencing, genome-wide inhibitory RNA screens, gene expression microarrays or RNA-seq after knockdown (KD) or overexpression of critical factors, immunoprecipitation followed by mass spectrometry proteomics and phosphoproteomics. The database provides web-based interactive search and visualization tools that can be used to build subnetworks and to identify known and novel regulatory interactions across various regulatory layers. The web-interface also includes tools to predict the effects of combinatorial KDs by additive effects controlled by sliders, or through simulation software implemented in MATLAB. Overall, the Embryonic Stem Cell Atlas from Pluripotency Evidence database is a comprehensive resource for the stem cell systems biology community. Database URL: http://www.maayanlab.net/ESCAPE

  9. Measuring molecular biomarkers in epidemiologic studies: laboratory techniques and biospecimen considerations.

    PubMed

    Erickson, Heidi S

    2012-09-28

    The future of personalized medicine depends on the ability to efficiently and rapidly elucidate a reliable set of disease-specific molecular biomarkers. High-throughput molecular biomarker analysis methods have been developed to identify disease risk, diagnostic, prognostic, and therapeutic targets in human clinical samples. Currently, high throughput screening allows us to analyze thousands of markers from one sample or one marker from thousands of samples and will eventually allow us to analyze thousands of markers from thousands of samples. Unfortunately, the inherent nature of current high throughput methodologies, clinical specimens, and cost of analysis is often prohibitive for extensive high throughput biomarker analysis. This review summarizes the current state of high throughput biomarker screening of clinical specimens applicable to genetic epidemiology and longitudinal population-based studies with a focus on considerations related to biospecimens, laboratory techniques, and sample pooling. Copyright © 2012 John Wiley & Sons, Ltd.

  10. A Systems Level Analysis Reveals Transcriptomic and Proteomic Complexity in Ixodes Ricinus Midgut and Salivary Glands During Early Attachment and Feeding*

    PubMed Central

    Schwarz, Alexandra; Tenzer, Stefan; Hackenberg, Michael; Erhart, Jan; Gerhold-Ay, Aslihan; Mazur, Johanna; Kuharev, Jörg; Ribeiro, José M. C.; Kotsyfakis, Michail

    2014-01-01

    Although pathogens are usually transmitted within the first 24–48 h of attachment of the castor bean tick Ixodes ricinus, little is known about the tick's biological responses at these earliest phases of attachment. Tick midgut and salivary glands are the main tissues involved in tick blood feeding and pathogen transmission but the limited genomic information for I. ricinus delays the application of high-throughput methods to study their physiology. We took advantage of the latest advances in the fields of Next Generation RNA-Sequencing and Label-free Quantitative Proteomics to deliver an unprecedented, quantitative description of the gene expression dynamics in the midgut and salivary glands of this disease vector upon attachment to the vertebrate host. A total of 373 of 1510 identified proteins had higher expression in the salivary glands, but only 110 had correspondingly high transcript levels in the same tissue. Furthermore, there was midgut-specific expression of 217 genes at both the transcriptome and proteome level. Tissue-dependent transcript, but not protein, accumulation was revealed for 552 of 885 genes. Moreover, we discovered the enrichment of tick salivary glands in proteins involved in gene transcription and translation, which agrees with the secretory role of this tissue; this finding also agrees with our finding of lower tick t-RNA representation in the salivary glands when compared with the midgut. The midgut, in turn, is enriched in metabolic components and proteins that support its mechanical integrity in order to accommodate and metabolize the ingested blood. Beyond understanding the physiological events that support hematophagy by arthropod ectoparasites, we discovered more than 1500 proteins located at the interface between ticks, the vertebrate host, and the tick-borne pathogens. Thus, our work significantly improves the knowledge of the genetics underlying the transmission lifecycle of this tick species, which is an essential step for developing alternative methods to better control tick-borne diseases. PMID:25048707

  11. Cytomics - importance of multimodal analysis of cell function and proliferation in oncology.

    PubMed

    Tárnok, A; Bocsi, J; Brockhoff, G

    2006-12-01

    Cancer is a highly complex and heterogeneous disease involving a succession of genetic changes (frequently caused or accompanied by exogenous trauma), and resulting in a molecular phenotype that in turn results in a malignant specification. The development of malignancy has been described as a multistep process involving self-sufficiency in growth signals, insensitivity to antigrowth signals, evasion of apoptosis, limitless replicative potential, sustained angiogenesis, and finally tissue invasion and metastasis. The quantitative analysis of networking molecules within the cells might be applied to understand native-state tissue signalling biology, complex drug actions and dysfunctional signalling in transformed cells, that is, in cancer cells. High-content and high-throughput single-cell analysis can lead to systems biology and cytomics. The application of cytomics in cancer research and diagnostics is very broad, ranging from the better understanding of the tumour cell biology to the identification of residual tumour cells after treatment, to drug discovery. The ultimate goal is to pinpoint in detail these processes on the molecular, cellular and tissue level. A comprehensive knowledge of these will require tissue analysis, which is multiplex and functional; thus, vast amounts of data are being collected from current genomic and proteomic platforms for integration and interpretation as well as for new varieties of updated cytomics technology. This overview will briefly highlight the most important aspects of this continuously developing field.

  12. Matrix metalloproteinase proteomics: substrates, targets, and therapy.

    PubMed

    Morrison, Charlotte J; Butler, Georgina S; Rodríguez, David; Overall, Christopher M

    2009-10-01

    Proteomics encompasses powerful techniques termed 'degradomics' for unbiased high-throughput protease substrate discovery screens that have been applied to an important family of extracellular proteases, the matrix metalloproteinases (MMPs). Together with the data generated from genetic deletion and transgenic mouse models and genomic profiling, these screens can uncover the diverse range of MMP functions, reveal which MMPs and MMP-mediated pathways exacerbate pathology, and which are involved in protection and the resolution of disease. This information can be used to identify and validate candidate drug targets and antitargets, and is critical for the development of new inhibitors of MMP function. Such inhibitors may target either the MMP directly in a specific manner or pathways upstream and downstream of MMP activity that are mediating deleterious effects in disease. Since MMPs do not operate alone but are part of the 'protease web', it is necessary to use system-wide approaches to understand MMP proteolysis in vivo, to discover new biological roles and their potential for therapeutic modification.

  13. Integration of Proteomic, Transcriptional, and Interactome Data Reveals Hidden Signaling Components

    PubMed Central

    Huang, Shao-shan Carol; Fraenkel, Ernest

    2009-01-01

    Cellular signaling and regulatory networks underlie fundamental biological processes such as growth, differentiation, and response to the environment. Although there are now various high-throughput methods for studying these processes, knowledge of them remains fragmentary. Typically, the vast majority of hits identified by transcriptional, proteomic, and genetic assays lie outside of the expected pathways. These unexpected components of the cellular response are often the most interesting, because they can provide new insights into biological processes and potentially reveal new therapeutic approaches. However, they are also the most difficult to interpret. We present a technique, based on the Steiner tree problem, that uses previously reported protein-protein and protein-DNA interactions to determine how these hits are organized into functionally coherent pathways, revealing many components of the cellular response that are not readily apparent in the original data. Applied simultaneously to phosphoproteomic and transcriptional data for the yeast pheromone response, it identifies changes in diverse cellular processes that extend far beyond the expected pathways. PMID:19638617

  14. A cell death assay for assessing the mitochondrial targeting of proteins.

    PubMed

    Camara Teixeira, Daniel; Cordonier, Elizabeth L; Wijeratne, Subhashinee S K; Huebbe, Patricia; Jamin, Augusta; Jarecke, Sarah; Wiebe, Matthew; Zempleni, Janos

    2018-06-01

    The mitochondrial proteome comprises 1000 to 1500 proteins, in addition to proteins for which the mitochondrial localization is uncertain. About 800 diseases have been linked with mutations in mitochondrial proteins. We devised a cell survival assay for assessing the mitochondrial localization in a high-throughput format. This protocol allows us to assess the mitochondrial localization of proteins and their mutants, and to identify drugs and nutrients that modulate the mitochondrial targeting of proteins. The assay works equally well for proteins directed to the outer mitochondrial membrane, inner mitochondrial membrane mitochondrial and mitochondrial matrix, as demonstrated by assessing the mitochondrial targeting of the following proteins: carnitine palmitoyl transferase 1 (consensus sequence and R123C mutant), acetyl-CoA carboxylase 2, uncoupling protein 1 and holocarboxylase synthetase. Our screen may be useful for linking the mitochondrial proteome with rare diseases and for devising drug- and nutrition-based strategies for altering the mitochondrial targeting of proteins. Copyright © 2018 Elsevier Inc. All rights reserved.

  15. Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework

    PubMed Central

    2012-01-01

    Background For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. Results We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. Conclusion The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources. PMID:23216909

  16. Isocitrate dehydrogenase mutations confer dasatinib hypersensitivity and SRC-dependence in intrahepatic cholangiocarcinoma

    PubMed Central

    Saha, Supriya K.; Gordan, John D.; Kleinstiver, Benjamin P.; Vu, Phuong; Najem, Mortada S.; Yeo, Jia-Chi; Shi, Lei; Kato, Yasutaka; Levin, Rebecca S.; Webber, James T.; Damon, Leah J.; Egan, Regina K.; Greninger, Patricia; McDermott, Ultan; Garnett, Mathew J.; Jenkins, Roger L.; Rieger-Christ, Kimberly M.; Sullivan, Travis B.; Hezel, Aram F.; Liss, Andrew S.; Mizukami, Yusuke; Goyal, Lipika; Ferrone, Cristina R.; Zhu, Andrew X.; Joung, J. Keith; Shokat, Kevan M.; Benes, Cyril H.; Bardeesy, Nabeel

    2017-01-01

    Intrahepatic cholangiocarcinoma (ICC) is an aggressive liver bile duct malignancy exhibiting frequent isocitrate dehydrogenase (IDH1/IDH2) mutations. Through a high-throughput drug screen of a large panel of cancer cell lines including 17 biliary tract cancers, we found that IDH mutant (IDHm) ICC cells demonstrate a striking response to the multi-kinase inhibitor dasatinib, with the highest sensitivity among 682 solid tumor cell lines. Using unbiased proteomics to capture the activated kinome and CRISPR/Cas9-based genome editing to introduce dasatinib-resistant ‘gatekeeper’ mutant kinases, we identified SRC as a critical dasatinib target in IDHm ICC. Importantly, dasatinib-treated IDHm xenografts exhibited pronounced apoptosis and tumor regression. Our results show that IDHm ICC cells have a unique dependency on SRC and suggest that dasatinib may have therapeutic benefit against IDHm ICC. Moreover, these proteomic and genome-editing strategies provide a systematic and broadly applicable approach to define targets of kinase inhibitors underlying drug responsiveness. PMID:27231123

  17. Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework.

    PubMed

    Lewis, Steven; Csordas, Attila; Killcoyne, Sarah; Hermjakob, Henning; Hoopmann, Michael R; Moritz, Robert L; Deutsch, Eric W; Boyle, John

    2012-12-05

    For shotgun mass spectrometry based proteomics the most computationally expensive step is in matching the spectra against an increasingly large database of sequences and their post-translational modifications with known masses. Each mass spectrometer can generate data at an astonishingly high rate, and the scope of what is searched for is continually increasing. Therefore solutions for improving our ability to perform these searches are needed. We present a sequence database search engine that is specifically designed to run efficiently on the Hadoop MapReduce distributed computing framework. The search engine implements the K-score algorithm, generating comparable output for the same input files as the original implementation. The scalability of the system is shown, and the architecture required for the development of such distributed processing is discussed. The software is scalable in its ability to handle a large peptide database, numerous modifications and large numbers of spectra. Performance scales with the number of processors in the cluster, allowing throughput to expand with the available resources.

  18. Proteome-wide covalent ligand discovery in native biological systems

    PubMed Central

    Backus, Keriann M.; Correia, Bruno E.; Lum, Kenneth M.; Forli, Stefano; Horning, Benjamin D.; González-Páez, Gonzalo E.; Chatterjee, Sandip; Lanning, Bryan R.; Teijaro, John R.; Olson, Arthur J.; Wolan, Dennis W.; Cravatt, Benjamin F.

    2016-01-01

    Small molecules are powerful tools for investigating protein function and can serve as leads for new therapeutics. Most human proteins, however, lack small-molecule ligands, and entire protein classes are considered “undruggable” 1,2. Fragment-based ligand discovery (FBLD) can identify small-molecule probes for proteins that have proven difficult to target using high-throughput screening of complex compound libraries 1,3. Although reversibly binding ligands are commonly pursued, covalent fragments provide an alternative route to small-molecule probes 4–10, including those that can access regions of proteins that are difficult to access through binding affinity alone 5,10,11. In this manuscript, we report a quantitative analysis of cysteine-reactive small-molecule fragments screened against thousands of proteins. Covalent ligands were identified for >700 cysteines found in both druggable proteins and proteins deficient in chemical probes, including transcription factors, adaptor/scaffolding proteins, and uncharacterized proteins. Among the atypical ligand-protein interactions discovered were compounds that react preferentially with pro- (inactive) caspases. We used these ligands to distinguish extrinsic apoptosis pathways in human cell lines versus primary human T-cells, showing that the former is largely mediated by caspase-8 while the latter depends on both caspase-8 and −10. Fragment-based covalent ligand discovery provides a greatly expanded portrait of the ligandable proteome and furnishes compounds that can illuminate protein functions in native biological systems. PMID:27309814

  19. Novel ageing-biomarker discovery using data-intensive technologies.

    PubMed

    Griffiths, H R; Augustyniak, E M; Bennett, S J; Debacq-Chainiaux, F; Dunston, C R; Kristensen, P; Melchjorsen, C J; Navarrete, Santos A; Simm, A; Toussaint, O

    2015-11-01

    Ageing is accompanied by many visible characteristics. Other biological and physiological markers are also well-described e.g. loss of circulating sex hormones and increased inflammatory cytokines. Biomarkers for healthy ageing studies are presently predicated on existing knowledge of ageing traits. The increasing availability of data-intensive methods enables deep-analysis of biological samples for novel biomarkers. We have adopted two discrete approaches in MARK-AGE Work Package 7 for biomarker discovery; (1) microarray analyses and/or proteomics in cell systems e.g. endothelial progenitor cells or T cell ageing including a stress model; and (2) investigation of cellular material and plasma directly from tightly-defined proband subsets of different ages using proteomic, transcriptomic and miR array. The first approach provided longitudinal insight into endothelial progenitor and T cell ageing. This review describes the strategy and use of hypothesis-free, data-intensive approaches to explore cellular proteins, miR, mRNA and plasma proteins as healthy ageing biomarkers, using ageing models and directly within samples from adults of different ages. It considers the challenges associated with integrating multiple models and pilot studies as rational biomarkers for a large cohort study. From this approach, a number of high-throughput methods were developed to evaluate novel, putative biomarkers of ageing in the MARK-AGE cohort. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.

  20. Comparison of the adolescent and adult mouse prefrontal cortex proteome

    PubMed Central

    Small, Amanda T.; Spanos, Marina; Burrus, Brainard M.

    2017-01-01

    Adolescence is a developmental period characterized by unique behavioral phenotypes (increased novelty seeking, risk taking, sociability and impulsivity) and increased risk for destructive behaviors, impaired decision making and psychiatric illness. Adaptive and maladaptive adolescent traits have been associated with development of the medial prefrontal cortex (mPFC), a brain region that mediates regulatory control of behavior. However, the molecular changes that underlie brain development and behavioral vulnerability have not been fully characterized. Using high-throughput 2D DIGE spot profiling with identification by MALDI-TOF mass spectrometry, we identified 62 spots in the PFC that exhibited age-dependent differences in expression. Identified proteins were associated with diverse cellular functions, including intracellular signaling, synaptic plasticity, cellular organization and metabolism. Separate Western blot analyses confirmed age-related changes in DPYSL2, DNM1, STXBP1 and CFL1 in the mPFC and expanded these findings to the dorsal striatum, nucleus accumbens, motor cortex, amygdala and ventral tegmental area. Ingenuity Pathway Analysis (IPA) identified functional interaction networks enriched with proteins identified in the proteomics screen, linking age-related alterations in protein expression to cellular assembly and development, cell signaling and behavior, and psychiatric illness. These results provide insight into potential molecular components of adolescent cortical development, implicating structural processes that begin during embryonic development as well as plastic adaptations in signaling that may work in concert to bring the cortex, and other brain regions, into maturity. PMID:28570644

Top