An object model and database for functional genomics.
Jones, Andrew; Hunt, Ela; Wastling, Jonathan M; Pizarro, Angel; Stoeckert, Christian J
2004-07-10
Large-scale functional genomics analysis is now feasible and presents significant challenges in data analysis, storage and querying. Data standards are required to enable the development of public data repositories and to improve data sharing. There is an established data format for microarrays (microarray gene expression markup language, MAGE-ML) and a draft standard for proteomics (PEDRo). We believe that all types of functional genomics experiments should be annotated in a consistent manner, and we hope to open up new ways of comparing multiple datasets used in functional genomics. We have created a functional genomics experiment object model (FGE-OM), developed from the microarray model, MAGE-OM and two models for proteomics, PEDRo and our own model (Gla-PSI-Glasgow Proposal for the Proteomics Standards Initiative). FGE-OM comprises three namespaces representing (i) the parts of the model common to all functional genomics experiments; (ii) microarray-specific components; and (iii) proteomics-specific components. We believe that FGE-OM should initiate discussion about the contents and structure of the next version of MAGE and the future of proteomics standards. A prototype database called RNA And Protein Abundance Database (RAPAD), based on FGE-OM, has been implemented and populated with data from microbial pathogenesis. FGE-OM and the RAPAD schema are available from http://www.gusdb.org/fge.html, along with a set of more detailed diagrams. RAPAD can be accessed by registration at the site.
Schröder, Christoph; Jacob, Anette; Tonack, Sarah; Radon, Tomasz P.; Sill, Martin; Zucknick, Manuela; Rüffer, Sven; Costello, Eithne; Neoptolemos, John P.; Crnogorac-Jurcevic, Tatjana; Bauer, Andrea; Fellenberg, Kurt; Hoheisel, Jörg D.
2010-01-01
Antibody microarrays have the potential to enable comprehensive proteomic analysis of small amounts of sample material. Here, protocols are presented for the production, quality assessment, and reproducible application of antibody microarrays in a two-color mode with an array of 1,800 features, representing 810 antibodies that were directed at 741 cancer-related proteins. In addition to measures of array quality, we implemented indicators for the accuracy and significance of dual-color detection. Dual-color measurements outperform a single-color approach concerning assay reproducibility and discriminative power. In the analysis of serum samples, depletion of high-abundance proteins did not improve technical assay quality. On the contrary, depletion introduced a strong bias in protein representation. In an initial study, we demonstrated the applicability of the protocols to proteins derived from urine samples. We identified differences between urine samples from pancreatic cancer patients and healthy subjects and between sexes. This study demonstrates that biomedically relevant data can be produced. As demonstrated by the thorough quality analysis, the dual-color antibody array approach proved to be competitive with other proteomic techniques and comparable in performance to transcriptional microarray analyses. PMID:20164060
Seliger, Barbara; Dressler, Sven P.; Wang, Ena; Kellner, Roland; Recktenwald, Christian V.; Lottspeich, Friedrich; Marincola, Francesco M.; Baumgärtner, Maja; Atkins, Derek; Lichtenfels, Rudolf
2012-01-01
Results obtained from expression profilings of renal cell carcinoma using different “ome”-based approaches and comprehensive data analysis demonstrated that proteome-based technologies and cDNA microarray analyses complement each other during the discovery phase for disease-related candidate biomarkers. The integration of the respective data revealed the uniqueness and complementarities of the different technologies. While comparative cDNA microarray analyses though restricted to upregulated targets largely revealed genes involved in controlling gene/protein expression (19%) and signal transduction processes (13%), proteomics/PROTEOMEX-defined candidate biomarkers include enzymes of the cellular metabolism (36%), transport proteins (12%) and cell motility/structural molecules (10%). Candidate biomarkers defined by proteomics and PROTEOMEX are frequently shared, whereas the sharing rate between cDNA microarray and proteome-based profilings is limited. Putative candidate biomarkers provide insights into their cellular (dys)function and their diagnostic/prognostic value but still warrant further validation in larger patient numbers. Based on the fact that merely 3 candidate biomarkers were shared by all applied technologies, namely annexin A4, tubulin alpha-1A chain and ubiquitin carboxyl-terminal hydrolase L1 the analysis at a single hierarchical level of biological regulation seems to provide only limited results thus emphasizing the importance and benefit of performing rather combinatorial screenings which can complement the standard clinical predictors. PMID:19235166
Direct labeling of serum proteins by fluorescent dye for antibody microarray.
Klimushina, M V; Gumanova, N G; Metelskaya, V A
2017-05-06
Analysis of serum proteome by antibody microarray is used to identify novel biomarkers and to study signaling pathways including protein phosphorylation and protein-protein interactions. Labeling of serum proteins is important for optimal performance of the antibody microarray. Proper choice of fluorescent label and optimal concentration of protein loaded on the microarray ensure good quality of imaging that can be reliably scanned and processed by the software. We have optimized direct serum protein labeling using fluorescent dye Arrayit Green 540 (Arrayit Corporation, USA) for antibody microarray. Optimized procedure produces high quality images that can be readily scanned and used for statistical analysis of protein composition of the serum. Copyright © 2017 Elsevier Inc. All rights reserved.
ArrayNinja: An Open Source Platform for Unified Planning and Analysis of Microarray Experiments.
Dickson, B M; Cornett, E M; Ramjan, Z; Rothbart, S B
2016-01-01
Microarray-based proteomic platforms have emerged as valuable tools for studying various aspects of protein function, particularly in the field of chromatin biochemistry. Microarray technology itself is largely unrestricted in regard to printable material and platform design, and efficient multidimensional optimization of assay parameters requires fluidity in the design and analysis of custom print layouts. This motivates the need for streamlined software infrastructure that facilitates the combined planning and analysis of custom microarray experiments. To this end, we have developed ArrayNinja as a portable, open source, and interactive application that unifies the planning and visualization of microarray experiments and provides maximum flexibility to end users. Array experiments can be planned, stored to a private database, and merged with the imaged results for a level of data interaction and centralization that is not currently attainable with available microarray informatics tools. © 2016 Elsevier Inc. All rights reserved.
Trivedi, Prinal; Edwards, Jode W; Wang, Jelai; Gadbury, Gary L; Srinivasasainagendra, Vinodh; Zakharkin, Stanislav O; Kim, Kyoungmi; Mehta, Tapan; Brand, Jacob P L; Patki, Amit; Page, Grier P; Allison, David B
2005-04-06
Many efforts in microarray data analysis are focused on providing tools and methods for the qualitative analysis of microarray data. HDBStat! (High-Dimensional Biology-Statistics) is a software package designed for analysis of high dimensional biology data such as microarray data. It was initially developed for the analysis of microarray gene expression data, but it can also be used for some applications in proteomics and other aspects of genomics. HDBStat! provides statisticians and biologists a flexible and easy-to-use interface to analyze complex microarray data using a variety of methods for data preprocessing, quality control analysis and hypothesis testing. Results generated from data preprocessing methods, quality control analysis and hypothesis testing methods are output in the form of Excel CSV tables, graphs and an Html report summarizing data analysis. HDBStat! is a platform-independent software that is freely available to academic institutions and non-profit organizations. It can be downloaded from our website http://www.soph.uab.edu/ssg_content.asp?id=1164.
Pérez-Bercoff, Lena; Valentini, Davide; Gaseitsiwe, Simani; Mahdavifar, Shahnaz; Schutkowski, Mike; Poiret, Thomas; Pérez-Bercoff, Åsa; Ljungman, Per; Maeurer, Markus J.
2014-01-01
Cytomegalovirus (CMV) infection represents a vital complication after Hematopoietic Stem Cell Transplantation (HSCT). We screened the entire CMV proteome to visualize the humoral target epitope-focus profile in serum after HSCT. IgG profiling from four patient groups (donor and/or recipient +/− for CMV) was performed at 6, 12 and 24 months after HSCT using microarray slides containing 17174 of 15mer-peptides overlapping by 4 aa covering 214 proteins from CMV. Data were analyzed using maSigPro, PAM and the ‘exclusive recognition analysis (ERA)’ to identify unique CMV epitope responses for each patient group. The ‘exclusive recognition analysis’ of serum epitope patterns segregated best 12 months after HSCT for the D+/R+ group (versus D−/R−). Epitopes were derived from UL123 (IE1), UL99 (pp28), UL32 (pp150), this changed at 24 months to 2 strongly recognized peptides provided from UL123 and UL100. Strongly (IgG) recognized CMV targets elicited also robust cytokine production in T-cells from patients after HSCT defined by intracellular cytokine staining (IL-2, TNF, IFN and IL-17). High-content peptide microarrays allow epitope profiling of entire viral proteomes; this approach can be useful to map relevant targets for diagnostics and therapy in patients with well defined clinical endpoints. Peptide microarray analysis visualizes the breadth of B-cell immune reconstitution after HSCT and provides a useful tool to gauge immune reconstitution. PMID:24740411
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie; Gruden, Kristina
2017-07-06
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato ( Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it.
Stare, Tjaša; Stare, Katja; Weckwerth, Wolfram; Wienkoop, Stefanie
2017-01-01
Plant diseases caused by viral infection are affecting all major crops. Being an obligate intracellular organisms, chemical control of these pathogens is so far not applied in the field except to control the insect vectors of the viruses. Understanding of molecular responses of plant immunity is therefore economically important, guiding the enforcement of crop resistance. To disentangle complex regulatory mechanisms of the plant immune responses, understanding system as a whole is a must. However, integrating data from different molecular analysis (transcriptomics, proteomics, metabolomics, smallRNA regulation etc.) is not straightforward. We evaluated the response of potato (Solanum tuberosum L.) following the infection with potato virus Y (PVY). The response has been analyzed on two molecular levels, with microarray transcriptome analysis and mass spectroscopy-based proteomics. Within this report, we performed detailed analysis of the results on both levels and compared two different approaches for analysis of proteomic data (spectral count versus MaxQuant). To link the data on different molecular levels, each protein was mapped to the corresponding potato transcript according to StNIB paralogue grouping. Only 33% of the proteins mapped to microarray probes in a one-to-one relation and additionally many showed discordance in detected levels of proteins with corresponding transcripts. We discussed functional importance of true biological differences between both levels and showed that the reason for the discordance between transcript and protein abundance lies partly in complexity and structure of biological regulation of proteome and transcriptome and partly in technical issues contributing to it. PMID:28684682
Quantitative proteomic analysis of microdissected oral epithelium for cancer biomarker discovery.
Xiao, Hua; Langerman, Alexander; Zhang, Yan; Khalid, Omar; Hu, Shen; Cao, Cheng-Xi; Lingen, Mark W; Wong, David T W
2015-11-01
Specific biomarkers are urgently needed for the detection and progression of oral cancer. The objective of this study was to discover cancer biomarkers from oral epithelium through utilizing high throughput quantitative proteomics approaches. Morphologically malignant, epithelial dysplasia, and adjacent normal epithelial tissues were laser capture microdissected (LCM) from 19 patients and used for proteomics analysis. Total proteins from each group were extracted, digested and then labelled with corresponding isobaric tags for relative and absolute quantitation (iTRAQ). Labelled peptides from each sample were combined and analyzed by liquid chromatography-mass spectrometry (LC-MS/MS) for protein identification and quantification. In total, 500 proteins were identified and 425 of them were quantified. When compared with adjacent normal oral epithelium, 17 and 15 proteins were consistently up-regulated or down-regulated in malignant and epithelial dysplasia, respectively. Half of these candidate biomarkers were discovered for oral cancer for the first time. Cornulin was initially confirmed in tissue protein extracts and was further validated in tissue microarray. Its presence in the saliva of oral cancer patients was also explored. Myoglobin and S100A8 were pre-validated by tissue microarray. These data demonstrated that the proteomic biomarkers discovered through this strategy are potential targets for oral cancer detection and salivary diagnostics. Copyright © 2015 Elsevier Ltd. All rights reserved.
Li, Lingyun; Li, Qingbo; Rohlin, Lars; Kim, UnMi; Salmon, Kirsty; Rejtar, Tomas; Gunsalus, Robert P.; Karger, Barry L.; Ferry, James G.
2008-01-01
Summary Methanosarcina acetivorans strain C2A is an acetate- and methanol-utilizing methane-producing organism for which the genome, the largest yet sequenced among the Archaea, reveals extensive physiological diversity. LC linear ion trap-FTICR mass spectrometry was employed to analyze acetate- vs. methanol-grown cells metabolically labeled with 14N vs. 15N, respectively, to obtain quantitative protein abundance ratios. DNA microarray analyses of acetate- vs. methanol-grown cells was also performed to determine gene expression ratios. The combined approaches were highly complementary, extending the physiological understanding of growth and methanogenesis. Of the 1081 proteins detected, 255 were ≥ 3-fold differentially abundant. DNA microarray analysis revealed 410 genes that were ≥ 2.5-fold differentially expressed of 1972 genes with detected expression. The ratios of differentially abundant proteins were in good agreement with expression ratios of the encoding genes. Taken together, the results suggest several novel roles for electron transport components specific to acetate-grown cells, including two flavodoxins each specific for growth on acetate or methanol. Protein abundance ratios indicated that duplicate CO dehydrogenase/acetyl-CoA complexes function in the conversion of acetate to methane. Surprisingly, the protein abundance and gene expression ratios indicated a general stress response in acetate- vs. methanol-grown cells that included enzymes specific for polyphosphate accumulation and oxidative stress. The microarray analysis identified transcripts of several genes encoding regulatory proteins with identity to the PhoU, MarR, GlnK, and TetR families commonly found in the Bacteria domain. An analysis of neighboring genes suggested roles in controlling phosphate metabolism (PhoU), ammonia assimilation (GlnK), and molybdopterin cofactor biosynthesis (TetR). Finally, the proteomic and microarray results suggested roles for two-component regulatory systems specific for each growth substrate. PMID:17269732
Analysis of Protein Expression in Cell Microarrays: A Tool for Antibody-based Proteomics
Andersson, Ann-Catrin; Strömberg, Sara; Bäckvall, Helena; Kampf, Caroline; Uhlen, Mathias; Wester, Kenneth; Pontén, Fredrik
2006-01-01
Tissue microarray (TMA) technology provides a possibility to explore protein expression patterns in a multitude of normal and disease tissues in a high-throughput setting. Although TMAs have been used for analysis of tissue samples, robust methods for studying in vitro cultured cell lines and cell aspirates in a TMA format have been lacking. We have adopted a technique to homogeneously distribute cells in an agarose gel matrix, creating an artificial tissue. This enables simultaneous profiling of protein expression in suspension- and adherent-grown cell samples assembled in a microarray. In addition, the present study provides an optimized strategy for the basic laboratory steps to efficiently produce TMAs. Presented modifications resulted in an improved quality of specimens and a higher section yield compared with standard TMA production protocols. Sections from the generated cell TMAs were tested for immunohistochemical staining properties using 20 well-characterized antibodies. Comparison of immunoreactivity in cultured dispersed cells and corresponding cells in tissue samples showed congruent results for all tested antibodies. We conclude that a modified TMA technique, including cell samples, provides a valuable tool for high-throughput analysis of protein expression, and that this technique can be used for global approaches to explore the human proteome. PMID:16957166
Identification of new autoantigens for primary biliary cirrhosis using human proteome microarrays.
Hu, Chao-Jun; Song, Guang; Huang, Wei; Liu, Guo-Zhen; Deng, Chui-Wen; Zeng, Hai-Pan; Wang, Li; Zhang, Feng-Chun; Zhang, Xuan; Jeong, Jun Seop; Blackshaw, Seth; Jiang, Li-Zhi; Zhu, Heng; Wu, Lin; Li, Yong-Zhe
2012-09-01
Primary biliary cirrhosis (PBC) is a chronic cholestatic liver disease of unknown etiology and is considered to be an autoimmune disease. Autoantibodies are important tools for accurate diagnosis of PBC. Here, we employed serum profiling analysis using a human proteome microarray composed of about 17,000 full-length unique proteins and identified 23 proteins that correlated with PBC. To validate these results, we fabricated a PBC-focused microarray with 21 of these newly identified candidates and nine additional known PBC antigens. By screening the PBC microarrays with additional cohorts of 191 PBC patients and 321 controls (43 autoimmune hepatitis, 55 hepatitis B virus, 31 hepatitis C virus, 48 rheumatoid arthritis, 45 systematic lupus erythematosus, 49 systemic sclerosis, and 50 healthy), six proteins were confirmed as novel PBC autoantigens with high sensitivities and specificities, including hexokinase-1 (isoforms I and II), Kelch-like protein 7, Kelch-like protein 12, zinc finger and BTB domain-containing protein 2, and eukaryotic translation initiation factor 2C, subunit 1. To facilitate clinical diagnosis, we developed ELISA for Kelch-like protein 12 and zinc finger and BTB domain-containing protein 2 and tested large cohorts (297 PBC and 637 control sera) to confirm the sensitivities and specificities observed in the microarray-based assays. In conclusion, our research showed that a strategy using high content protein microarray combined with a smaller but more focused protein microarray can effectively identify and validate novel PBC-specific autoantigens and has the capacity to be translated to clinical diagnosis by means of an ELISA-based method.
Weniger, Markus; Engelmann, Julia C; Schultz, Jörg
2007-01-01
Background Regulation of gene expression is relevant to many areas of biology and medicine, in the study of treatments, diseases, and developmental stages. Microarrays can be used to measure the expression level of thousands of mRNAs at the same time, allowing insight into or comparison of different cellular conditions. The data derived out of microarray experiments is highly dimensional and often noisy, and interpretation of the results can get intricate. Although programs for the statistical analysis of microarray data exist, most of them lack an integration of analysis results and biological interpretation. Results We have developed GEPAT, Genome Expression Pathway Analysis Tool, offering an analysis of gene expression data under genomic, proteomic and metabolic context. We provide an integration of statistical methods for data import and data analysis together with a biological interpretation for subsets of probes or single probes on the chip. GEPAT imports various types of oligonucleotide and cDNA array data formats. Different normalization methods can be applied to the data, afterwards data annotation is performed. After import, GEPAT offers various statistical data analysis methods, as hierarchical, k-means and PCA clustering, a linear model based t-test or chromosomal profile comparison. The results of the analysis can be interpreted by enrichment of biological terms, pathway analysis or interaction networks. Different biological databases are included, to give various information for each probe on the chip. GEPAT offers no linear work flow, but allows the usage of any subset of probes and samples as a start for a new data analysis. GEPAT relies on established data analysis packages, offers a modular approach for an easy extension, and can be run on a computer grid to allow a large number of users. It is freely available under the LGPL open source license for academic and commercial users at . Conclusion GEPAT is a modular, scalable and professional-grade software integrating analysis and interpretation of microarray gene expression data. An installation available for academic users can be found at . PMID:17543125
Statistical design of quantitative mass spectrometry-based proteomic experiments.
Oberg, Ann L; Vitek, Olga
2009-05-01
We review the fundamental principles of statistical experimental design, and their application to quantitative mass spectrometry-based proteomics. We focus on class comparison using Analysis of Variance (ANOVA), and discuss how randomization, replication and blocking help avoid systematic biases due to the experimental procedure, and help optimize our ability to detect true quantitative changes between groups. We also discuss the issues of pooling multiple biological specimens for a single mass analysis, and calculation of the number of replicates in a future study. When applicable, we emphasize the parallels between designing quantitative proteomic experiments and experiments with gene expression microarrays, and give examples from that area of research. We illustrate the discussion using theoretical considerations, and using real-data examples of profiling of disease.
Wagner, Wolfgang; Feldmann, Robert E; Seckinger, Anja; Maurer, Martin H; Wein, Frederik; Blake, Jonathon; Krause, Ulf; Kalenka, Armin; Bürgers, Heinrich F; Saffrich, Rainer; Wuchter, Patrick; Kuschinsky, Wolfgang; Ho, Anthony D
2006-04-01
Mesenchymal stem cells (MSC) raise high hopes in clinical applications. However, the lack of common standards and a precise definition of MSC preparations remains a major obstacle in research and application of MSC. Whereas surface antigen markers have failed to precisely define this population, a combination of proteomic data and microarray data provides a new dimension for the definition of MSC preparations. In our continuing effort to characterize MSC, we have analyzed the differential transcriptome and proteome expression profiles of MSC preparations isolated from human bone marrow under two different expansion media (BM-MSC-M1 and BM-MSC-M2). In proteomics, 136 protein spots were unambiguously identified by MALDI-TOF-MS and corresponding cDNA spots were selected on our "Human Transcriptome cDNA Microarray." Combination of datasets revealed a correlation in differential gene expression and protein expression of BM-MSC-M1 vs BM-MSC-M2. Genes involved in metabolism were more highly expressed in BM-MSC-M1, whereas genes involved in development, morphogenesis, extracellular matrix, and differentiation were more highly expressed in BM-MSC-M2. Interchanging culture conditions for 8 days revealed that differential expression was retained in several genes whereas it was altered in others. Our results have provided evidence that homogeneous BM-MSC preparations can reproducibly be isolated under standardized conditions, whereas culture conditions exert a prominent impact on transcriptome, proteome, and cellular organization of BM-MSC.
Quintela, T; Marcelino, H; Deery, M J; Feret, R; Howard, J; Lilley, K S; Albuquerque, T; Gonçalves, I; Duarte, A C; Santos, C R A
2016-01-01
The choroid plexus (CP) epithelium is a unique structure in the brain that forms an interface between the peripheral blood and the cerebrospinal fluid (CSF), which is mostly produced by the CP itself. Because the CP transcriptome is regulated by the sex hormone background, the present study compared gene/protein expression profiles in the CP and CSF from male and female rats aiming to better understand sex-related differences in CP functions and brain physiology. We used data previously obtained by cDNA microarrays to compare the CP transcriptome between male and female rats, and complemented these data with the proteomic analysis of the CSF of castrated and sham-operated males and females. Microarray analysis showed that 17 128 and 17 002 genes are expressed in the male and female CP, which allowed the functional annotation of 141 and 134 pathways, respectively. Among the most expressed genes, canonical pathways associated with mitochondrial dysfunctions and oxidative phosphorylation were the most prominent, whereas the most relevant molecular and cellular functions annotated were protein synthesis, cellular growth and proliferation, cell death and survival, molecular transport, and protein trafficking. No significant differences were found between males and females regarding these pathways. Seminal functions of the CP differentially regulated between sexes were circadian rhythm signalling, as well as several canonical pathways related to stem cell differentiation, metabolism and the barrier function of the CP. The proteomic analysis identified five down-regulated proteins in the CSF samples from male rats compared to females and seven proteins exhibiting marked variation in the CSF of gonadectomised males compared to sham animals, whereas no differences were found between sham and ovariectomised females. These data clearly show sex-related differences in CP gene expression and CSF protein composition that may impact upon neurological diseases. © 2015 British Society for Neuroendocrinology.
Gregori, Josep; Villarreal, Laura; Sánchez, Alex; Baselga, José; Villanueva, Josep
2013-12-16
The microarray community has shown that the low reproducibility observed in gene expression-based biomarker discovery studies is partially due to relying solely on p-values to get the lists of differentially expressed genes. Their conclusions recommended complementing the p-value cutoff with the use of effect-size criteria. The aim of this work was to evaluate the influence of such an effect-size filter on spectral counting-based comparative proteomic analysis. The results proved that the filter increased the number of true positives and decreased the number of false positives and the false discovery rate of the dataset. These results were confirmed by simulation experiments where the effect size filter was used to evaluate systematically variable fractions of differentially expressed proteins. Our results suggest that relaxing the p-value cut-off followed by a post-test filter based on effect size and signal level thresholds can increase the reproducibility of statistical results obtained in comparative proteomic analysis. Based on our work, we recommend using a filter consisting of a minimum absolute log2 fold change of 0.8 and a minimum signal of 2-4 SpC on the most abundant condition for the general practice of comparative proteomics. The implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of the results obtained among independent laboratories and MS platforms. Quality control analysis of microarray-based gene expression studies pointed out that the low reproducibility observed in the lists of differentially expressed genes could be partially attributed to the fact that these lists are generated relying solely on p-values. Our study has established that the implementation of an effect size post-test filter improves the statistical results of spectral count-based quantitative proteomics. The results proved that the filter increased the number of true positives whereas decreased the false positives and the false discovery rate of the datasets. The results presented here prove that a post-test filter applying a reasonable effect size and signal level thresholds helps to increase the reproducibility of statistical results in comparative proteomic analysis. Furthermore, the implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of results obtained among independent laboratories and MS platforms. This article is part of a Special Issue entitled: Standardization and Quality Control in Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.
Proteomic analysis of formalin-fixed paraffin embedded tissue by MALDI imaging mass spectrometry
Casadonte, Rita; Caprioli, Richard M
2012-01-01
Archived formalin-fixed paraffin-embedded (FFPE) tissue collections represent a valuable informational resource for proteomic studies. Multiple FFPE core biopsies can be assembled in a single block to form tissue microarrays (TMAs). We describe a protocol for analyzing protein in FFPE -TMAs using matrix-assisted laser desorption/ionization (MAL DI) imaging mass spectrometry (IMS). The workflow incorporates an antigen retrieval step following deparaffinization, in situ trypsin digestion, matrix application and then mass spectrometry signal acquisition. The direct analysis of FFPE -TMA tissue using IMS allows direct analysis of multiple tissue samples in a single experiment without extraction and purification of proteins. The advantages of high speed and throughput, easy sample handling and excellent reproducibility make this technology a favorable approach for the proteomic analysis of clinical research cohorts with large sample numbers. For example, TMA analysis of 300 FFPE cores would typically require 6 h of total time through data acquisition, not including data analysis. PMID:22011652
Glaros, Trevor G; Blancett, Candace D; Bell, Todd M; Natesan, Mohan; Ulrich, Robert G
2015-01-01
The bacterium Burkholderia mallei is the etiological agent of glanders, a highly contagious, often fatal zoonotic infectious disease that is also a biodefense concern. Clinical laboratory assays that analyze blood or other biological fluids are the highest priority because these specimens can be collected with minimal risk to the patient. However, progress in developing sensitive assays for monitoring B. mallei infection is hampered by a shortage of useful biomarkers. Reasoning that there should be a strong correlation between the proteomes of infected tissues and circulating serum, we employed imaging mass spectrometry (IMS) of thin-sectioned tissues from Chlorocebus aethiops (African green) monkeys infected with B. mallei to localize host and pathogen proteins that were associated with abscesses. Using laser-capture microdissection of specific regions identified by IMS and histology within the tissue sections, a more extensive proteomic analysis was performed by a technique that combined the physical separation capabilities of liquid chromatography (LC) with the sensitive mass analysis capabilities of mass spectrometry (LC-MS/MS). By examining standard formalin-fixed, paraffin-embedded tissue sections, this strategy resulted in the identification of several proteins that were associated with lung and skin abscesses, including the host protein calprotectin and the pathogen protein GroEL. Elevated levels of calprotectin detected by ELISA and antibody responses to GroEL, measured by a microarray of the bacterial proteome, were subsequently detected in the sera of C. aethiops, Macaca mulatta, and Macaca fascicularis primates infected with B. mallei. Our results demonstrate that a combination of multidimensional MS analysis of traditional histology specimens with high-content protein microarrays can be used to discover lead pairs of host-pathogen biomarkers of infection that are identifiable in biological fluids.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chin, Mark H.; Qian, Weijun; Wang, Haixing
2008-02-10
The molecular mechanisms underlying the changes in the nigrostriatal pathway in Parkinson disease (PD) are not completely understood. Here we use mass spectrometry and microarrays to study the proteomic and transcriptomic changes in the striatum of two mouse models of PD, induced by the distinct neurotoxins 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) and methamphetamine (METH). Proteomic analyses resulted in the identification and relative quantification of 912 proteins with two or more unique peptides and 85 proteins with significant abundance changes following neurotoxin treatment. Similarly, microarray analyses revealed 181 genes with significant changes in mRNA following neurotoxin treatment. The combined protein and gene list providesmore » a clearer picture of the potential mechanisms underlying neurodegeneration observed in PD. Functional analysis of this combined list revealed a number of significant categories, including mitochondrial dysfunction, oxidative stress response and apoptosis. Additionally, codon usage and miRNAs may play an important role in translational control in the striatum. These results constitute one of the largest datasets integrating protein and transcript changes for these neurotoxin models with many similar endpoint phenotypes but distinct mechanisms.« less
Gupta, Surya; De Puysseleyr, Veronic; Van der Heyden, José; Maddelein, Davy; Lemmens, Irma; Lievens, Sam; Degroeve, Sven; Tavernier, Jan; Martens, Lennart
2017-05-01
Protein-protein interaction (PPI) studies have dramatically expanded our knowledge about cellular behaviour and development in different conditions. A multitude of high-throughput PPI techniques have been developed to achieve proteome-scale coverage for PPI studies, including the microarray based Mammalian Protein-Protein Interaction Trap (MAPPIT) system. Because such high-throughput techniques typically report thousands of interactions, managing and analysing the large amounts of acquired data is a challenge. We have therefore built the MAPPIT cell microArray Protein Protein Interaction-Data management & Analysis Tool (MAPPI-DAT) as an automated data management and analysis tool for MAPPIT cell microarray experiments. MAPPI-DAT stores the experimental data and metadata in a systematic and structured way, automates data analysis and interpretation, and enables the meta-analysis of MAPPIT cell microarray data across all stored experiments. MAPPI-DAT is developed in Python, using R for data analysis and MySQL as data management system. MAPPI-DAT is cross-platform and can be ran on Microsoft Windows, Linux and OS X/macOS. The source code and a Microsoft Windows executable are freely available under the permissive Apache2 open source license at https://github.com/compomics/MAPPI-DAT. jan.tavernier@vib-ugent.be or lennart.martens@vib-ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Quantitative Proteomics Analysis of the cAMP/Protein Kinase A Signaling Pathway
2012-01-01
To define the proteins whose expression is regulated by cAMP and protein kinase A (PKA), we used a quantitative proteomics approach in studies of wild-type (WT) and kin- (PKA-null) S49 murine T lymphoma cells. We also compared the impact of endogenous increases in the level of cAMP [by forskolin (Fsk) and the phosphodiesterase inhibitor isobutylmethylxanthine (IBMX)] or by a cAMP analogue (8-CPT-cAMP). We identified 1056 proteins in WT and kin- S49 cells and found that 8-CPT-cAMP and Fsk with IBMX produced differences in protein expression. WT S49 cells had a correlation coefficient of 0.41 between DNA microarray data and the proteomics analysis in cells incubated with 8-CPT-cAMP for 24 h and a correlation coefficient of 0.42 between the DNA microarray data obtained at 6 h and the changes in protein expression after incubation with 8-CPT-cAMP for 24 h. Glutathione reductase (Gsr) had a higher level of basal expression in kin- S49 cells than in WT cells. Consistent with this finding, kin- cells are less sensitive to cell killing and generation of malondialdehyde than are WT cells incubated with H2O2. Cyclic AMP acting via PKA thus has a broad impact on protein expression in mammalian cells, including in the regulation of Gsr and oxidative stress. PMID:23110364
2013-01-01
Background Triglyceride deposit cardiomyovasculopathy (TGCV) is a rare disease, characterized by the massive accumulation of triglyceride (TG) in multiple tissues, especially skeletal muscle, heart muscle and the coronary artery. TGCV is caused by mutation of adipose triglyceride lipase, which is an essential molecule for the hydrolysis of TG. TGCV is at high risk for skeletal myopathy and heart dysfunction, and therefore premature death. Development of therapeutic methods for TGCV is highly desirable. This study aims to discover specific molecules responsible for TGCV pathogenesis. Methods To identify differentially expressed proteins in TGCV patient cells, the stable isotope labeling with amino acids in cell culture (SILAC) method coupled with LC-MS/MS was performed using skin fibroblast cells derived from two TGCV patients and three healthy volunteers. Altered protein expression in TGCV cells was confirmed using the selected reaction monitoring (SRM) method. Microarray-based transcriptome analysis was simultaneously performed to identify changes in gene expression in TGCV cells. Results Using SILAC proteomics, 4033 proteins were quantified, 53 of which showed significantly altered expression in both TGCV patient cells. Twenty altered proteins were chosen and confirmed using SRM. SRM analysis successfully quantified 14 proteins, 13 of which showed the same trend as SILAC proteomics. The altered protein expression data set was used in Ingenuity Pathway Analysis (IPA), and significant networks were identified. Several of these proteins have been previously implicated in lipid metabolism, while others represent new therapeutic targets or markers for TGCV. Microarray analysis quantified 20743 transcripts, and 252 genes showed significantly altered expression in both TGCV patient cells. Ten altered genes were chosen, 9 of which were successfully confirmed using quantitative RT-PCR. Biological networks of altered genes were analyzed using an IPA search. Conclusions We performed the SILAC- and SRM-based identification-through-confirmation study using skin fibroblast cells derived from TGCV patients, and first identified altered proteins specific for TGCV. Microarray analysis also identified changes in gene expression. The functional networks of the altered proteins and genes are discussed. Our findings will be exploited to elucidate the pathogenesis of TGCV and discover clinically relevant molecules for TGCV in the near future. PMID:24360150
USDA-ARS?s Scientific Manuscript database
In addition to microarray technology, which provides a robust method to study protein function in a rapid, economical, and proteome-wide fashion, plasmid-based functional proteomics is an important technology for rapidly obtaining large quantities of protein and determining protein function across a...
Implementation of proteomics for cancer research: past, present, and future.
Karimi, Parisa; Shahrokni, Armin; Ranjbar, Mohammad R Nezami
2014-01-01
Cancer is the leading cause of the death, accounts for about 13% of all annual deaths worldwide. Many different fields of science are collaborating together studying cancer to improve our knowledge of this lethal disease, and find better solutions for diagnosis and treatment. Proteomics is one of the most recent and rapidly growing areas in molecular biology that helps understanding cancer from an omics data analysis point of view. The human proteome project was officially initiated in 2008. Proteomics enables the scientists to interrogate a variety of biospecimens for their protein contents and measure the concentrations of these proteins. Current necessary equipment and technologies for cancer proteomics are mass spectrometry, protein microarrays, nanotechnology and bioinformatics. In this paper, we provide a brief review on proteomics and its application in cancer research. After a brief introduction including its definition, we summarize the history of major previous work conducted by researchers, followed by an overview on the role of proteomics in cancer studies. We also provide a list of different utilities in cancer proteomics and investigate their advantages and shortcomings from theoretical and practical angles. Finally, we explore some of the main challenges and conclude the paper with future directions in this field.
2014-01-01
Background KIAA1199 is a recently identified novel gene that is up-regulated in human cancer with poor survival. Our proteomic study on signaling polarity in chemotactic cells revealed KIAA1199 as a novel protein target that may be involved in cellular chemotaxis and motility. In the present study, we examined the functional significance of KIAA1199 expression in breast cancer growth, motility and invasiveness. Methods We validated the previous microarray observation by tissue microarray immunohistochemistry using a TMA slide containing 12 breast tumor tissue cores and 12 corresponding normal tissues. We performed the shRNA-mediated knockdown of KIAA1199 in MDA-MB-231 and HS578T cells to study the role of this protein in cell proliferation, migration and apoptosis in vitro. We studied the effects of KIAA1199 knockdown in vivo in two groups of mice (n = 5). We carried out the SILAC LC-MS/MS based proteomic studies on the involvement of KIAA1199 in breast cancer. Results KIAA1199 mRNA and protein was significantly overexpressed in breast tumor specimens and cell lines as compared with non-neoplastic breast tissues from large-scale microarray and studies of breast cancer cell lines and tumors. To gain deeper insights into the novel role of KIAA1199 in breast cancer, we modulated KIAA1199 expression using shRNA-mediated knockdown in two breast cancer cell lines (MDA-MB-231 and HS578T), expressing higher levels of KIAA1199. The KIAA1199 knockdown cells showed reduced motility and cell proliferation in vitro. Moreover, when the knockdown cells were injected into the mammary fat pads of female athymic nude mice, there was a significant decrease in tumor incidence and growth. In addition, quantitative proteomic analysis revealed that knockdown of KIAA1199 in breast cancer (MDA-MB-231) cells affected a broad range of cellular functions including apoptosis, metabolism and cell motility. Conclusions Our findings indicate that KIAA1199 may play an important role in breast tumor growth and invasiveness, and that it may represent a novel target for biomarker development and a novel therapeutic target for breast cancer. PMID:24628760
Jain, K K
2001-02-01
Cambridge Healthtech Institute's Third Annual Conference on Lab-on-a-Chip and Microarray technology covered the latest advances in this technology and applications in life sciences. Highlights of the meetings are reported briefly with emphasis on applications in genomics, drug discovery and molecular diagnostics. There was an emphasis on microfluidics because of the wide applications in laboratory and drug discovery. The lab-on-a-chip provides the facilities of a complete laboratory in a hand-held miniature device. Several microarray systems have been used for hybridisation and detection techniques. Oligonucleotide scanning arrays provide a versatile tool for the analysis of nucleic acid interactions and provide a platform for improving the array-based methods for investigation of antisense therapeutics. A method for analysing combinatorial DNA arrays using oligonucleotide-modified gold nanoparticle probes and a conventional scanner has considerable potential in molecular diagnostics. Various applications of microarray technology for high-throughput screening in drug discovery and single nucleotide polymorphisms (SNP) analysis were discussed. Protein chips have important applications in proteomics. With the considerable amount of data generated by the different technologies using microarrays, it is obvious that the reading of the information and its interpretation and management through the use of bioinformatics is essential. Various techniques for data analysis were presented. Biochip and microarray technology has an essential role to play in the evolving trends in healthcare, which integrate diagnosis with prevention/treatment and emphasise personalised medicines.
Application of proteomics to ecology and population biology.
Karr, T L
2008-02-01
Proteomics is a relatively new scientific discipline that merges protein biochemistry, genome biology and bioinformatics to determine the spatial and temporal expression of proteins in cells, tissues and whole organisms. There has been very little application of proteomics to the fields of behavioral genetics, evolution, ecology and population dynamics, and has only recently been effectively applied to the closely allied fields of molecular evolution and genetics. However, there exists considerable potential for proteomics to impact in areas related to functional ecology; this review will introduce the general concepts and methodologies that define the field of proteomics and compare and contrast the advantages and disadvantages with other methods. Examples of how proteomics can aid, complement and indeed extend the study of functional ecology will be discussed including the main tool of ecological studies, population genetics with an emphasis on metapopulation structure analysis. Because proteomic analyses provide a direct measure of gene expression, it obviates some of the limitations associated with other genomic approaches, such as microarray and EST analyses. Likewise, in conjunction with associated bioinformatics and molecular evolutionary tools, proteomics can provide the foundation of a systems-level integration approach that can enhance ecological studies. It can be envisioned that proteomics will provide important new information on issues specific to metapopulation biology and adaptive processes in nature. A specific example of the application of proteomics to sperm ageing is provided to illustrate the potential utility of the approach.
Profiling the Aspergillus fumigatus Proteome in Response to Caspofungin ▿ †
Cagas, Steven E.; Jain, Mohit Raja; Li, Hong; Perlin, David S.
2011-01-01
The proteomic response of Aspergillus fumigatus to caspofungin was evaluated by gel-free isobaric tagging for relative and absolute quantitation (iTRAQ) as a means to determine potential biomarkers of drug action. A cell fractionation approach yielding 4 subcellular compartment fractions was used to enhance the resolution of proteins for proteomic analysis. Using iTRAQ, a total of 471 unique proteins were identified in soluble and cell wall/plasma membrane fractions at 24 and 48 h of growth in rich media in a wild-type drug-susceptible strain. A total of 122 proteins showed at least a 2-fold change in relative abundance following exposure to caspofungin (CSF) at just below the minimum effective concentration (0.12 μg/ml). The largest changes were seen in the mitochondrial hypoxia response domain protein (AFUA_1G12250), the level of which decreased >16-fold in the secreted fraction, and ChiA1, the level of which decreased 12.1-fold in the cell wall/plasma membrane fraction. The level of the major allergen and cytotoxin AspF1 was also shown to decrease by 12.1-fold upon the addition of drug. A subsequent iTRAQ analysis of an echinocandin-resistant strain (fks1-S678P) was used to validate proteins specific to drug action. A total of 103 proteins in the 2 fractions tested by iTRAQ were differentially expressed in the wild-type susceptible strain but not significantly changed in the resistant strain. Of these potential biomarkers, 11 had levels that changed at least 12-fold. Microarray analysis of the susceptible strain was performed to evaluate the correlation between proteomics and genomics, with a total of 117 genes found to be changing at least 2-fold. Of these, a total of 22 proteins with significant changes identified by iTRAQ also showed significant gene expression level changes by microarray. Overall, these data have the potential to identify biomarkers that assess the relative efficacy of echinocandin drug therapy. PMID:20974863
2010-01-01
Background The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. Results In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. Conclusion High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data. PMID:20122245
Seok, Junhee; Kaushal, Amit; Davis, Ronald W; Xiao, Wenzhong
2010-01-18
The large amount of high-throughput genomic data has facilitated the discovery of the regulatory relationships between transcription factors and their target genes. While early methods for discovery of transcriptional regulation relationships from microarray data often focused on the high-throughput experimental data alone, more recent approaches have explored the integration of external knowledge bases of gene interactions. In this work, we develop an algorithm that provides improved performance in the prediction of transcriptional regulatory relationships by supplementing the analysis of microarray data with a new method of integrating information from an existing knowledge base. Using a well-known dataset of yeast microarrays and the Yeast Proteome Database, a comprehensive collection of known information of yeast genes, we show that knowledge-based predictions demonstrate better sensitivity and specificity in inferring new transcriptional interactions than predictions from microarray data alone. We also show that comprehensive, direct and high-quality knowledge bases provide better prediction performance. Comparison of our results with ChIP-chip data and growth fitness data suggests that our predicted genome-wide regulatory pairs in yeast are reasonable candidates for follow-up biological verification. High quality, comprehensive, and direct knowledge bases, when combined with appropriate bioinformatic algorithms, can significantly improve the discovery of gene regulatory relationships from high throughput gene expression data.
Recent advances in proteomic applications for schistosomiasis research: potential clinical impact.
Sotillo, Javier; Doolan, Denise; Loukas, Alex
2017-02-01
Schistosomiasis is a neglected tropical disease affecting hundreds of millions of people worldwide. Recent advances in the field of proteomics and the development of new and highly sensitive mass spectrometers and quantitative techniques have provided new tools for advancing the molecular biology, cell biology, diagnosis and vaccine development for public health threats such as schistosomiasis. Areas covered: In this review we describe the latest advances in research that utilizes proteomics-based tools to address some of the key challenges to developing effective interventions against schistosomiasis. We also provide information about the potential of extracellular vesicles to advance the fight against this devastating disease. Expert commentary: Different proteins are already being tested as vaccines against schistosomiasis with promising results. The re-analysis of the Schistosoma spp. proteomes using new and more sensitive mass spectrometers as well as better separation approaches will help identify more vaccine targets in a rational and informed manner. In addition, the recent development of new proteome microarrays will facilitate characterisation of novel markers of infection as well as new vaccine and diagnostic candidate antigens.
Proteomics and Systems Biology: Current and Future Applications in the Nutritional Sciences1
Moore, J. Bernadette; Weeks, Mark E.
2011-01-01
In the last decade, advances in genomics, proteomics, and metabolomics have yielded large-scale datasets that have driven an interest in global analyses, with the objective of understanding biological systems as a whole. Systems biology integrates computational modeling and experimental biology to predict and characterize the dynamic properties of biological systems, which are viewed as complex signaling networks. Whereas the systems analysis of disease-perturbed networks holds promise for identification of drug targets for therapy, equally the identified critical network nodes may be targeted through nutritional intervention in either a preventative or therapeutic fashion. As such, in the context of the nutritional sciences, it is envisioned that systems analysis of normal and nutrient-perturbed signaling networks in combination with knowledge of underlying genetic polymorphisms will lead to a future in which the health of individuals will be improved through predictive and preventative nutrition. Although high-throughput transcriptomic microarray data were initially most readily available and amenable to systems analysis, recent technological and methodological advances in MS have contributed to a linear increase in proteomic investigations. It is now commonplace for combined proteomic technologies to generate complex, multi-faceted datasets, and these will be the keystone of future systems biology research. This review will define systems biology, outline current proteomic methodologies, highlight successful applications of proteomics in nutrition research, and discuss the challenges for future applications of systems biology approaches in the nutritional sciences. PMID:22332076
Chipster: user-friendly analysis software for microarray and other high-throughput data.
Kallio, M Aleksi; Tuimala, Jarno T; Hupponen, Taavi; Klemelä, Petri; Gentile, Massimiliano; Scheinin, Ilari; Koski, Mikko; Käki, Janne; Korpelainen, Eija I
2011-10-14
The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available.
Chipster: user-friendly analysis software for microarray and other high-throughput data
2011-01-01
Background The growth of high-throughput technologies such as microarrays and next generation sequencing has been accompanied by active research in data analysis methodology, producing new analysis methods at a rapid pace. While most of the newly developed methods are freely available, their use requires substantial computational skills. In order to enable non-programming biologists to benefit from the method development in a timely manner, we have created the Chipster software. Results Chipster (http://chipster.csc.fi/) brings a powerful collection of data analysis methods within the reach of bioscientists via its intuitive graphical user interface. Users can analyze and integrate different data types such as gene expression, miRNA and aCGH. The analysis functionality is complemented with rich interactive visualizations, allowing users to select datapoints and create new gene lists based on these selections. Importantly, users can save the performed analysis steps as reusable, automatic workflows, which can also be shared with other users. Being a versatile and easily extendable platform, Chipster can be used for microarray, proteomics and sequencing data. In this article we describe its comprehensive collection of analysis and visualization tools for microarray data using three case studies. Conclusions Chipster is a user-friendly analysis software for high-throughput data. Its intuitive graphical user interface enables biologists to access a powerful collection of data analysis and integration tools, and to visualize data interactively. Users can collaborate by sharing analysis sessions and workflows. Chipster is open source, and the server installation package is freely available. PMID:21999641
Red blood cell (RBC) membrane proteomics--Part I: Proteomics and RBC physiology.
Pasini, Erica M; Lutz, Hans U; Mann, Matthias; Thomas, Alan W
2010-01-03
Membrane proteomics is concerned with accurately and sensitively identifying molecules involved in cell compartmentalisation, including those controlling the interface between the cell and the outside world. The high lipid content of the environment in which these proteins are found often causes a particular set of problems that must be overcome when isolating the required material before effective HPLC-MS approaches can be performed. The membrane is an unusually dynamic cellular structure since it interacts with an ever changing environment. A full understanding of this critical cell component will ultimately require, in addition to proteomics, lipidomics, glycomics, interactomics and study of post-translational modifications. Devoid of nucleus and organelles in mammalian species other than camelids, and constantly in motion in the blood stream, red blood cells (RBCs) are the sole mammalian oxygen transporter. The fact that mature mammalian RBCs have no internal membrane-bound organelles, somewhat simplifies proteomics analysis of the plasma membrane and the fact that it has no nucleus disqualifies microarray based methods. Proteomics has the potential to provide a better understanding of this critical interface, and thereby assist in identifying new approaches to diseases. (c) 2009 Elsevier B.V. All rights reserved.
Quantitative proteomic analysis in breast cancer.
Tabchy, A; Hennessy, B T; Gonzalez-Angulo, A M; Bernstam, F M; Lu, Y; Mills, G B
2011-02-01
Much progress has recently been made in the genomic and transcriptional characterization of tumors. However, historically the characterization of cells at the protein level has suffered limitations in reproducibility, scalability and robustness. Recent technological advances have made it possible to accurately and reproducibly portray the global levels and active states of cellular proteins. Protein microarrays examine the native post-translational conformations of proteins including activated phosphorylated states, in a comprehensive high-throughput mode, and can map activated pathways and networks of proteins inside the cells. The reverse-phase protein microarray (RPPA) offers a unique opportunity to study signal transduction networks in small biological samples such as human biopsy material and can provide critical information for therapeutic decision-making and the monitoring of patients for targeted molecular medicine. By providing the key missing link to the story generated from genomic and gene expression characterization efforts, functional proteomics offer the promise of a comprehensive understanding of cancer. Several initial successes in breast cancer are showing that such information is clinically relevant. Copyright 2011 Prous Science, S.A.U. or its licensors. All rights reserved.
Time-series analysis of the transcriptome and proteome of Escherichia coli upon glucose repression.
Borirak, Orawan; Rolfe, Matthew D; de Koning, Leo J; Hoefsloot, Huub C J; Bekker, Martijn; Dekker, Henk L; Roseboom, Winfried; Green, Jeffrey; de Koster, Chris G; Hellingwerf, Klaas J
2015-10-01
Time-series transcript- and protein-profiles were measured upon initiation of carbon catabolite repression in Escherichia coli, in order to investigate the extent of post-transcriptional control in this prototypical response. A glucose-limited chemostat culture was used as the CCR-free reference condition. Stopping the pump and simultaneously adding a pulse of glucose, that saturated the cells for at least 1h, was used to initiate the glucose response. Samples were collected and subjected to quantitative time-series analysis of both the transcriptome (using microarray analysis) and the proteome (through a combination of 15N-metabolic labeling and mass spectrometry). Changes in the transcriptome and corresponding proteome were analyzed using statistical procedures designed specifically for time-series data. By comparison of the two sets of data, a total of 96 genes were identified that are post-transcriptionally regulated. This gene list provides candidates for future in-depth investigation of the molecular mechanisms involved in post-transcriptional regulation during carbon catabolite repression in E. coli, like the involvement of small RNAs. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Enhancing Results of Microarray Hybridizations Through Microagitation
Toegl, Andreas; Kirchner, Roland; Gauer, Christoph; Wixforth, Achim
2003-01-01
Protein and DNA microarrays have become a standard tool in proteomics/genomics research. In order to guarantee fast and reproducible hybridization results, the diffusion limit must be overcome. Surface acoustic wave (SAW) micro-agitation chips efficiently agitate the smallest sample volumes (down to 10 μL and below) without introducing any dead volume. The advantages are reduced reaction time, increased signal-to-noise ratio, improved homogeneity across the microarray, and better slide-to-slide reproducibility. The SAW micromixer chips are the heart of the Advalytix ArrayBooster, which is compatible with all microarrays based on the microscope slide format. PMID:13678150
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.
Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J
2008-06-18
Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson correlation coefficient and the SD-weighted correlation coefficient, and is particularly useful for clustering replicated microarray data. This computational approach should be generally useful for proteomic data or other high-throughput analysis methodology.
2009-08-27
aeruginosa, Salmonella ty- phimurium, Shigella flexneri, and Escherichia coli, all pathogenic Gram-negative bacteria. These antibodies en- abled detection of...w .m cponline.org D ow nloaded from broad range of O and H strains), Shigella flexneri (recognizing Shigella dysenteriae, Shigella boydii, and S...treated with antibiotics. In addition, sera were also obtained from rhesus monkeys vaccinated with the anthrax vaccine B. anthracis recombinant
Yang, Wei; Kim, Yongsoo; Kim, Taek-Kyun; Keay, Susan K; Kim, Kwang Pyo; Steen, Hanno; Freeman, Michael R; Hwang, Daehee; Kim, Jayoung
2012-12-01
What's known on the subject? and What does the study add? Interstitial cystitis (IC) is a prevalent and debilitating pelvic disorder generally accompanied by chronic pain combined with chronic urinating problems. Over one million Americans are affected, especially middle-aged women. However, its aetiology or mechanism remains unclear. No efficient drug has been provided to patients. Several urinary biomarker candidates have been identified for IC; among the most promising is antiproliferative factor (APF), whose biological activity is detectable in urine specimens from >94% of patients with both ulcerative and non-ulcerative IC. The present study identified several important mediators of the effect of APF on bladder cell physiology, suggesting several candidate drug targets against IC. In an attempt to identify potential proteins and genes regulated by APF in vivo, and to possibly expand the APF-regulated network identified by stable isotope labelling by amino acids in cell culture (SILAC), we performed an integration analysis of our own SILAC data and the microarray data of Gamper et al. (2009) BMC Genomics 10: 199. Notably, two of the proteins (i.e. MAPKSP1 and GSPT1) that are down-regulated by APF are involved in the activation of mTORC1, suggesting that the mammalian target of rapamycin (mTOR) pathway is potentially a critical pathway regulated by APF in vivo. Several components of the mTOR pathway are currently being studied as potential therapeutic targets in other diseases. Our analysis suggests that this pathway might also be relevant in the design of diagnostic tools and medications targeting IC. • To enhance our understanding of the interstitial cystitis urine biomarker antiproliferative factor (APF), as well as interstitial cystitis biology more generally at the systems level, we reanalyzed recently published large-scale quantitative proteomics and in vivo transcriptomics data sets using an integration analysis tool that we have developed. • To identify more differentially expressed genes with a lower false discovery rate from a previously published microarray data set, an integrative hypothesis-testing statistical approach was applied. • For validation experiments, expression and phosphorylation levels of select proteins were evaluated by western blotting. • Integration analysis of this transcriptomics data set with our own quantitative proteomics data set identified 10 genes that are potentially regulated by APF in vivo from 4140 differentially expressed genes identified with a false discovery rate of 1%. • Of these, five (i.e. JUP, MAPKSP1, GSPT1, PTGS2/COX-2 and XPOT) were found to be prominent after network modelling of the common genes identified in the proteomics and microarray studies. • This molecular signature reflects the biological processes of cell adhesion, cell proliferation and inflammation, which is consistent with the known physiological effects of APF. • Lastly, we found the mammalian target of rapamycin pathway was down-regulated in response to APF. • This unbiased integration analysis of in vitro quantitative proteomics data with in vivo quantitative transcriptomics data led to the identification of potential downstream mediators of the APF signal transduction pathway. © 2012 THE AUTHORS. BJU INTERNATIONAL © 2012 BJU INTERNATIONAL.
Kume, Hideaki; Muraoka, Satoshi; Kuga, Takahisa; Adachi, Jun; Narumi, Ryohei; Watanabe, Shio; Kuwano, Masayoshi; Kodera, Yoshio; Matsushita, Kazuyuki; Fukuoka, Junya; Masuda, Takeshi; Ishihama, Yasushi; Matsubara, Hisahiro; Nomura, Fumio; Tomonaga, Takeshi
2014-01-01
Recent advances in quantitative proteomic technology have enabled the large-scale validation of biomarkers. We here performed a quantitative proteomic analysis of membrane fractions from colorectal cancer tissue to discover biomarker candidates, and then extensively validated the candidate proteins identified. A total of 5566 proteins were identified in six tissue samples, each of which was obtained from polyps and cancer with and without metastasis. GO cellular component analysis predicted that 3087 of these proteins were membrane proteins, whereas TMHMM algorithm predicted that 1567 proteins had a transmembrane domain. Differences were observed in the expression of 159 membrane proteins and 55 extracellular proteins between polyps and cancer without metastasis, while the expression of 32 membrane proteins and 17 extracellular proteins differed between cancer with and without metastasis. A total of 105 of these biomarker candidates were quantitated using selected (or multiple) reaction monitoring (SRM/MRM) with stable synthetic isotope-labeled peptides as an internal control. The results obtained revealed differences in the expression of 69 of these proteins, and this was subsequently verified in an independent set of patient samples (polyps (n = 10), cancer without metastasis (n = 10), cancer with metastasis (n = 10)). Significant differences were observed in the expression of 44 of these proteins, including ITGA5, GPRC5A, PDGFRB, and TFRC, which have already been shown to be overexpressed in colorectal cancer, as well as proteins with unknown function, such as C8orf55. The expression of C8orf55 was also shown to be high not only in colorectal cancer, but also in several cancer tissues using a multicancer tissue microarray, which included 1150 cores from 14 cancer tissues. This is the largest verification study of biomarker candidate membrane proteins to date; our methods for biomarker discovery and subsequent validation using SRM/MRM will contribute to the identification of useful biomarker candidates for various cancers. Data are available via ProteomeXchange with identifier PXD000851. PMID:24687888
Kume, Hideaki; Muraoka, Satoshi; Kuga, Takahisa; Adachi, Jun; Narumi, Ryohei; Watanabe, Shio; Kuwano, Masayoshi; Kodera, Yoshio; Matsushita, Kazuyuki; Fukuoka, Junya; Masuda, Takeshi; Ishihama, Yasushi; Matsubara, Hisahiro; Nomura, Fumio; Tomonaga, Takeshi
2014-06-01
Recent advances in quantitative proteomic technology have enabled the large-scale validation of biomarkers. We here performed a quantitative proteomic analysis of membrane fractions from colorectal cancer tissue to discover biomarker candidates, and then extensively validated the candidate proteins identified. A total of 5566 proteins were identified in six tissue samples, each of which was obtained from polyps and cancer with and without metastasis. GO cellular component analysis predicted that 3087 of these proteins were membrane proteins, whereas TMHMM algorithm predicted that 1567 proteins had a transmembrane domain. Differences were observed in the expression of 159 membrane proteins and 55 extracellular proteins between polyps and cancer without metastasis, while the expression of 32 membrane proteins and 17 extracellular proteins differed between cancer with and without metastasis. A total of 105 of these biomarker candidates were quantitated using selected (or multiple) reaction monitoring (SRM/MRM) with stable synthetic isotope-labeled peptides as an internal control. The results obtained revealed differences in the expression of 69 of these proteins, and this was subsequently verified in an independent set of patient samples (polyps (n = 10), cancer without metastasis (n = 10), cancer with metastasis (n = 10)). Significant differences were observed in the expression of 44 of these proteins, including ITGA5, GPRC5A, PDGFRB, and TFRC, which have already been shown to be overexpressed in colorectal cancer, as well as proteins with unknown function, such as C8orf55. The expression of C8orf55 was also shown to be high not only in colorectal cancer, but also in several cancer tissues using a multicancer tissue microarray, which included 1150 cores from 14 cancer tissues. This is the largest verification study of biomarker candidate membrane proteins to date; our methods for biomarker discovery and subsequent validation using SRM/MRM will contribute to the identification of useful biomarker candidates for various cancers. Data are available via ProteomeXchange with identifier PXD000851. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data
Yi, Ming; Horton, Jay D; Cohen, Jonathan C; Hobbs, Helen H; Stephens, Robert M
2006-01-01
Background Analysis of High Throughput (HTP) Data such as microarray and proteomics data has provided a powerful methodology to study patterns of gene regulation at genome scale. A major unresolved problem in the post-genomic era is to assemble the large amounts of data generated into a meaningful biological context. We have developed a comprehensive software tool, WholePathwayScope (WPS), for deriving biological insights from analysis of HTP data. Result WPS extracts gene lists with shared biological themes through color cue templates. WPS statistically evaluates global functional category enrichment of gene lists and pathway-level pattern enrichment of data. WPS incorporates well-known biological pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) and Biocarta, GO (Gene Ontology) terms as well as user-defined pathways or relevant gene clusters or groups, and explores gene-term relationships within the derived gene-term association networks (GTANs). WPS simultaneously compares multiple datasets within biological contexts either as pathways or as association networks. WPS also integrates Genetic Association Database and Partial MedGene Database for disease-association information. We have used this program to analyze and compare microarray and proteomics datasets derived from a variety of biological systems. Application examples demonstrated the capacity of WPS to significantly facilitate the analysis of HTP data for integrative discovery. Conclusion This tool represents a pathway-based platform for discovery integration to maximize analysis power. The tool is freely available at . PMID:16423281
The quest of the human proteome and the missing proteins: digging deeper.
Reddy, Panga Jaipal; Ray, Sandipan; Srivastava, Sanjeeva
2015-05-01
Given the diverse range of transcriptional and post-transcriptional mechanisms of gene regulation, the estimates of the human proteome is likely subject to scientific surprises as the field of proteomics has gained momentum worldwide. In this regard, the establishment of the "Human Proteome Draft" using high-resolution mass spectrometry (MS), tissue microarrays, and immunohistochemistry by three independent research groups (laboratories of Pandey, Kuster, and Uhlen) accelerated the pace of proteomics research. The Chromosome Centric Human Proteome Project (C-HPP) has taken initiative towards the completion of the Human Proteome Project (HPP) so as to understand the proteomics correlates of common complex human diseases and biological diversity, not to mention person-to-person and population differences in response to drugs, nutrition, vaccines, and other health interventions and host-environment interactions. Although high-resolution MS-based and antibody microarray approaches have shown enormous promises, we are still unable to map the whole human proteome due to the presence of numerous "missing proteins." In December 2014, at the Indian Institute of Technology Bombay, Mumbai the 6(th) Annual Meeting of the Proteomics Society, India (PSI) and the International Proteomics Conference was held. As part of this interdisciplinary summit, a panel discussion session on "The Quest of the Human Proteome and Missing Proteins" was organized. Eminent scientists in the field of proteomics and systems biology, including Akhilesh Pandey, Gilbert S. Omenn, Mark S. Baker, and Robert L. Mortiz, shed light on different aspects of the human proteome drafts and missing proteins. Importantly, the possible reasons for the "missing proteins" in shotgun MS workflow were identified and debated by experts as low tissue expression, lack of enzymatic digestion site, or protein lost during extraction, among other contributing factors. To capture the missing proteins, the experts' collective view was to study the wider tissue range with multiple digesting enzymes and follow targeted proteomics workflow in particular. On the innovation trajectory from the proteomics laboratory to novel proteomics diagnostics and therapeutics in society, we will also need new conceptual frames for translation science and innovation strategy in proteomics. These will embody both technical as well as rigorous social science and humanities considerations to understand the correlates of the proteome from cell to society.
Proteomic identification of rhythmic proteins in rice seedlings.
Hwang, Heeyoun; Cho, Man-Ho; Hahn, Bum-Soo; Lim, Hyemin; Kwon, Yong-Kook; Hahn, Tae-Ryong; Bhoo, Seong Hee
2011-04-01
Many aspects of plant metabolism that are involved in plant growth and development are influenced by light-regulated diurnal rhythms as well as endogenous clock-regulated circadian rhythms. To identify the rhythmic proteins in rice, periodically grown (12h light/12h dark cycle) seedlings were harvested for three days at six-hour intervals. Continuous dark-adapted plants were also harvested for two days. Among approximately 3000 reproducible protein spots on each gel, proteomic analysis ascertained 354 spots (~12%) as light-regulated rhythmic proteins, in which 53 spots showed prolonged rhythm under continuous dark conditions. Of these 354 ascertained rhythmic protein spots, 74 diurnal spots and 10 prolonged rhythmic spots under continuous dark were identified by MALDI-TOF MS analysis. The rhythmic proteins were functionally classified into photosynthesis, central metabolism, protein synthesis, nitrogen metabolism, stress resistance, signal transduction and unknown. Comparative analysis of our proteomic data with the public microarray database (the Plant DIURNAL Project) and RT-PCR analysis of rhythmic proteins showed differences in rhythmic expression phases between mRNA and protein, suggesting that the clock-regulated proteins in rice are modulated by not only transcriptional but also post-transcriptional, translational, and/or post-translational processes. 2011 Elsevier B.V. All rights reserved.
Elamin, Ashraf; Titz, Bjoern; Dijon, Sophie; Merg, Celine; Geertz, Marcel; Schneider, Thomas; Martin, Florian; Schlage, Walter K; Frentzel, Stefan; Talamo, Fabio; Phillips, Blaine; Veljkovic, Emilija; Ivanov, Nikolai V; Vanscheeuwijck, Patrick; Peitsch, Manuel C; Hoeng, Julia
2016-08-11
Smoking is associated with several serious diseases, such as lung cancer and chronic obstructive pulmonary disease (COPD). Within our systems toxicology framework, we are assessing whether potential modified risk tobacco products (MRTP) can reduce smoking-related health risks compared to conventional cigarettes. In this article, we evaluated to what extent 2D-PAGE/MALDI MS/MS (2D-PAGE) can complement the iTRAQ LC-MS/MS results from a previously reported mouse inhalation study, in which we assessed a prototypic MRTP (pMRTP). Selected differentially expressed proteins identified by both LC-MS/MS and 2D-PAGE approaches were further verified using reverse-phase protein microarrays. LC-MS/MS captured the effects of cigarette smoke (CS) on the lung proteome more comprehensively than 2D-PAGE. However, an integrated analysis of both proteomics data sets showed that 2D-PAGE data complement the LC-MS/MS results by supporting the overall trend of lower effects of pMRTP aerosol than CS on the lung proteome. Biological effects of CS exposure supported by both methods included increases in immune-related, surfactant metabolism, proteasome, and actin cytoskeleton protein clusters. Overall, while 2D-PAGE has its value, especially as a complementary method for the analysis of effects on intact proteins, LC-MS/MS approaches will likely be the method of choice for proteome analysis in systems toxicology investigations. Quantitative proteomics is anticipated to play a growing role within systems toxicology assessment frameworks in the future. To further understand how different proteomics technologies can contribute to toxicity assessment, we conducted a quantitative proteomics analysis using 2D-PAGE and isobaric tag-based LC-MS/MS approaches and compared the results produced from the 2 approaches. Using a prototypic modified risk tobacco product (pMRTP) as our test item, we show compared with cigarette smoke, how 2D-PAGE results can complement and support LC-MS/MS data, demonstrating the much lower effects of pMRTP aerosol than cigarette smoke on the mouse lung proteome. The combined analysis of 2D-PAGE and LC-MS/MS data identified an effect of cigarette smoke on the proteasome and actin cytoskeleton in the lung. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Li, Nan; Stein, Richard S L; He, Wei; Komives, Elizabeth; Wang, Wei
2013-10-01
Methylation is one of the important post-translational modifications that play critical roles in regulating protein functions. Proteomic identification of this post-translational modification and understanding how it affects protein activity remain great challenges. We tackled this problem from the aspect of methylation mediating protein-protein interaction. Using the chromodomain of human chromobox protein homolog 6 as a model system, we developed a systematic approach that integrates structure modeling, bioinformatics analysis, and peptide microarray experiments to identify lysine residues that are methylated and recognized by the chromodomain in the human proteome. Given the important role of chromobox protein homolog 6 as a reader of histone modifications, it was interesting to find that the majority of its interacting partners identified via this approach function in chromatin remodeling and transcriptional regulation. Our study not only illustrates a novel angle for identifying methyllysines on a proteome-wide scale and elucidating their potential roles in regulating protein function, but also suggests possible strategies for engineering the chromodomain-peptide interface to enhance the recognition of and manipulate the signal transduction mediated by such interactions.
High throughput gene expression profiling: a molecular approach to integrative physiology
Liang, Mingyu; Cowley, Allen W; Greene, Andrew S
2004-01-01
Integrative physiology emphasizes the importance of understanding multiple pathways with overlapping, complementary, or opposing effects and their interactions in the context of intact organisms. The DNA microarray technology, the most commonly used method for high-throughput gene expression profiling, has been touted as an integrative tool that provides insights into regulatory pathways. However, the physiology community has been slow in acceptance of these techniques because of early failure in generating useful data and the lack of a cohesive theoretical framework in which experiments can be analysed. With recent advances in both technology and analysis, we propose a concept of multidimensional integration of physiology that incorporates data generated by DNA microarray and other functional, genomic, and proteomic approaches to achieve a truly integrative understanding of physiology. Analysis of several studies performed in simpler organisms or in mammalian model animals supports the feasibility of such multidimensional integration and demonstrates the power of DNA microarray as an indispensable molecular tool for such integration. Evaluation of DNA microarray techniques indicates that these techniques, despite limitations, have advanced to a point where the question-driven profiling research has become a feasible complement to the conventional, hypothesis-driven research. With a keen sense of homeostasis, global regulation, and quantitative analysis, integrative physiologists are uniquely positioned to apply these techniques to enhance the understanding of complex physiological functions. PMID:14678487
Gandhi, Deepa; Tarale, Prashant; Naoghare, Pravin K; Bafana, Amit; Kannan, Krishnamurthi; Sivanesan, Saravanadevi
2016-01-01
Endosulfan, an organochlorine pesticide, is known to induce multiple disorders/abnormalities including neuro-degenerative disorders in many animal species. However, the molecular mechanism of endosulfan induced neuronal alterations is still not well understood. In the present study, the effect of sub-lethal concentration of endosulfan (3 μM) on human neuroblastoma cells (SH-SY5Y) was investigated using genomic and proteomic approaches. Microarray and 2D-PAGE followed by MALDI-TOF-MS analysis revealed differential expression of 831 transcripts and 16 proteins in exposed cells. A gene ontology enrichment analysis revealed that the differentially expressed genes and proteins were involved in variety of cellular events such as neuronal developmental pathway, immune response, cell differentiation, apoptosis, transmission of nerve impulse, axonogenesis, etc. The present study attempted to explore the possible molecular mechanism of endosulfan induced neuronal alterations in SH-SY5Y cells using an integrated genomic and proteomic approach. Based on the gene and protein profile possible mechanisms underlying endosulfan neurotoxicity were predicted. Copyright © 2015 Elsevier B.V. All rights reserved.
Even-Desrumeaux, Klervi; Baty, Daniel; Chames, Patrick
2010-01-01
Antibodies microarrays are among the novel class of rapidly emerging proteomic technologies that will allow us to efficiently perform specific diagnosis and proteome analysis. Recombinant antibody fragments are especially suited for this approach but their stability is often a limiting factor. Camelids produce functional antibodies devoid of light chains (HCAbs) of which the single N-terminal domain is fully capable of antigen binding. When produced as an independent domain, these so-called single domain antibody fragments (sdAbs) have several advantages for biotechnological applications thanks to their unique properties of size (15 kDa), stability, solubility, and expression yield. These features should allow sdAbs to outperform other antibody formats in a number of applications, notably as capture molecule for antibody arrays. In this study, we have produced antibody microarrays using direct and oriented immobilization of sdAbs produced in crude bacterial lysates to generate proof-of-principle of a high-throughput compatible array design. Several sdAb immobilization strategies have been explored. Immobilization of in vivo biotinylated sdAbs by direct spotting of bacterial lysate on streptavidin and sandwich detection was developed to achieve high sensitivity and specificity, whereas immobilization of “multi-tagged” sdAbs via anti-tag antibodies and direct labeled sample detection strategy was optimized for the design of high-density antibody arrays for high-throughput proteomics and identification of potential biomarkers. PMID:20859568
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fukunaga, Satoki; Environmental Health Science Laboratory, Sumitomo Chemical Co., Ltd., 3-1-98 Kasugade-Naka, Konohana-ku, Osaka 554-8558; Kakehashi, Anna
To determine miRNAs and their predicted target proteins regulatory networks which are potentially involved in onset of pulmonary fibrosis in the bleomycin rat model, we conducted integrative miRNA microarray and iTRAQ-coupled LC-MS/MS proteomic analyses, and evaluated the significance of altered biological functions and pathways. We observed that alterations of miRNAs and proteins are associated with the early phase of bleomycin-induced pulmonary fibrosis, and identified potential target pairs by using ingenuity pathway analysis. Using the data set of these alterations, it was demonstrated that those miRNAs, in association with their predicted target proteins, are potentially involved in canonical pathways reflective ofmore » initial epithelial injury and fibrogenic processes, and biofunctions related to induction of cellular development, movement, growth, and proliferation. Prediction of activated functions suggested that lung cells acquire proliferative, migratory, and invasive capabilities, and resistance to cell death especially in the very early phase of bleomycin-induced pulmonary fibrosis. The present study will provide new insights for understanding the molecular pathogenesis of idiopathic pulmonary fibrosis. - Highlights: • We analyzed bleomycin-induced pulmonary fibrosis in the rat. • Integrative analyses of miRNA microarray and proteomics were conducted. • We determined the alterations of miRNAs and their potential target proteins. • The alterations may control biological functions and pathways in pulmonary fibrosis. • Our result may provide new insights of pulmonary fibrosis.« less
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
Fungal proteomics: from identification to function.
Doyle, Sean
2011-08-01
Some fungi cause disease in humans and plants, while others have demonstrable potential for the control of insect pests. In addition, fungi are also a rich reservoir of therapeutic metabolites and industrially useful enzymes. Detailed analysis of fungal biochemistry is now enabled by multiple technologies including protein mass spectrometry, genome and transcriptome sequencing and advances in bioinformatics. Yet, the assignment of function to fungal proteins, encoded either by in silico annotated, or unannotated genes, remains problematic. The purpose of this review is to describe the strategies used by many researchers to reveal protein function in fungi, and more importantly, to consolidate the nomenclature of 'unknown function protein' as opposed to 'hypothetical protein' - once any protein has been identified by protein mass spectrometry. A combination of approaches including comparative proteomics, pathogen-induced protein expression and immunoproteomics are outlined, which, when used in combination with a variety of other techniques (e.g. functional genomics, microarray analysis, immunochemical and infection model systems), appear to yield comprehensive and definitive information on protein function in fungi. The relative advantages of proteomic, as opposed to transcriptomic-only, analyses are also described. In the future, combined high-throughput, quantitative proteomics, allied to transcriptomic sequencing, are set to reveal much about protein function in fungi. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
DAnTE: a statistical tool for quantitative analysis of –omics data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Polpitiya, Ashoka D.; Qian, Weijun; Jaitly, Navdeep
2008-05-03
DAnTE (Data Analysis Tool Extension) is a statistical tool designed to address challenges unique to quantitative bottom-up, shotgun proteomics data. This tool has also been demonstrated for microarray data and can easily be extended to other high-throughput data types. DAnTE features selected normalization methods, missing value imputation algorithms, peptide to protein rollup methods, an extensive array of plotting functions, and a comprehensive ANOVA scheme that can handle unbalanced data and random effects. The Graphical User Interface (GUI) is designed to be very intuitive and user friendly.
Yao, Chenxi; Wang, Tao; Zhang, Buqing; He, Dacheng; Na, Na; Ouyang, Jin
2015-11-01
The interaction between bioactive small molecule ligands and proteins is one of the important research areas in proteomics. Herein, a simple and rapid method is established to screen small ligands that bind to proteins. We designed an agarose slide to immobilize different proteins. The protein microarrays were allowed to interact with different small ligands, and after washing, the microarrays were screened by desorption electrospray ionization mass spectrometry (DESI MS). This method can be applied to screen specific protein binding ligands and was shown for seven proteins and 34 known ligands for these proteins. In addition, a high-throughput screening was achieved, with the analysis requiring approximately 4 s for one sample spot. We then applied this method to determine the binding between the important protein matrix metalloproteinase-9 (MMP-9) and 88 small compounds. The molecular docking results confirmed the MS results, demonstrating that this method is suitable for the rapid and accurate screening of ligands binding to proteins. Graphical Abstract ᅟ.
Tips and Tricks for Successful Application of Statistical Methods to Biological Data.
Schlenker, Evelyn
2016-01-01
This chapter discusses experimental design and use of statistics to describe characteristics of data (descriptive statistics) and inferential statistics that test the hypothesis posed by the investigator. Inferential statistics, based on probability distributions, depend upon the type and distribution of the data. For data that are continuous, randomly and independently selected, as well as normally distributed more powerful parametric tests such as Student's t test and analysis of variance (ANOVA) can be used. For non-normally distributed or skewed data, transformation of the data (using logarithms) may normalize the data allowing use of parametric tests. Alternatively, with skewed data nonparametric tests can be utilized, some of which rely on data that are ranked prior to statistical analysis. Experimental designs and analyses need to balance between committing type 1 errors (false positives) and type 2 errors (false negatives). For a variety of clinical studies that determine risk or benefit, relative risk ratios (random clinical trials and cohort studies) or odds ratios (case-control studies) are utilized. Although both use 2 × 2 tables, their premise and calculations differ. Finally, special statistical methods are applied to microarray and proteomics data, since the large number of genes or proteins evaluated increase the likelihood of false discoveries. Additional studies in separate samples are used to verify microarray and proteomic data. Examples in this chapter and references are available to help continued investigation of experimental designs and appropriate data analysis.
Tao, Zhihua; Gao, Peng; Liu, Hung-Wen
2009-12-15
Poly(ADP-ribosyl)ation of various nuclear proteins catalyzed by a family of NAD(+)-dependent enzymes, poly(ADP-ribose) polymerases (PARPs), is an important posttranslational modification reaction. PARP activity has been demonstrated in all types of eukaryotic cells with the exception of yeast, in which the expression of human PARP-1 was shown to lead to retarded cell growth. We investigated the yeast growth inhibition caused by human PARP-1 expression in Saccharomyces cerevisiae. Flow cytometry analysis reveals that PARP-1-expressing yeast cells accumulate in the G(2)/M stage of the cell cycle. Confocal microscopy analysis shows that human PARP-1 is distributed throughout the nucleus of yeast cells but is enriched in the nucleolus. Utilizing yeast proteome microarray screening, we identified 33 putative PARP-1 substrates, six of which are known to be involved in ribosome biogenesis. The poly(ADP-ribosyl)ation of three of these yeast proteins, together with two human homologues, was confirmed by an in vitro PARP-1 assay. Finally, a polysome profile analysis using sucrose gradient ultracentrifugation demonstrated that the ribosome levels in yeast cells expressing PARP-1 are lower than those in control yeast cells. Overall, our data suggest that human PARP-1 may affect ribosome biogenesis by modifying certain nucleolar proteins in yeast. The artificial PARP-1 pathway in yeast may be used as a simple platform to identify substrates and verify function of this important enzyme.
Multiplexed protein profiling on microarrays by rolling-circle amplification
Schweitzer, Barry; Roberts, Scott; Grimwade, Brian; Shao, Weiping; Wang, Minjuan; Fu, Qin; Shu, Quiping; Laroche, Isabelle; Zhou, Zhimin; Tchernev, Velizar T.; Christiansen, Jason; Velleca, Mark; Kingsmore, Stephen F.
2010-01-01
Fluorescent-sandwich immunoassays on microarrays hold appeal for proteomics studies, because equipment and antibodies are readily available, and assays are simple, scalable, and reproducible. The achievement of adequate sensitivity and specificity, however, requires a general method of immunoassay amplification. We describe coupling of isothermal rolling-circle amplification (RCA) to universal antibodies for this purpose. A total of 75 cytokines were measured simultaneously on glass arrays with signal amplification by RCA with high specificity, femtomolar sensitivity, 3 log quantitative range, and economy of sample consumption. A 51-feature RCA cytokine glass array was used to measure secretion from human dendritic cells (DCs) induced by lipopolysaccharide (LPS) or tumor necrosis factor-α (TNF-α). As expected, LPS induced rapid secretion of inflammatory cytokines such as macrophage inflammatory protein (MIP)-1β, interleukin (IL)-8, and interferon-inducible protein (IP)-10. We found that eotaxin-2 and I-309 were induced by LPS; in addition, macrophage-derived chemokine (MDC), thymus and activation-regulated chemokine (TARC), soluble interleukin 6 receptor (sIL-6R), and soluble tumor necrosis factor receptor I (sTNF-RI) were induced by TNF-α treatment. Because microarrays can accommodat ~1,000 sandwich immunoassays of this type, a relatively small number of RCA microarrays seem to offer a tractable approach for proteomic surveys. PMID:11923841
A systematic evaluation of normalization methods in quantitative label-free proteomics.
Välikangas, Tommi; Suomi, Tomi; Elo, Laura L
2018-01-01
To date, mass spectrometry (MS) data remain inherently biased as a result of reasons ranging from sample handling to differences caused by the instrumentation. Normalization is the process that aims to account for the bias and make samples more comparable. The selection of a proper normalization method is a pivotal task for the reliability of the downstream analysis and results. Many normalization methods commonly used in proteomics have been adapted from the DNA microarray techniques. Previous studies comparing normalization methods in proteomics have focused mainly on intragroup variation. In this study, several popular and widely used normalization methods representing different strategies in normalization are evaluated using three spike-in and one experimental mouse label-free proteomic data sets. The normalization methods are evaluated in terms of their ability to reduce variation between technical replicates, their effect on differential expression analysis and their effect on the estimation of logarithmic fold changes. Additionally, we examined whether normalizing the whole data globally or in segments for the differential expression analysis has an effect on the performance of the normalization methods. We found that variance stabilization normalization (Vsn) reduced variation the most between technical replicates in all examined data sets. Vsn also performed consistently well in the differential expression analysis. Linear regression normalization and local regression normalization performed also systematically well. Finally, we discuss the choice of a normalization method and some qualities of a suitable normalization method in the light of the results of our evaluation. © The Author 2016. Published by Oxford University Press.
Novel Biomarker Candidates for Colorectal Cancer Metastasis: A Meta-analysis of In Vitro Studies
Long, Nguyen Phuoc; Lee, Wun Jun; Huy, Nguyen Truong; Lee, Seul Ji; Park, Jeong Hill; Kwon, Sung Won
2016-01-01
Colorectal cancer (CRC) is one of the most common and lethal cancers. Although numerous studies have evaluated potential biomarkers for early diagnosis, current biomarkers have failed to reach an acceptable level of accuracy for distant metastasis. In this paper, we performed a gene set meta-analysis of in vitro microarray studies and combined the results from this study with previously published proteomic data to validate and suggest prognostic candidates for CRC metastasis. Two microarray data sets included found 21 significant genes. Of these significant genes, ALDOA, IL8 (CXCL8), and PARP4 had strong potential as prognostic candidates. LAMB2, MCM7, CXCL23A, SERPINA3, ABCA3, ALDH3A2, and POLR2I also have potential. Other candidates were more controversial, possibly because of the biologic heterogeneity of tumor cells, which is a major obstacle to predicting metastasis. In conclusion, we demonstrated a meta-analysis approach and successfully suggested ten biomarker candidates for future investigation. PMID:27688707
Novel Biomarker Candidates for Colorectal Cancer Metastasis: A Meta-analysis of In Vitro Studies.
Long, Nguyen Phuoc; Lee, Wun Jun; Huy, Nguyen Truong; Lee, Seul Ji; Park, Jeong Hill; Kwon, Sung Won
2016-01-01
Colorectal cancer (CRC) is one of the most common and lethal cancers. Although numerous studies have evaluated potential biomarkers for early diagnosis, current biomarkers have failed to reach an acceptable level of accuracy for distant metastasis. In this paper, we performed a gene set meta-analysis of in vitro microarray studies and combined the results from this study with previously published proteomic data to validate and suggest prognostic candidates for CRC metastasis. Two microarray data sets included found 21 significant genes. Of these significant genes, ALDOA, IL8 (CXCL8), and PARP4 had strong potential as prognostic candidates. LAMB2, MCM7, CXCL23A, SERPINA3, ABCA3, ALDH3A2, and POLR2I also have potential. Other candidates were more controversial, possibly because of the biologic heterogeneity of tumor cells, which is a major obstacle to predicting metastasis. In conclusion, we demonstrated a meta-analysis approach and successfully suggested ten biomarker candidates for future investigation.
Li, Yong-Fang; Mahalingam, Ramamurthy; Sunkar, Ramanjulu
2017-01-01
Alteration of gene expression is an essential mechanism, which allows plants to respond and adapt to adverse environmental conditions. Transcriptome and proteome analyses in plants exposed to abiotic stresses revealed that protein levels are not correlated with the changes in corresponding mRNAs, indicating regulation at translational level is another major regulator for gene expression. Analysis of translatome, which refers to all mRNAs associated with ribosomes, thus has the potential to bridge the gap between transcriptome and proteome. Polysomal RNA profiling and recently developed ribosome profiling (Ribo-seq) are two main methods for translatome analysis at global level. Here, we describe the classical procedure for polysomal RNA isolation by sucrose gradient ultracentrifugation followed by highthroughput RNA-seq to identify genes regulated at translational level. Polysomal RNA can be further used for a variety of downstream applications including Northern blot analysis, qRT-PCR, RNase protection assay, and microarray-based gene expression profiling.
The Proteomic Signature of Aspergillus fumigatus During Early Development*
Cagas, Steven E.; Jain, Mohit Raja; Li, Hong; Perlin, David S.
2011-01-01
Aspergillus fumigatus is a saprophytic fungus that causes a range of diseases in humans including invasive aspergillosis. All forms of disease begin with the inhalation of conidia, which germinate and develop. Four stages of early development were evaluated using the gel free system of isobaric tagging for relative and absolute quantitation to determine the full proteomic profile of the pathogen. A total of 461 proteins were identified at 0, 4, 8, and 16 h and fold changes for each were established. Ten proteins including the hydrophobin rodlet protein RodA and a protein involved in melanin synthesis Abr2 were found to decrease relative to conidia. To generate a more comprehensive view of early development, a whole genome microarray analysis was performed comparing conidia to 8 and 16 h of growth. A total of 1871 genes were found to change significantly at 8 h with 1001 genes up-regulated and 870 down-regulated. At 16 h, 1235 genes changed significantly with 855 up-regulated and 380 down-regulated. When a comparison between the proteomics and microarray data was performed at 8 h, a total of 22 proteins with significant changes also had corresponding genes that changed significantly. When the same comparison was performed at 16 h, 12 protein and gene combinations were found. This study, the most comprehensive to date, provides insights into early pathways activated during growth and development of A. fumigatus. It reveals a pathogen that is gearing up for rapid growth by building translation machinery, generating ATP, and is very much committed to aerobic metabolism. PMID:21825280
Cross-platform method for identifying candidate network biomarkers for prostate cancer.
Jin, G; Zhou, X; Cui, K; Zhang, X-S; Chen, L; Wong, S T C
2009-11-01
Discovering biomarkers using mass spectrometry (MS) and microarray expression profiles is a promising strategy in molecular diagnosis. Here, the authors proposed a new pipeline for biomarker discovery that integrates disease information for proteins and genes, expression profiles in both genomic and proteomic levels, and protein-protein interactions (PPIs) to discover high confidence network biomarkers. Using this pipeline, a total of 474 molecules (genes and proteins) related to prostate cancer were identified and a prostate-cancer-related network (PCRN) was derived from the integrative information. Thus, a set of candidate network biomarkers were identified from multiple expression profiles composed by eight microarray datasets and one proteomics dataset. The network biomarkers with PPIs can accurately distinguish the prostate patients from the normal ones, which potentially provide more reliable hits of biomarker candidates than conventional biomarker discovery methods.
Natural variation in floral nectar proteins of two Nicotiana attenuata accessions.
Seo, Pil Joon; Wielsch, Natalie; Kessler, Danny; Svatos, Ales; Park, Chung-Mo; Baldwin, Ian T; Kim, Sang-Gyu
2013-07-13
Floral nectar (FN) contains not only energy-rich compounds to attract pollinators, but also defense chemicals and several proteins. However, proteomic analysis of FN has been hampered by the lack of publically available sequence information from nectar-producing plants. Here we used next-generation sequencing and advanced proteomics to profile FN proteins in the opportunistic outcrossing wild tobacco, Nicotiana attenuata. We constructed a transcriptome database of N. attenuata and characterized its nectar proteome using LC-MS/MS. The FN proteins of N. attenuata included nectarins, sugar-cleaving enzymes (glucosidase, galactosidase, and xylosidase), RNases, pathogen-related proteins, and lipid transfer proteins. Natural variation in FN proteins of eleven N. attenuata accessions revealed a negative relationship between the accumulation of two abundant proteins, nectarin1b and nectarin5. In addition, microarray analysis of nectary tissues revealed that protein accumulation in FN is not simply correlated with the accumulation of transcripts encoding FN proteins and identified a group of genes that were specifically expressed in the nectary. Natural variation of identified FN proteins in the ecological model plant N. attenuata suggests that nectar chemistry may have a complex function in plant-pollinator-microbe interactions.
Natural variation in floral nectar proteins of two Nicotiana attenuata accessions
2013-01-01
Background Floral nectar (FN) contains not only energy-rich compounds to attract pollinators, but also defense chemicals and several proteins. However, proteomic analysis of FN has been hampered by the lack of publically available sequence information from nectar-producing plants. Here we used next-generation sequencing and advanced proteomics to profile FN proteins in the opportunistic outcrossing wild tobacco, Nicotiana attenuata. Results We constructed a transcriptome database of N. attenuata and characterized its nectar proteome using LC-MS/MS. The FN proteins of N. attenuata included nectarins, sugar-cleaving enzymes (glucosidase, galactosidase, and xylosidase), RNases, pathogen-related proteins, and lipid transfer proteins. Natural variation in FN proteins of eleven N. attenuata accessions revealed a negative relationship between the accumulation of two abundant proteins, nectarin1b and nectarin5. In addition, microarray analysis of nectary tissues revealed that protein accumulation in FN is not simply correlated with the accumulation of transcripts encoding FN proteins and identified a group of genes that were specifically expressed in the nectary. Conclusions Natural variation of identified FN proteins in the ecological model plant N. attenuata suggests that nectar chemistry may have a complex function in plant-pollinator-microbe interactions. PMID:23848992
Bordner, Kelly A.; George, Elizabeth D.; Carlyle, Becky C.; Duque, Alvaro; Kitchen, Robert R.; Lam, TuKiet T.; Colangelo, Christopher M.; Stone, Kathryn L.; Abbott, Thomas B.; Mane, Shrikant M.; Nairn, Angus C.; Simen, Arthur A.
2011-01-01
Early life neglect is an important public health problem which can lead to lasting psychological dysfunction. Good animal models are necessary to understand the mechanisms responsible for the behavioral and anatomical pathology that results. We recently described a novel model of early life neglect, maternal separation with early weaning (MSEW), that produces behavioral changes in the mouse that persist into adulthood. To begin to understand the mechanism by which MSEW leads to these changes we applied cDNA microarray, next-generation RNA-sequencing (RNA-seq), label-free proteomics, multiple reaction monitoring (MRM) proteomics, and methylation analysis to tissue samples obtained from medial prefrontal cortex to determine the molecular changes induced by MSEW that persist into adulthood. The results show that MSEW leads to dysregulation of markers of mature oligodendrocytes and genes involved in protein translation and other categories, an apparent downward biasing of translation, and methylation changes in the promoter regions of selected dysregulated genes. These findings are likely to prove useful in understanding the mechanism by which early life neglect affects brain structure, cognition, and behavior. PMID:21629843
Säll, Anna; Walle, Maria; Wingren, Christer; Müller, Susanne; Nyman, Tomas; Vala, Andrea; Ohlin, Mats; Borrebaeck, Carl A K; Persson, Helena
2016-10-01
Antibody-based proteomics offers distinct advantages in the analysis of complex samples for discovery and validation of biomarkers associated with disease. However, its large-scale implementation requires tools and technologies that allow development of suitable antibody or antibody fragments in a high-throughput manner. To address this we designed and constructed two human synthetic antibody fragment (scFv) libraries denoted HelL-11 and HelL-13. By the use of phage display technology, in total 466 unique scFv antibodies specific for 114 different antigens were generated. The specificities of these antibodies were analyzed in a variety of immunochemical assays and a subset was further evaluated for functionality in protein microarray applications. This high-throughput approach demonstrates the ability to rapidly generate a wealth of reagents not only for proteome research, but potentially also for diagnostics and therapeutics. In addition, this work provides a great example on how a synthetic approach can be used to optimize library designs. By having precise control of the diversity introduced into the antigen-binding sites, synthetic libraries offer increased understanding of how different diversity contributes to antibody binding reactivity and stability, thereby providing the key to future library optimization. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Luo, Yanzhang; Mok, Tin Seak; Lin, Xiuxian; Zhang, Wanling; Cui, Yizhi; Guo, Jiahui; Chen, Xing; Zhang, Tao; Wang, Tong
2017-01-01
Nasopharyngeal carcinoma (NPC) is a serious threat to public health, and the biomarker discovery is of urgent needs. The data-independent mode (DIA) based sequential window acquisition of all theoretical fragment-ion spectra (SWATH) mass spectrometry (MS) has been proved to be precise in protein quantitation and efficient for cancer biomarker researches. In this study, we performed the first SWATH-MS analysis comparing the NPC and normal tissues. Spike-in stable isotope labeling by amino acids in cell culture (super-SILAC) MS was used as a shotgun reference. We identified and quantified 1414 proteins across all SWATH-MS analyses. We found that SWATH-MS had a unique feature to preferentially detect proteins with smaller molecular weights than either super-SILAC MS or human proteome background. With SWATH-MS, 29 significant differentially express proteins (DEPs) were identified. Among them, carbonic anhydrase 2 (CA2) was selected for further validation per novelty, MS quality and other supporting rationale. With the tissue microarray analysis, we found that CA2 had an AUC of 0.94 in differentiating NPC from normal tissue samples. In conclusion, SWATH-MS has unique features in proteome analysis, and it leads to the identification of CA2 as a potentially new diagnostic biomarker for NPC. PMID:28117408
Patel, Vyomesh; Hood, Brian L; Molinolo, Alfredo A; Lee, Norman H; Conrads, Thomas P; Braisted, John C; Krizman, David B; Veenstra, Timothy D; Gutkind, J Silvio
2008-02-15
Squamous cell carcinoma of the head and neck (HNSCC), the sixth most prevalent cancer among men worldwide, is associated with poor prognosis, which has improved only marginally over the past three decades. A proteomic analysis of HNSCC lesions may help identify novel molecular targets for the early detection, prevention, and treatment of HNSCC. Laser capture microdissection was combined with recently developed techniques for protein extraction from formalin-fixed paraffin-embedded (FFPE) tissues and a novel proteomics platform. Approximately 20,000 cells procured from FFPE tissue sections of normal oral epithelium and well, moderately, and poorly differentiated HNSCC were processed for mass spectrometry and bioinformatic analysis. A large number of proteins expressed in normal oral epithelium and HNSCC, including cytokeratins, intermediate filaments, differentiation markers, and proteins involved in stem cell maintenance, signal transduction, migration, cell cycle regulation, growth and angiogenesis, matrix degradation, and proteins with tumor suppressive and oncogenic potential, were readily detected. Of interest, the relative expression of many of these molecules followed a distinct pattern in normal squamous epithelia and well, moderately, and poorly differentiated HNSCC tumor tissues. Representative proteins were further validated using immunohistochemical studies in HNSCC tissue sections and tissue microarrays. The ability to combine laser capture microdissection and in-depth proteomic analysis of FFPE tissues provided a wealth of information regarding the nature of the proteins expressed in normal squamous epithelium and during HNSCC progression, which may allow the development of novel biomarkers of diagnostic and prognostic value and the identification of novel targets for therapeutic intervention in HNSCC.
Rohban, Rokhsareh; Reinisch, Andreas; Etchart, Nathalie; Schallmoser, Katharina; Hofmann, Nicole A.; Szoke, Krisztina; Brinchmann, Jan E.; Rad, Ehsan Bonyadi; Rohde, Eva; Strunk, Dirk
2013-01-01
Therapeutic neo-vasculogenesis in vivo can be achieved by the co-transplantation of human endothelial colony-forming progenitor cells (ECFCs) with mesenchymal stem/progenitor cells (MSPCs). The underlying mechanism is not completely understood thus hampering the development of novel stem cell therapies. We hypothesized that proteomic profiling could be used to retrieve the in vivo signaling signature during the initial phase of human neo-vasculogenesis. ECFCs and MSPCs were therefore either transplanted alone or co-transplanted subcutaneously into immune deficient mice. Early cell signaling, occurring within the first 24 hours in vivo, was analyzed using antibody microarray proteomic profiling. Vessel formation and persistence were verified in parallel transplants for up to 24 weeks. Proteomic analysis revealed significant alteration of regulatory components including caspases, calcium/calmodulin-dependent protein kinase, DNA protein kinase, human ErbB2 receptor-tyrosine kinase as well as mitogen-activated protein kinases. Caspase-4 was selected from array results as one therapeutic candidate for targeting vascular network formation in vitro as well as modulating therapeutic vasculogenesis in vivo. As a proof-of-principle, caspase-4 and general caspase-blocking led to diminished endothelial network formation in vitro and significantly decreased vasculogenesis in vivo. Proteomic profiling ex vivo thus unraveled a signaling signature which can be used for target selection to modulate neo-vasculogenesis in vivo. PMID:23826172
Bioinformatics/biostatistics: microarray analysis.
Eichler, Gabriel S
2012-01-01
The quantity and complexity of the molecular-level data generated in both research and clinical settings require the use of sophisticated, powerful computational interpretation techniques. It is for this reason that bioinformatic analysis of complex molecular profiling data has become a fundamental technology in the development of personalized medicine. This chapter provides a high-level overview of the field of bioinformatics and outlines several, classic bioinformatic approaches. The highlighted approaches can be aptly applied to nearly any sort of high-dimensional genomic, proteomic, or metabolomic experiments. Reviewed technologies in this chapter include traditional clustering analysis, the Gene Expression Dynamics Inspector (GEDI), GoMiner (GoMiner), Gene Set Enrichment Analysis (GSEA), and the Learner of Functional Enrichment (LeFE).
CEBS object model for systems biology data, SysBio-OM.
Xirasagar, Sandhya; Gustafson, Scott; Merrick, B Alex; Tomer, Kenneth B; Stasiewicz, Stanley; Chan, Denny D; Yost, Kenneth J; Yates, John R; Sumner, Susan; Xiao, Nianqing; Waters, Michael D
2004-09-01
To promote a systems biology approach to understanding the biological effects of environmental stressors, the Chemical Effects in Biological Systems (CEBS) knowledge base is being developed to house data from multiple complex data streams in a systems friendly manner that will accommodate extensive querying from users. Unified data representation via a single object model will greatly aid in integrating data storage and management, and facilitate reuse of software to analyze and display data resulting from diverse differential expression or differential profile technologies. Data streams include, but are not limited to, gene expression analysis (transcriptomics), protein expression and protein-protein interaction analysis (proteomics) and changes in low molecular weight metabolite levels (metabolomics). To enable the integration of microarray gene expression, proteomics and metabolomics data in the CEBS system, we designed an object model, Systems Biology Object Model (SysBio-OM). The model is comprehensive and leverages other open source efforts, namely the MicroArray Gene Expression Object Model (MAGE-OM) and the Proteomics Experiment Data Repository (PEDRo) object model. SysBio-OM is designed by extending MAGE-OM to represent protein expression data elements (including those from PEDRo), protein-protein interaction and metabolomics data. SysBio-OM promotes the standardization of data representation and data quality by facilitating the capture of the minimum annotation required for an experiment. Such standardization refines the accuracy of data mining and interpretation. The open source SysBio-OM model, which can be implemented on varied computing platforms is presented here. A universal modeling language depiction of the entire SysBio-OM is available at http://cebs.niehs.nih.gov/SysBioOM/. The Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at http://cebs.niehs.nih.gov/cebsdownloads. The database and interface are being built to implement the model and will be available for public use at http://cebs.niehs.nih.gov.
Ambati, Aditya; Valentini, Davide; Montomoli, Emanuele; Lapini, Guilia; Biuso, Fabrizio; Wenschuh, Holger; Magalhaes, Isabelle; Maeurer, Markus
2015-01-01
A high content peptide microarray containing the entire influenza A virus [A/California/08/2009(H1N1)] proteome and haemagglutinin proteins from 12 other influenza A subtypes, including the haemagglutinin from the [A/South Carolina/1/1918(H1N1)] strain, was used to gauge serum IgG epitope signatures before and after Pandemrix® vaccination or H1N1 infection in a Swedish cohort during the pandemic influenza season 2009. A very narrow pattern of pandemic flu-specific IgG epitope recognition was observed in the serum from individuals who later contracted H1N1 infection. Moreover, the pandemic influenza infection generated IgG reactivity to two adjacent epitopes of the neuraminidase protein. The differential serum IgG recognition was focused on haemagglutinin 1 (H1) and restricted to classical antigenic sites (Cb) in both the vaccinated controls and individuals with flu infections. We further identified a novel epitope VEPGDKITFEATGNL on the Ca antigenic site (251–265) of the pandemic flu haemagglutinin, which was exclusively recognized in serum from individuals with previous vaccinations and never in serum from individuals with H1N1 infection (confirmed by RNA PCR analysis from nasal swabs). This epitope was mapped to the receptor-binding domain of the influenza haemagglutinin and could serve as a correlate of immune protection in the context of pandemic flu. The study shows that unbiased epitope mapping using peptide microarray technology leads to the identification of biologically and clinically relevant target structures. Most significantly an H1N1 infection induced a different footprint of IgG epitope recognition patterns compared with the pandemic H1N1 vaccine. PMID:25639813
Li, Xiao-jun; Yi, Eugene C; Kemp, Christopher J; Zhang, Hui; Aebersold, Ruedi
2005-09-01
There is an increasing interest in the quantitative proteomic measurement of the protein contents of substantially similar biological samples, e.g. for the analysis of cellular response to perturbations over time or for the discovery of protein biomarkers from clinical samples. Technical limitations of current proteomic platforms such as limited reproducibility and low throughput make this a challenging task. A new LC-MS-based platform is able to generate complex peptide patterns from the analysis of proteolyzed protein samples at high throughput and represents a promising approach for quantitative proteomics. A crucial component of the LC-MS approach is the accurate evaluation of the abundance of detected peptides over many samples and the identification of peptide features that can stratify samples with respect to their genetic, physiological, or environmental origins. We present here a new software suite, SpecArray, that generates a peptide versus sample array from a set of LC-MS data. A peptide array stores the relative abundance of thousands of peptide features in many samples and is in a format identical to that of a gene expression microarray. A peptide array can be subjected to an unsupervised clustering analysis to stratify samples or to a discriminant analysis to identify discriminatory peptide features. We applied the SpecArray to analyze two sets of LC-MS data: one was from four repeat LC-MS analyses of the same glycopeptide sample, and another was from LC-MS analysis of serum samples of five male and five female mice. We demonstrate through these two study cases that the SpecArray software suite can serve as an effective software platform in the LC-MS approach for quantitative proteomics.
Next-Generation Technologies for Multiomics Approaches Including Interactome Sequencing
Ohashi, Hiroyuki; Miyamoto-Sato, Etsuko
2015-01-01
The development of high-speed analytical techniques such as next-generation sequencing and microarrays allows high-throughput analysis of biological information at a low cost. These techniques contribute to medical and bioscience advancements and provide new avenues for scientific research. Here, we outline a variety of new innovative techniques and discuss their use in omics research (e.g., genomics, transcriptomics, metabolomics, proteomics, and interactomics). We also discuss the possible applications of these methods, including an interactome sequencing technology that we developed, in future medical and life science research. PMID:25649523
Mokada-Gopal, Lavanya; Boeser, Alexander; Lehmann, Christian H K; Drepper, Friedel; Dudziak, Diana; Warscheid, Bettina; Voehringer, David
2017-05-01
The transcription factor STAT6 plays a key role in mediating signaling downstream of the receptors for IL-4 and IL-13. In B cells, STAT6 is required for class switch recombination to IgE and for germinal center formation during type 2 immune responses directed against allergens or helminths. In this study, we compared the transcriptomes and proteomes of primary mouse B cells from wild-type and STAT6-deficient mice cultured for 4 d in the presence or absence of IL-4. Microarray analysis revealed that 214 mRNAs were upregulated and 149 were downregulated >3-fold by IL-4 in a STAT6-dependent manner. Across all samples, ∼5000 proteins were identified by label-free quantitative liquid chromatography/mass spectrometry. A total of 149 proteins was found to be differentially expressed >3-fold between IL-4-stimulated wild-type and STAT6 -/- B cells (75 upregulated and 74 downregulated). Comparative analysis of the proteome and transcriptome revealed that expression of these proteins was mainly regulated at the transcriptional level, which argues against a major role for posttranscriptional mechanisms that modulate the STAT6-dependent proteome. Nine proteins were selected for confirmation by flow cytometry or Western blot. We show that CD30, CD79b, SLP-76, DEC205, IL-5Rα, STAT5, and Thy1 are induced by IL-4 in a STAT6-dependent manner. In contrast, Syk and Fc receptor-like 1 were downregulated. This dataset provides a framework for further functional analysis of newly identified IL-4-regulated proteins in B cells that may contribute to germinal center formation and IgE switching in type 2 immunity. Copyright © 2017 by The American Association of Immunologists, Inc.
2014-01-01
Background Induced resistance (IR) can be part of a sustainable plant protection strategy against important plant diseases. β-aminobutyric acid (BABA) can induce resistance in a wide range of plants against several types of pathogens, including potato infected with Phytophthora infestans. However, the molecular mechanisms behind this are unclear and seem to be dependent on the system studied. To elucidate the defence responses activated by BABA in potato, a genome-wide transcript microarray analysis in combination with label-free quantitative proteomics analysis of the apoplast secretome were performed two days after treatment of the leaf canopy with BABA at two concentrations, 1 and 10 mM. Results Over 5000 transcripts were differentially expressed and over 90 secretome proteins changed in abundance indicating a massive activation of defence mechanisms with 10 mM BABA, the concentration effective against late blight disease. To aid analysis, we present a more comprehensive functional annotation of the microarray probes and gene models by retrieving information from orthologous gene families across 26 sequenced plant genomes. The new annotation provided GO terms to 8616 previously un-annotated probes. Conclusions BABA at 10 mM affected several processes related to plant hormones and amino acid metabolism. A major accumulation of PR proteins was also evident, and in the mevalonate pathway, genes involved in sterol biosynthesis were down-regulated, whereas several enzymes involved in the sesquiterpene phytoalexin biosynthesis were up-regulated. Interestingly, abscisic acid (ABA) responsive genes were not as clearly regulated by BABA in potato as previously reported in Arabidopsis. Together these findings provide candidates and markers for improved resistance in potato, one of the most important crops in the world. PMID:24773703
Cox, Brian; Sharma, Parveen; Evangelou, Andreas I; Whiteley, Kathie; Ignatchenko, Vladimir; Ignatchenko, Alex; Baczyk, Dora; Czikk, Marie; Kingdom, John; Rossant, Janet; Gramolini, Anthony O; Adamson, S Lee; Kislinger, Thomas
2011-12-01
Preeclampsia (PE) adversely impacts ~5% of pregnancies. Despite extensive research, no consistent biomarkers or cures have emerged, suggesting that different molecular mechanisms may cause clinically similar disease. To address this, we undertook a proteomics study with three main goals: (1) to identify a panel of cell surface markers that distinguish the trophoblast and endothelial cells of the placenta in the mouse; (2) to translate this marker set to human via the Human Protein Atlas database; and (3) to utilize the validated human trophoblast markers to identify subgroups of human preeclampsia. To achieve these goals, plasma membrane proteins at the blood tissue interfaces were extracted from placentas using intravascular silica-bead perfusion, and then identified using shotgun proteomics. We identified 1181 plasma membrane proteins, of which 171 were enriched at the maternal blood-trophoblast interface and 192 at the fetal endothelial interface with a 70% conservation of expression in humans. Three distinct molecular subgroups of human preeclampsia were identified in existing human microarray data by using expression patterns of trophoblast-enriched proteins. Analysis of all misexpressed genes revealed divergent dysfunctions including angiogenesis (subgroup 1), MAPK signaling (subgroup 2), and hormone biosynthesis and metabolism (subgroup 3). Subgroup 2 lacked expected changes in known preeclampsia markers (sFLT1, sENG) and uniquely overexpressed GNA12. In an independent set of 40 banked placental specimens, GNA12 was overexpressed during preeclampsia when co-incident with chronic hypertension. In the current study we used a novel translational analysis to integrate mouse and human trophoblast protein expression with human microarray data. This strategy identified distinct molecular pathologies in human preeclampsia. We conclude that clinically similar preeclampsia patients exhibit divergent placental gene expression profiles thus implicating divergent molecular mechanisms in the origins of this disease.
Hueber, Wolfgang; Tomooka, Beren H; Zhao, Xiaoyan; Kidd, Brian A; Drijfhout, Jan W; Fries, James F; van Venrooij, Walther J; Metzger, Allan L; Genovese, Mark C; Robinson, William H
2007-01-01
Objectives To identify peripheral blood autoantibody and cytokine profiles that characterise clinically relevant subgroups of patients with early rheumatoid arthritis using arthritis antigen microarrays and a multiplex cytokine assay. Methods Serum samples from 56 patients with a diagnosis of rheumatoid arthritis of <6 months' duration were tested. Cytokine profiles were also determined in samples from patients with psoriatic arthritis (PsA) and ankylosing spondylitis (n = 21), and from healthy individuals (n = 19). Data were analysed using Kruskal–Wallis test with Dunn's adjustment for multiple comparisons, linear correlation tests, significance analysis of microarrays (SAM) and hierarchical clustering software. Results Distinct antibody profiles were associated with subgroups of patients who exhibited high serum levels of tumour necrosis factor (TNF)α, interleukin (IL)1β, IL6, IL13, IL15 and granulocyte macrophage colony‐stimulating factor. Significantly increased autoantibody reactivity against citrullinated epitopes was observed in patients within the cytokine “high” subgroup. Increased levels of TNFα, IL1α, IL12p40 and IL13, and the chemokines eotaxin/CCL11, monocyte chemoattractant protein‐1 and interferon‐inducible protein 10, were present in early rheumatoid arthritis as compared with controls (p<0.001). Chemokines showed some of the most impressive differences. Only IL8/CXCL8 concentrations were higher in patients with PsA/ankylosing spondylitis (p = 0.02). Conclusions Increased blood levels of proinflammatory cytokines are associated with autoantibody targeting of citrullinated antigens and surrogate markers of disease activity in patients with early rheumatoid arthritis. Proteomic analysis of serum autoantibodies, cytokines and chemokines enables stratification of patients with early rheumatoid arthritis into molecular subgroups. PMID:16901957
Identifier mapping performance for integrating transcriptomics and proteomics experimental results
2011-01-01
Background Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. Results We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. Conclusions The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. PMID:21619611
Deciphering the Function of New Gonococcal Vaccine Antigens Using Phenotypic Microarrays
Baarda, Benjamin I.; Emerson, Sarah; Proteau, Philip J.
2017-01-01
ABSTRACT The function and extracellular location of cell envelope proteins make them attractive candidates for developing vaccines against bacterial diseases, including challenging drug-resistant pathogens, such as Neisseria gonorrhoeae. A proteomics-driven reverse vaccinology approach has delivered multiple gonorrhea vaccine candidates; however, the biological functions of many of them remain to be elucidated. Herein, the functions of six gonorrhea vaccine candidates—NGO2121, NGO1985, NGO2054, NGO2111, NGO1205, and NGO1344—in cell envelope homeostasis were probed using phenotype microarrays under 1,056 conditions and a ΔbamE mutant (Δngo1780) as a reference of perturbed outer membrane integrity. Optimal growth conditions for an N. gonorrhoeae phenotype microarray assay in defined liquid medium were developed, which can be useful in other applications, including rapid and thorough antimicrobial susceptibility assessment. Our studies revealed 91 conditions having uniquely positive or negative effects on one of the examined mutants. A cluster analysis of 37 and 57 commonly beneficial and detrimental compounds, respectively, revealed three separate phenotype groups: NGO2121 and NGO1985; NGO1344 and BamE; and the trio of NGO1205, NGO2111, and NGO2054, with the last protein forming an independent branch of this cluster. Similar phenotypes were associated with loss of these vaccine candidates in the highly antibiotic-resistant WHO X strain. Based on their extensive sensitivity phenomes, NGO1985 and NGO2121 appear to be the most promising vaccine candidates. This study establishes the principle that phenotype microarrays can be successfully applied to a fastidious bacterial organism, such as N. gonorrhoeae. IMPORTANCE Innovative approaches are required to develop vaccines against prevalent and neglected sexually transmitted infections, such as gonorrhea. Herein, we have utilized phenotype microarrays in the first such investigation into Neisseria gonorrhoeae to probe the function of proteome-derived vaccine candidates in cell envelope homeostasis. Information gained from this screening can feed the vaccine candidate decision tree by providing insights into the roles these proteins play in membrane permeability, integrity, and overall N. gonorrhoeae physiology. The optimized screening protocol can be applied in investigations into the function of other hypothetical proteins of N. gonorrhoeae discovered in the expanding number of whole-genome sequences, in addition to revealing phenotypic differences between clinical and laboratory strains. PMID:28630127
Microintaglio Printing for Soft Lithography-Based in Situ Microarrays
Biyani, Manish; Ichiki, Takanori
2015-01-01
Advances in lithographic approaches to fabricating bio-microarrays have been extensively explored over the last two decades. However, the need for pattern flexibility, a high density, a high resolution, affordability and on-demand fabrication is promoting the development of unconventional routes for microarray fabrication. This review highlights the development and uses of a new molecular lithography approach, called “microintaglio printing technology”, for large-scale bio-microarray fabrication using a microreactor array (µRA)-based chip consisting of uniformly-arranged, femtoliter-size µRA molds. In this method, a single-molecule-amplified DNA microarray pattern is self-assembled onto a µRA mold and subsequently converted into a messenger RNA or protein microarray pattern by simultaneously producing and transferring (immobilizing) a messenger RNA or a protein from a µRA mold to a glass surface. Microintaglio printing allows the self-assembly and patterning of in situ-synthesized biomolecules into high-density (kilo-giga-density), ordered arrays on a chip surface with µm-order precision. This holistic aim, which is difficult to achieve using conventional printing and microarray approaches, is expected to revolutionize and reshape proteomics. This review is not written comprehensively, but rather substantively, highlighting the versatility of microintaglio printing for developing a prerequisite platform for microarray technology for the postgenomic era. PMID:27600226
Microarray, proteomic, and metabonomic technologies are becoming increasingly accessible as tools for ecotoxicology research. Effective use of these technologies will depend, at least in part, on the ability to apply these techniques within a paradigm of hypothesis driven researc...
Molony, Ryan D.; Rice, James M.; Yuk, Jongseol; Shetty, Vivek; Dey, Dipak; Lawrence, David A.; Lynes, Michael A.
2012-01-01
Biological indicators have numerous and widespread utility in personalized medicine, but the measurement of these indicators also pose many technological and practical challenges. Blood/plasma has typically been used as the sample source with which to measure these indicators, but the invasiveness associated with procurement of samples has led to increased interest in saliva as an attractive alternative. However, there are unique issues associated with the measurement of saliva biomarkers. These issues are compounded by the imperfect correlation between saliva and plasma with respect to biomarker profiles. In this manuscript, we address the technical challenges associated with saliva biomarker quantification describe a high-content microarray assay that employs both grating-coupled surface plasmon resonance imaging surface plasmon coupled emission modalities in a highly sensitive assay that has a large dynamic range. This powerful approach provides the tools to map the proteome of saliva, which in turn should greatly enhance the utility of salivary biomarker profiles in personalized medicine. PMID:22896008
Parallel mRNA, proteomics and miRNA expression analysis in cell line models of the intestine.
O'Sullivan, Finbarr; Keenan, Joanne; Aherne, Sinead; O'Neill, Fiona; Clarke, Colin; Henry, Michael; Meleady, Paula; Breen, Laura; Barron, Niall; Clynes, Martin; Horgan, Karina; Doolan, Padraig; Murphy, Richard
2017-11-07
To identify miRNA-regulated proteins differentially expressed between Caco2 and HT-29: two principal cell line models of the intestine. Exponentially growing Caco-2 and HT-29 cells were harvested and prepared for mRNA, miRNA and proteomic profiling. mRNA microarray profiling analysis was carried out using the Affymetrix GeneChip Human Gene 1.0 ST array. miRNA microarray profiling analysis was carried out using the Affymetrix Genechip miRNA 3.0 array. Quantitative Label-free LC-MS/MS proteomic analysis was performed using a Dionex Ultimate 3000 RSLCnano system coupled to a hybrid linear ion trap/Orbitrap mass spectrometer. Peptide identities were validated in Proteome Discoverer 2.1 and were subsequently imported into Progenesis QI software for further analysis. Hierarchical cluster analysis for all three parallel datasets (miRNA, proteomics, mRNA) was conducted in the R software environment using the Euclidean distance measure and Ward's clustering algorithm. The prediction of miRNA and oppositely correlated protein/mRNA interactions was performed using TargetScan 6.1. GO biological process, molecular function and cellular component enrichment analysis was carried out for the DE miRNA, protein and mRNA lists via the Pathway Studio 11.3 Web interface using their Mammalian database. Differential expression (DE) profiling comparing the intestinal cell lines HT-29 and Caco-2 identified 1795 Genes, 168 Proteins and 160 miRNAs as DE between the two cell lines. At the gene level, 1084 genes were upregulated and 711 were downregulated in the Caco-2 cell line relative to the HT-29 cell line. At the protein level, 57 proteins were found to be upregulated and 111 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Finally, at the miRNAs level, 104 were upregulated and 56 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Gene ontology (GO) analysis of the DE mRNA identified cell adhesion, migration and ECM organization, cellular lipid and cholesterol metabolic processes, small molecule transport and a range of responses to external stimuli, while similar analysis of the DE protein list identified gene expression/transcription, epigenetic mechanisms, DNA replication, differentiation and translation ontology categories. The DE protein and gene lists were found to share 15 biological processes including for example epithelial cell differentiation [ P value ≤ 1.81613E-08 (protein list); P ≤ 0.000434311 (gene list)] and actin filament bundle assembly [ P value ≤ 0.001582797 (protein list); P ≤ 0.002733714 (gene list)]. Analysis was conducted on the three data streams acquired in parallel to identify targets undergoing potential miRNA translational repression identified 34 proteins, whose respective mRNAs were detected but no change in expression was observed. Of these 34 proteins, 27 proteins downregulated in the Caco-2 cell line relative to the HT-29 cell line and predicted to be targeted by 19 unique anti-correlated/upregulated microRNAs and 7 proteins upregulated in the Caco-2 cell line relative to the HT-29 cell line and predicted to be targeted by 15 unique anti-correlated/downregulated microRNAs. This first study providing "tri-omics" analysis of the principal intestinal cell line models Caco-2 and HT-29 has identified 34 proteins potentially undergoing miRNA translational repression.
Parallel mRNA, proteomics and miRNA expression analysis in cell line models of the intestine
O’Sullivan, Finbarr; Keenan, Joanne; Aherne, Sinead; O’Neill, Fiona; Clarke, Colin; Henry, Michael; Meleady, Paula; Breen, Laura; Barron, Niall; Clynes, Martin; Horgan, Karina; Doolan, Padraig; Murphy, Richard
2017-01-01
AIM To identify miRNA-regulated proteins differentially expressed between Caco2 and HT-29: two principal cell line models of the intestine. METHODS Exponentially growing Caco-2 and HT-29 cells were harvested and prepared for mRNA, miRNA and proteomic profiling. mRNA microarray profiling analysis was carried out using the Affymetrix GeneChip Human Gene 1.0 ST array. miRNA microarray profiling analysis was carried out using the Affymetrix Genechip miRNA 3.0 array. Quantitative Label-free LC-MS/MS proteomic analysis was performed using a Dionex Ultimate 3000 RSLCnano system coupled to a hybrid linear ion trap/Orbitrap mass spectrometer. Peptide identities were validated in Proteome Discoverer 2.1 and were subsequently imported into Progenesis QI software for further analysis. Hierarchical cluster analysis for all three parallel datasets (miRNA, proteomics, mRNA) was conducted in the R software environment using the Euclidean distance measure and Ward’s clustering algorithm. The prediction of miRNA and oppositely correlated protein/mRNA interactions was performed using TargetScan 6.1. GO biological process, molecular function and cellular component enrichment analysis was carried out for the DE miRNA, protein and mRNA lists via the Pathway Studio 11.3 Web interface using their Mammalian database. RESULTS Differential expression (DE) profiling comparing the intestinal cell lines HT-29 and Caco-2 identified 1795 Genes, 168 Proteins and 160 miRNAs as DE between the two cell lines. At the gene level, 1084 genes were upregulated and 711 were downregulated in the Caco-2 cell line relative to the HT-29 cell line. At the protein level, 57 proteins were found to be upregulated and 111 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Finally, at the miRNAs level, 104 were upregulated and 56 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Gene ontology (GO) analysis of the DE mRNA identified cell adhesion, migration and ECM organization, cellular lipid and cholesterol metabolic processes, small molecule transport and a range of responses to external stimuli, while similar analysis of the DE protein list identified gene expression/transcription, epigenetic mechanisms, DNA replication, differentiation and translation ontology categories. The DE protein and gene lists were found to share 15 biological processes including for example epithelial cell differentiation [P value ≤ 1.81613E-08 (protein list); P ≤ 0.000434311 (gene list)] and actin filament bundle assembly [P value ≤ 0.001582797 (protein list); P ≤ 0.002733714 (gene list)]. Analysis was conducted on the three data streams acquired in parallel to identify targets undergoing potential miRNA translational repression identified 34 proteins, whose respective mRNAs were detected but no change in expression was observed. Of these 34 proteins, 27 proteins downregulated in the Caco-2 cell line relative to the HT-29 cell line and predicted to be targeted by 19 unique anti-correlated/upregulated microRNAs and 7 proteins upregulated in the Caco-2 cell line relative to the HT-29 cell line and predicted to be targeted by 15 unique anti-correlated/downregulated microRNAs. CONCLUSION This first study providing “tri-omics” analysis of the principal intestinal cell line models Caco-2 and HT-29 has identified 34 proteins potentially undergoing miRNA translational repression. PMID:29151691
Loch, Christian M; Strickler, James E
2012-11-01
Substrate ubiquitylation is a reversible process critical to cellular homeostasis that is often dysregulated in many human pathologies including cancer and neurodegeneration. Elucidating the mechanistic details of this pathway could unlock a large store of information useful to the design of diagnostic and therapeutic interventions. Proteomic approaches to the questions at hand have generally utilized mass spectrometry (MS), which has been successful in identifying both ubiquitylation substrates and profiling pan-cellular chain linkages, but is generally unable to connect the two. Interacting partners of the deubiquitylating enzymes (DUBs) have also been reported by MS, although substrates of catalytically competent DUBs generally cannot be. Where they have been used towards the study of ubiquitylation, protein microarrays have usually functioned as platforms for the identification of substrates for specific E3 ubiquitin ligases. Here, we report on the first use of protein microarrays to identify substrates of DUBs, and in so doing demonstrate the first example of microarray proteomics involving multiple (i.e., distinct, sequential and opposing) enzymatic activities. This technique demonstrates the selectivity of DUBs for both substrate and type (mono- versus poly-) of ubiquitylation. This work shows that the vast majority of DUBs are monoubiquitylated in vitro, and are incapable of removing this modification from themselves. This work also underscores the critical role of utilizing both ubiquitin chains and substrates when attempting to characterize DUBs. This article is part of a Special Issue entitled: Ubiquitin Drug Discovery and Diagnostics. Copyright © 2012 Elsevier B.V. All rights reserved.
Privacy Preserving PCA on Distributed Bioinformatics Datasets
ERIC Educational Resources Information Center
Li, Xin
2011-01-01
In recent years, new bioinformatics technologies, such as gene expression microarray, genome-wide association study, proteomics, and metabolomics, have been widely used to simultaneously identify a huge number of human genomic/genetic biomarkers, generate a tremendously large amount of data, and dramatically increase the knowledge on human…
Multi-omic profiling to assess the effect of iron starvation in Streptococcus pneumoniae TIGR4
Jiménez-Munguía, Irene; Calderón-Santiago, Mónica; Rodríguez-Franco, Antonio; Priego-Capote, Feliciano
2018-01-01
We applied multi-omics approaches (transcriptomics, proteomics and metabolomics) to study the effect of iron starvation on the Gram-positive human pathogen Streptococcus pneumoniae to elucidate global changes in the bacterium in a condition similar to what can be found in the host during an infectious episode. We treated the reference strain TIGR4 with the iron chelator deferoxamine mesylate. DNA microarrays revealed changes in the expression of operons involved in multiple biological processes, with a prevalence of genes coding for ion binding proteins. We also studied the changes in protein abundance by 2-DE followed by MALDI-TOF/TOF analysis of total cell extracts and secretome fractions. The main proteomic changes were found in proteins related to the primary and amino sugar metabolism, especially in enzymes with divalent cations as cofactors. Finally, the metabolomic analysis of intracellular metabolites showed altered levels of amino sugars involved in the cell wall peptidoglycan metabolism. This work shows the utility of multi-perspective studies that can provide complementary results for the comprehension of how a given condition can influence global physiological changes in microorganisms.
Zhang, Bing; Schmoyer, Denise; Kirov, Stefan; Snoddy, Jay
2004-01-01
Background Microarray and other high-throughput technologies are producing large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in the gene sets. Results We have created a web-based tool for data analysis and data visualization for sets of genes called GOTree Machine (GOTM). This tool was originally intended to analyze sets of co-regulated genes identified from microarray analysis but is adaptable for use with other gene sets from other high-throughput analyses. GOTree Machine generates a GOTree, a tree-like structure to navigate the Gene Ontology Directed Acyclic Graph for input gene sets. This system provides user friendly data navigation and visualization. Statistical analysis helps users to identify the most important Gene Ontology categories for the input gene sets and suggests biological areas that warrant further study. GOTree Machine is available online at . Conclusion GOTree Machine has a broad application in functional genomic, proteomic and other high-throughput methods that generate large sets of interesting genes; its primary purpose is to help users sort for interesting patterns in gene sets. PMID:14975175
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haab, Brian B.; Geierstanger, Bernhard H.; Michailidis, George
2005-08-01
Four different immunoassay and antibody microarray methods performed at four different sites were used to measure the levels of a broad range of proteins (N = 323 assays; 39, 88, 168, and 28 assays at the respective sites; 237 unique analytes) in the human serum and plasma reference specimens distributed by the Plasma Proteome Project (PPP) of the HUPO. The methods provided a means to (1) assess the level of systematic variation in protein abundances associated with blood preparation methods (serum, citrate-anticoagulated-plasma, EDTA-anticoagulated-plasma, or heparin-anticoagulated-plasma) and (2) evaluate the dependence on concentration of MS-based protein identifications from data sets usingmore » the HUPO specimens. Some proteins, particularly cytokines, had highly variable concentrations between the different sample preparations, suggesting specific effects of certain anticoagulants on the stability or availability of these proteins. The linkage of antibody-based measurements from 66 different analytes with the combined MS/MS data from 18 different laboratories showed that protein detection and the quality of MS data increased with analyte concentration. The conclusions from these initial analyses are that the optimal blood preparation method is variable between analytes and that the discovery of blood proteins by MS can be extended to concentrations below the ng/mL range under certain circumstances. Continued developments in antibody-based methods will further advance the scientific goals of the PPP.« less
Ragno, Silvia; Romano, Maria; Howell, Steven; Pappin, Darryl J C; Jenner, Peter J; Colston, Michael J
2001-01-01
We investigated the changes which occur in gene expression in the human macrophage cell line, THP1, at 1, 6 and 12 hr following infection with Mycobacterium tuberculosis. The analysis was carried out at the transcriptome level, using microarrays consisting of 375 human genes generally thought to be involved in immunoregulation, and at the proteomic level, using two-dimensional gel electrophoresis and mass spectrometry. The analysis of the transcriptome using microarrays revealed that many genes were up-regulated at 6 and 12 hr. Most of these genes encoded proteins involved in cell migration and homing, including the chemokines interleukin (IL)-8, osteopontin, monocyte chemotactic protein-1 (MCP-1), macrophage inflammatory protein-1α (MIP-1α), regulated on activation, normal, T-cell expressed and secreted (RANTES), MIP-1β, MIP-3α, myeloid progenitor inhibitory factor-1 (MPIF-1), pulmonary and activation regulated chemokine (PARC), growth regulated gene-β (GRO-β), GRO-γ, MCP-2, I-309, and the T helper 2 (Th2) and eosinophil-attracting chemokine, eotaxin. Other genes involved in cell migration which were up-regulated included the matrix metalloproteinase MMP-9, vascular endothelial growth factor (VEGF) and its receptor Flk-1, the chemokine receptor CCR3, and the cell adhesion molecules vesicular cell adhesion molecule-1 (VCAM-1) and integrin a3. In addition to the chemokine response, genes encoding the proinflammatory cytokines IL-1β (showing a 433-fold induction), IL-2 and tumour necrosis factor-α (TNF-α), were also found to be induced at 6 and/or 12 hr. It was more difficult to detect changes using the proteomic approach. Nevertheless, IL-1β was again shown to be strongly up-regulated. The enzyme manganese superoxide dismutase was also found to be strongly up-regulated; this enzyme was found to be macrophage-, rather than M. tuberculosis, derived. The heat-shock protein hsp27 was found to be down-regulated following infection. We also identified a mycobacterial protein, the product of the atpD gene (thought to be involved in the regulation of cytoplasmic pH) in the infected macrophage extracts. PMID:11576227
Adeola, Henry A.; Smith, Muneerah; Kaestner, Lisa; Blackburn, Jonathan M.; Zerbini, Luiz F.
2016-01-01
There is a growing need for high throughput diagnostic tools for early diagnosis and treatment monitoring of prostate cancer (PCa) in Africa. The role of cancer-testis antigens (CTAs) in PCa in men of African descent is poorly researched. Hence, we aimed to elucidate the role of 123 Tumour Associated Antigens (TAAs) using antigen microarray platform in blood samples (N = 67) from a South African PCa, Benign prostatic hyperplasia (BPH) and disease control (DC) cohort. Linear (fold-over-cutoff) and differential expression quantitation of autoantibody signal intensities were performed. Molecular signatures of candidate PCa antigen biomarkers were identified and analyzed for ethnic group variation. Potential cancer diagnostic and immunotherapeutic inferences were drawn. We identified a total of 41 potential diagnostic/therapeutic antigen biomarkers for PCa. By linear quantitation, four antigens, GAGE1, ROPN1, SPANXA1 and PRKCZ were found to have higher autoantibody titres in PCa serum as compared with BPH where MAGEB1 and PRKCZ were highly expressed. Also, p53 S15A and p53 S46A were found highly expressed in the disease control group. Statistical analysis by differential expression revealed twenty-four antigens as upregulated in PCa samples, while 11 were downregulated in comparison to BPH and DC (FDR = 0.01). FGFR2, COL6A1and CALM1 were verifiable biomarkers of PCa analysis using urinary shotgun proteomics. Functional pathway annotation of identified biomarkers revealed similar enrichment both at genomic and proteomic level and ethnic variations were observed. Cancer antigen arrays are emerging useful in potential diagnostic and immunotherapeutic antigen biomarker discovery. PMID:26885621
NCBI GEO: mining millions of expression profiles--database and tools.
Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron
2005-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Kottom, Theodore J; Limper, Andrew H
2011-10-01
Pneumocystis carinii (Pc) undergoes morphological transitions between cysts and trophic forms. We have previously described two Pc serine/threonine kinases, termed PcCbk1 and PcSte20, with PcSte20 belonging to a family of kinases involved in yeast mating, while PcCbk1 is a member of a group of protein kinases involved in regulation of cell cycle, shape, and proliferation. As Pc remains genetically intractable, knowledge on specific substrates phosphorylated by these kinases remains limited. Utilizing the phylogenetic relatedness of Pc to Saccharomyces cerevisiae, we interrogated a yeast proteome microarray containing >4000 purified protein based peptides, leading to the identification of 18 potential PcCbk1 and 15 PcSte20 substrates (Z-score > 3.0). A number of these potential protein substrates are involved in bud site selection, polarized growth, and response to mating α factor and pseudohyphal and invasive growth. Full-length open reading frames suggested by the PcCbk1 and PcSte20 protoarrays were amplified and expressed. These five proteins were used as substrates for PcCbk1 or PcSte20, with each being highly phosphorylated by the respective kinase. Finally, to demonstrate the utility of this method to identify novel PcCbk1 and PcSte20 substrates, we analysed DNA sequence data from the partially complete Pc genome database and detected partial sequence information of potential PcCbk1 kinase substrates PcPxl1 and PcInt1. We additionally identified the potential PcSte20 kinase substrate PcBdf2. Full-length Pc substrates were cloned and expressed in yeast, and shown to be phosphorylated by the respective Pc kinases. In conclusion, the yeast protein microarray represents a novel crossover technique for identifying unique potential Pc kinase substrates. Copyright © 2011 John Wiley & Sons, Ltd.
Transcriptional and proteomic analysis of the Aspergillus fumigatus ΔprtT protease-deficient mutant.
Hagag, Shelly; Kubitschek-Barreira, Paula; Neves, Gabriela W P; Amar, David; Nierman, William; Shalit, Itamar; Shamir, Ron; Lopes-Bezerra, Leila; Osherov, Nir
2012-01-01
Aspergillus fumigatus is the most common opportunistic mold pathogen of humans, infecting immunocompromised patients. The fungus invades the lungs and other organs, causing severe damage. Penetration of the pulmonary epithelium is a key step in the infectious process. A. fumigatus produces extracellular proteases to degrade the host structural barriers. The A. fumigatus transcription factor PrtT controls the expression of multiple secreted proteases. PrtT shows similarity to the fungal Gal4-type Zn(2)-Cys(6) DNA-binding domain of several transcription factors. In this work, we further investigate the function of this transcription factor by performing a transcriptional and a proteomic analysis of the ΔprtT mutant. Unexpectedly, microarray analysis revealed that in addition to the expected decrease in protease expression, expression of genes involved in iron uptake and ergosterol synthesis was dramatically decreased in the ΔprtT mutant. A second finding of interest is that deletion of prtT resulted in the upregulation of four secondary metabolite clusters, including genes for the biosynthesis of toxic pseurotin A. Proteomic analysis identified reduced levels of three secreted proteases (ALP1 protease, TppA, AFUA_2G01250) and increased levels of three secreted polysaccharide-degrading enzymes in the ΔprtT mutant possibly in response to its inability to derive sufficient nourishment from protein breakdown. This report highlights the complexity of gene regulation by PrtT, and suggests a potential novel link between the regulation of protease secretion and the control of iron uptake, ergosterol biosynthesis and secondary metabolite production in A. fumigatus.
Transcriptomic analysis of Arabidopsis developing stems: a close-up on cell wall genes
Minic, Zoran; Jamet, Elisabeth; San-Clemente, Hélène; Pelletier, Sandra; Renou, Jean-Pierre; Rihouey, Christophe; Okinyo, Denis PO; Proux, Caroline; Lerouge, Patrice; Jouanin, Lise
2009-01-01
Background Different strategies (genetics, biochemistry, and proteomics) can be used to study proteins involved in cell biogenesis. The availability of the complete sequences of several plant genomes allowed the development of transcriptomic studies. Although the expression patterns of some Arabidopsis thaliana genes involved in cell wall biogenesis were identified at different physiological stages, detailed microarray analysis of plant cell wall genes has not been performed on any plant tissues. Using transcriptomic and bioinformatic tools, we studied the regulation of cell wall genes in Arabidopsis stems, i.e. genes encoding proteins involved in cell wall biogenesis and genes encoding secreted proteins. Results Transcriptomic analyses of stems were performed at three different developmental stages, i.e., young stems, intermediate stage, and mature stems. Many genes involved in the synthesis of cell wall components such as polysaccharides and monolignols were identified. A total of 345 genes encoding predicted secreted proteins with moderate or high level of transcripts were analyzed in details. The encoded proteins were distributed into 8 classes, based on the presence of predicted functional domains. Proteins acting on carbohydrates and proteins of unknown function constituted the two most abundant classes. Other proteins were proteases, oxido-reductases, proteins with interacting domains, proteins involved in signalling, and structural proteins. Particularly high levels of expression were established for genes encoding pectin methylesterases, germin-like proteins, arabinogalactan proteins, fasciclin-like arabinogalactan proteins, and structural proteins. Finally, the results of this transcriptomic analyses were compared with those obtained through a cell wall proteomic analysis from the same material. Only a small proportion of genes identified by previous proteomic analyses were identified by transcriptomics. Conversely, only a few proteins encoded by genes having moderate or high level of transcripts were identified by proteomics. Conclusion Analysis of the genes predicted to encode cell wall proteins revealed that about 345 genes had moderate or high levels of transcripts. Among them, we identified many new genes possibly involved in cell wall biogenesis. The discrepancies observed between results of this transcriptomic study and a previous proteomic study on the same material revealed post-transcriptional mechanisms of regulation of expression of genes encoding cell wall proteins. PMID:19149885
Brusniak, Mi-Youn; Bodenmiller, Bernd; Campbell, David; Cooke, Kelly; Eddes, James; Garbutt, Andrew; Lau, Hollis; Letarte, Simon; Mueller, Lukas N; Sharma, Vagisha; Vitek, Olga; Zhang, Ning; Aebersold, Ruedi; Watts, Julian D
2008-01-01
Background Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field. PMID:19087345
High-Throughput Cloning and Expression Library Creation for Functional Proteomics
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-01-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047
Characterization of proteomic and metabolomic responses to dietary factors and supplements.
Astle, John; Ferguson, Jonathan T; German, J Bruce; Harrigan, George G; Kelleher, Neil L; Kodadek, Thomas; Parks, Bryan A; Roth, Michael J; Singletary, Keith W; Wenger, Craig D; Mahady, Gail B
2007-12-01
Over the past decade there has been a renewed interest in research and development of both dietary and nutritional supplements. Significant advancements have been made in the scientific assessment of the quality, safety, and efficacy of these products because of the strong interest in and financial support of these projects. As research in both fields continues to advance, opportunities to use new and innovative research technologies and methodologies, such as proteomics and metabolomics, are critical for the future progress of the science. The purpose of the symposium was to begin the process of communicating new innovative proteomic and metabolomic methodologies that may be applied by researchers in both the nutrition and the natural product communities. This symposium highlighted 2 proteomic approaches, protein fingerprinting in complex mixtures with peptoid microarrays and top-down mass spectrometry for annotation of gene products. Likewise, an overview of the methodologies used in metabolomic profiling of natural products was presented, and an illustration of an integrated metabolomics approach in nutrition research was highlighted.
High-throughput cloning and expression library creation for functional proteomics.
Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua
2013-05-01
The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particularly important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single-gene experiments, creating the need for fast, flexible, and reliable cloning systems. These collections of ORF clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial, we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator(TM) DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This tutorial is part of the International Proteomics Tutorial Programme (IPTP12). © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Qendro, Veneta; Bugos, Grace A; Lundgren, Debbie H; Glynn, John; Han, May H; Han, David K
2017-03-01
In order to gain mechanistic insights into multiple sclerosis (MS) pathogenesis, we utilized a multi-dimensional approach to test the hypothesis that mutations in myelin proteins lead to immune activation and central nervous system autoimmunity in MS. Mass spectrometry-based proteomic analysis of human MS brain lesions revealed seven unique mutations of PLP1; a key myelin protein that is known to be destroyed in MS. Surprisingly, in-depth genomic analysis of two MS patients at the genomic DNA and mRNA confirmed mutated PLP1 in RNA, but not in the genomic DNA. Quantification of wild type and mutant PLP RNA levels by qPCR further validated the presence of mutant PLP RNA in the MS patients. To seek evidence linking mutations in abundant myelin proteins and immune-mediated destruction of myelin, specific immune response against mutant PLP1 in MS patients was examined. Thus, we have designed paired, wild type and mutant peptide microarrays, and examined antibody response to multiple mutated PLP1 in sera from MS patients. Consistent with the idea of different patients exhibiting unique mutation profiles, we found that 13 out of 20 MS patients showed antibody responses against specific but not against all the mutant-PLP1 peptides. Interestingly, we found mutant PLP-directed antibody response against specific mutant peptides in the sera of pre-MS controls. The results from integrative proteomic, genomic, and immune analyses reveal a possible mechanism of mutation-driven pathogenesis in human MS. The study also highlights the need for integrative genomic and proteomic analyses for uncovering pathogenic mechanisms of human diseases. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Chao, Jie; Li, Zhenhua; Li, Jing; Peng, Hongzhen; Su, Shao; Li, Qian; Zhu, Changfeng; Zuo, Xiaolei; Song, Shiping; Wang, Lianhui; Wang, Lihua
2016-07-15
Microarrays of biomolecules hold great promise in the fields of genomics, proteomics, and clinical assays on account of their remarkably parallel and high-throughput assay capability. However, the fluorescence detection used in most conventional DNA microarrays is still limited by sensitivity. In this study, we have demonstrated a novel universal and highly sensitive platform for fluorescent detection of sequence specific DNA at the femtomolar level by combining dextran-coated microarrays with hybridization chain reaction (HCR) signal amplification. Three-dimensional dextran matrix was covalently coated on glass surface as the scaffold to immobilize DNA recognition probes to increase the surface binding capacity and accessibility. DNA nanowire tentacles were formed on the matrix surface for efficient signal amplification by capturing multiple fluorescent molecules in a highly ordered way. By quantifying microscopic fluorescent signals, the synergetic effects of dextran and HCR greatly improved sensitivity of DNA microarrays, with a detection limit of 10fM (1×10(5) molecules). This detection assay could recognize one-base mismatch with fluorescence signals dropped down to ~20%. This cost-effective microarray platform also worked well with samples in serum and thus shows great potential for clinical diagnosis. Copyright © 2016 Elsevier B.V. All rights reserved.
Transcriptomic and Proteomic Analysis of Oenococcus oeni Adaptation to Wine Stress Conditions
Margalef-Català, Mar; Araque, Isabel; Bordons, Albert; Reguant, Cristina; Bautista-Gallego, Joaquín
2016-01-01
Oenococcus oeni, the main lactic acid bacteria responsible for malolactic fermentation in wine, has to adapt to stressful conditions, such as low pH and high ethanol content. In this study, the changes in the transcriptome and the proteome of O. oeni PSU-1 during the adaptation period before MLF start have been studied. DNA microarrays were used for the transcriptomic analysis and two complementary proteomic techniques, 2-D DIGE and iTRAQ labeling were used to analyze the proteomic response. One of the most influenced functions in PSU-1 due to inoculation into wine-like medium (WLM) was translation, showing the over-expression of certain ribosomal genes and the corresponding proteins. Amino acid metabolism and transport was also altered and several peptidases were up regulated both at gene and protein level. Certain proteins involved in glutamine and glutamate metabolism showed an increased abundance revealing the key role of nitrogen uptake under stressful conditions. A strong transcriptional inhibition of carbohydrate metabolism related genes was observed. On the other hand, the transcriptional up-regulation of malate transport and citrate consumption was indicative of the use of L-malate and citrate associated to stress response and as an alternative energy source to sugar metabolism. Regarding the stress mechanisms, our results support the relevance of the thioredoxin and glutathione systems in the adaptation of O. oeni to wine related stress. Genes and proteins related to cell wall showed also significant changes indicating the relevance of the cell envelop as protective barrier to environmental stress. The differences found between transcriptomic and proteomic data suggested the relevance of post-transcriptional mechanisms and the complexity of the stress response in O. oeni adaptation. Further research should deepen into the metabolisms mostly altered due to wine conditions to elucidate the role of each mechanism in the O. oeni ability to develop MLF. PMID:27746771
Zangar, Richard C.; Varnum, Susan M.; Covington, Chandice Y.; ...
2004-01-01
Identifying useful markers of cancer can be problematic due to limited amounts of sample. Some samples such as nipple aspirate fluid (NAF) or early-stage tumors are inherently small. Other samples such as serum are collected in larger volumes but archives of these samples are very valuable and only small amounts of each sample may be available for a single study. Also, given the diverse nature of cancer and the inherent variability in individual protein levels, it seems likely that the best approach to screen for cancer will be to determine the profile of a battery of proteins. As a result,more » a major challenge in identifying protein markers of disease is the ability to screen many proteins using very small amounts of sample. In this review, we outline some technological advances in proteomics that greatly advance this capability. Specifically, we propose a strategy for identifying markers of breast cancer in NAF that utilizes mass spectrometry (MS) to simultaneously screen hundreds or thousands of proteins in each sample. The best potential markers identified by the MS analysis can then be extensively characterized using an ELISA microarray assay. Because the microarray analysis is quantitative and large numbers of samples can be efficiently analyzed, this approach offers the ability to rapidly assess a battery of selected proteins in a manner that is directly relevant to traditional clinical assays.« less
Xia, Qiangwei; Wang, Tiansong; Park, Yoonsuk; Lamont, Richard J.; Hackett, Murray
2009-01-01
Differential analysis of whole cell proteomes by mass spectrometry has largely been applied using various forms of stable isotope labeling. While metabolic stable isotope labeling has been the method of choice, it is often not possible to apply such an approach. Four different label free ways of calculating expression ratios in a classic “two-state” experiment are compared: signal intensity at the peptide level, signal intensity at the protein level, spectral counting at the peptide level, and spectral counting at the protein level. The quantitative data were mined from a dataset of 1245 qualitatively identified proteins, about 56% of the protein encoding open reading frames from Porphyromonas gingivalis, a Gram-negative intracellular pathogen being studied under extracellular and intracellular conditions. Two different control populations were compared against P. gingivalis internalized within a model human target cell line. The q-value statistic, a measure of false discovery rate previously applied to transcription microarrays, was applied to proteomics data. For spectral counting, the most logically consistent estimate of random error came from applying the locally weighted scatter plot smoothing procedure (LOWESS) to the most extreme ratios generated from a control technical replicate, thus setting upper and lower bounds for the region of experimentally observed random error. PMID:19337574
Ho, Yu-Hsuan; Shah, Pramod; Chen, Yi-Wen; Chen, Chien-Sheng
2016-01-01
Antimicrobial peptides (AMPs) act either through membrane lysis or by attacking intracellular targets. Intracellular targeting AMPs are a resource for antimicrobial agent development. Several AMPs have been identified as intracellular targeting peptides; however, the intracellular targets of many of these peptides remain unknown. In the present study, we used an Escherichia coli proteome microarray to systematically identify the protein targets of three intracellular targeting AMPs: bactenecin 7 (Bac7), a hybrid of pleurocidin and dermaseptin (P-Der), and proline-arginine-rich peptide (PR-39). In addition, we also included the data of lactoferricin B (LfcinB) from our previous study for a more comprehensive analysis. We analyzed the unique protein hits of each AMP in the Kyoto Encyclopedia of Genes and Genomes. The results indicated that Bac7 targets purine metabolism and histidine kinase, LfcinB attacks the transcription-related activities and several cellular carbohydrate biosynthetic processes, P-Der affects several catabolic processes of small molecules, and PR-39 preferentially recognizes proteins involved in RNA- and folate-metabolism-related cellular processes. Moreover, both Bac7 and LfcinB target purine metabolism, whereas LfcinB and PR-39 target lipopolysaccharide biosynthesis. This suggested that LfcinB and Bac7 as well as LfcinB and PR-39 have a synergistic effect on antimicrobial activity, which was validated through antimicrobial assays. Furthermore, common hits of all four AMPs indicated that all of them target arginine decarboxylase, which is a crucial enzyme for Escherichia coli survival in extremely acidic environments. Thus, these AMPs may display greater inhibition to bacterial growth in extremely acidic environments. We have also confirmed this finding in bacterial growth inhibition assays. In conclusion, this comprehensive identification and systematic analysis of intracellular targeting AMPs reveals crucial insights into the intracellular mechanisms of the action of AMPs. PMID:26902206
Ho, Yu-Hsuan; Shah, Pramod; Chen, Yi-Wen; Chen, Chien-Sheng
2016-06-01
Antimicrobial peptides (AMPs) act either through membrane lysis or by attacking intracellular targets. Intracellular targeting AMPs are a resource for antimicrobial agent development. Several AMPs have been identified as intracellular targeting peptides; however, the intracellular targets of many of these peptides remain unknown. In the present study, we used an Escherichia coli proteome microarray to systematically identify the protein targets of three intracellular targeting AMPs: bactenecin 7 (Bac7), a hybrid of pleurocidin and dermaseptin (P-Der), and proline-arginine-rich peptide (PR-39). In addition, we also included the data of lactoferricin B (LfcinB) from our previous study for a more comprehensive analysis. We analyzed the unique protein hits of each AMP in the Kyoto Encyclopedia of Genes and Genomes. The results indicated that Bac7 targets purine metabolism and histidine kinase, LfcinB attacks the transcription-related activities and several cellular carbohydrate biosynthetic processes, P-Der affects several catabolic processes of small molecules, and PR-39 preferentially recognizes proteins involved in RNA- and folate-metabolism-related cellular processes. Moreover, both Bac7 and LfcinB target purine metabolism, whereas LfcinB and PR-39 target lipopolysaccharide biosynthesis. This suggested that LfcinB and Bac7 as well as LfcinB and PR-39 have a synergistic effect on antimicrobial activity, which was validated through antimicrobial assays. Furthermore, common hits of all four AMPs indicated that all of them target arginine decarboxylase, which is a crucial enzyme for Escherichia coli survival in extremely acidic environments. Thus, these AMPs may display greater inhibition to bacterial growth in extremely acidic environments. We have also confirmed this finding in bacterial growth inhibition assays. In conclusion, this comprehensive identification and systematic analysis of intracellular targeting AMPs reveals crucial insights into the intracellular mechanisms of the action of AMPs. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf; ...
2014-11-14
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
Hsu, Chun-Nan; Lai, Jin-Mei; Liu, Chia-Hung; Tseng, Huei-Hun; Lin, Chih-Yun; Lin, Kuan-Ting; Yeh, Hsu-Hua; Sung, Ting-Yi; Hsu, Wen-Lian; Su, Li-Jen; Lee, Sheng-An; Chen, Chang-Han; Lee, Gen-Cher; Lee, DT; Shiue, Yow-Ling; Yeh, Chang-Wei; Chang, Chao-Hui; Kao, Cheng-Yan; Huang, Chi-Ying F
2007-01-01
Background The significant advances in microarray and proteomics analyses have resulted in an exponential increase in potential new targets and have promised to shed light on the identification of disease markers and cellular pathways. We aim to collect and decipher the HCC-related genes at the systems level. Results Here, we build an integrative platform, the Encyclopedia of Hepatocellular Carcinoma genes Online, dubbed EHCO , to systematically collect, organize and compare the pileup of unsorted HCC-related studies by using natural language processing and softbots. Among the eight gene set collections, ranging across PubMed, SAGE, microarray, and proteomics data, there are 2,906 genes in total; however, more than 77% genes are only included once, suggesting that tremendous efforts need to be exerted to characterize the relationship between HCC and these genes. Of these HCC inventories, protein binding represents the largest proportion (~25%) from Gene Ontology analysis. In fact, many differentially expressed gene sets in EHCO could form interaction networks (e.g. HBV-associated HCC network) by using available human protein-protein interaction datasets. To further highlight the potential new targets in the inferred network from EHCO, we combine comparative genomics and interactomics approaches to analyze 120 evolutionary conserved and overexpressed genes in HCC. 47 out of 120 queries can form a highly interactive network with 18 queries serving as hubs. Conclusion This architectural map may represent the first step toward the attempt to decipher the hepatocarcinogenesis at the systems level. Targeting hubs and/or disruption of the network formation might reveal novel strategy for HCC treatment. PMID:17326819
Microscopy Images as Interactive Tools in Cell Modeling and Cell Biology Education
ERIC Educational Resources Information Center
Araujo-Jorge, Tania C.; Cardona, Tania S.; Mendes, Claudia L. S.; Henriques-Pons, Andrea; Meirelles, Rosane M. S.; Coutinho, Claudia M. L. M.; Aguiar, Luiz Edmundo V.; Meirelles, Maria de Nazareth L.; de Castro, Solange L.; Barbosa, Helene S.; Luz, Mauricio R. M. P.
2004-01-01
The advent of genomics, proteomics, and microarray technology has brought much excitement to science, both in teaching and in learning. The public is eager to know about the processes of life. In the present context of the explosive growth of scientific information, a major challenge of modern cell biology is to popularize basic concepts of…
Mechanisms of CCl4-induced liver fibrosis with combined transcriptomic and proteomic analysis.
Dong, Shu; Chen, Qi-Long; Song, Ya-Nan; Sun, Yang; Wei, Bin; Li, Xiao-Yan; Hu, Yi-Yang; Liu, Ping; Su, Shi-Bing
2016-01-01
The classic toxicity of carbon tetrachloride (CCl4) is to induce liver lesion and liver fibrosis. Liver fibrosis is a consequence of chronic liver lesion, which can progress into liver cirrhosis even hepatocarcinoma. However, the toxicological mechanisms of CCl4-induced liver fibrosis remain not fully understood. We combined transcriptomic and proteomic analysis and biological network technology, predicted toxicological targets and regulatory networks of CCl4 in liver fibrosis. Wistar rats were treated with CCl4 for 9 weeks. Histopathological changes, hydroxyproline (Hyp) contents, serum ALT and AST in the CCl4-treated group were significantly higher than that of CCl4-untreated group. CCl4-treated and -untreated liver tissues were examined by microarray and iTRAQ. The results showed that 3535 genes (fold change ≥ 1.5, P < 0.05) and 1412 proteins (fold change ≥ 1.2, P < 0.05) were differentially expressed. Moreover, the integrative analysis of transcriptomics and proteomics data showed 523 overlapped proteins, enriched in 182 GO terms including oxidation reduction, response to oxidative stress, inflammatory response, extracellular matrix organization, etc. Furthermore, KEGG pathway analysis showed that 36 pathways including retinol metabolism, PPAR signaling pathway, glycolysis/gluconeogenesis, arachidonic acid metabolism, metabolism of xenobiotics by cytochrome P450 and drug metabolism. Network of protein-protein interaction (PPI) and key function with their related targets were performed and the degree of network was calculated with Cytoscape. The expression of key targets such as CYP4A3, ALDH2 and ALDH7A1 decreased after CCl4 treatment. Therefore, the toxicological mechanisms of CCl4-induced liver fibrosis may be related with multi biological process, pathway and targets which may provide potential protection reaction mechanism for CCl4 detoxication in the liver.
Kimura, Yayoi; Yanagimachi, Masakatsu; Ino, Yoko; Aketagawa, Mao; Matsuo, Michie; Okayama, Akiko; Shimizu, Hiroyuki; Oba, Kunihiro; Morioka, Ichiro; Imagawa, Tomoyuki; Kaneko, Tetsuji; Yokota, Shumpei; Hirano, Hisashi; Mori, Masaaki
2017-01-01
Kawasaki disease (KD) is a systemic vasculitis and childhood febrile disease that can lead to cardiovascular complications. The diagnosis of KD depends on its clinical features, and thus it is sometimes difficult to make a definitive diagnosis. In order to identify diagnostic serum biomarkers for KD, we explored serum KD-related proteins, which differentially expressed during the acute and recovery phases of two patients by mass spectrometry (MS). We identified a total of 1,879 proteins by MS-based proteomic analysis. The levels of three of these proteins, namely lipopolysaccharide-binding protein (LBP), leucine-rich alpha-2-glycoprotein (LRG1), and angiotensinogen (AGT), were higher in acute phase patients. In contrast, the level of retinol-binding protein 4 (RBP4) was decreased. To confirm the usefulness of these proteins as biomarkers, we analyzed a total of 270 samples, including those collected from 55 patients with acute phase KD, by using western blot analysis and microarray enzyme-linked immunosorbent assays (ELISAs). Over the course of this experiment, we determined that the expression level of these proteins changes specifically in the acute phase of KD, rather than the recovery phase of KD or other febrile illness. Thus, LRG1 could be used as biomarkers to facilitate KD diagnosis based on clinical features. PMID:28262744
Syed, Nazia; Chavan, Sandip; Sahasrabuddhe, Nandini A; Renuse, Santosh; Sathe, Gajanan; Nanjappa, Vishalakshi; Radhakrishnan, Aneesha; Raja, Remya; Pinto, Sneha M; Srinivasan, Anand; Prasad, T S Keshava; Srikumar, Kotteazeth; Gowda, Harsha; Santosh, Vani; Sidransky, David; Califano, Joseph A; Pandey, Akhilesh; Chatterjee, Aditi
2015-01-01
Dysregulation of protein expression is associated with most diseases including cancer. MS-based proteomic analysis is widely employed as a tool to study protein dysregulation in cancers. Proteins that are differentially expressed in head and neck squamous cell carcinoma (HNSCC) cell lines compared to the normal oral cell line could serve as biomarkers for patient stratification. To understand the proteomic complexity in HNSCC, we carried out iTRAQ-based MS analysis on a panel of HNSCC cell lines in addition to a normal oral keratinocyte cell line. LC-MS/MS analysis of total proteome of the HNSCC cell lines led to the identification of 3263 proteins, of which 185 proteins were overexpressed and 190 proteins were downregulated more than twofold in at least two of the three HNSCC cell lines studied. Among the overexpressed proteins, 23 proteins were related to DNA replication and repair. These included high-mobility group box 2 (HMGB2) protein, which was overexpressed in all three HNSCC lines studied. Overexpression of HMGB2 has been reported in various cancers, yet its role in HNSCC remains unclear. Immunohistochemical labeling of HMGB2 in a panel of HNSCC tumors using tissue microarrays revealed overexpression in 77% (54 of 70) of tumors. The HMGB proteins are known to bind to DNA structure resulting from cisplatin-DNA adducts and affect the chemosensitivity of cells. We observed that siRNA-mediated silencing of HMGB2 increased the sensitivity of the HNSCC cell lines to cisplatin and 5-FU. We hypothesize that targeting HMGB2 could enhance the efficacy of existing chemotherapeutic regimens for treatment of HNSCC. All MS data have been deposited in the ProteomeXchange with identifier PXD000737 (http://proteomecentral.proteomexchange.org/dataset/PXD000737). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zhong, Qing; Guo, Tiannan; Rechsteiner, Markus; Rüschoff, Jan H.; Rupp, Niels; Fankhauser, Christian; Saba, Karim; Mortezavi, Ashkan; Poyet, Cédric; Hermanns, Thomas; Zhu, Yi; Moch, Holger; Aebersold, Ruedi; Wild, Peter J.
2017-01-01
Microscopy image data of human cancers provide detailed phenotypes of spatially and morphologically intact tissues at single-cell resolution, thus complementing large-scale molecular analyses, e.g., next generation sequencing or proteomic profiling. Here we describe a high-resolution tissue microarray (TMA) image dataset from a cohort of 71 prostate tissue samples, which was hybridized with bright-field dual colour chromogenic and silver in situ hybridization probes for the tumour suppressor gene PTEN. These tissue samples were digitized and supplemented with expert annotations, clinical information, statistical models of PTEN genetic status, and computer source codes. For validation, we constructed an additional TMA dataset for 424 prostate tissues, hybridized with FISH probes for PTEN, and performed survival analysis on a subset of 339 radical prostatectomy specimens with overall, disease-specific and recurrence-free survival (maximum 167 months). For application, we further produced 6,036 image patches derived from two whole slides. Our curated collection of prostate cancer data sets provides reuse potential for both biomedical and computational studies. PMID:28291248
High quality epoxysilane substrate for clinical multiplex serodiagnostic proteomic microarrays
NASA Astrophysics Data System (ADS)
Ewart, Tom; Carmichael, Stuart; Lea, Peter
2005-09-01
Polylysine and aminopropylsilane treated glass comprised the majority of substrates employed in first generation genetic microarray substrates. Second generation single stranded long oligo libraries with amino termini provided for controlled terminal specific attachment, and rationally designed unique sequence libraries with normalized melting temperatures. These libraries benefit from active covalent coupling surfaces such as Epoxysilane. The latter's oxime ring shows versatile reactivity with amino-, thiol- and hydroxyl- groups thus encompassing small molecule, oligo and proteomic microarray applications. Batch-to-batch production uniformity supports entry of the Epoxysilane process into clinical diagnostics. We carried out multiple print runs of 21 clinically relevant bacterial and viral antigens at optimized concentrations, plus human IgG and IgM standards in triplicate on multiple batches of Epoxysilane substrates. A set of 45 patient sera were assayed in a 35 minute protocol using 10 microliters per array in a capillary-fill format (15 minute serum incubation, wash, 15 minute incubation with Cy3-labeled anti-hIgG plus Dy647-labeled anti-hIgM, final wash). The LOD (3 SD above background) was better than 1 microgram/ml for IgG, and standard curves were regular and monotonically increasing over the range 0 to 1000 micrograms/ml. Ninety-five percent of the CVs for the standards were under 10%, and 90% percent of CVs for antigen responses were under 10% across all batches of Epoxysilane and print runs. In addition, where SDs are larger than expected, microarray images may be readily reviewed for quality control purposes and pin misprints quickly identified. In order to determine the influence of stirring on sensitivity and speed of the microarray assay, we printed 10 common ToRCH antigens (H. pylori, T. gondii, Rubella, Rubeola, C. trachomatis, Herpes 1 and 2, CMV, C. jejuni, and EBV) in Epoxysilane-activated slide-wells. Anti-IgG-Cy3 direct binding to printed IgG calibration spots could be detected (3 x LOD) above background at 100 pg/ml (0.13 femtomoles sample content) in a 10 minute incubation. The LOD for detection of serum anti-H. pylori antibody level was 9 ng/ml in the same incubation time.
Exploitation of molecular profiling techniques for GM food safety assessment.
Kuiper, Harry A; Kok, Esther J; Engel, Karl-Heinz
2003-04-01
Several strategies have been developed to identify unintended alterations in the composition of genetically modified (GM) food crops that may occur as a result of the genetic modification process. These include comparative chemical analysis of single compounds in GM food crops and their conventional non-GM counterparts, and profiling methods such as DNA/RNA microarray technologies, proteomics and metabolite profiling. The potential of profiling methods is obvious, but further exploration of specificity, sensitivity and validation is needed. Moreover, the successful application of profiling techniques to the safety evaluation of GM foods will require linked databases to be built that contain information on variations in profiles associated with differences in developmental stages and environmental conditions.
Quantifying protein-protein interactions in high throughput using protein domain microarrays.
Kaushansky, Alexis; Allen, John E; Gordus, Andrew; Stiffler, Michael A; Karp, Ethan S; Chang, Bryan H; MacBeath, Gavin
2010-04-01
Protein microarrays provide an efficient way to identify and quantify protein-protein interactions in high throughput. One drawback of this technique is that proteins show a broad range of physicochemical properties and are often difficult to produce recombinantly. To circumvent these problems, we have focused on families of protein interaction domains. Here we provide protocols for constructing microarrays of protein interaction domains in individual wells of 96-well microtiter plates, and for quantifying domain-peptide interactions in high throughput using fluorescently labeled synthetic peptides. As specific examples, we will describe the construction of microarrays of virtually every human Src homology 2 (SH2) and phosphotyrosine binding (PTB) domain, as well as microarrays of mouse PDZ domains, all produced recombinantly in Escherichia coli. For domains that mediate high-affinity interactions, such as SH2 and PTB domains, equilibrium dissociation constants (K(D)s) for their peptide ligands can be measured directly on arrays by obtaining saturation binding curves. For weaker binding domains, such as PDZ domains, arrays are best used to identify candidate interactions, which are then retested and quantified by fluorescence polarization. Overall, protein domain microarrays provide the ability to rapidly identify and quantify protein-ligand interactions with minimal sample consumption. Because entire domain families can be interrogated simultaneously, they provide a powerful way to assess binding selectivity on a proteome-wide scale and provide an unbiased perspective on the connectivity of protein-protein interaction networks.
The Effect of Iron Limitation on the Transcriptome and Proteome of Pseudomonas fluorescens Pf-5
Lim, Chee Kent; Hassan, Karl A.; Tetu, Sasha G.; Loper, Joyce E.; Paulsen, Ian T.
2012-01-01
One of the most important micronutrients for bacterial growth is iron, whose bioavailability in soil is limited. Consequently, rhizospheric bacteria such as Pseudomonas fluorescens employ a range of mechanisms to acquire or compete for iron. We investigated the transcriptomic and proteomic effects of iron limitation on P. fluorescens Pf-5 by employing microarray and iTRAQ techniques, respectively. Analysis of this data revealed that genes encoding functions related to iron homeostasis, including pyoverdine and enantio-pyochelin biosynthesis, a number of TonB-dependent receptor systems, as well as some inner-membrane transporters, were significantly up-regulated in response to iron limitation. Transcription of a ribosomal protein L36-encoding gene was also highly up-regulated during iron limitation. Certain genes or proteins involved in biosynthesis of secondary metabolites such as 2,4-diacetylphloroglucinol (DAPG), orfamide A and pyrrolnitrin, as well as a chitinase, were over-expressed under iron-limited conditions. In contrast, we observed that expression of genes involved in hydrogen cyanide production and flagellar biosynthesis were down-regulated in an iron-depleted culture medium. Phenotypic tests revealed that Pf-5 had reduced swarming motility on semi-solid agar in response to iron limitation. Comparison of the transcriptomic data with the proteomic data suggested that iron acquisition is regulated at both the transcriptional and post-transcriptional levels. PMID:22723948
NASA Technical Reports Server (NTRS)
Stein, T. Peter; Wade, Charles E.
2003-01-01
PURPOSE OF REVIEW: In response to decreased usage, skeletal muscle undergoes adaptive reductive remodeling due to the decrease in tension on the weight bearing components of the musculo-skeletal system. This response occurs with uncomplicated disuse (e.g. bed rest, space flight), as a secondary consequence of several widely prevalent chronic diseases for which activity is reduced (e.g. chronic obstructive pulmonary disease and chronic heart failure) and is part of the aging process. The problem is therefore one of considerable clinical importance. RECENT FINDINGS: The impaired function and exercise intolerance is related more to the associated muscle wasting rather than to the specific organ system primarily impacted by the disease. Progress has continued in describing the use of anabolic drugs and dietary manipulation. The major advance in the field has been: (i) the discovery of the atrogin-1 gene and (ii) the application of microarray expression analysis and proteomics with the objectives of obtaining comprehensive understanding of the pathways changed with disuse atrophy. SUMMARY: Disuse atrophy is a common clinical problem. There is a need for therapeutic interventions that do not involve exercise. A better understanding of the changes, particularly at the molecular level, could indicate hitherto unsuspected sites for nutritional and pharmacological intervention.
Clustering and Network Analysis of Reverse Phase Protein Array Data.
Byron, Adam
2017-01-01
Molecular profiling of proteins and phosphoproteins using a reverse phase protein array (RPPA) platform, with a panel of target-specific antibodies, enables the parallel, quantitative proteomic analysis of many biological samples in a microarray format. Hence, RPPA analysis can generate a high volume of multidimensional data that must be effectively interrogated and interpreted. A range of computational techniques for data mining can be applied to detect and explore data structure and to form functional predictions from large datasets. Here, two approaches for the computational analysis of RPPA data are detailed: the identification of similar patterns of protein expression by hierarchical cluster analysis and the modeling of protein interactions and signaling relationships by network analysis. The protocols use freely available, cross-platform software, are easy to implement, and do not require any programming expertise. Serving as data-driven starting points for further in-depth analysis, validation, and biological experimentation, these and related bioinformatic approaches can accelerate the functional interpretation of RPPA data.
LXtoo: an integrated live Linux distribution for the bioinformatics community
2012-01-01
Background Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Findings Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. Conclusions LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo. PMID:22813356
LXtoo: an integrated live Linux distribution for the bioinformatics community.
Yu, Guangchuang; Wang, Li-Gen; Meng, Xiao-Hua; He, Qing-Yu
2012-07-19
Recent advances in high-throughput technologies dramatically increase biological data generation. However, many research groups lack computing facilities and specialists. This is an obstacle that remains to be addressed. Here, we present a Linux distribution, LXtoo, to provide a flexible computing platform for bioinformatics analysis. Unlike most of the existing live Linux distributions for bioinformatics limiting their usage to sequence analysis and protein structure prediction, LXtoo incorporates a comprehensive collection of bioinformatics software, including data mining tools for microarray and proteomics, protein-protein interaction analysis, and computationally complex tasks like molecular dynamics. Moreover, most of the programs have been configured and optimized for high performance computing. LXtoo aims to provide well-supported computing environment tailored for bioinformatics research, reducing duplication of efforts in building computing infrastructure. LXtoo is distributed as a Live DVD and freely available at http://bioinformatics.jnu.edu.cn/LXtoo.
Waters, Katrina M.; Liu, Tao; Quesenberry, Ryan D.; Willse, Alan R.; Bandyopadhyay, Somnath; Kathmann, Loel E.; Weber, Thomas J.; Smith, Richard D.; Wiley, H. Steven; Thrall, Brian D.
2012-01-01
To understand how integration of multiple data types can help decipher cellular responses at the systems level, we analyzed the mitogenic response of human mammary epithelial cells to epidermal growth factor (EGF) using whole genome microarrays, mass spectrometry-based proteomics and large-scale western blots with over 1000 antibodies. A time course analysis revealed significant differences in the expression of 3172 genes and 596 proteins, including protein phosphorylation changes measured by western blot. Integration of these disparate data types showed that each contributed qualitatively different components to the observed cell response to EGF and that varying degrees of concordance in gene expression and protein abundance measurements could be linked to specific biological processes. Networks inferred from individual data types were relatively limited, whereas networks derived from the integrated data recapitulated the known major cellular responses to EGF and exhibited more highly connected signaling nodes than networks derived from any individual dataset. While cell cycle regulatory pathways were altered as anticipated, we found the most robust response to mitogenic concentrations of EGF was induction of matrix metalloprotease cascades, highlighting the importance of the EGFR system as a regulator of the extracellular environment. These results demonstrate the value of integrating multiple levels of biological information to more accurately reconstruct networks of cellular response. PMID:22479638
Technological advances and genomics in metazoan parasites.
Knox, D P
2004-02-01
Molecular biology has provided the means to identify parasite proteins, to define their function, patterns of expression and the means to produce them in quantity for subsequent functional analyses. Whole genome and expressed sequence tag programmes, and the parallel development of powerful bioinformatics tools, allow the execution of genome-wide between stage or species comparisons and meaningful gene-expression profiling. The latter can be undertaken with several new technologies such as DNA microarray and serial analysis of gene expression. Proteome analysis has come to the fore in recent years providing a crucial link between the gene and its protein product. RNA interference and ballistic gene transfer are exciting developments which can provide the means to precisely define the function of individual genes and, of importance in devising novel parasite control strategies, the effect that gene knockdown will have on parasite survival.
Peterson, Elena S; McCue, Lee Ann; Schrimpe-Rutledge, Alexandra C; Jensen, Jeffrey L; Walker, Hyunjoo; Kobold, Markus A; Webb, Samantha R; Payne, Samuel H; Ansong, Charles; Adkins, Joshua N; Cannon, William R; Webb-Robertson, Bobbie-Jo M
2012-04-05
The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php.
2012-01-01
Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php. PMID:22480257
Carmona, Santiago J.; Nielsen, Morten; Schafer-Nielsen, Claus; Mucci, Juan; Altcheh, Jaime; Balouz, Virginia; Tekiel, Valeria; Frasch, Alberto C.; Campetella, Oscar; Buscaglia, Carlos A.; Agüero, Fernán
2015-01-01
Complete characterization of antibody specificities associated to natural infections is expected to provide a rich source of serologic biomarkers with potential applications in molecular diagnosis, follow-up of chemotherapeutic treatments, and prioritization of targets for vaccine development. Here, we developed a highly-multiplexed platform based on next-generation high-density peptide microarrays to map these specificities in Chagas Disease, an exemplar of a human infectious disease caused by the protozoan Trypanosoma cruzi. We designed a high-density peptide microarray containing more than 175,000 overlapping 15mer peptides derived from T. cruzi proteins. Peptides were synthesized in situ on microarray slides, spanning the complete length of 457 parasite proteins with fully overlapped 15mers (1 residue shift). Screening of these slides with antibodies purified from infected patients and healthy donors demonstrated both a high technical reproducibility as well as epitope mapping consistency when compared with earlier low-throughput technologies. Using a conservative signal threshold to classify positive (reactive) peptides we identified 2,031 disease-specific peptides and 97 novel parasite antigens, effectively doubling the number of known antigens and providing a 10-fold increase in the number of fine mapped antigenic determinants for this disease. Finally, further analysis of the chip data showed that optimizing the amount of sequence overlap of displayed peptides can increase the protein space covered in a single chip by at least ∼threefold without sacrificing sensitivity. In conclusion, we show the power of high-density peptide chips for the discovery of pathogen-specific linear B-cell epitopes from clinical samples, thus setting the stage for high-throughput biomarker discovery screenings and proteome-wide studies of immune responses against pathogens. PMID:25922409
Carmona, Santiago J; Nielsen, Morten; Schafer-Nielsen, Claus; Mucci, Juan; Altcheh, Jaime; Balouz, Virginia; Tekiel, Valeria; Frasch, Alberto C; Campetella, Oscar; Buscaglia, Carlos A; Agüero, Fernán
2015-07-01
Complete characterization of antibody specificities associated to natural infections is expected to provide a rich source of serologic biomarkers with potential applications in molecular diagnosis, follow-up of chemotherapeutic treatments, and prioritization of targets for vaccine development. Here, we developed a highly-multiplexed platform based on next-generation high-density peptide microarrays to map these specificities in Chagas Disease, an exemplar of a human infectious disease caused by the protozoan Trypanosoma cruzi. We designed a high-density peptide microarray containing more than 175,000 overlapping 15 mer peptides derived from T. cruzi proteins. Peptides were synthesized in situ on microarray slides, spanning the complete length of 457 parasite proteins with fully overlapped 15 mers (1 residue shift). Screening of these slides with antibodies purified from infected patients and healthy donors demonstrated both a high technical reproducibility as well as epitope mapping consistency when compared with earlier low-throughput technologies. Using a conservative signal threshold to classify positive (reactive) peptides we identified 2,031 disease-specific peptides and 97 novel parasite antigens, effectively doubling the number of known antigens and providing a 10-fold increase in the number of fine mapped antigenic determinants for this disease. Finally, further analysis of the chip data showed that optimizing the amount of sequence overlap of displayed peptides can increase the protein space covered in a single chip by at least ∼ threefold without sacrificing sensitivity. In conclusion, we show the power of high-density peptide chips for the discovery of pathogen-specific linear B-cell epitopes from clinical samples, thus setting the stage for high-throughput biomarker discovery screenings and proteome-wide studies of immune responses against pathogens. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Profiling the humoral immune response of acute and chronic Q fever by protein microarray.
Vigil, Adam; Chen, Chen; Jain, Aarti; Nakajima-Sasaki, Rie; Jasinskas, Algimantas; Pablo, Jozelyn; Hendrix, Laura R; Samuel, James E; Felgner, Philip L
2011-10-01
Antigen profiling using comprehensive protein microarrays is a powerful tool for characterizing the humoral immune response to infectious pathogens. Coxiella burnetii is a CDC category B bioterrorist infectious agent with worldwide distribution. In order to assess the antibody repertoire of acute and chronic Q fever patients we have constructed a protein microarray containing 93% of the proteome of Coxiella burnetii, the causative agent of Q fever. Here we report the profile of the IgG and IgM seroreactivity in 25 acute Q fever patients in longitudinal samples. We found that both early and late time points of infection have a very consistent repertoire of IgM and IgG response, with a limited number of proteins undergoing increasing or decreasing seroreactivity. We also probed a large collection of acute and chronic Q fever patient samples and identified serological markers that can differentiate between the two disease states. In this comparative analysis we confirmed the identity of numerous IgG biomarkers of acute infection, identified novel IgG biomarkers for acute and chronic infections, and profiled for the first time the IgM antibody repertoire for both acute and chronic Q fever. Using these results we were able to devise a test that can distinguish acute from chronic Q fever. These results also provide a unique perspective on isotype switch and demonstrate the utility of protein microarrays for simultaneously examining the dynamic humoral immune response against thousands of proteins from a large number of patients. The results presented here identify novel seroreactive antigens for the development of recombinant protein-based diagnostics and subunit vaccines, and provide insight into the development of the antibody response.
Hall, Neil; Karras, Marianna; Raine, J Dale; Carlton, Jane M; Kooij, Taco W A; Berriman, Matthew; Florens, Laurence; Janssen, Christoph S; Pain, Arnab; Christophides, Georges K; James, Keith; Rutherford, Kim; Harris, Barbara; Harris, David; Churcher, Carol; Quail, Michael A; Ormond, Doug; Doggett, Jon; Trueman, Holly E; Mendoza, Jacqui; Bidwell, Shelby L; Rajandream, Marie-Adele; Carucci, Daniel J; Yates, John R; Kafatos, Fotis C; Janse, Chris J; Barrell, Bart; Turner, C Michael R; Waters, Andrew P; Sinden, Robert E
2005-01-07
Plasmodium berghei and Plasmodium chabaudi are widely used model malaria species. Comparison of their genomes, integrated with proteomic and microarray data, with the genomes of Plasmodium falciparum and Plasmodium yoelii revealed a conserved core of 4500 Plasmodium genes in the central regions of the 14 chromosomes and highlighted genes evolving rapidly because of stage-specific selective pressures. Four strategies for gene expression are apparent during the parasites' life cycle: (i) housekeeping; (ii) host-related; (iii) strategy-specific related to invasion, asexual replication, and sexual development; and (iv) stage-specific. We observed posttranscriptional gene silencing through translational repression of messenger RNA during sexual development, and a 47-base 3' untranslated region motif is implicated in this process.
Cell-Cell Interactions during pollen tube guidance
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daphne Preuss
The long-term goal of this research is to identify the signaling molecules that mediate plant cell-cell interactions during pollination. The immediate goals of this project are to perform genetic and molecular analysis of pollen tube guidance. Specifically, we proposed to: 1. Characterize the pistil components that direct pollen tube navigation using the Arabidopsis thaliana in vitro pollen tube guidance system 2. Identify pistil signals that direct pollen tube guidance by a) using microarrays to profile gene expression in developing pistils, and b) employing proteomics and metabolomics to isolate pollen tube guidance signals. 3. Explore the genetic basis of natural variationmore » in guidance signals, comparing the in vitro interactions between pollen and pistils from A. thaliana and its close relatives.« less
Pan, I-Chun; Tsai, Huei-Hsuan; Cheng, Ya-Tan; Wen, Tuan-Nan; Buckhout, Thomas J.; Schmidt, Wolfgang
2015-01-01
Acclimation to changing environmental conditions is mediated by proteins, the abundance of which is carefully tuned by an elaborate interplay of DNA-templated and post-transcriptional processes. To dissect the mechanisms that control and mediate cellular iron homeostasis, we conducted quantitative high-resolution iTRAQ proteomics and microarray-based transcriptomic profiling of iron-deficient Arabidopsis thaliana plants. A total of 13,706 and 12,124 proteins was identified with a quadrupole-Orbitrap hybrid mass spectrometer in roots and leaves, respectively. This deep proteomic coverage allowed accurate estimates of post-transcriptional regulation in response to iron deficiency. Similarly regulated transcripts were detected in only 13% (roots) and 11% (leaves) of the 886 proteins that differentially accumulated between iron-sufficient and iron-deficient plants, indicating that the majority of the iron-responsive proteins was post-transcriptionally regulated. Mutants harboring defects in the RING DOMAIN LIGASE1 (RGLG1)1 and RING DOMAIN LIGASE2 (RGLG2) showed a pleiotropic phenotype that resembled iron-deficient plants with reduced trichome density and the formation of branched root hairs. Proteomic and transcriptomic profiling of rglg1 rglg2 double mutants revealed that the functional RGLG protein is required for the regulation of a large set of iron-responsive proteins including the coordinated expression of ribosomal proteins. This integrative analysis provides a detailed catalog of post-transcriptionally regulated proteins and allows the concept of a chiefly transcriptionally regulated iron deficiency response to be revisited. Protein data are available via ProteomeXchange with identifier PXD002126. PMID:26253232
Microarrays: Molecular allergology and nanotechnology for personalised medicine (II).
Lucas, J M
2010-01-01
Progress in nanotechnology and DNA recombination techniques have produced tools for the diagnosis and investigation of allergy at molecular level. The most advanced examples of such progress are the microarray techniques, which have been expanded not only in research in the field of proteomics but also in application to the clinical setting. Microarrays of allergic components offer results relating to hundreds of allergenic components in a single test, and using a small amount of serum which can be obtained from capillary blood. The availability of new molecules will allow the development of panels including new allergenic components and sources, which will require evaluation for clinical use. Their application opens the door to component-based diagnosis, to the holistic perception of sensitisation as represented by molecular allergy, and to patient-centred medical practice by allowing great diagnostic accuracy and the definition of individualised immunotherapy for each patient. The present article reviews the application of allergenic component microarrays to allergology for diagnosis, management in the form of specific immunotherapy, and epidemiological studies. A review is also made of the use of protein and gene microarray techniques in basic research and in allergological diseases. Lastly, an evaluation is made of the challenges we face in introducing such techniques to clinical practice, and of the future perspectives of this new technology. Copyright 2010 SEICAP. Published by Elsevier Espana. All rights reserved.
Malinowski, Douglas P
2007-05-01
In recent years, the application of genomic and proteomic technologies to the problem of breast cancer prognosis and the prediction of therapy response have begun to yield encouraging results. Independent studies employing transcriptional profiling of primary breast cancer specimens using DNA microarrays have identified gene expression profiles that correlate with clinical outcome in primary breast biopsy specimens. Recent advances in microarray technology have demonstrated reproducibility, making clinical applications more achievable. In this regard, one such DNA microarray device based upon a 70-gene expression signature was recently cleared by the US FDA for application to breast cancer prognosis. These DNA microarrays often employ at least 70 gene targets for transcriptional profiling and prognostic assessment in breast cancer. The use of PCR-based methods utilizing a small subset of genes has recently demonstrated the ability to predict the clinical outcome in early-stage breast cancer. Furthermore, protein-based immunohistochemistry methods have progressed from using gene clusters and gene expression profiling to smaller subsets of expressed proteins to predict prognosis in early-stage breast cancer. Beyond prognostic applications, DNA microarray-based transcriptional profiling has demonstrated the ability to predict response to chemotherapy in early-stage breast cancer patients. In this review, recent advances in the use of multiple markers for prognosis of disease recurrence in early-stage breast cancer and the prediction of therapy response will be discussed.
Predicting breast cancer using an expression values weighted clinical classifier.
Thomas, Minta; De Brabanter, Kris; Suykens, Johan A K; De Moor, Bart
2014-12-31
Clinical data, such as patient history, laboratory analysis, ultrasound parameters-which are the basis of day-to-day clinical decision support-are often used to guide the clinical management of cancer in the presence of microarray data. Several data fusion techniques are available to integrate genomics or proteomics data, but only a few studies have created a single prediction model using both gene expression and clinical data. These studies often remain inconclusive regarding an obtained improvement in prediction performance. To improve clinical management, these data should be fully exploited. This requires efficient algorithms to integrate these data sets and design a final classifier. LS-SVM classifiers and generalized eigenvalue/singular value decompositions are successfully used in many bioinformatics applications for prediction tasks. While bringing up the benefits of these two techniques, we propose a machine learning approach, a weighted LS-SVM classifier to integrate two data sources: microarray and clinical parameters. We compared and evaluated the proposed methods on five breast cancer case studies. Compared to LS-SVM classifier on individual data sets, generalized eigenvalue decomposition (GEVD) and kernel GEVD, the proposed weighted LS-SVM classifier offers good prediction performance, in terms of test area under ROC Curve (AUC), on all breast cancer case studies. Thus a clinical classifier weighted with microarray data set results in significantly improved diagnosis, prognosis and prediction responses to therapy. The proposed model has been shown as a promising mathematical framework in both data fusion and non-linear classification problems.
Meta-analysis of pathway enrichment: combining independent and dependent omics data sets.
Kaever, Alexander; Landesfeind, Manuel; Feussner, Kirstin; Morgenstern, Burkhard; Feussner, Ivo; Meinicke, Peter
2014-01-01
A major challenge in current systems biology is the combination and integrative analysis of large data sets obtained from different high-throughput omics platforms, such as mass spectrometry based Metabolomics and Proteomics or DNA microarray or RNA-seq-based Transcriptomics. Especially in the case of non-targeted Metabolomics experiments, where it is often impossible to unambiguously map ion features from mass spectrometry analysis to metabolites, the integration of more reliable omics technologies is highly desirable. A popular method for the knowledge-based interpretation of single data sets is the (Gene) Set Enrichment Analysis. In order to combine the results from different analyses, we introduce a methodical framework for the meta-analysis of p-values obtained from Pathway Enrichment Analysis (Set Enrichment Analysis based on pathways) of multiple dependent or independent data sets from different omics platforms. For dependent data sets, e.g. obtained from the same biological samples, the framework utilizes a covariance estimation procedure based on the nonsignificant pathways in single data set enrichment analysis. The framework is evaluated and applied in the joint analysis of Metabolomics mass spectrometry and Transcriptomics DNA microarray data in the context of plant wounding. In extensive studies of simulated data set dependence, the introduced correlation could be fully reconstructed by means of the covariance estimation based on pathway enrichment. By restricting the range of p-values of pathways considered in the estimation, the overestimation of correlation, which is introduced by the significant pathways, could be reduced. When applying the proposed methods to the real data sets, the meta-analysis was shown not only to be a powerful tool to investigate the correlation between different data sets and summarize the results of multiple analyses but also to distinguish experiment-specific key pathways.
Kratochwill, Klaus; Bender, Thorsten O; Lichtenauer, Anton M; Herzog, Rebecca; Tarantino, Silvia; Bialas, Katarzyna; Jörres, Achim; Aufricht, Christoph
2015-01-01
Recent research suggests that cytoprotective responses, such as expression of heat-shock proteins, might be inadequately induced in mesothelial cells by heat-sterilized peritoneal dialysis (PD) fluids. This study compares transcriptome data and multiple protein expression profiles for providing new insight into regulatory mechanisms. Two-dimensional difference gel electrophoresis (2D-DIGE) based proteomics and topic defined gene expression microarray-based transcriptomics techniques were used to evaluate stress responses in human omental peritoneal mesothelial cells in response to heat- or filter-sterilized PD fluids. Data from selected heat-shock proteins were validated by 2D western-blot analysis. Comparison of proteomics and transcriptomics data discriminated differentially regulated protein abundance into groups depending on correlating or noncorrelating transcripts. Inadequate abundance of several heat-shock proteins following exposure to heat-sterilized PD fluids is not reflected on the mRNA level indicating interference beyond transcriptional regulation. For the first time, this study describes evidence for posttranscriptional inadequacy of heat-shock protein expression by heat-sterilized PD fluids as a novel cytotoxic property. Cross-omics technologies introduce a novel way of understanding PDF bioincompatibility and searching for new interventions to reestablish adequate cytoprotective responses.
Protein profiling in serum after traumatic brain injury in rats reveals potential injury markers.
Thelin, Eric Peter; Just, David; Frostell, Arvid; Häggmark-Månberg, Anna; Risling, Mårten; Svensson, Mikael; Nilsson, Peter; Bellander, Bo-Michael
2018-03-15
The serum proteome following traumatic brain injury (TBI) could provide information for outcome prediction and injury monitoring. The aim with this affinity proteomic study was to identify serum proteins over time and between normoxic and hypoxic conditions in focal TBI. Sprague Dawley rats (n=73) received a 3mm deep controlled cortical impact ("severe injury"). Following injury, the rats inhaled either a normoxic (22% O 2 ) or hypoxic (11% O 2 ) air mixture for 30min before resuscitation. The rats were sacrificed at day 1, 3, 7, 14 and 28 after trauma. A total of 204 antibodies targeting 143 unique proteins of interest in TBI research, were selected. The sample proteome was analyzed in a suspension bead array set-up. Comparative statistics and factor analysis were used to detect differences as well as variance in the data. We found that complement factor 9 (C9), complement factor B (CFB) and aldolase c (ALDOC) were detected at higher levels the first days after trauma. In contrast, hypoxia inducing factor (HIF)1α, amyloid precursor protein (APP) and WBSCR17 increased over the subsequent weeks. S100A9 levels were higher in hypoxic-compared to normoxic rats, together with a majority of the analyzed proteins, albeit few reached statistical significance. The principal component analysis revealed a variance in the data, highlighting clusters of proteins. Protein profiling of serum following TBI using an antibody based microarray revealed temporal changes of several proteins over an extended period of up to four weeks. Further studies are warranted to confirm our findings. Copyright © 2016 The Author(s). Published by Elsevier B.V. All rights reserved.
Ozerov, Ivan V; Lezhnina, Ksenia V; Izumchenko, Evgeny; Artemov, Artem V; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N; Labat, Ivan; West, Michael D; Buzdin, Anton; Cantor, Charles R; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex
2016-11-16
Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy.
Ozerov, Ivan V.; Lezhnina, Ksenia V.; Izumchenko, Evgeny; Artemov, Artem V.; Medintsev, Sergey; Vanhaelen, Quentin; Aliper, Alexander; Vijg, Jan; Osipov, Andreyan N.; Labat, Ivan; West, Michael D.; Buzdin, Anton; Cantor, Charles R.; Nikolsky, Yuri; Borisov, Nikolay; Irincheeva, Irina; Khokhlovich, Edward; Sidransky, David; Camargo, Miguel Luiz; Zhavoronkov, Alex
2016-01-01
Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy. PMID:27848968
Qi, Yong; Xiong, Xiaolu; Wang, Xile; Duan, Changsong; Jia, Yinjun; Jiao, Jun; Gong, Wenping; Wen, Bohai
2013-01-01
Background Rickettsia heilongjiangensis, the agent of Far-Eastern spotted fever (FESF), is an obligate intracellular bacterium. The surface-exposed proteins (SEPs) of rickettsiae are involved in rickettsial adherence to and invasion of host cells, intracellular bacterial growth, and/or interaction with immune cells. They are also potential molecular candidates for the development of diagnostic reagents and vaccines against rickettsiosis. Methods R. heilongjiangensis SEPs were identified by biotin-streptavidin affinity purification and 2D electrophoreses coupled with ESI-MS/MS. Recombinant SEPs were probed with various sera to analyze their serological characteristics using a protein microarray and an enzyme-linked immune sorbent assay (ELISA). Results Twenty-five SEPs were identified, most of which were predicted to reside on the surface of R. heilongjiangensis cells. Bioinformatics analysis suggests that these proteins could be involved in bacterial pathogenesis. Eleven of the 25 SEPs were recognized as major seroreactive antigens by sera from R. heilongjiangensis-infected mice and FESF patients. Among the major seroreactive SEPs, microarray assays and/or ELISAs revealed that GroEL, OmpA-2, OmpB-3, PrsA, RplY, RpsB, SurA and YbgF had modest sensitivity and specificity for recognizing R. heilongjiangensis infection and/or spotted fever. Conclusions Many of the SEPs identified herein have potentially important roles in R. heilongjiangensis pathogenicity. Some of them have potential as serodiagnostic antigens or as subunit vaccine antigens against the disease. PMID:23894656
Zhang, Min; Zhang, Lin; Zou, Jinfeng; Yao, Chen; Xiao, Hui; Liu, Qing; Wang, Jing; Wang, Dong; Wang, Chenguang; Guo, Zheng
2009-07-01
According to current consistency metrics such as percentage of overlapping genes (POG), lists of differentially expressed genes (DEGs) detected from different microarray studies for a complex disease are often highly inconsistent. This irreproducibility problem also exists in other high-throughput post-genomic areas such as proteomics and metabolism. A complex disease is often characterized with many coordinated molecular changes, which should be considered when evaluating the reproducibility of discovery lists from different studies. We proposed metrics percentage of overlapping genes-related (POGR) and normalized POGR (nPOGR) to evaluate the consistency between two DEG lists for a complex disease, considering correlated molecular changes rather than only counting gene overlaps between the lists. Based on microarray datasets of three diseases, we showed that though the POG scores for DEG lists from different studies for each disease are extremely low, the POGR and nPOGR scores can be rather high, suggesting that the apparently inconsistent DEG lists may be highly reproducible in the sense that they are actually significantly correlated. Observing different discovery results for a disease by the POGR and nPOGR scores will obviously reduce the uncertainty of the microarray studies. The proposed metrics could also be applicable in many other high-throughput post-genomic areas.
Integrated Omic Analysis of a Guinea Pig Model of Heart Failure and Sudden Cardiac Death.
Foster, D Brian; Liu, Ting; Kammers, Kai; O'Meally, Robert; Yang, Ni; Papanicolaou, Kyriakos N; Talbot, C Conover; Cole, Robert N; O'Rourke, Brian
2016-09-02
Here, we examine key regulatory pathways underlying the transition from compensated hypertrophy (HYP) to decompensated heart failure (HF) and sudden cardiac death (SCD) in a guinea pig pressure-overload model by integrated multiome analysis. Relative protein abundances from sham-operated HYP and HF hearts were assessed by iTRAQ LC-MS/MS. Metabolites were quantified by LC-MS/MS or GC-MS. Transcriptome profiles were obtained using mRNA microarrays. The guinea pig HF proteome exhibited classic biosignatures of cardiac HYP, left ventricular dysfunction, fibrosis, inflammation, and extravasation. Fatty acid metabolism, mitochondrial transcription/translation factors, antioxidant enzymes, and other mitochondrial procsses, were downregulated in HF but not HYP. Proteins upregulated in HF implicate extracellular matrix remodeling, cytoskeletal remodeling, and acute phase inflammation markers. Among metabolites, acylcarnitines were downregulated in HYP and fatty acids accumulated in HF. The correlation of transcript and protein changes in HF was weak (R(2) = 0.23), suggesting post-transcriptional gene regulation in HF. Proteome/metabolome integration indicated metabolic bottlenecks in fatty acyl-CoA processing by carnitine palmitoyl transferase (CPT1B) as well as TCA cycle inhibition. On the basis of these findings, we present a model of cardiac decompensation involving impaired nuclear integration of Ca(2+) and cyclic nucleotide signals that are coupled to mitochondrial metabolic and antioxidant defects through the CREB/PGC1α transcriptional axis.
Orme, Rowan P; Gates, Monte A; Fricker-Gates, Rosemary A
2010-08-15
Cell transplantation using stem cell-derived neurons is commonly viewed as a candidate therapy for neurodegenerative diseases. However, methods for differentiating stem cells into homogenous populations of neurons suitable for transplant remain elusive. This suggests that there are as yet unknown signalling factors working in vivo to specify neuronal cell fate during development. These factors could be manipulated to better differentiate stem cells into neural populations useful for therapeutic transplantation. Here a quantitative proteomics approach is described for investigating cell signalling in the developing central nervous system (CNS), using the embryonic ventral mesencephalon as a model. Briefly, total protein was extracted from embryonic ventral midbrain tissue before, during and after the birth of dopaminergic neurons, and digested using trypsin. Two-dimensional liquid chromatography, coupled with tandem mass spectrometry, was then used to identify proteins from the tryptic peptides. Isobaric tagging for relative and absolute quantification (iTRAQ) reagents were used to label the tryptic peptides and facilitate relative quantitative analysis. The success of the experiment was confirmed by the identification of proteins known to be expressed in the developing ventral midbrain, as well as by Western blotting, and immunolabelling of embryonic tissue sections. This method of protein discovery improves upon previous attempts to identify novel signalling factors through microarray analysis. Importantly, the methods described here could be applied to virtually any aspect of development. (c) 2010 Elsevier B.V. All rights reserved.
Randles, Michael J.; Woolf, Adrian S.; Huang, Jennifer L.; Byron, Adam; Humphries, Jonathan D.; Price, Karen L.; Kolatsi-Joannou, Maria; Collinson, Sophie; Denny, Thomas; Knight, David; Mironov, Aleksandr; Starborg, Toby; Korstanje, Ron; Humphries, Martin J.; Long, David A.
2015-01-01
Glomerular disease often features altered histologic patterns of extracellular matrix (ECM). Despite this, the potential complexities of the glomerular ECM in both health and disease are poorly understood. To explore whether genetic background and sex determine glomerular ECM composition, we investigated two mouse strains, FVB and B6, using RNA microarrays of isolated glomeruli combined with proteomic glomerular ECM analyses. These studies, undertaken in healthy young adult animals, revealed unique strain- and sex-dependent glomerular ECM signatures, which correlated with variations in levels of albuminuria and known predisposition to progressive nephropathy. Among the variation, we observed changes in netrin 4, fibroblast growth factor 2, tenascin C, collagen 1, meprin 1-α, and meprin 1-β. Differences in protein abundance were validated by quantitative immunohistochemistry and Western blot analysis, and the collective differences were not explained by mutations in known ECM or glomerular disease genes. Within the distinct signatures, we discovered a core set of structural ECM proteins that form multiple protein–protein interactions and are conserved from mouse to man. Furthermore, we found striking ultrastructural changes in glomerular basement membranes in FVB mice. Pathway analysis of merged transcriptomic and proteomic datasets identified potential ECM regulatory pathways involving inhibition of matrix metalloproteases, liver X receptor/retinoid X receptor, nuclear factor erythroid 2-related factor 2, notch, and cyclin-dependent kinase 5. These pathways may therefore alter ECM and confer susceptibility to disease. PMID:25896609
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matheis, Katja A., E-mail: katja.matheis@boehringer-ingelheim.com; Com, Emmanuelle; High-Throughput Proteomics Core Facility OUEST-genopole
2011-04-15
The European InnoMed-PredTox project was a collaborative effort between 15 pharmaceutical companies, 2 small and mid-sized enterprises, and 3 universities with the goal of delivering deeper insights into the molecular mechanisms of kidney and liver toxicity and to identify mechanism-linked diagnostic or prognostic safety biomarker candidates by combining conventional toxicological parameters with 'omics' data. Mechanistic toxicity studies with 16 different compounds, 2 dose levels, and 3 time points were performed in male Crl: WI(Han) rats. Three of the 16 investigated compounds, BI-3 (FP007SE), Gentamicin (FP009SF), and IMM125 (FP013NO), induced kidney proximal tubule damage (PTD). In addition to histopathology and clinicalmore » chemistry, transcriptomics microarray and proteomics 2D-DIGE analysis were performed. Data from the three PTD studies were combined for a cross-study and cross-omics meta-analysis of the target organ. The mechanistic interpretation of kidney PTD-associated deregulated transcripts revealed, in addition to previously described kidney damage transcript biomarkers such as KIM-1, CLU and TIMP-1, a number of additional deregulated pathways congruent with histopathology observations on a single animal basis, including a specific effect on the complement system. The identification of new, more specific biomarker candidates for PTD was most successful when transcriptomics data were used. Combining transcriptomics data with proteomics data added extra value.« less
Differential expression of intermediate filaments in the process of developing hepatic steatosis.
Park, Jung-Eun; Kim, Hyun Tae; Lee, Sujin; Lee, Ye-Suk; Choi, Ung-Kyu; Kang, Jeong Han; Choi, Soo Young; Kang, Tae-Cheon; Choi, Myung-Sook; Kwon, Oh-Shin
2011-07-01
Obesity causes changes in fatty acid metabolism that consequently leads to fatty liver. To identify the possible proteins involved in the processes of obesity, we performed a proteomic analysis of obesity-induced mouse liver. Male C57BL/6J mice that were fed a high-fat diet (HFD) for 24 wk, developed hepatic steatosis characterized by considerable increase in free fatty acid (FFA) and triglyceride levels. Body weights were measured weekly and other measurements at weeks 2, 6, 12, 16, and 24. 2-D-based proteomic analysis revealed that, compared with the normal diet (ND) (n=50), high-fat diet (n=50) changed the expression of 12 protein (8 up and 4 downregulated, by a 1.5× fold change and more, p<0.05). The most pronounced difference was observed in intermediate microfilament (IF) cytoskeleton proteins. In particular, vimentin (vim) as well as cytokeratins (CK-8 and CK-18) were significantly upregulated in obese animals. Moreover, the level of caspase-generated IF fragment was also positively correlated with the degree of steatosis. The results suggest a significant alteration in IF organization during the development of hepatic steatosis leading to inflammation. The expression profile of selected proteins including vim was validated by Western blot, microarray analysis, and hepatocyte morphology by immunohistochemistry. Our results suggest that vim, like CK-18, may be a useful marker for predicting obesity and liver disease. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Exploiting fluorescence for multiplex immunoassays on protein microarrays
NASA Astrophysics Data System (ADS)
Herbáth, Melinda; Papp, Krisztián; Balogh, Andrea; Matkó, János; Prechl, József
2014-09-01
Protein microarray technology is becoming the method of choice for identifying protein interaction partners, detecting specific proteins, carbohydrates and lipids, or for characterizing protein interactions and serum antibodies in a massively parallel manner. Availability of the well-established instrumentation of DNA arrays and development of new fluorescent detection instruments promoted the spread of this technique. Fluorescent detection has the advantage of high sensitivity, specificity, simplicity and wide dynamic range required by most measurements. Fluorescence through specifically designed probes and an increasing variety of detection modes offers an excellent tool for such microarray platforms. Measuring for example the level of antibodies, their isotypes and/or antigen specificity simultaneously can offer more complex and comprehensive information about the investigated biological phenomenon, especially if we take into consideration that hundreds of samples can be measured in a single assay. Not only body fluids, but also cell lysates, extracted cellular components, and intact living cells can be analyzed on protein arrays for monitoring functional responses to printed samples on the surface. As a rapidly evolving area, protein microarray technology offers a great bulk of information and new depth of knowledge. These are the features that endow protein arrays with wide applicability and robust sample analyzing capability. On the whole, protein arrays are emerging new tools not just in proteomics, but glycomics, lipidomics, and are also important for immunological research. In this review we attempt to summarize the technical aspects of planar fluorescent microarray technology along with the description of its main immunological applications.
Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu
2012-06-08
Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.
Augustin, Regina; Lichtenthaler, Stefan F.; Greeff, Michael; Hansen, Jens; Wurst, Wolfgang; Trümbach, Dietrich
2011-01-01
The molecular mechanisms and genetic risk factors underlying Alzheimer's disease (AD) pathogenesis are only partly understood. To identify new factors, which may contribute to AD, different approaches are taken including proteomics, genetics, and functional genomics. Here, we used a bioinformatics approach and found that distinct AD-related genes share modules of transcription factor binding sites, suggesting a transcriptional coregulation. To detect additional coregulated genes, which may potentially contribute to AD, we established a new bioinformatics workflow with known multivariate methods like support vector machines, biclustering, and predicted transcription factor binding site modules by using in silico analysis and over 400 expression arrays from human and mouse. Two significant modules are composed of three transcription factor families: CTCF, SP1F, and EGRF/ZBPF, which are conserved between human and mouse APP promoter sequences. The specific combination of in silico promoter and multivariate analysis can identify regulation mechanisms of genes involved in multifactorial diseases. PMID:21559189
Pathogen profiling for disease management and surveillance.
Sintchenko, Vitali; Iredell, Jonathan R; Gilbert, Gwendolyn L
2007-06-01
The usefulness of rapid pathogen genotyping is widely recognized, but its effective interpretation and application requires integration into clinical and public health decision-making. How can pathogen genotyping data best be translated to inform disease management and surveillance? Pathogen profiling integrates microbial genomics data into communicable disease control by consolidating phenotypic identity-based methods with DNA microarrays, proteomics, metabolomics and sequence-based typing. Sharing data on pathogen profiles should facilitate our understanding of transmission patterns and the dynamics of epidemics.
Differential expression of genes and proteins associated with wool follicle cycling.
Liu, Nan; Li, Hegang; Liu, Kaidong; Yu, Juanjuan; Cheng, Ming; De, Wei; Liu, Jifeng; Shi, Shuyan; He, Yanghua; Zhao, Jinshan
2014-08-01
Sheep are valuable resources for the wool industry. Wool growth of Aohan fine wool sheep has cycled during different seasons in 1 year. Therefore, identifying genes that control wool growth cycling might lead to ways for improving the quality and yield of fine wool. In this study, we employed Agilent sheep gene expression microarray and proteomic technology to compare the gene expression patterns of the body side skins at August and December time points in Aohan fine wool sheep (a Chinese indigenous breed). Microarray study revealed that 2,223 transcripts were differentially expressed, including 1,162 up-regulated and 1,061 down-regulated transcripts, comparing body side skin at the August time point to the December one (A/D) in Aohan fine wool sheep. Then seven differentially expressed genes were selected to validated the reliability of the gene chip data. The majority of the genes possibly related to follicle development and wool growth could be assigned into the categories including regulation of receptor binding, extracellular region, protein binding and extracellular space. Proteomic study revealed that 84 protein spots showed significant differences in expression levels. Of the 84, 63 protein spots were upregulated and 21 were downregulated in A/D. Finally, 55 protein points were determined through MALDI-TOF/MS analyses. Furthermore, the regulation mechanism of hair follicle might resemble that of fetation.
Opposing roles of the aldo-keto reductases AKR1B1 and AKR1B10 in colorectal cancer.
Taskoparan, Betul; Seza, Esin Gulce; Demirkol, Secil; Tuncer, Sinem; Stefek, Milan; Gure, Ali Osmay; Banerjee, Sreeparna
2017-12-01
Aldo-keto reductases (including AKR1B1 and AKR1B10) constitute a family of oxidoreductases that have been implicated in the pathophysiology of diabetes and cancer, including colorectal cancer (CRC). Available data indicate that, despite their similarities in structure and enzymatic functions, their roles in CRC may be divergent. Here, we aimed to determine the expression and functional implications of AKR1B1 and AKR1B10 in CRC. AKR1B1 and AKR1B10 gene expression levels were analyzed using publicly available microarray data and ex vivo CRC-derived cDNA samples. Gene Set Enrichment Analysis (GSEA), The Cancer Genome Atlas (TCGA) RNA-seq data and The Cancer Proteome Atlas (TCPA) proteome data were analyzed to determine the effect of high and low AKR1B1 and AKR1B10 expression levels in CRC patients. Proliferation, cell cycle progression, cellular motility, adhesion and inflammation were determined in CRC-derived cell lines in which these genes were either exogenously overexpressed or silenced. We found that the expression of AKR1B1 was unaltered, whereas that of AKR1B10 was decreased in primary CRCs. GSEA revealed that, while high AKR1B1 expression was associated with increased cell cycle progression, cellular motility and inflammation, high AKR1B10 expression was associated with a weak inflammatory phenotype. Functional studies carried out in CRC-derived cell lines confirmed these data. Microarray data analysis indicated that high expression levels of AKR1B1 and AKR1B10 were significantly associated with shorter and longer disease-free survival rates, respectively. A combined gene expression signature of AKR1B10 (low) and AKR1B1 (high) showed a better prognostic stratification of CRC patients independent of confounding factors. Despite their similarities, the expression levels and functions of AKR1B1 and AKR1B10 are highly divergent in CRC, and they may have prognostic implications.
Nicolau, Monica; Levine, Arnold J; Carlsson, Gunnar
2011-04-26
High-throughput biological data, whether generated as sequencing, transcriptional microarrays, proteomic, or other means, continues to require analytic methods that address its high dimensional aspects. Because the computational part of data analysis ultimately identifies shape characteristics in the organization of data sets, the mathematics of shape recognition in high dimensions continues to be a crucial part of data analysis. This article introduces a method that extracts information from high-throughput microarray data and, by using topology, provides greater depth of information than current analytic techniques. The method, termed Progression Analysis of Disease (PAD), first identifies robust aspects of cluster analysis, then goes deeper to find a multitude of biologically meaningful shape characteristics in these data. Additionally, because PAD incorporates a visualization tool, it provides a simple picture or graph that can be used to further explore these data. Although PAD can be applied to a wide range of high-throughput data types, it is used here as an example to analyze breast cancer transcriptional data. This identified a unique subgroup of Estrogen Receptor-positive (ER(+)) breast cancers that express high levels of c-MYB and low levels of innate inflammatory genes. These patients exhibit 100% survival and no metastasis. No supervised step beyond distinction between tumor and healthy patients was used to identify this subtype. The group has a clear and distinct, statistically significant molecular signature, it highlights coherent biology but is invisible to cluster methods, and does not fit into the accepted classification of Luminal A/B, Normal-like subtypes of ER(+) breast cancers. We denote the group as c-MYB(+) breast cancer.
Overcoming antifungal resistance
Srinivasan, Anand; Lopez-Ribot, Jose L.; Ramasubramanian, Anand K.
2014-01-01
Fungal infections have become one of the major causes of morbidity and mortality in immunocompromised patients. Despite increased awareness and improved treatment strategies, the frequent development of resistance to the antifungal drugs used in clinical settings contributes to the increasing toll of mycoses. Although a natural phenomenon, antifungal drug resistance can compromise advances in the development of effective diagnostic techniques and novel antifungals. In this review, we will discuss the advent of cellular-microarrays, microfluidics, genomics, proteomics and other state-of-the art technologies in conquering antifungal drug resistance. PMID:24847655
2009-07-01
AD_________________ ( Leave blank) Award Number: W81XWH-08-1-0048 TITLE: Functional...cancer human cell lines, T98G and U2OS, in order to test for cell-line specific effects . After infecting equal number of cells with the viruses carrying... Guava ™ FACS analyzer with the ViaCount™ kit (Millipore). As shown in Fig. 7, VCaP cells infected with ERG or ERGa once again failed to proliferate
[Neoadjuvant therapy for esophageal cancer - indication and efficacy].
Kato, Ken; Hamaguchi, Tetsuya; Yamada, Yasuhide; Shirao, Kuniaki; Shimada, Yasuhiro
2007-10-01
Some approaches such as adjuvant chemotherapy, neoadjuvant chemotherapy and neoadjuvant chemoradiotherapy have been tried to improve the efficacy of treatment for resectable esophageal cancer patients. The usefullness of neoadjuvant chemotherapy, has remained a matter of controversy. However, there is a report from JCOG9907 in Japan that two courses of neoadjuvant 5-FU/CDDP improved the survival of esophageal squamous cell cancer patients. Neoadjuvant chemoradiotherapy has not had a consistent evaluation because of the varying results of each trial. But from the results of meta-analysis and CALGB9781, the neoadjuvant chemoradiotherapy called "trimodality therapy" has been a standard treatment in the United States. We should evaluate whether there would be similar effectiveness in Japan, where the histology and operative approach are different. Some approaches such as DNA microarray and proteomics, which can predict the treatment effect, are being tried.
2012-01-01
Background Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? Results The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Conclusion Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway. PMID:23216969
Günther, Oliver P; Chen, Virginia; Freue, Gabriela Cohen; Balshaw, Robert F; Tebbutt, Scott J; Hollander, Zsuzsanna; Takhar, Mandeep; McMaster, W Robert; McManus, Bruce M; Keown, Paul A; Ng, Raymond T
2012-12-08
Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble? The first part of the paper presents a computational biomarker development pipeline for genomic and proteomic data. The pipeline begins with data acquisition (e.g., from bio-samples to microarray data), quality control, statistical analysis and mining of the data, and finally various forms of validation. The pipeline ensures that the various classifiers to be combined later in an ensemble are diverse and adequate for clinical use. Five mRNA genomic and five proteomic classifiers were developed independently using single time-point blood samples from 11 acute-rejection and 22 non-rejection renal transplant patients. The second part of the paper examines five ensembles ranging in size from two to 10 individual classifiers. Performance of ensembles is characterized by area under the curve (AUC), sensitivity, and specificity, as derived from the probability of acute rejection for individual classifiers in the ensemble in combination with one of two aggregation methods: (1) Average Probability or (2) Vote Threshold. One ensemble demonstrated superior performance and was able to improve sensitivity and AUC beyond the best values observed for any of the individual classifiers in the ensemble, while staying within the range of observed specificity. The Vote Threshold aggregation method achieved improved sensitivity for all 5 ensembles, but typically at the cost of decreased specificity. Proteo-genomic biomarker ensemble classifiers show promise in the diagnosis of acute renal allograft rejection and can improve classification performance beyond that of individual genomic or proteomic classifiers alone. Validation of our results in an international multicenter study is currently underway.
Aptamer-based multiplexed proteomic technology for biomarker discovery.
Gold, Larry; Ayers, Deborah; Bertino, Jennifer; Bock, Christopher; Bock, Ashley; Brody, Edward N; Carter, Jeff; Dalby, Andrew B; Eaton, Bruce E; Fitzwater, Tim; Flather, Dylan; Forbes, Ashley; Foreman, Trudi; Fowler, Cate; Gawande, Bharat; Goss, Meredith; Gunn, Magda; Gupta, Shashi; Halladay, Dennis; Heil, Jim; Heilig, Joe; Hicke, Brian; Husar, Gregory; Janjic, Nebojsa; Jarvis, Thale; Jennings, Susan; Katilius, Evaldas; Keeney, Tracy R; Kim, Nancy; Koch, Tad H; Kraemer, Stephan; Kroiss, Luke; Le, Ngan; Levine, Daniel; Lindsey, Wes; Lollo, Bridget; Mayfield, Wes; Mehan, Mike; Mehler, Robert; Nelson, Sally K; Nelson, Michele; Nieuwlandt, Dan; Nikrad, Malti; Ochsner, Urs; Ostroff, Rachel M; Otis, Matt; Parker, Thomas; Pietrasiewicz, Steve; Resnicow, Daniel I; Rohloff, John; Sanders, Glenn; Sattin, Sarah; Schneider, Daniel; Singer, Britta; Stanton, Martin; Sterkel, Alana; Stewart, Alex; Stratford, Suzanne; Vaught, Jonathan D; Vrkljan, Mike; Walker, Jeffrey J; Watrobka, Mike; Waugh, Sheela; Weiss, Allison; Wilcox, Sheri K; Wolfson, Alexey; Wolk, Steven K; Zhang, Chi; Zichi, Dom
2010-12-07
The interrogation of proteomes ("proteomics") in a highly multiplexed and efficient manner remains a coveted and challenging goal in biology and medicine. We present a new aptamer-based proteomic technology for biomarker discovery capable of simultaneously measuring thousands of proteins from small sample volumes (15 µL of serum or plasma). Our current assay measures 813 proteins with low limits of detection (1 pM median), 7 logs of overall dynamic range (~100 fM-1 µM), and 5% median coefficient of variation. This technology is enabled by a new generation of aptamers that contain chemically modified nucleotides, which greatly expand the physicochemical diversity of the large randomized nucleic acid libraries from which the aptamers are selected. Proteins in complex matrices such as plasma are measured with a process that transforms a signature of protein concentrations into a corresponding signature of DNA aptamer concentrations, which is quantified on a DNA microarray. Our assay takes advantage of the dual nature of aptamers as both folded protein-binding entities with defined shapes and unique nucleotide sequences recognizable by specific hybridization probes. To demonstrate the utility of our proteomics biomarker discovery technology, we applied it to a clinical study of chronic kidney disease (CKD). We identified two well known CKD biomarkers as well as an additional 58 potential CKD biomarkers. These results demonstrate the potential utility of our technology to rapidly discover unique protein signatures characteristic of various disease states. We describe a versatile and powerful tool that allows large-scale comparison of proteome profiles among discrete populations. This unbiased and highly multiplexed search engine will enable the discovery of novel biomarkers in a manner that is unencumbered by our incomplete knowledge of biology, thereby helping to advance the next generation of evidence-based medicine.
SALAD database: a motif-based database of protein annotations for plant comparative genomics
Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi
2010-01-01
Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209 529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named ‘SALAD on ARRAYs’ to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis. PMID:19854933
Galindo González, Leonardo M; El Kayal, Walid; Ju, Chelsea J-T; Allen, Carmen C G; King-Jones, Susanne; Cooke, Janice E K
2012-04-01
In the autumn, stems of woody perennials such as forest trees undergo a transition from active growth to dormancy. We used microarray transcriptomic profiling in combination with a proteomics analysis to elucidate processes that occur during this growth-to-dormancy transition in a conifer, white spruce (Picea glauca[Moench] Voss). Several differentially expressed genes were likely associated with the developmental transition that occurs during growth cessation in the cambial zone and the concomitant completion of cell maturation in vascular tissues. Genes encoding for cell wall and membrane biosynthetic enzymes showed transcript abundance patterns consistent with completion of cell maturation, and also of cell wall and membrane modifications potentially enabling cells to withstand the harsh conditions of winter. Several differentially expressed genes were identified that encoded putative regulators of cambial activity, cell development and of the photoperiodic pathway. Reconfiguration of carbon allocation figured centrally in the tree's overwintering preparations. For example, genes associated with carbon-based defences such as terpenoids were down-regulated, while many genes associated with protein-based defences and other stress mitigation mechanisms were up-regulated. Several of these correspond to proteins that were accumulated during the growth-to-dormancy transition, emphasizing the importance of stress protection in the tree's adaptive response to overwintering. © 2011 Blackwell Publishing Ltd.
SALAD database: a motif-based database of protein annotations for plant comparative genomics.
Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi
2010-01-01
Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209,529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named 'SALAD on ARRAYs' to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis.
Ranninger, Christina; Rurik, Marc; Limonciel, Alice; Ruzek, Silke; Reischl, Roland; Wilmes, Anja; Jennings, Paul; Hewitt, Philip; Dekant, Wolfgang; Kohlbacher, Oliver; Huber, Christian G.
2015-01-01
Untargeted metabolomics has the potential to improve the predictivity of in vitro toxicity models and therefore may aid the replacement of expensive and laborious animal models. Here we describe a long term repeat dose nephrotoxicity study conducted on the human renal proximal tubular epithelial cell line, RPTEC/TERT1, treated with 10 and 35 μmol·liter−1 of chloroacetaldehyde, a metabolite of the anti-cancer drug ifosfamide. Our study outlines the establishment of an automated and easy to use untargeted metabolomics workflow for HPLC-high resolution mass spectrometry data. Automated data analysis workflows based on open source software (OpenMS, KNIME) enabled a comprehensive and reproducible analysis of the complex and voluminous metabolomics data produced by the profiling approach. Time- and concentration-dependent responses were clearly evident in the metabolomic profiles. To obtain a more comprehensive picture of the mode of action, transcriptomics and proteomics data were also integrated. For toxicity profiling of chloroacetaldehyde, 428 and 317 metabolite features were detectable in positive and negative modes, respectively, after stringent removal of chemical noise and unstable signals. Changes upon treatment were explored using principal component analysis, and statistically significant differences were identified using linear models for microarray assays. The analysis revealed toxic effects only for the treatment with 35 μmol·liter−1 for 3 and 14 days. The most regulated metabolites were glutathione and metabolites related to the oxidative stress response of the cells. These findings are corroborated by proteomics and transcriptomics data, which show, among other things, an activation of the Nrf2 and ATF4 pathways. PMID:26055719
Wu, Wei-Sheng; Jhou, Meng-Jhun
2017-01-13
Missing value imputation is important for microarray data analyses because microarray data with missing values would significantly degrade the performance of the downstream analyses. Although many microarray missing value imputation algorithms have been developed, an objective and comprehensive performance comparison framework is still lacking. To solve this problem, we previously proposed a framework which can perform a comprehensive performance comparison of different existing algorithms. Also the performance of a new algorithm can be evaluated by our performance comparison framework. However, constructing our framework is not an easy task for the interested researchers. To save researchers' time and efforts, here we present an easy-to-use web tool named MVIAeval (Missing Value Imputation Algorithm evaluator) which implements our performance comparison framework. MVIAeval provides a user-friendly interface allowing users to upload the R code of their new algorithm and select (i) the test datasets among 20 benchmark microarray (time series and non-time series) datasets, (ii) the compared algorithms among 12 existing algorithms, (iii) the performance indices from three existing ones, (iv) the comprehensive performance scores from two possible choices, and (v) the number of simulation runs. The comprehensive performance comparison results are then generated and shown as both figures and tables. MVIAeval is a useful tool for researchers to easily conduct a comprehensive and objective performance evaluation of their newly developed missing value imputation algorithm for microarray data or any data which can be represented as a matrix form (e.g. NGS data or proteomics data). Thus, MVIAeval will greatly expedite the progress in the research of missing value imputation algorithms.
MINER: exploratory analysis of gene interaction networks by machine learning from expression data.
Kadupitige, Sidath Randeni; Leung, Kin Chun; Sellmeier, Julia; Sivieng, Jane; Catchpoole, Daniel R; Bain, Michael E; Gaëta, Bruno A
2009-12-03
The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. We have developed MINER (Microarray Interactive Network Exploration and Representation), an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.
Lässer, Cecilia; Shelke, Ganesh Vilas; Yeri, Ashish; Kim, Dae-Kyum; Crescitelli, Rossella; Raimondo, Stefania; Sjöstrand, Margareta; Gho, Yong Song; Van Keuren Jensen, Kendall; Lötvall, Jan
2017-01-01
ABSTRACT Cells secrete extracellular RNA (exRNA) to their surrounding environment and exRNA has been found in many body fluids such as blood, breast milk and cerebrospinal fluid. However, there are conflicting results regarding the nature of exRNA. Here, we have separated 2 distinct exRNA profiles released by mast cells, here termed high-density (HD) and low-density (LD) exRNA. The exRNA in both fractions was characterized by microarray and next-generation sequencing. Both exRNA fractions contained mRNA and miRNA, and the mRNAs in the LD exRNA correlated closely with the cellular mRNA, whereas the HD mRNA did not. Furthermore, the HD exRNA was enriched in lincRNA, antisense RNA, vault RNA, snoRNA, and snRNA with little or no evidence of full-length 18S and 28S rRNA. The LD exRNA was enriched in mitochondrial rRNA, mitochondrial tRNA, tRNA, piRNA, Y RNA, and full-length 18S and 28S rRNA. The proteomes of the HD and LD exRNA-containing fractions were determined with LC-MS/MS and analyzed with Gene Ontology term finder, which showed that both proteomes were associated with the term extracellular vesicles and electron microscopy suggests that at least a part of the exRNA is associated with exosome-like extracellular vesicles. Additionally, the proteins in the HD fractions tended to be associated with the nucleus and ribosomes, whereas the LD fraction proteome tended to be associated with the mitochondrion. We show that the 2 exRNA signatures released by a single cell type can be separated by floatation on a density gradient. These results show that cells can release multiple types of exRNA with substantial differences in RNA species content. This is important for any future studies determining the nature and function of exRNA released from different cells under different conditions. PMID:27791479
Analysis of High-Throughput ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Zangar, Richard C.
Our research group develops analytical methods and software for the high-throughput analysis of quantitative enzyme-linked immunosorbent assay (ELISA) microarrays. ELISA microarrays differ from DNA microarrays in several fundamental aspects and most algorithms for analysis of DNA microarray data are not applicable to ELISA microarrays. In this review, we provide an overview of the steps involved in ELISA microarray data analysis and how the statistically sound algorithms we have developed provide an integrated software suite to address the needs of each data-processing step. The algorithms discussed are available in a set of open-source software tools (http://www.pnl.gov/statistics/ProMAT).
Zhu, Yuerong; Zhu, Yuelin; Xu, Wei
2008-01-01
Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from . PMID:18218103
An Efficient Covalent Coating on Glass Slides for Preparation of Optical Oligonucleotide Microarrays
Pourjahed, Atefeh; Rabiee, Mohammad; Tahriri, Mohammadreza
2013-01-01
Objective(s): Microarrays are potential analyzing tools for genomics and proteomics researches, which is in needed of suitable substrate for coating and also hybridization of biomolecules. Materials and Methods: In this research, a thin film of oxidized agarose was prepared on the glass slides which previously coated with poly-L-lysine (PLL). Some of the aldehyde groups of the activated agarose linked covalently to PLL amine groups; also bound to the amino groups of biomolecules. These linkages were fixed by UV irradiation. The prepared substrates were compared to only agarose-coated and PLL-coated slides. Results: Results on atomic force microscope (AFM) demonstrated that agarose provided three-dimensional surface which had higher loading and bindig capacity for biomolecules than PLL-coated surface which had two-dimensional surface. In addition, the signal-to-noise ratio in hybridization reactions performed on the agarose-PLL coated substrates increased two fold and four fold compared to agarose and PLL coated substrates, respectively. Conclusion: The agarose-PLL microarrays had the highest signal (2546) and lowest background signal (205) in hybridization, suggesting that the prepared slides are suitable in analyzing wide concentration range of analytes. PMID:24570832
Nares, Salvador; Moutsopoulos, Niki M.; Angelov, Nikola; Rangel, Zoila G.; Munson, Peter J.; Sinha, Neha; Wahl, Sharon M.
2009-01-01
Long-lived monocytes, macrophages, and dendritic cells (DCs) are Toll-like receptor-expressing, antigen-presenting cells derived from a common myeloid lineage that play key roles in innate and adaptive immune responses. Based on immunohistochemical and molecular analyses of inflamed tissues from patients with chronic destructive periodontal disease, these cells, found in the inflammatory infiltrate, may drive the progressive periodontal pathogenesis. To investigate early transcriptional signatures and subsequent proteomic responses to the periodontal pathogen, Porphyromonas gingivalis, donor-matched human blood monocytes, differentiated DCs, and macrophages were exposed to P. gingivalis lipopolysaccharide (LPS) and gene expression levels were measured by oligonucleotide microarrays. In addition to striking differences in constitutive transcriptional profiles between these myeloid populations, we identify a P. gingivalis LPS-inducible convergent, transcriptional core response of more than 400 annotated genes/ESTs among these populations, reflected by a shared, but quantitatively distinct, proteomic response. Nonetheless, clear differences emerged between the monocytes, DCs, and macrophages. The finding that long-lived myeloid inflammatory cells, particularly DCs, rapidly and aggressively respond to P. gingivalis LPS by generating chemokines, proteases, and cytokines capable of driving T-helper cell lineage polarization without evidence of corresponding immunosuppressive pathways highlights their prominent role in host defense and progressive tissue pathogenesis. The shared, unique, and/or complementary transcriptional and proteomic profiles may frame the context of the host response to P. gingivalis, contributing to the destructive nature of periodontal inflammation. PMID:19264901
Proteomic and genomic characterization of a yeast model for Ogden syndrome
Dörfel, Max J.; Fang, Han; Crain, Jonathan; Klingener, Michael; Weiser, Jake
2016-01-01
Abstract Naa10 is an Nα‐terminal acetyltransferase that, in a complex with its auxiliary subunit Naa15, co‐translationally acetylates the α‐amino group of newly synthetized proteins as they emerge from the ribosome. Roughly 40–50% of the human proteome is acetylated by Naa10, rendering this an enzyme one of the most broad substrate ranges known. Recently, we reported an X‐linked disorder of infancy, Ogden syndrome, in two families harbouring a c.109 T > C (p.Ser37Pro) variant in NAA10. In the present study we performed in‐depth characterization of a yeast model of Ogden syndrome. Stress tests and proteomic analyses suggest that the S37P mutation disrupts Naa10 function and reduces cellular fitness during heat shock, possibly owing to dysregulation of chaperone expression and accumulation. Microarray and RNA‐seq revealed a pseudo‐diploid gene expression profile in ΔNaa10 cells, probably responsible for a mating defect. In conclusion, the data presented here further support the disruptive nature of the S37P/Ogden mutation and identify affected cellular processes potentially contributing to the severe phenotype seen in Ogden syndrome. Data are available via GEO under identifier GSE86482 or with ProteomeXchange under identifier PXD004923. © 2016 The Authors. Yeast published by John Wiley & Sons, Ltd. PMID:27668839
Randles, Michael J; Woolf, Adrian S; Huang, Jennifer L; Byron, Adam; Humphries, Jonathan D; Price, Karen L; Kolatsi-Joannou, Maria; Collinson, Sophie; Denny, Thomas; Knight, David; Mironov, Aleksandr; Starborg, Toby; Korstanje, Ron; Humphries, Martin J; Long, David A; Lennon, Rachel
2015-12-01
Glomerular disease often features altered histologic patterns of extracellular matrix (ECM). Despite this, the potential complexities of the glomerular ECM in both health and disease are poorly understood. To explore whether genetic background and sex determine glomerular ECM composition, we investigated two mouse strains, FVB and B6, using RNA microarrays of isolated glomeruli combined with proteomic glomerular ECM analyses. These studies, undertaken in healthy young adult animals, revealed unique strain- and sex-dependent glomerular ECM signatures, which correlated with variations in levels of albuminuria and known predisposition to progressive nephropathy. Among the variation, we observed changes in netrin 4, fibroblast growth factor 2, tenascin C, collagen 1, meprin 1-α, and meprin 1-β. Differences in protein abundance were validated by quantitative immunohistochemistry and Western blot analysis, and the collective differences were not explained by mutations in known ECM or glomerular disease genes. Within the distinct signatures, we discovered a core set of structural ECM proteins that form multiple protein-protein interactions and are conserved from mouse to man. Furthermore, we found striking ultrastructural changes in glomerular basement membranes in FVB mice. Pathway analysis of merged transcriptomic and proteomic datasets identified potential ECM regulatory pathways involving inhibition of matrix metalloproteases, liver X receptor/retinoid X receptor, nuclear factor erythroid 2-related factor 2, notch, and cyclin-dependent kinase 5. These pathways may therefore alter ECM and confer susceptibility to disease. Copyright © 2015 by the American Society of Nephrology.
Zhang, Yu; Tang, Yin; Sun, Shuai; Wang, Zhihua; Wu, Wenjun; Zhao, Xiaodong; Czajkowsky, Daniel M; Li, Yan; Tian, Jianhui; Xu, Ling; Wei, Wei; Deng, Yuliang; Shi, Qihui
2015-10-06
The high glucose uptake and activation of oncogenic signaling pathways in cancer cells has long made these features, together with the mutational spectrum, prime diagnostic targets of circulating tumor cells (CTCs). Further, an ability to characterize these properties at a single cell resolution is widely believed to be essential, as the known extensive heterogeneity in CTCs can obscure important correlations in data obtained from cell population-based methods. However, to date, it has not been possible to quantitatively measure metabolic, proteomic, and genetic data from a single CTC. Here we report a microchip-based approach that allows for the codetection of glucose uptake, intracellular functional proteins, and genetic mutations at the single-cell level from rare tumor cells. The microchip contains thousands of nanoliter grooves (nanowells) that isolate individual CTCs and allow for the assessment of their glucose uptake via imaging of a fluorescent glucose analog, quantification of a panel of intracellular signaling proteins using a miniaturized antibody barcode microarray, and retrieval of the individual cell nuclei for subsequent off-chip genome amplification and sequencing. This approach integrates molecular-scale information on the metabolic, proteomic, and genetic status of single cells and permits the inference of associations between genetic signatures, energy consumption, and phosphoproteins oncogenic signaling activities in CTCs isolated from blood samples of patients. Importantly, this microchip chip-based approach achieves this multidimensional molecular analysis with minimal cell loss (<20%), which is the bottleneck of the rare cell analysis.
Umar, Arzu; Kang, Hyuk; Timmermans, Annemieke M; Look, Maxime P; Meijer-van Gelder, Marion E; den Bakker, Michael A; Jaitly, Navdeep; Martens, John W M; Luider, Theo M; Foekens, John A; Pasa-Tolić, Ljiljana
2009-06-01
Tamoxifen resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that are associated with tamoxifen resistance is a first step toward better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy resistance in breast cancer using nano-LC coupled with FTICR MS. Comparative proteome analysis was performed on approximately 5,500 pooled tumor cells (corresponding to approximately 550 ng of protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data sets (n = 24 and n = 27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag reference databases. A total of 17,263 unique peptides were identified that corresponded to 2,556 non-redundant proteins identified with > or = 2 peptides. 1,713 overlapping proteins between the two data sets were used for further analysis. Comparative proteome analysis revealed 100 putatively differentially abundant proteins between tamoxifen-sensitive and tamoxifen-resistant tumors. The presence and relative abundance for 47 differentially abundant proteins were verified by targeted nano-LC-MS/MS in a selection of unpooled, non-microdissected discovery set tumor tissue extracts. ENPP1, EIF3E, and GNB4 were significantly associated with progression-free survival upon tamoxifen treatment for recurrent disease. Differential abundance of our top discriminating protein, extracellular matrix metalloproteinase inducer, was validated by tissue microarray in an independent patient cohort (n = 156). Extracellular matrix metalloproteinase inducer levels were higher in therapy-resistant tumors and significantly associated with an earlier tumor progression following first line tamoxifen treatment (hazard ratio, 1.87; 95% confidence interval, 1.25-2.80; p = 0.002). In summary, comparative proteomics performed on laser capture microdissection-derived breast tumor cells using nano-LC-FTICR MS technology revealed a set of putative biomarkers associated with tamoxifen therapy resistance in recurrent breast cancer.
From genomes to vaccines: Leishmania as a model.
Almeida, Renata; Norrish, Alan; Levick, Mark; Vetrie, David; Freeman, Tom; Vilo, Jaak; Ivens, Alasdair; Lange, Uta; Stober, Carmel; McCann, Sharon; Blackwell, Jenefer M
2002-01-01
The 35 Mb genome of Leishmania should be sequenced by late 2002. It contains approximately 8500 genes that will probably translate into more than 10 000 proteins. In the laboratory we have been piloting strategies to try to harness the power of the genome-proteome for rapid screening of new vaccine candidate. To this end, microarray analysis of 1094 unique genes identified using an EST analysis of 2091 cDNA clones from spliced leader libraries prepared from different developmental stages of Leishmania has been employed. The plan was to identify amastigote-expressed genes that could be used in high-throughput DNA-vaccine screens to identify potential new vaccine candidates. Despite the lack of transcriptional regulation that polycistronic transcription in Leishmania dictates, the data provide evidence for a high level of post-transcriptional regulation of RNA abundance during the developmental cycle of promastigotes in culture and in lesion-derived amastigotes of Leishmania major. This has provided 147 candidates from the 1094 unique genes that are specifically upregulated in amastigotes and are being used in vaccine studies. Using DNA vaccination, it was demonstrated that pooling strategies can work to identify protective vaccines, but it was found that some potentially protective antigens are masked by other disease-exacerbatory antigens in the pool. A total of 100 new vaccine candidates are currently being tested separately and in pools to extend this analysis, and to facilitate retrospective bioinformatic analysis to develop predictive algorithms for sequences that constitute potentially protective antigens. We are also working with other members of the Leishmania Genome Network to determine whether RNA expression determined by microarray analyses parallels expression at the protein level. We believe we are making good progress in developing strategies that will allow rapid translation of the sequence of Leishmania into potential interventions for disease control in humans. PMID:11839176
Belciug, Smaranda; Gorunescu, Florin
2018-06-08
Methods based on microarrays (MA), mass spectrometry (MS), and machine learning (ML) algorithms have evolved rapidly in recent years, allowing for early detection of several types of cancer. A pitfall of these approaches, however, is the overfitting of data due to large number of attributes and small number of instances -- a phenomenon known as the 'curse of dimensionality'. A potentially fruitful idea to avoid this drawback is to develop algorithms that combine fast computation with a filtering module for the attributes. The goal of this paper is to propose a statistical strategy to initiate the hidden nodes of a single-hidden layer feedforward neural network (SLFN) by using both the knowledge embedded in data and a filtering mechanism for attribute relevance. In order to attest its feasibility, the proposed model has been tested on five publicly available high-dimensional datasets: breast, lung, colon, and ovarian cancer regarding gene expression and proteomic spectra provided by cDNA arrays, DNA microarray, and MS. The novel algorithm, called adaptive SLFN (aSLFN), has been compared with four major classification algorithms: traditional ELM, radial basis function network (RBF), single-hidden layer feedforward neural network trained by backpropagation algorithm (BP-SLFN), and support vector-machine (SVM). Experimental results showed that the classification performance of aSLFN is competitive with the comparison models. Copyright © 2018. Published by Elsevier Inc.
Spermatogenesis in mammals: proteomic insights.
Chocu, Sophie; Calvel, Pierre; Rolland, Antoine D; Pineau, Charles
2012-08-01
Spermatogenesis is a highly sophisticated process involved in the transmission of genetic heritage. It includes halving ploidy, repackaging of the chromatin for transport, and the equipment of developing spermatids and eventually spermatozoa with the advanced apparatus (e.g., tightly packed mitochondrial sheat in the mid piece, elongating of the tail, reduction of cytoplasmic volume) to elicit motility once they reach the epididymis. Mammalian spermatogenesis is divided into three phases. In the first the primitive germ cells or spermatogonia undergo a series of mitotic divisions. In the second the spermatocytes undergo two consecutive divisions in meiosis to produce haploid spermatids. In the third the spermatids differentiate into spermatozoa in a process called spermiogenesis. Paracrine, autocrine, juxtacrine, and endocrine pathways all contribute to the regulation of the process. The array of structural elements and chemical factors modulating somatic and germ cell activity is such that the network linking the various cellular activities during spermatogenesis is unimaginably complex. Over the past two decades, advances in genomics have greatly improved our knowledge of spermatogenesis, by identifying numerous genes essential for the development of functional male gametes. Large-scale analyses of testicular function have deepened our insight into normal and pathological spermatogenesis. Progress in genome sequencing and microarray technology have been exploited for genome-wide expression studies, leading to the identification of hundreds of genes differentially expressed within the testis. However, although proteomics has now come of age, the proteomics-based investigation of spermatogenesis remains in its infancy. Here, we review the state-of-the-art of large-scale proteomic analyses of spermatogenesis, from germ cell development during sex determination to spermatogenesis in the adult. Indeed, a few laboratories have undertaken differential protein profiling expression studies and/or systematic analyses of testicular proteomes in entire organs or isolated cells from various species. We consider the pros and cons of proteomics for studying the testicular germ cell gene expression program. Finally, we address the use of protein datasets, through integrative genomics (i.e., combining genomics, transcriptomics, and proteomics), bioinformatics, and modelling.
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
Robinson, Mark D; McCarthy, Davis J; Smyth, Gordon K
2010-01-01
It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data. The package is freely available under the LGPL licence from the Bioconductor web site (http://bioconductor.org).
New technology and resources for cryptococcal research
Zhang, Nannan; Park, Yoon-Dong; Williamson, Peter R.
2014-01-01
Rapid advances in molecular biology and genome sequencing have enabled the generation of new technology and resources for cryptococcal research. RNAi-mediated specific gene knock down has become routine and more efficient by utilizing modified shRNA plasmids and convergent promoter RNAi constructs. This system was recently applied in a high-throughput screen to identify genes involved in host-pathogen interactions. Gene deletion efficiencies have also been improved by increasing rates of homologous recombination through a number of approaches, including a combination of double-joint PCR with split-marker transformation, the use of dominant selectable markers and the introduction of Cre-Loxp systems into Cryptococcus. Moreover, visualization of cryptococcal proteins has become more facile using fusions with codon-optimized fluorescent tags, such as green or red fluorescent proteins or, mCherry. Using recent genome-wide analytical tools, new transcriptional factors and regulatory proteins have been identified in novel virulence-related signaling pathways by employing microarray analysis, RNA-sequencing and proteomic analysis. PMID:25460849
Chemiluminescence microarrays in analytical chemistry: a critical review.
Seidel, Michael; Niessner, Reinhard
2014-09-01
Multi-analyte immunoassays on microarrays and on multiplex DNA microarrays have been described for quantitative analysis of small organic molecules (e.g., antibiotics, drugs of abuse, small molecule toxins), proteins (e.g., antibodies or protein toxins), and microorganisms, viruses, and eukaryotic cells. In analytical chemistry, multi-analyte detection by use of analytical microarrays has become an innovative research topic because of the possibility of generating several sets of quantitative data for different analyte classes in a short time. Chemiluminescence (CL) microarrays are powerful tools for rapid multiplex analysis of complex matrices. A wide range of applications for CL microarrays is described in the literature dealing with analytical microarrays. The motivation for this review is to summarize the current state of CL-based analytical microarrays. Combining analysis of different compound classes on CL microarrays reduces analysis time, cost of reagents, and use of laboratory space. Applications are discussed, with examples from food safety, water safety, environmental monitoring, diagnostics, forensics, toxicology, and biosecurity. The potential and limitations of research on multiplex analysis by use of CL microarrays are discussed in this review.
Assessing the impact of transcriptomics, proteomics and metabolomics on fungal phytopathology.
Tan, Kar-Chun; Ipcho, Simon V S; Trengove, Robert D; Oliver, Richard P; Solomon, Peter S
2009-09-01
SUMMARY Peer-reviewed literature is today littered with exciting new tools and techniques that are being used in all areas of biology and medicine. Transcriptomics, proteomics and, more recently, metabolomics are three of these techniques that have impacted on fungal plant pathology. Used individually, each of these techniques can generate a plethora of data that could occupy a laboratory for years. When used in combination, they have the potential to comprehensively dissect a system at the transcriptional and translational level. Transcriptomics, or quantitative gene expression profiling, is arguably the most familiar to researchers in the field of fungal plant pathology. Microarrays have been the primary technique for the last decade, but others are now emerging. Proteomics has also been exploited by the fungal phytopathogen community, but perhaps not to its potential. A lack of genome sequence information has frustrated proteomics researchers and has largely contributed to this technique not fulfilling its potential. The coming of the genome sequencing era has partially alleviated this problem. Metabolomics is the most recent of these techniques to emerge and is concerned with the non-targeted profiling of all metabolites in a given system. Metabolomics studies on fungal plant pathogens are only just beginning to appear, although its potential to dissect many facets of the pathogen and disease will see its popularity increase quickly. This review assesses the impact of transcriptomics, proteomics and metabolomics on fungal plant pathology over the last decade and discusses their futures. Each of the techniques is described briefly with further reading recommended. Key examples highlighting the application of these technologies to fungal plant pathogens are also reviewed.
Completed | Office of Cancer Clinical Proteomics Research
Prior to the current Clinical Proteomic Tumor Analysis Consortium (CPTAC), previously funded initiatives associated with clinical proteomics research included: Clinical Proteomic Tumor Analysis Consortium (CPTAC 2.0) Clinical Proteomic Technologies for Cancer Initiative (CPTC) Mouse Proteomic Technologies Initiative
Identification of significant features by the Global Mean Rank test.
Klammer, Martin; Dybowski, J Nikolaj; Hoffmann, Daniel; Schaab, Christoph
2014-01-01
With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use.
Breuer, Eun-Kyoung Yim; Murph, Mandi M.
2011-01-01
Technological and scientific innovations over the last decade have greatly contributed to improved diagnostics, predictive models, and prognosis among cancers affecting women. In fact, an explosion of information in these areas has almost assured future generations that outcomes in cancer will continue to improve. Herein we discuss the current status of breast, cervical, and ovarian cancers as it relates to screening, disease diagnosis, and treatment options. Among the differences in these cancers, it is striking that breast cancer has multiple predictive tests based upon tumor biomarkers and sophisticated, individualized options for prescription therapeutics while ovarian cancer lacks these tools. In addition, cervical cancer leads the way in innovative, cancer-preventative vaccines and multiple screening options to prevent disease progression. For each of these malignancies, emerging proteomic technologies based upon mass spectrometry, stable isotope labeling with amino acids, high-throughput ELISA, tissue or protein microarray techniques, and click chemistry in the pursuit of activity-based profiling can pioneer the next generation of discovery. We will discuss six of the latest techniques to understand proteomics in cancer and highlight research utilizing these techniques with the goal of improvement in the management of women's cancers. PMID:21886869
2009-01-01
Background The maintenance of internal pH in bacterial cells is challenged by natural stress conditions, during host infection or in biotechnological production processes. Comprehensive transcriptomic and proteomic analyses has been conducted in several bacterial model systems, yet questions remain as to the mechanisms of pH homeostasis. Results Here we present the comprehensive analysis of pH homeostasis in C. glutamicum, a bacterium of industrial importance. At pH values between 6 and 9 effective maintenance of the internal pH at 7.5 ± 0.5 pH units was found. By DNA microarray analyses differential mRNA patterns were identified. The expression profiles were validated and extended by 1D-LC-ESI-MS/MS based quantification of soluble and membrane proteins. Regulators involved were identified and thereby participation of numerous signaling modules in pH response was found. The functional analysis revealed for the first time the occurrence of oxidative stress in C. glutamicum cells at neutral and low pH conditions accompanied by activation of the iron starvation response. Intracellular metabolite pool analysis unraveled inhibition of the TCA and other pathways at low pH. Methionine and cysteine synthesis were found to be activated via the McbR regulator, cysteine accumulation was observed and addition of cysteine was shown to be toxic under acidic conditions. Conclusions Novel limitations for C. glutamicum at non-optimal pH values were identified by a comprehensive analysis on the level of the transcriptome, proteome, and metabolome indicating a functional link between pH acclimatization, oxidative stress, iron homeostasis, and metabolic alterations. The results offer new insights into bacterial stress physiology and new starting points for bacterial strain design or pathogen defense. PMID:20025733
Mining biological databases for candidate disease genes
NASA Astrophysics Data System (ADS)
Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.
2001-07-01
The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
Raschzok, Nathanael; Werner, Wiebke; Sallmon, Hannes; Billecke, Nils; Dame, Christof; Neuhaus, Peter; Sauer, Igor M
2011-06-01
The liver has the unique capacity to regenerate after surgical resection. However, the regulation of liver regeneration is not completely understood. Recent reports indicate an essential role for small noncoding microRNAs (miRNAs) in the regulation of hepatic development, carcinogenesis, and early regeneration. We hypothesized that miRNAs are critically involved in all phases of liver regeneration after partial hepatectomy. We performed miRNA microarray analyses after 70% partial hepatectomy in rats under isoflurane anesthesia at different time points (0 h to 5 days) and after sham laparotomy. Putative targets of differentially expressed miRNAs were determined using a bioinformatic approach. Two-dimensional (2D)-PAGE proteomic analyses and protein identification were performed on specimens at 0 and 24 h after resection. The temporal dynamics of liver regeneration were characterized by 5-bromo- 2-deoxyuridine, proliferating cell nuclear antigen, IL-6, and hepatocyte growth factor. We demonstrate that miRNA expression patterns changed during liver regeneration and that these changes were most evident during the peak of DNA replication at 24 h after resection. Expression of 13 miRNAs was significantly reduced 12-48 h after resection (>25% change), out of which downreguation was confirmed in isolated hepatocytes for 6 miRNAs at 24 h, whereas three miRNAs were significantly upregulated. Proteomic analysis revealed 65 upregulated proteins; among them, 23 represent putative targets of the differentially expressed miRNAs. We provide a temporal miRNA expression and proteomic dataset of the regenerating rat liver, which indicates a primary function for miRNA during the peak of DNA replication. These data will assist further functional studies on the role of miRNAs during liver regeneration.
Marimuthu, Arivusudar; Chavan, Sandip; Sathe, Gajanan; Sahasrabuddhe, Nandini A; Srikanth, Srinivas M; Renuse, Santosh; Ahmad, Sartaj; Radhakrishnan, Aneesha; Barbhuiya, Mustafa A; Kumar, Rekha V; Harsha, H C; Sidransky, David; Califano, Joseph; Pandey, Akhilesh; Chatterjee, Aditi
2013-11-01
Protein biomarker discovery for early detection of head and neck squamous cell carcinoma (HNSCC) is a crucial unmet need to improve patient outcomes. Mass spectrometry-based proteomics has emerged as a promising tool for identification of biomarkers in different cancer types. Proteins secreted from cancer cells can serve as potential biomarkers for early diagnosis. In the current study, we have used isobaric tag for relative and absolute quantitation (iTRAQ) labeling methodology coupled with high resolution mass spectrometry to identify and quantitate secreted proteins from a panel of head and neck carcinoma cell lines. In all, we identified 2,472 proteins, of which 225 proteins were secreted at higher or lower abundance in HNSCC-derived cell lines. Of these, 148 were present in higher abundance and 77 were present in lower abundance in the cancer-cell derived secretome. We detected a higher abundance of some previously known markers for HNSCC including insulin like growth factor binding protein 3, IGFBP3 (11-fold) and opioid growth factor receptor, OGFR (10-fold) demonstrating the validity of our approach. We also identified several novel secreted proteins in HNSCC including olfactomedin-4, OLFM4 (12-fold) and hepatocyte growth factor activator, HGFA (5-fold). IHC-based validation was conducted in HNSCC using tissue microarrays which revealed overexpression of IGFBP3 and OLFM4 in 70% and 75% of the tested cases, respectively. Our study illustrates quantitative proteomics of secretome as a robust approach for identification of potential HNSCC biomarkers. This article is part of a Special Issue entitled: An Updated Secretome. Copyright © 2013 Elsevier B.V. All rights reserved.
Colak, Dilek; Alaiya, Ayodele A; Kaya, Namik; Muiya, Nzioka P; AlHarazi, Olfat; Shinwari, Zakia; Andres, Editha; Dzimiri, Nduna
2016-01-01
The disease pathways leading to idiopathic dilated cardiomyopathy (DCM) are still elusive. The present study investigated integrated global transcriptional and translational changes in human DCM for disease biomarker discovery. We used identical myocardial tissues from five DCM hearts compared to five non-failing (NF) donor hearts for both transcriptome profiling using the ABI high-density oligonucleotide microarrays and proteome expression with One-Dimensional Nano Acquity liquid chromatography coupled with tandem mass spectrometry on the Synapt G2 system. We identified 1262 differentially expressed genes (DEGs) and 269 proteins (DEPs) between DCM cases and healthy controls. Among the most significantly upregulated (>5-fold) proteins were GRK5, APOA2, IGHG3, ANXA6, HSP90AA1, and ATP5C1 (p< 0.01). On the other hand, the most significantly downregulated proteins were GSTM5, COX17, CAV1 and ANXA3. At least ten entities were concomitantly upregulated on the two analysis platforms: GOT1, ALDH4A1, PDHB, BDH1, SLC2A11, HSP90AA1, HSP90AB1, H2AFV, HSPA5 and NDUFV1. Gene ontology analyses of DEGs and DEPs revealed significant overlap with enrichment of genes/proteins related to metabolic process, biosynthetic process, cellular component organization, oxidative phosphorylation, alterations in glycolysis and ATP synthesis, Alzheimer's disease, chemokine-mediated inflammation and cytokine signalling pathways. The concomitant use of transcriptome and proteome expression to evaluate global changes in DCM has led to the identification of sixteen commonly altered entities as well as novel genes, proteins and pathways whose cardiac functions have yet to be deciphered. This data should contribute towards better management of the disease.
cDNA microarray analysis of esophageal cancer: discoveries and prospects.
Shimada, Yutaka; Sato, Fumiaki; Shimizu, Kazuharu; Tsujimoto, Gozoh; Tsukada, Kazuhiro
2009-07-01
Recent progress in molecular biology has revealed many genetic and epigenetic alterations that are involved in the development and progression of esophageal cancer. Microarray analysis has also revealed several genetic networks that are involved in esophageal cancer. However, clinical application of microarray techniques and use of microarray data have not yet occurred. In this review, we focus on the recent developments and problems with microarray analysis of esophageal cancer.
Spaceflight Alters Bacterial Gene Expression and Virulence and Reveals Role for Global Regulator Hfq
NASA Technical Reports Server (NTRS)
Wilson, J. W.; Ott, C. M.; zuBentrup, K. Honer; Ramamurthy R.; Quick, L.; Porwollik, S.; Cheng, P.; McClellan, M.; Tsaprailis, G.; Radabaugh, T.;
2007-01-01
A comprehensive analysis of both the molecular genetic and phenotypic responses of any organism to the spaceflight environment has never been accomplished due to significant technological and logistical hurdles. Moreover, the effects of spaceflight on microbial pathogenicity and associated infectious disease risks have not been studied. The bacterial pathogen Salmonella typhimurium was grown aboard Space Shuttle mission STS-115 and compared to identical ground control cultures. Global microarray and proteomic analyses revealed 167 transcripts and 73 proteins changed expression with the conserved RNA-binding protein Hfq identified as a likely global regulator involved in the response to this environment. Hfq involvement was confirmed with a ground based microgravity culture model. Spaceflight samples exhibited enhanced virulence in a murine infection model and extracellular matrix accumulation consistent with a biofilm. Strategies to target Hfq and related regulators could potentially decrease infectious disease risks during spaceflight missions and provide novel therapeutic options on Earth.
Importing MAGE-ML format microarray data into BioConductor.
Durinck, Steffen; Allemeersch, Joke; Carey, Vincent J; Moreau, Yves; De Moor, Bart
2004-12-12
The microarray gene expression markup language (MAGE-ML) is a widely used XML (eXtensible Markup Language) standard for describing and exchanging information about microarray experiments. It can describe microarray designs, microarray experiment designs, gene expression data and data analysis results. We describe RMAGEML, a new Bioconductor package that provides a link between cDNA microarray data stored in MAGE-ML format and the Bioconductor framework for preprocessing, visualization and analysis of microarray experiments. http://www.bioconductor.org. Open Source.
CRISPR/Cas9: From Genome Engineering to Cancer Drug Discovery
Luo, Ji
2016-01-01
Advances in translational research are often driven by new technologies. The advent of microarrays, next-generation sequencing, proteomics and RNA interference (RNAi) have led to breakthroughs in our understanding of the mechanisms of cancer and the discovery of new cancer drug targets. The discovery of the bacterial clustered regularly interspaced palindromic repeat (CRISPR) system and its subsequent adaptation as a tool for mammalian genome engineering has opened up new avenues for functional genomics studies. This review will focus on the utility of CRISPR in the context of cancer drug target discovery. PMID:28603775
Killion, Patrick J; Sherlock, Gavin; Iyer, Vishwanath R
2003-01-01
Background The power of microarray analysis can be realized only if data is systematically archived and linked to biological annotations as well as analysis algorithms. Description The Longhorn Array Database (LAD) is a MIAME compliant microarray database that operates on PostgreSQL and Linux. It is a fully open source version of the Stanford Microarray Database (SMD), one of the largest microarray databases. LAD is available at Conclusions Our development of LAD provides a simple, free, open, reliable and proven solution for storage and analysis of two-color microarray data. PMID:12930545
Novel ageing-biomarker discovery using data-intensive technologies.
Griffiths, H R; Augustyniak, E M; Bennett, S J; Debacq-Chainiaux, F; Dunston, C R; Kristensen, P; Melchjorsen, C J; Navarrete, Santos A; Simm, A; Toussaint, O
2015-11-01
Ageing is accompanied by many visible characteristics. Other biological and physiological markers are also well-described e.g. loss of circulating sex hormones and increased inflammatory cytokines. Biomarkers for healthy ageing studies are presently predicated on existing knowledge of ageing traits. The increasing availability of data-intensive methods enables deep-analysis of biological samples for novel biomarkers. We have adopted two discrete approaches in MARK-AGE Work Package 7 for biomarker discovery; (1) microarray analyses and/or proteomics in cell systems e.g. endothelial progenitor cells or T cell ageing including a stress model; and (2) investigation of cellular material and plasma directly from tightly-defined proband subsets of different ages using proteomic, transcriptomic and miR array. The first approach provided longitudinal insight into endothelial progenitor and T cell ageing. This review describes the strategy and use of hypothesis-free, data-intensive approaches to explore cellular proteins, miR, mRNA and plasma proteins as healthy ageing biomarkers, using ageing models and directly within samples from adults of different ages. It considers the challenges associated with integrating multiple models and pilot studies as rational biomarkers for a large cohort study. From this approach, a number of high-throughput methods were developed to evaluate novel, putative biomarkers of ageing in the MARK-AGE cohort. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.
The regulon of the RNA chaperone CspA and its auto-regulation in Staphylococcus aureus.
Caballero, Carlos J; Menendez-Gil, Pilar; Catalan-Moreno, Arancha; Vergara-Irigaray, Marta; García, Begoña; Segura, Víctor; Irurzun, Naiara; Villanueva, Maite; Ruiz de Los Mozos, Igor; Solano, Cristina; Lasa, Iñigo; Toledo-Arana, Alejandro
2018-02-16
RNA-binding proteins (RBPs) are essential to fine-tune gene expression. RBPs containing the cold-shock domain are RNA chaperones that have been extensively studied. However, the RNA targets and specific functions for many of them remain elusive. Here, combining comparative proteomics and RBP-immunoprecipitation-microarray profiling, we have determined the regulon of the RNA chaperone CspA of Staphylococcus aureus. Functional analysis revealed that proteins involved in carbohydrate and ribonucleotide metabolism, stress response and virulence gene expression were affected by cspA deletion. Stress-associated phenotypes such as increased bacterial aggregation and diminished resistance to oxidative-stress stood out. Integration of the proteome and targetome showed that CspA post-transcriptionally modulates both positively and negatively the expression of its targets, denoting additional functions to the previously proposed translation enhancement. One of these repressed targets was its own mRNA, indicating the presence of a negative post-transcriptional feedback loop. CspA bound the 5'UTR of its own mRNA disrupting a hairpin, which was previously described as an RNase III target. Thus, deletion of the cspA 5'UTR abrogated mRNA processing and auto-regulation. We propose that CspA interacts through a U-rich motif, which is located at the RNase III cleavage site, portraying CspA as a putative RNase III-antagonist.
Aptamer-Based Multiplexed Proteomic Technology for Biomarker Discovery
Gold, Larry; Ayers, Deborah; Bertino, Jennifer; Bock, Christopher; Bock, Ashley; Brody, Edward N.; Carter, Jeff; Dalby, Andrew B.; Eaton, Bruce E.; Fitzwater, Tim; Flather, Dylan; Forbes, Ashley; Foreman, Trudi; Fowler, Cate; Gawande, Bharat; Goss, Meredith; Gunn, Magda; Gupta, Shashi; Halladay, Dennis; Heil, Jim; Heilig, Joe; Hicke, Brian; Husar, Gregory; Janjic, Nebojsa; Jarvis, Thale; Jennings, Susan; Katilius, Evaldas; Keeney, Tracy R.; Kim, Nancy; Koch, Tad H.; Kraemer, Stephan; Kroiss, Luke; Le, Ngan; Levine, Daniel; Lindsey, Wes; Lollo, Bridget; Mayfield, Wes; Mehan, Mike; Mehler, Robert; Nelson, Sally K.; Nelson, Michele; Nieuwlandt, Dan; Nikrad, Malti; Ochsner, Urs; Ostroff, Rachel M.; Otis, Matt; Parker, Thomas; Pietrasiewicz, Steve; Resnicow, Daniel I.; Rohloff, John; Sanders, Glenn; Sattin, Sarah; Schneider, Daniel; Singer, Britta; Stanton, Martin; Sterkel, Alana; Stewart, Alex; Stratford, Suzanne; Vaught, Jonathan D.; Vrkljan, Mike; Walker, Jeffrey J.; Watrobka, Mike; Waugh, Sheela; Weiss, Allison; Wilcox, Sheri K.; Wolfson, Alexey; Wolk, Steven K.; Zhang, Chi; Zichi, Dom
2010-01-01
Background The interrogation of proteomes (“proteomics”) in a highly multiplexed and efficient manner remains a coveted and challenging goal in biology and medicine. Methodology/Principal Findings We present a new aptamer-based proteomic technology for biomarker discovery capable of simultaneously measuring thousands of proteins from small sample volumes (15 µL of serum or plasma). Our current assay measures 813 proteins with low limits of detection (1 pM median), 7 logs of overall dynamic range (∼100 fM–1 µM), and 5% median coefficient of variation. This technology is enabled by a new generation of aptamers that contain chemically modified nucleotides, which greatly expand the physicochemical diversity of the large randomized nucleic acid libraries from which the aptamers are selected. Proteins in complex matrices such as plasma are measured with a process that transforms a signature of protein concentrations into a corresponding signature of DNA aptamer concentrations, which is quantified on a DNA microarray. Our assay takes advantage of the dual nature of aptamers as both folded protein-binding entities with defined shapes and unique nucleotide sequences recognizable by specific hybridization probes. To demonstrate the utility of our proteomics biomarker discovery technology, we applied it to a clinical study of chronic kidney disease (CKD). We identified two well known CKD biomarkers as well as an additional 58 potential CKD biomarkers. These results demonstrate the potential utility of our technology to rapidly discover unique protein signatures characteristic of various disease states. Conclusions/Significance We describe a versatile and powerful tool that allows large-scale comparison of proteome profiles among discrete populations. This unbiased and highly multiplexed search engine will enable the discovery of novel biomarkers in a manner that is unencumbered by our incomplete knowledge of biology, thereby helping to advance the next generation of evidence-based medicine. PMID:21165148
Multiplex array proteomics detects increased MMP-8 in CSF after spinal cord injury.
Light, Matthew; Minor, Kenneth H; DeWitt, Peter; Jasper, Kyle H; Davies, Stephen J A
2012-06-11
A variety of methods have been used to study inflammatory changes in the acutely injured spinal cord. Recently novel multiplex assays have been used in an attempt to overcome limitations in numbers of available targets studied in a single experiment. Other technical challenges in developing pre-clinical rodent models to investigate biomarkers in cerebrospinal fluid (CSF) include relatively small volumes of sample and low concentrations of target proteins. The primary objective of this study was to characterize the inflammatory profile present in CSF at a subacute time point in a clinically relevant rodent model of traumatic spinal cord injury (SCI). Our other aim was to test a microarray proteomics platform specifically for this application. A 34 cytokine sandwich ELISA microarray was used to study inflammatory changes in CSF samples taken 12 days post-cervical SCI in adult rats. The difference between the median foreground signal and the median background signal was measured. Bonferroni and Benjamini-Hochburg multiple testing corrections were applied to limit the False Discovery Rate (FDR), and a linear mixed model was used to account for repeated measures in the array. We report a novel subacute SCI biomarker, elevated levels of matrix metalloproteinase-8 protein in CSF, and discuss application of statistical models designed for multiplex testing. Major advantages of this assay over conventional methods include high-throughput format, good sensitivity, and reduced sample consumption. This method can be useful for creating comprehensive inflammatory profiles, and biomarkers can be used in the clinic to assess injury severity and to objectively grade response to therapy.
Expression and Production of SH2 Domain Proteins.
Liu, Bernard A; Ogiue-Ikeda, Mari; Machida, Kazuya
2017-01-01
The Src Homology 2 (SH2) domain lies at the heart of phosphotyrosine signaling, coordinating signaling events downstream of receptor tyrosine kinases (RTKs), adaptors, and scaffolds. Over a hundred SH2 domains are present in mammals, each having a unique specificity which determines its interactions with multiple binding partners. One of the essential tools necessary for studying and determining the role of SH2 domains in phosphotyrosine signaling is a set of soluble recombinant SH2 proteins. Here we describe methods, based on a broad experience with purification of all SH2 domains, for the production of SH2 domain proteins needed for proteomic and biochemical-based studies such as peptide arrays, mass-spectrometry, protein microarrays, reverse-phase microarrays, and high-throughput fluorescence polarization (HTP-FP). We describe stepwise protocols for expression and purification of SH2 domains using GST or poly His-tags, two widely adopted affinity tags. In addition, we address alternative approaches, challenges, and validation studies for assessing protein quality and provide general characteristics of purified human SH2 domains.
Ryan, Michael C; Zeeberg, Barry R; Caplen, Natasha J; Cleland, James A; Kahn, Ari B; Liu, Hongfang; Weinstein, John N
2008-01-01
Background Over 60% of protein-coding genes in vertebrates express mRNAs that undergo alternative splicing. The resulting collection of transcript isoforms poses significant challenges for contemporary biological assays. For example, RT-PCR validation of gene expression microarray results may be unsuccessful if the two technologies target different splice variants. Effective use of sequence-based technologies requires knowledge of the specific splice variant(s) that are targeted. In addition, the critical roles of alternative splice forms in biological function and in disease suggest that assay results may be more informative if analyzed in the context of the targeted splice variant. Results A number of contemporary technologies are used for analyzing transcripts or proteins. To enable investigation of the impact of splice variation on the interpretation of data derived from those technologies, we have developed SpliceCenter. SpliceCenter is a suite of user-friendly, web-based applications that includes programs for analysis of RT-PCR primer/probe sets, effectors of RNAi, microarrays, and protein-targeting technologies. Both interactive and high-throughput implementations of the tools are provided. The interactive versions of SpliceCenter tools provide visualizations of a gene's alternative transcripts and probe target positions, enabling the user to identify which splice variants are or are not targeted. The high-throughput batch versions accept user query files and provide results in tabular form. When, for example, we used SpliceCenter's batch siRNA-Check to process the Cancer Genome Anatomy Project's large-scale shRNA library, we found that only 59% of the 50,766 shRNAs in the library target all known splice variants of the target gene, 32% target some but not all, and 9% do not target any currently annotated transcript. Conclusion SpliceCenter provides unique, user-friendly applications for assessing the impact of transcript variation on the design and interpretation of RT-PCR, RNAi, gene expression microarrays, antibody-based detection, and mass spectrometry proteomics. The tools are intended for use by bench biologists as well as bioinformaticists. PMID:18638396
Diagnosis of Zika Virus Infection by Peptide Array and Enzyme-Linked Immunosorbent Assay.
Mishra, Nischay; Caciula, Adrian; Price, Adam; Thakkar, Riddhi; Ng, James; Chauhan, Lokendra V; Jain, Komal; Che, Xiaoyu; Espinosa, Diego A; Montoya Cruz, Magelda; Balmaseda, Angel; Sullivan, Eric H; Patel, Jigar J; Jarman, Richard G; Rakeman, Jennifer L; Egan, Christina T; Reusken, Chantal B E M; Koopmans, Marion P G; Harris, Eva; Tokarz, Rafal; Briese, Thomas; Lipkin, W Ian
2018-03-06
Zika virus (ZIKV) is implicated in fetal stillbirth, microcephaly, intracranial calcifications, and ocular anomalies following vertical transmission from infected mothers. In adults, infection may trigger autoimmune inflammatory polyneuropathy. Transmission most commonly follows the bite of infected Aedes mosquitoes but may also occur through sexual intercourse or receipt of blood products. Definitive diagnosis through detection of viral RNA is possible in serum or plasma within 10 days of disease onset, in whole blood within 3 weeks of onset, and in semen for up to 3 months. Serological diagnosis is nonetheless critical because few patients have access to molecular diagnostics during the acute phase of infection and infection may be associated with only mild or inapparent disease that does not prompt molecular testing. Serological diagnosis is confounded by cross-reactivity of immune sera with other flaviviruses endemic in the areas where ZIKV has recently emerged. Accordingly, we built a high-density microarray comprising nonredundant 12-mer peptides that tile, with one-residue overlap, the proteomes of Zika, dengue, yellow fever, West Nile, Ilheus, Oropouche, and chikungunya viruses. Serological analysis enabled discovery of a ZIKV NS2B 20-residue peptide that had high sensitivity (96.0%) and specificity (95.9%) versus natural infection with or vaccination against dengue, chikungunya, yellow fever, West Nile, tick-borne encephalitis, or Japanese encephalitis virus in a microarray assay and an enzyme-linked immunosorbent assay (ELISA) of early-convalescent-phase sera (2 to 3 weeks after onset of symptomatic infection). IMPORTANCE The emergence of Zika virus (ZIKV) as a teratogen is a profound challenge to global public health. Molecular diagnosis of infection is straightforward during the 3-week period when patients are viremic. However, serological diagnosis thereafter of historical exposure has been confounded by cross-reactivity. Using high-density peptide arrays that tile the proteomes of a selection of flaviviruses to identify a ZIKV-specific peptide, we established two assays that enable sensitive and specific diagnosis of exposure to ZIKV. These assays may be useful in guiding clinical management of mothers at risk for potential exposure to ZIKV and enable insights into the epidemiology of ZIKV infections.
Marisch, Karoline; Bayer, Karl; Scharl, Theresa; Mairhofer, Juergen; Krempl, Peter M.; Hummel, Karin; Razzazi-Fazeli, Ebrahim; Striedner, Gerald
2013-01-01
Escherichia coli K–12 and B strains are among the most frequently used bacterial hosts for production of recombinant proteins on an industrial scale. To improve existing processes and to accelerate bioprocess development, we performed a detailed host analysis. We investigated the different behaviors of the E. coli production strains BL21, RV308, and HMS174 in response to high-glucose concentrations. Tightly controlled cultivations were conducted under defined environmental conditions for the in-depth analysis of physiological behavior. In addition to acquisition of standard process parameters, we also used DNA microarray analysis and differential gel electrophoresis (EttanTM DIGE). Batch cultivations showed different yields of the distinct strains for cell dry mass and growth rate, which were highest for BL21. In addition, production of acetate, triggered by excess glucose supply, was much higher for the K–12 strains compared to the B strain. Analysis of transcriptome data showed significant alteration in 347 of 3882 genes common among all three hosts. These differentially expressed genes included, for example, those involved in transport, iron acquisition, and motility. The investigation of proteome patterns additionally revealed a high number of differentially expressed proteins among the investigated hosts. The subsequently selected 38 spots included proteins involved in transport and motility. The results of this comprehensive analysis delivered a full genomic picture of the three investigated strains. Differentially expressed groups for targeted host modification were identified like glucose transport or iron acquisition, enabling potential optimization of strains to improve yield and process quality. Dissimilar growth profiles of the strains confirm different genotypes. Furthermore, distinct transcriptome patterns support differential regulation at the genome level. The identified proteins showed high agreement with the transcriptome data and suggest similar regulation within a host at both levels for the identified groups. Such host attributes need to be considered in future process design and operation. PMID:23950949
Marisch, Karoline; Bayer, Karl; Scharl, Theresa; Mairhofer, Juergen; Krempl, Peter M; Hummel, Karin; Razzazi-Fazeli, Ebrahim; Striedner, Gerald
2013-01-01
Escherichia coli K-12 and B strains are among the most frequently used bacterial hosts for production of recombinant proteins on an industrial scale. To improve existing processes and to accelerate bioprocess development, we performed a detailed host analysis. We investigated the different behaviors of the E. coli production strains BL21, RV308, and HMS174 in response to high-glucose concentrations. Tightly controlled cultivations were conducted under defined environmental conditions for the in-depth analysis of physiological behavior. In addition to acquisition of standard process parameters, we also used DNA microarray analysis and differential gel electrophoresis (Ettan(TM) DIGE). Batch cultivations showed different yields of the distinct strains for cell dry mass and growth rate, which were highest for BL21. In addition, production of acetate, triggered by excess glucose supply, was much higher for the K-12 strains compared to the B strain. Analysis of transcriptome data showed significant alteration in 347 of 3882 genes common among all three hosts. These differentially expressed genes included, for example, those involved in transport, iron acquisition, and motility. The investigation of proteome patterns additionally revealed a high number of differentially expressed proteins among the investigated hosts. The subsequently selected 38 spots included proteins involved in transport and motility. The results of this comprehensive analysis delivered a full genomic picture of the three investigated strains. Differentially expressed groups for targeted host modification were identified like glucose transport or iron acquisition, enabling potential optimization of strains to improve yield and process quality. Dissimilar growth profiles of the strains confirm different genotypes. Furthermore, distinct transcriptome patterns support differential regulation at the genome level. The identified proteins showed high agreement with the transcriptome data and suggest similar regulation within a host at both levels for the identified groups. Such host attributes need to be considered in future process design and operation.
The Microarray Revolution: Perspectives from Educators
ERIC Educational Resources Information Center
Brewster, Jay L.; Beason, K. Beth; Eckdahl, Todd T.; Evans, Irene M.
2004-01-01
In recent years, microarray analysis has become a key experimental tool, enabling the analysis of genome-wide patterns of gene expression. This review approaches the microarray revolution with a focus upon four topics: 1) the early development of this technology and its application to cancer diagnostics; 2) a primer of microarray research,…
Epigenetics of prostate cancer.
McKee, Tawnya C; Tricoli, James V
2015-01-01
The introduction of novel technologies that can be applied to the investigation of the molecular underpinnings of human cancer has allowed for new insights into the mechanisms associated with tumor development and progression. They have also advanced the diagnosis, prognosis and treatment of cancer. These technologies include microarray and other analysis methods for the generation of large-scale gene expression data on both mRNA and miRNA, next-generation DNA sequencing technologies utilizing a number of platforms to perform whole genome, whole exome, or targeted DNA sequencing to determine somatic mutational differences and gene rearrangements, and a variety of proteomic analysis platforms including liquid chromatography/mass spectrometry (LC/MS) analysis to survey alterations in protein profiles in tumors. One other important advancement has been our current ability to survey the methylome of human tumors in a comprehensive fashion through the use of sequence-based and array-based methylation analysis (Bock et al., Nat Biotechnol 28:1106-1114, 2010; Harris et al., Nat Biotechnol 28:1097-1105, 2010). The focus of this chapter is to present and discuss the evidence for key genes involved in prostate tumor development, progression, or resistance to therapy that are regulated by methylation-induced silencing.
Critical roles for a genetic code alteration in the evolution of the genus Candida.
Silva, Raquel M; Paredes, João A; Moura, Gabriela R; Manadas, Bruno; Lima-Costa, Tatiana; Rocha, Rita; Miranda, Isabel; Gomes, Ana C; Koerkamp, Marian J G; Perrot, Michel; Holstege, Frank C P; Boucherie, Hélian; Santos, Manuel A S
2007-10-31
During the last 30 years, several alterations to the standard genetic code have been discovered in various bacterial and eukaryotic species. Sense and nonsense codons have been reassigned or reprogrammed to expand the genetic code to selenocysteine and pyrrolysine. These discoveries highlight unexpected flexibility in the genetic code, but do not elucidate how the organisms survived the proteome chaos generated by codon identity redefinition. In order to shed new light on this question, we have reconstructed a Candida genetic code alteration in Saccharomyces cerevisiae and used a combination of DNA microarrays, proteomics and genetics approaches to evaluate its impact on gene expression, adaptation and sexual reproduction. This genetic manipulation blocked mating, locked yeast in a diploid state, remodelled gene expression and created stress cross-protection that generated adaptive advantages under environmental challenging conditions. This study highlights unanticipated roles for codon identity redefinition during the evolution of the genus Candida, and strongly suggests that genetic code alterations create genetic barriers that speed up speciation.
Barton, G; Abbott, J; Chiba, N; Huang, DW; Huang, Y; Krznaric, M; Mack-Smith, J; Saleem, A; Sherman, BT; Tiwari, B; Tomlinson, C; Aitman, T; Darlington, J; Game, L; Sternberg, MJE; Butcher, SA
2008-01-01
Background Microarray experimentation requires the application of complex analysis methods as well as the use of non-trivial computer technologies to manage the resultant large data sets. This, together with the proliferation of tools and techniques for microarray data analysis, makes it very challenging for a laboratory scientist to keep up-to-date with the latest developments in this field. Our aim was to develop a distributed e-support system for microarray data analysis and management. Results EMAAS (Extensible MicroArray Analysis System) is a multi-user rich internet application (RIA) providing simple, robust access to up-to-date resources for microarray data storage and analysis, combined with integrated tools to optimise real time user support and training. The system leverages the power of distributed computing to perform microarray analyses, and provides seamless access to resources located at various remote facilities. The EMAAS framework allows users to import microarray data from several sources to an underlying database, to pre-process, quality assess and analyse the data, to perform functional analyses, and to track data analysis steps, all through a single easy to use web portal. This interface offers distance support to users both in the form of video tutorials and via live screen feeds using the web conferencing tool EVO. A number of analysis packages, including R-Bioconductor and Affymetrix Power Tools have been integrated on the server side and are available programmatically through the Postgres-PLR library or on grid compute clusters. Integrated distributed resources include the functional annotation tool DAVID, GeneCards and the microarray data repositories GEO, CELSIUS and MiMiR. EMAAS currently supports analysis of Affymetrix 3' and Exon expression arrays, and the system is extensible to cater for other microarray and transcriptomic platforms. Conclusion EMAAS enables users to track and perform microarray data management and analysis tasks through a single easy-to-use web application. The system architecture is flexible and scalable to allow new array types, analysis algorithms and tools to be added with relative ease and to cope with large increases in data volume. PMID:19032776
An Introduction to MAMA (Meta-Analysis of MicroArray data) System.
Zhang, Zhe; Fenstermacher, David
2005-01-01
Analyzing microarray data across multiple experiments has been proven advantageous. To support this kind of analysis, we are developing a software system called MAMA (Meta-Analysis of MicroArray data). MAMA utilizes a client-server architecture with a relational database on the server-side for the storage of microarray datasets collected from various resources. The client-side is an application running on the end user's computer that allows the user to manipulate microarray data and analytical results locally. MAMA implementation will integrate several analytical methods, including meta-analysis within an open-source framework offering other developers the flexibility to plug in additional statistical algorithms.
Li, Dongmei; Le Pape, Marc A; Parikh, Nisha I; Chen, Will X; Dye, Timothy D
2013-01-01
Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis.
Transfection microarray and the applications.
Miyake, Masato; Yoshikawa, Tomohiro; Fujita, Satoshi; Miyake, Jun
2009-05-01
Microarray transfection has been extensively studied for high-throughput functional analysis of mammalian cells. However, control of efficiency and reproducibility are the critical issues for practical use. By using solid-phase transfection accelerators and nano-scaffold, we provide a highly efficient and reproducible microarray-transfection device, "transfection microarray". The device would be applied to the limited number of available primary cells and stem cells not only for large-scale functional analysis but also reporter-based time-lapse cellular event analysis.
Rigbolt, Kristoffer T G; Vanselow, Jens T; Blagoev, Blagoy
2011-08-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)(1). The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net.
Rigbolt, Kristoffer T. G.; Vanselow, Jens T.; Blagoev, Blagoy
2011-01-01
Recent technological advances have made it possible to identify and quantify thousands of proteins in a single proteomics experiment. As a result of these developments, the analysis of data has become the bottleneck of proteomics experiment. To provide the proteomics community with a user-friendly platform for comprehensive analysis, inspection and visualization of quantitative proteomics data we developed the Graphical Proteomics Data Explorer (GProX)1. The program requires no special bioinformatics training, as all functions of GProX are accessible within its graphical user-friendly interface which will be intuitive to most users. Basic features facilitate the uncomplicated management and organization of large data sets and complex experimental setups as well as the inspection and graphical plotting of quantitative data. These are complemented by readily available high-level analysis options such as database querying, clustering based on abundance ratios, feature enrichment tests for e.g. GO terms and pathway analysis tools. A number of plotting options for visualization of quantitative proteomics data is available and most analysis functions in GProX create customizable high quality graphical displays in both vector and bitmap formats. The generic import requirements allow data originating from essentially all mass spectrometry platforms, quantitation strategies and software to be analyzed in the program. GProX represents a powerful approach to proteomics data analysis providing proteomics experimenters with a toolbox for bioinformatics analysis of quantitative proteomics data. The program is released as open-source and can be freely downloaded from the project webpage at http://gprox.sourceforge.net. PMID:21602510
Zhang, Zhaowei; Li, Peiwu; Hu, Xiaofeng; Zhang, Qi; Ding, Xiaoxia; Zhang, Wen
2012-01-01
Chemical contaminants in food have caused serious health issues in both humans and animals. Microarray technology is an advanced technique suitable for the analysis of chemical contaminates. In particular, immuno-microarray approach is one of the most promising methods for chemical contaminants analysis. The use of microarrays for the analysis of chemical contaminants is the subject of this review. Fabrication strategies and detection methods for chemical contaminants are discussed in detail. Application to the analysis of mycotoxins, biotoxins, pesticide residues, and pharmaceutical residues is also described. Finally, future challenges and opportunities are discussed.
Diagnosis of Zika Virus Infection by Peptide Array and Enzyme-Linked Immunosorbent Assay
Caciula, Adrian; Price, Adam; Thakkar, Riddhi; Ng, James; Chauhan, Lokendra V.; Jain, Komal; Che, Xiaoyu; Espinosa, Diego A.; Montoya Cruz, Magelda; Balmaseda, Angel; Sullivan, Eric H.; Patel, Jigar J.; Jarman, Richard G.; Rakeman, Jennifer L.; Egan, Christina T.; Reusken, Chantal B. E. M.; Koopmans, Marion P. G.; Harris, Eva; Tokarz, Rafal; Briese, Thomas
2018-01-01
ABSTRACT Zika virus (ZIKV) is implicated in fetal stillbirth, microcephaly, intracranial calcifications, and ocular anomalies following vertical transmission from infected mothers. In adults, infection may trigger autoimmune inflammatory polyneuropathy. Transmission most commonly follows the bite of infected Aedes mosquitoes but may also occur through sexual intercourse or receipt of blood products. Definitive diagnosis through detection of viral RNA is possible in serum or plasma within 10 days of disease onset, in whole blood within 3 weeks of onset, and in semen for up to 3 months. Serological diagnosis is nonetheless critical because few patients have access to molecular diagnostics during the acute phase of infection and infection may be associated with only mild or inapparent disease that does not prompt molecular testing. Serological diagnosis is confounded by cross-reactivity of immune sera with other flaviviruses endemic in the areas where ZIKV has recently emerged. Accordingly, we built a high-density microarray comprising nonredundant 12-mer peptides that tile, with one-residue overlap, the proteomes of Zika, dengue, yellow fever, West Nile, Ilheus, Oropouche, and chikungunya viruses. Serological analysis enabled discovery of a ZIKV NS2B 20-residue peptide that had high sensitivity (96.0%) and specificity (95.9%) versus natural infection with or vaccination against dengue, chikungunya, yellow fever, West Nile, tick-borne encephalitis, or Japanese encephalitis virus in a microarray assay and an enzyme-linked immunosorbent assay (ELISA) of early-convalescent-phase sera (2 to 3 weeks after onset of symptomatic infection). PMID:29511073
A cross-omics toxicological evaluation of drinking water treated with different processes.
Shi, Peng; Jia, Shuyu; Zhang, Xu-Xiang; Zhao, Fuzheng; Chen, Yajun; Zhou, Qing; Cheng, Shupei; Li, Ai-Min
2014-04-30
Cross-omics profiling and phenotypic analysis were conducted to comprehensively assess the toxicities of source of drinking water (SDW), effluent of conventional treatment (ECT) and effluent of advanced treatment (EAT) in a water treatment plant. SDW feeding increased body weight, and relative liver and kidney weights of mice. Hepatic histopathological damages and serum biochemical alterations were observed in the mice fed with SDW and ECT, but EAT feeding showed no obvious effects. Transcriptomic analysis demonstrated that exposure to water samples caused differential expression of hundreds of genes in livers. Cluster analysis of the differentially expressed genes which generated by both microarrays and digital gene expression showed similar grouping patterns. Proteomic and metabolomics analyses indicated that drinking SDW, ECT and EAT generated 59, 145 and 41 significantly altered proteins in livers and 8, 2 and 0 altered metabolites in serum, respectively. SDW was found to affect several metabolic pathways including metabolism of xenobiotics by cytochrome P450 and fatty acid metabolism. SDW and ECT might induce molecular toxicities to mice, but the advanced treatment process can reduce the potential health risk by effectively removing toxic chemicals in drinking water. Copyright © 2014 Elsevier B.V. All rights reserved.
Toward a Genome-Wide Systems Biology Analysis of Host-Pathogen Interactions in Group A Streptococcus
Musser, James M.; DeLeo, Frank R.
2005-01-01
Genome-wide analysis of microbial pathogens and molecular pathogenesis processes has become an area of considerable activity in the last 5 years. These studies have been made possible by several advances, including completion of the human genome sequence, publication of genome sequences for many human pathogens, development of microarray technology and high-throughput proteomics, and maturation of bioinformatics. Despite these advances, relatively little effort has been expended in the bacterial pathogenesis arena to develop and use integrated research platforms in a systems biology approach to enhance our understanding of disease processes. This review discusses progress made in exploiting an integrated genome-wide research platform to gain new knowledge about how the human bacterial pathogen group A Streptococcus causes disease. Results of these studies have provided many new avenues for basic pathogenesis research and translational research focused on development of an efficacious human vaccine and novel therapeutics. One goal in summarizing this line of study is to bring exciting new findings to the attention of the investigative pathology community. In addition, we hope the review will stimulate investigators to consider using analogous approaches for analysis of the molecular pathogenesis of other microbes. PMID:16314461
The National Cancer Institute will hold a public pre-application webinar on Friday, December 11 at 12:00 p.m. (EST) for the Funding Opportunity Announcements (FOAs) RFA-CA-15-021 entitled “Proteome Characterization Centers for Clinical Proteomic Tumor Analysis Consortium (U24), RFA-CA-15-022 entitled “Proteogenomic Translational Research Centers for Clinical Proteomic Tumor Analysis Consortium (U01)”, and RFA-CA-15-023 entitled “Proteogenomic Data Analysis Centers for Clinical Proteomic Tumor Analysis Consortium (U24)”.
Nakajima, Rie; Escudero, Raquel; Molina, Douglas M; Rodríguez-Vargas, Manuela; Randall, Arlo; Jasinskas, Algis; Pablo, Jozelyn; Felgner, Philip L; AuCoin, David P; Anda, Pedro; Davies, D Huw
2016-07-01
Tularemia in humans is caused mainly by two subspecies of the Gram-negative facultative anaerobe Francisella tularensis: F. tularensis subsp. tularensis (type A) and F. tularensis subsp. holarctica (type B). The current serological test for tularemia is based on agglutination of whole organisms, and the reactive antigens are not well understood. Previously, we profiled the antibody responses in type A and B tularemia cases in the United States using a proteome microarray of 1,741 different proteins derived from the type A strain Schu S4. Fifteen dominant antigens able to detect antibodies to both types of infection were identified, although these were not validated in a different immunoassay format. Since type A and B subspecies are closely related, we hypothesized that Schu S4 antigens would also have utility for diagnosing type B tularemia caused by strains from other geographic locations. To test this, we probed the Schu S4 array with sera from 241 type B tularemia cases in Spain. Despite there being no type A strains in Spain, we confirmed the responses against some of the same potential serodiagnostic antigens reported previously, as well as determined the responses against additional potential serodiagnostic antigens. Five potential serodiagnostic antigens were evaluated on immunostrips, and two of these (FTT1696/GroEL and FTT0975/conserved hypothetical protein) discriminated between the Spanish tularemia cases and healthy controls. We conclude that antigens from the type A strain Schu S4 are suitable for detection of antibodies from patients with type B F. tularensis infections and that these can be used for the diagnosis of tularemia in a deployable format, such as the immunostrip. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Nakajima, Rie; Escudero, Raquel; Molina, Douglas M.; Rodríguez-Vargas, Manuela; Randall, Arlo; Jasinskas, Algis; Pablo, Jozelyn; Felgner, Philip L.; AuCoin, David P.; Anda, Pedro
2016-01-01
Tularemia in humans is caused mainly by two subspecies of the Gram-negative facultative anaerobe Francisella tularensis: F. tularensis subsp. tularensis (type A) and F. tularensis subsp. holarctica (type B). The current serological test for tularemia is based on agglutination of whole organisms, and the reactive antigens are not well understood. Previously, we profiled the antibody responses in type A and B tularemia cases in the United States using a proteome microarray of 1,741 different proteins derived from the type A strain Schu S4. Fifteen dominant antigens able to detect antibodies to both types of infection were identified, although these were not validated in a different immunoassay format. Since type A and B subspecies are closely related, we hypothesized that Schu S4 antigens would also have utility for diagnosing type B tularemia caused by strains from other geographic locations. To test this, we probed the Schu S4 array with sera from 241 type B tularemia cases in Spain. Despite there being no type A strains in Spain, we confirmed the responses against some of the same potential serodiagnostic antigens reported previously, as well as determined the responses against additional potential serodiagnostic antigens. Five potential serodiagnostic antigens were evaluated on immunostrips, and two of these (FTT1696/GroEL and FTT0975/conserved hypothetical protein) discriminated between the Spanish tularemia cases and healthy controls. We conclude that antigens from the type A strain Schu S4 are suitable for detection of antibodies from patients with type B F. tularensis infections and that these can be used for the diagnosis of tularemia in a deployable format, such as the immunostrip. PMID:27098957
Microarray and Proteomic Analysis of Breast Cancer Cell and Osteoblast Co-cultures
Morrison, Charlotte; Mancini, Stephanie; Cipollone, Jane; Kappelhoff, Reinhild; Roskelley, Calvin; Overall, Christopher
2011-01-01
Dynamic reciprocal interactions between a tumor and its microenvironment impact both the establishment and progression of metastases. These interactions are mediated, in part, through proteolytic sculpting of the microenvironment, particularly by the matrix metalloproteinases, with both tumors and stroma contributing to the proteolytic milieu. Because bone is one of the predominant sites of breast cancer metastases, we used a co-culture system in which a subpopulation of the highly invasive human breast cancer cell line MDA-MB-231, with increased propensity to metastasize to bone, was overlaid onto a monolayer of differentiated osteoblast MC3T3-E1 cells in a mineralized osteoid matrix. CLIP-CHIP® microarrays identified changes in the complete protease and inhibitor expression profile of the breast cancer and osteoblast cells that were induced upon co-culture. A large increase in osteoblast-derived MMP-13 mRNA and protein was observed. Affymetrix analysis and validation showed induction of MMP-13 was initiated by soluble factors produced by the breast tumor cells, including oncostatin M and the acute response apolipoprotein SAA3. Significant changes in the osteoblast secretomes upon addition of MMP-13 were identified by degradomics from which six novel MMP-13 substrates with the potential to functionally impact breast cancer metastasis to bone were identified and validated. These included inactivation of the chemokines CCL2 and CCL7, activation of platelet-derived growth factor-C, and cleavage of SAA3, osteoprotegerin, CutA, and antithrombin III. Hence, the influence of breast cancer metastases on the bone microenvironment that is executed via the induction of osteoblast MMP-13 with the potential to enhance metastases growth by generating a microenvironmental amplifying feedback loop is revealed. PMID:21784845
Contributions to Statistical Problems Related to Microarray Data
ERIC Educational Resources Information Center
Hong, Feng
2009-01-01
Microarray is a high throughput technology to measure the gene expression. Analysis of microarray data brings many interesting and challenging problems. This thesis consists three studies related to microarray data. First, we propose a Bayesian model for microarray data and use Bayes Factors to identify differentially expressed genes. Second, we…
Hirayama, Mio; Kobayashi, Daiki; Mizuguchi, Souhei; Morikawa, Takashi; Nagayama, Megumi; Midorikawa, Uichi; Wilson, Masayo M; Nambu, Akiko N; Yoshizawa, Akiyasu C; Kawano, Shin; Araki, Norie
2013-05-01
Neurofibromatosis type 1 (NF1) tumor suppressor gene product, neurofibromin, functions in part as a Ras-GAP, and though its loss is implicated in the neuronal abnormality of NF1 patients, its precise cellular function remains unclear. To study the molecular mechanism of NF1 pathogenesis, we prepared NF1 gene knockdown (KD) PC12 cells, as a NF1 disease model, and analyzed their molecular (gene and protein) expression profiles with a unique integrated proteomics approach, comprising iTRAQ, 2D-DIGE, and DNA microarrays, using an integrated protein and gene expression analysis chart (iPEACH). In NF1-KD PC12 cells showing abnormal neuronal differentiation after NGF treatment, of 3198 molecules quantitatively identified and listed in iPEACH, 97 molecules continuously up- or down-regulated over time were extracted. Pathway and network analysis further revealed overrepresentation of calcium signaling and transcriptional regulation by glucocorticoid receptor (GR) in the up-regulated protein set, whereas nerve system development was overrepresented in the down-regulated protein set. The novel up-regulated network we discovered, "dynein IC2-GR-COX-1 signaling," was then examined in NF1-KD cells. Validation studies confirmed that NF1 knockdown induces altered splicing and phosphorylation patterns of dynein IC2 isomers, up-regulation and accumulation of nuclear GR, and increased COX-1 expression in NGF-treated cells. Moreover, the neurite retraction phenotype observed in NF1-KD cells was significantly recovered by knockdown of the dynein IC2-C isoform and COX-1. In addition, dynein IC2 siRNA significantly inhibited nuclear translocation and accumulation of GR and up-regulation of COX-1 expression. These results suggest that dynein IC2 up-regulates GR nuclear translocation and accumulation, and subsequently causes increased COX-1 expression, in this NF1 disease model. Our integrated proteomics strategy, which combines multiple approaches, demonstrates that NF1-related neural abnormalities are, in part, caused by up-regulation of dynein IC2-GR-COX-1 signaling, which may be a novel therapeutic target for NF1.
Stangeland, Biljana; Mughal, Awais A; Grieg, Zanina; Sandberg, Cecilie Jonsgar; Joel, Mrinal; Nygård, Ståle; Meling, Torstein; Murrell, Wayne; Vik Mo, Einar O; Langmoen, Iver A
2015-09-22
Glioblastoma (GBM) is both the most common and the most lethal primary brain tumor. It is thought that GBM stem cells (GSCs) are critically important in resistance to therapy. Therefore, there is a strong rationale to target these cells in order to develop new molecular therapies.To identify molecular targets in GSCs, we compared gene expression in GSCs to that in neural stem cells (NSCs) from the adult human brain, using microarrays. Bioinformatic filtering identified 20 genes (PBK/TOPK, CENPA, KIF15, DEPDC1, CDC6, DLG7/DLGAP5/HURP, KIF18A, EZH2, HMMR/RHAMM/CD168, NOL4, MPP6, MDM1, RAPGEF4, RHBDD1, FNDC3B, FILIP1L, MCC, ATXN7L4/ATXN7L1, P2RY5/LPAR6 and FAM118A) that were consistently expressed in GSC cultures and consistently not expressed in NSC cultures. The expression of these genes was confirmed in clinical samples (TCGA and REMBRANDT). The first nine genes were highly co-expressed in all GBM subtypes and were part of the same protein-protein interaction network. Furthermore, their combined up-regulation correlated negatively with patient survival in the mesenchymal GBM subtype. Using targeted proteomics and the COGNOSCENTE database we linked these genes to GBM signalling pathways.Nine genes: PBK, CENPA, KIF15, DEPDC1, CDC6, DLG7, KIF18A, EZH2 and HMMR should be further explored as targets for treatment of GBM.
A Proteomic Study of Brassinosteroid Response in Arabidopsis
Deng, Zhiping; Zhang, Xin; Tang, Wenqiang; Oses-Prieto, Juan A; Suzuki, Nagi; Gendron, Joshua M; Chen, Huanjing; Guan, Shenheng; Chalkley, Robert J.; Peterman, T. Kaye; Burlingame, Alma L.; Wang, Zhi-Yong
2010-01-01
Summary The plant steroid hormones brassinosteroids (BRs) play an important role in a wide range of developmental and physiological processes. How BR signaling regulates diverse processes remains unclear. To understand the molecular details of BR responses, we have performed a proteomic study of BR-regulated proteins in Arabidopsis using two-dimensional difference gel electrophoresis (2-D DIGE) coupled with liquid chromatography-tandem mass spectrometry (LC-MS/MS). We identified 42 BR-regulated proteins, which are predicted to play potential roles in BR regulation of specific cellular processes, such as signaling, cytoskeleton rearrangement, vesicle trafficking, and biosynthesis of hormones and vitamins. Analyses of the BR insensitive mutant bri1-116 and BR hypersensitive mutant bzr1-1D identified 5 proteins (PATL1, PATL2, THI1, AtMDAR3 and NADP-ME2) affected by both BR-treatment and in the mutants, suggesting their importance in BR action. Selected proteins were further studied using insertion knockout mutants or immunoblotting. Interestingly, about 80% of the BR-responsive proteins were not identified in previous microarray studies, and direct comparison between protein- and RNA changes in BR mutants revealed a very weak correlation. RT-PCR analysis of selected genes revealed gene-specific kinetic relationships between RNA and protein responses. Furthermore, BR-regulated posttranslational modification of BiP2 protein was detected as spot shifts in 2-D DIGE. This study provides novel insights into the molecular networks that link BR signaling to specific cellular and physiological responses. PMID:17848588
Microarray platform for omics analysis
NASA Astrophysics Data System (ADS)
Mecklenburg, Michael; Xie, Bin
2001-09-01
Microarray technology has revolutionized genetic analysis. However, limitations in genome analysis has lead to renewed interest in establishing 'omic' strategies. As we enter the post-genomic era, new microarray technologies are needed to address these new classes of 'omic' targets, such as proteins, as well as lipids and carbohydrates. We have developed a microarray platform that combines self- assembling monolayers with the biotin-streptavidin system to provide a robust, versatile immobilization scheme. A hydrophobic film is patterned on the surface creating an array of tension wells that eliminates evaporation effects thereby reducing the shear stress to which biomolecules are exposed to during immobilization. The streptavidin linker layer makes it possible to adapt and/or develop microarray based assays using virtually any class of biomolecules including: carbohydrates, peptides, antibodies, receptors, as well as them ore traditional DNA based arrays. Our microarray technology is designed to furnish seamless compatibility across the various 'omic' platforms by providing a common blueprint for fabricating and analyzing arrays. The prototype microarray uses a microscope slide footprint patterned with 2 by 96 flat wells. Data on the microarray platform will be presented.
Wilson, J. W.; Ott, C. M.; zu Bentrup, K. Höner; Ramamurthy, R.; Quick, L.; Porwollik, S.; Cheng, P.; McClelland, M.; Tsaprailis, G.; Radabaugh, T.; Hunt, A.; Fernandez, D.; Richter, E.; Shah, M.; Kilcoyne, M.; Joshi, L.; Nelman-Gonzalez, M.; Hing, S.; Parra, M.; Dumars, P.; Norwood, K.; Bober, R.; Devich, J.; Ruggles, A.; Goulart, C.; Rupert, M.; Stodieck, L.; Stafford, P.; Catella, L.; Schurr, M. J.; Buchanan, K.; Morici, L.; McCracken, J.; Allen, P.; Baker-Coleman, C.; Hammond, T.; Vogel, J.; Nelson, R.; Pierson, D. L.; Stefanyshyn-Piper, H. M.; Nickerson, C. A.
2007-01-01
A comprehensive analysis of both the molecular genetic and phenotypic responses of any organism to the space flight environment has never been accomplished because of significant technological and logistical hurdles. Moreover, the effects of space flight on microbial pathogenicity and associated infectious disease risks have not been studied. The bacterial pathogen Salmonella typhimurium was grown aboard Space Shuttle mission STS-115 and compared with identical ground control cultures. Global microarray and proteomic analyses revealed that 167 transcripts and 73 proteins changed expression with the conserved RNA-binding protein Hfq identified as a likely global regulator involved in the response to this environment. Hfq involvement was confirmed with a ground-based microgravity culture model. Space flight samples exhibited enhanced virulence in a murine infection model and extracellular matrix accumulation consistent with a biofilm. Strategies to target Hfq and related regulators could potentially decrease infectious disease risks during space flight missions and provide novel therapeutic options on Earth. PMID:17901201
Yu, Kebing; Salomon, Arthur R
2009-12-01
Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through MS/MS. Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to various experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our high throughput autonomous proteomic pipeline used in the automated acquisition and post-acquisition analysis of proteomic data.
Microarrays in brain research: the good, the bad and the ugly.
Mirnics, K
2001-06-01
Making sense of microarray data is a complex process, in which the interpretation of findings will depend on the overall experimental design and judgement of the investigator performing the analysis. As a result, differences in tissue harvesting, microarray types, sample labelling and data analysis procedures make post hoc sharing of microarray data a great challenge. To ensure rapid and meaningful data exchange, we need to create some order out of the existing chaos. In these ground-breaking microarray standardization and data sharing efforts, NIH agencies should take a leading role
Narihiro, Takashi; Kanosue, Yuji; Hiraishi, Akira
2016-06-25
This study was undertaken to examine the effects of water activity (aw) on the viability of actinobacterial isolates from a fed-batch composting (FBC) process by comparing culturability and stainability with 5-cyano-2,3-ditoryl tetrazolium chloride (CTC). The FBC reactor as the source of these bacteria was operated with the daily loading of household biowaste for 70 d. During this period of composting, aw in the reactor decreased linearly with time and reached approximately 0.95 at the end of operation. The plate counts of aerobic chemoorganotrophic bacteria were 3.2-fold higher than CTC-positive (CTC+) counts on average at the fully acclimated stage (after 7 weeks of operation), in which Actinobacteria predominated, as shown by lipoquinone profiling and cultivation methods. When the actinobacterial isolates from the FBC process were grown under aw stress, no significant differences were observed in culturability among the cultures, whereas CTC stainability decreased with reductions in aw levels. A cDNA microarray-based transcriptomic analysis of a representative isolate showed that many of the genes involved in cellular metabolism and genetic information processing were down-regulated by aw stress. This result was fully supported by a proteomic analysis. The results of the present study suggest that, in low aw mature compost, the metabolic activity of the community with Actinobacteria predominating is temporarily reduced to a level that hardly reacts with CTC; however, these bacteria are easily recoverable by exposure to a high aw culture medium. This may be a plausible reason why acclimated FBC reactors in which Actinobacteria predominate yields higher plate counts than CTC+ counts.
Jonckheere, Wim; Dermauw, Wannes; Zhurov, Vladimir; Wybouw, Nicky; Van den Bulcke, Jan; Villarroel, Carlos A; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C; Tirry, Luc; Baggerman, Geert; Clark, Richard M; Kant, Merijn R; Vanholme, Bartel; Menschaert, Gerben; Van Leeuwen, Thomas
2016-12-01
The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants-bean, maize, soy, and tomato-was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Jonckheere, Wim; Zhurov, Vladimir; Villarroel, Carlos A.; Greenhalgh, Robert; Grbić, Mike; Schuurink, Rob C.; Tirry, Luc; Kant, Merijn R.; Vanholme, Bartel
2016-01-01
The two-spotted spider mite Tetranychus urticae is an extremely polyphagous crop pest. Alongside an unparalleled detoxification potential for plant secondary metabolites, it has recently been shown that spider mites can attenuate or even suppress plant defenses. Salivary constituents, notably effectors, have been proposed to play an important role in manipulating plant defenses and might determine the outcome of plant-mite interactions. Here, the proteomic composition of saliva from T. urticae lines adapted to various host plants—bean, maize, soy, and tomato—was analyzed using a custom-developed feeding assay coupled with nano-LC tandem mass spectrometry. About 90 putative T. urticae salivary proteins were identified. Many are of unknown function, and in numerous cases belonging to multimembered gene families. RNAseq expression analysis revealed that many genes coding for these salivary proteins were highly expressed in the proterosoma, the mite body region that includes the salivary glands. A subset of genes encoding putative salivary proteins was selected for whole-mount in situ hybridization, and were found to be expressed in the anterior and dorsal podocephalic glands. Strikingly, host plant dependent expression was evident for putative salivary proteins, and was further studied in detail by micro-array based genome-wide expression profiling. This meta-analysis revealed for the first time the salivary protein repertoire of a phytophagous chelicerate. The availability of this salivary proteome will assist in unraveling the molecular interface between phytophagous mites and their host plants, and may ultimately facilitate the development of mite-resistant crops. Furthermore, the technique used in this study is a time- and resource-efficient method to examine the salivary protein composition of other small arthropods for which saliva or salivary glands cannot be isolated easily. PMID:27703040
Dittmer, Neal T; Hiromasa, Yasuaki; Tomich, John M; Lu, Nanyan; Beeman, Richard W; Kramer, Karl J; Kanost, Michael R
2012-01-01
The insect cuticle is a composite biomaterial made up primarily of chitin and proteins. The physical properties of the cuticle can vary greatly from hard and rigid to soft and flexible. Understanding how different cuticle types are assembled can aid in the development of novel biomimetic materials for use in medicine and technology. Toward this goal, we have taken a combined proteomics and transcriptomics approach with the red flour beetle, Tribolium castaneum, to examine the protein and gene expression profiles of the elytra and hindwings, appendages that contain rigid and soft cuticles, respectively. Two-dimensional gel electrophoresis analysis revealed distinct differences in the protein profiles between elytra and hindwings, with four highly abundant proteins dominating the elytral cuticle extract. MALDI/TOF mass spectrometry identified 19 proteins homologous to known or hypothesized cuticular proteins (CPs), including a novel low complexity protein enriched in charged residues. Microarray analysis identified 372 genes with a 10-fold or greater difference in transcript levels between elytra and hindwings. CP genes with higher expression in the elytra belonged to the Rebers and Riddiford family (CPR) type 2, or cuticular proteins of low complexity (CPLC) enriched in glycine or proline. In contrast, a majority of the CP genes with higher expression in hindwings were classified as CPR type 1, cuticular proteins analogous to peritrophins (CPAP), or members of the Tweedle family. This research shows that the elyra and hindwings, representatives of rigid and soft cuticles, have different protein and gene expression profiles for structural proteins that may influence the mechanical properties of these cuticles.
Avens, Heather J.; Bowman, Christopher N.
2009-01-01
Antibody microarrays are a critical tool for proteomics, requiring broad, highly sensitive detection of numerous low abundance biomarkers. Fluorescent polymerization-based amplification (FPBA) is presented as a novel, non-enzymatic signal amplification method that takes advantage of the chain-reaction nature of radical polymerization to achieve a highly amplified fluorescent response. A streptavidin-eosin conjugate localizes eosin photoinitiators for polymerization on the chip where biotinylated target protein is bound. The chip is contacted with acrylamide as a monomer, N-methyldiethanolamine as a coinitiator and yellow/green fluorescent nanoparticles (NPs) which, upon initiation, combine to form a macroscopically visible and highly fluorescent film. The rapid polymerization kinetics and the presence of cross-linker favor entrapment of the fluorescent NPs in the polymer, enabling highly sensitive fluorescent biodetection. This method is demonstrated as being appropriate for antibody microarrays and is compared to detection approaches which utilize streptavidin-FITC (SA-FITC) and streptavidin-labeled yellow/green NPs (SA-NPs). It is found that FPBA is able to detect 0.16 (+/− 0.01) biotin-antibody/µm2 (or 40 zeptomole surface-bound target molecules), while SA-FITC has a limit of detection of 31 (+/− 1) biotin-antibody/µm2 and SA-NPs fail to achieve any significant signal under the conditions evaluated here. Further, FPBA in conjunction with fluorescent stereomicroscopy yields equal or better sensitivity compared to fluorescent detection of SA-eosin using a much more costly microarray scanner. By facilitating highly sensitive detection, FPBA is expected to enable detection of low abundance antigens and also make possible a transition towards less expensive fluorescence detection instrumentation. PMID:19508906
Villers, Jennifer; Savocco, Jérôme; Szopinska, Aleksandra; Degand, Hervé; Nootens, Sylvain; Morsomme, Pierre
2017-09-01
Yeast cells, to be able to grow on a wide variety of nitrogen sources, regulate the set of nitrogen transporters present at their plasma membrane. Such regulation relies on both transcriptional and post-translational events. Although microarray studies have identified most nitrogen-sensitive genes, nitrogen-induced post-translational regulation has only been studied for very few proteins among which the general amino acid permease Gap1. Adding a preferred nitrogen source to proline-grown cells triggers Gap1 endocytosis and vacuolar degradation in an Rsp5-Bul1/2-dependent manner. Here, we used a proteomic approach to follow the dynamics of the plasma membrane proteome after addition of a preferred nitrogen source. We identified new targets of the nitrogen regulation and four transporters of poor nitrogen sources-Put4, Opt2, Dal5, and Ptr2-that rapidly decrease in abundance. Although the kinetics is different for each transporter, we found that three of them-Put4, Dal5, and Ptr2-are endocytosed, like Gap1, in an Rsp5-dependent manner and degraded in the vacuole. Finally, we showed that Gap1 stabilization at the plasma membrane, through deletion of Bul proteins, regulates the abundance of Put4, Dal5 and Ptr2. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Escobar-Hoyos, Luisa F; Yang, Jie; Zhu, Jiawen; Cavallo, Julie-Ann; Zhai, Haiyan; Burke, Stephanie; Koller, Antonius; Chen, Emily I; Shroyer, Kenneth R
2014-01-01
Most previously described immunohistochemical markers of cervical high-grade squamous intraepithelial lesion (HSIL) and squamous cell carcinoma may help to improve diagnostic accuracy but have a minimal prognostic value. The goals of the current study were to identify and validate novel candidate biomarkers that could potentially improve diagnostic and prognostic accuracy for cervical HSIL and squamous cell carcinoma. Microdissected tissue sections from formalin-fixed paraffin-embedded normal ectocervical squamous mucosa, low-grade squamous intraepithelial lesion (LSIL), HSIL and squamous cell carcinoma sections were analyzed by mass spectrometry-based shotgun proteomics for biomarker discovery. The diagnostic specificity of candidate biomarkers was subsequently evaluated by immunohistochemical analysis of tissue microarrays. Among 1750 proteins identified by proteomic analyses, keratin 4 (KRT4) and keratin 17 (KRT17) showed reciprocal patterns of expression in the spectrum of cases ranging from normal ectocervical squamous mucosa to squamous cell carcinoma. Immunohistochemical studies confirmed that KRT4 expression was significantly decreased in squamous cell carcinoma compared with the other diagnostic categories. By contrast, KRT17 expression was significantly increased in HSIL and squamous cell carcinoma compared with normal ectocervical squamous mucosa and LSIL. KRT17 was also highly expressed in immature squamous metaplasia and in endocervical reserve cells but was generally not detected in mature squamous metaplasia. Furthermore, high levels of KRT17 expression were significantly associated with poor survival of squamous cell carcinoma patients (Hazard ratio = 14.76, P = 0.01). In summary, both KRT4 and KRT17 expressions are related to the histopathology of the cervical squamous mucosa; KRT17 is highly overexpressed in immature squamous metaplasia, in HSIL, and in squamous cell carcinoma and the level of KRT17 in squamous cell carcinoma may help to identify patients who are at greatest risk for cervical cancer mortality. PMID:24051697
DOE Office of Scientific and Technical Information (OSTI.GOV)
Men, Yujie; Feil, Helene; Verberkmoes, Nathan C
2012-01-01
Dehalococcoides ethenogenes strain 195 (DE195) was grown in a sustainable syntrophic association with Desulfovibrio vulgaris Hildenborough (DVH) as a co-culture, as well as with DVH and the hydrogenotrophic methanogen Methanobacterium congolense (MC) as a tri-culture using lactate as the sole energy and carbon source. In the co- and tri-cultures, maximum dechlorination rates of DE195 were enhanced by approximately three times (11.0 0.01 lmol per day for the co-culture and 10.1 0.3 lmol per day for the tri-culture) compared with DE195 grown alone (3.8 0.1 lmol per day). Cell yield of DE195 was enhanced in the co-culture (9.0 0.5107 cells permore » lmol Cl released, compared with 6.8 0.9107 cells per lmol Cl released for the pure culture), whereas no further enhancement was observed in the tri-culture (7.3 1.8107 cells per lmol Cl released). The transcriptome of DE195 grown in the co-culture was analyzed using a wholegenome microarray targeting DE195, which detected 102 significantly up- or down-regulated genes compared with DE195 grown in isolation, whereas no significant transcriptomic difference was observed between co- and tri-cultures. Proteomic analysis showed that 120 proteins were differentially expressed in the co-culture compared with DE195 grown in isolation. Physiological, transcriptomic and proteomic results indicate that the robust growth of DE195 in co- and tri-cultures is because of the advantages associated with the capabilities of DVH to ferment lactate to provide H2 and acetate for growth, along with potential benefits from proton translocation, cobalamin-salvaging and amino acid biosynthesis, whereas MC in the tri-culture provided no significant additional benefits beyond those of DVH.« less
National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have released a dataset of proteins and phosphopeptides identified through deep proteomic and phosphoproteomic analysis of breast tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA).
National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have just released a comprehensive dataset of the proteomic analysis of high grade serous ovarian tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA). This is one of the largest public datasets covering the proteome, phosphoproteome and glycoproteome with complementary deep genomic sequencing data on the same tumor.
Multiplex infectious disease microarrays: STAT serology on a drop of blood
NASA Astrophysics Data System (ADS)
Ewart, Tom; Tarnopolsky, Mark; Baker, Steve; Raha, Sandeep; Wong, Yuen-Yee; Ciebiera, Kathy
2009-06-01
New and resurgent viral and antibiotic-resistant bacterial diseases are being encountered worldwide. The US CDC now ranks hospital acquired infections among the top 10 leading causes of death in the US, costing $20 billion annually. Such nosocomial infections presently affect 5% - 10% of hospitalized patients leading to 2 million cases and 99,000 deaths annually. Until now, assays available to mount comprehensive surveillance of infectious disease exposure by biosecurity agencies and hospital infection control units have been too slow and too costly. In earlier clinical studies we have reported proteomic microarrays combining 13 autoimmune and 26 viral and bacterial pathogens that revealed correlations between autoimmune diseases and antecedent infections. In this work we have expanded the array to 40 viruses and bacteria and investigated a suspected role of human endogenous retroviruses in autoimmune neuropathies. Using scanning laser imaging, and fluorescence color multiplexing, serum IgG and IgM responses are measured concurrently on the same array, for 14 arrays (patient samples) per microscope slide in 15 minutes. Other advantages include internal calibration, 10 μL sample size, increased laboratory efficiency, and potential factor of 100 cost reduction.
Rezk, Amgad R; Ramesan, Shwathy; Yeo, Leslie Y
2018-01-30
The microarray titre plate remains a fundamental workhorse in genomic, proteomic and cellomic analyses that underpin the drug discovery process. Nevertheless, liquid handling technologies for sample dispensing, processing and transfer have not progressed significantly beyond conventional robotic micropipetting techniques, which are not only at their fundamental sample size limit, but are also prone to mechanical failure and contamination. This is because alternative technologies to date suffer from a number of constraints, mainly their limitation to carry out only a single liquid operation such as dispensing or mixing at a given time, and their inability to address individual wells, particularly at high throughput. Here, we demonstrate the possibility for true sequential or simultaneous single- and multi-well addressability in a 96-well plate using a reconfigurable modular platform from which MHz-order hybrid surface and bulk acoustic waves can be coupled to drive a variety of microfluidic modes including mixing, sample preconcentration and droplet jetting/ejection in individual or multiple wells on demand, thus constituting a highly versatile yet simple setup capable of improving the functionality of existing laboratory protocols and processes.
Trends in imprint lithography for biological applications.
Truskett, Van N; Watts, Michael P C
2006-07-01
Imprint lithography is emerging as an alternative nano-patterning technology to traditional photolithography that permits the fabrication of 2D and 3D structures with <100 nm resolution, patterning and modification of functional materials other than photoresist and is low cost, with operational ease for use in developing bio-devices. Techniques for imprint lithography, categorized as either 'molding and embossing' or 'transfer printing', will be discussed in the context of microarrays for genomics, proteomics and tissue engineering. Specifically, fabrication by nanoimprint lithography (NIL), UV-NIL, step and flash imprint lithography (S-FIL), micromolding by elastomeric stamps and micro- and nano-contact printing will be reviewed.
Halligan, Brian D.; Geiger, Joey F.; Vallejos, Andrew K.; Greene, Andrew S.; Twigger, Simon N.
2009-01-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step by step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center website (http://proteomics.mcw.edu/vipdac). PMID:19358578
Halligan, Brian D; Geiger, Joey F; Vallejos, Andrew K; Greene, Andrew S; Twigger, Simon N
2009-06-01
One of the major difficulties for many laboratories setting up proteomics programs has been obtaining and maintaining the computational infrastructure required for the analysis of the large flow of proteomics data. We describe a system that combines distributed cloud computing and open source software to allow laboratories to set up scalable virtual proteomics analysis clusters without the investment in computational hardware or software licensing fees. Additionally, the pricing structure of distributed computing providers, such as Amazon Web Services, allows laboratories or even individuals to have large-scale computational resources at their disposal at a very low cost per run. We provide detailed step-by-step instructions on how to implement the virtual proteomics analysis clusters as well as a list of current available preconfigured Amazon machine images containing the OMSSA and X!Tandem search algorithms and sequence databases on the Medical College of Wisconsin Proteomics Center Web site ( http://proteomics.mcw.edu/vipdac ).
Experimental Demyelination and Axonal Loss Are Reduced in MicroRNA-146a Deficient Mice.
Martin, Nellie A; Molnar, Viktor; Szilagyi, Gabor T; Elkjaer, Maria L; Nawrocki, Arkadiusz; Okarmus, Justyna; Wlodarczyk, Agnieszka; Thygesen, Eva K; Palkovits, Miklos; Gallyas, Ferenc; Larsen, Martin R; Lassmann, Hans; Benedikz, Eirikur; Owens, Trevor; Svenningsen, Asa F; Illes, Zsolt
2018-01-01
The cuprizone (CPZ) model of multiple sclerosis (MS) was used to identify microRNAs (miRNAs) related to in vivo de- and remyelination. We further investigated the role of miR-146a in miR-146a-deficient (KO) mice: this miRNA is differentially expressed in MS lesions and promotes differentiation of oligodendrocyte precursor cells (OPCs) during remyelination, but its role has not been examined during demyelination. MicroRNAs were examined by Agilent Mouse miRNA Microarray in the corpus callosum during CPZ-induced demyelination and remyelination. Demyelination, axonal loss, changes in number of oligodendrocytes, OPCs, and macrophages/microglia was compared by histology/immunohistochemistry between KO and WT mice. Differential expression of target genes and proteins of miR-146a was analyzed in the transcriptome (4 × 44K Agilent Whole Mouse Genome Microarray) and proteome (liquid chromatography tandem mass spectrometry) of CPZ-induced de- and remyelination in WT mice. Levels of proinflammatory molecules in the corpus callosum were compared in WT versus KO mice by Meso Scale Discovery multiplex protein analysis. miR-146a was increasingly upregulated during CPZ-induced de- and remyelination. The absence of miR-146a in KO mice protected against demyelination, axonal loss, body weight loss, and atrophy of thymus and spleen. The number of CNP + oligodendrocytes was increased during demyelination in the miR-146a KO mice, while there was a trend of increased number of NG2 + OPCs in the WT mice. miR-146a target genes, SNAP25 and SMAD4, were downregulated in the proteome of demyelinating corpus callosum in WT mice. Higher levels of SNAP25 were measured by ELISA in the corpus callosum of miR-146a KO mice, but there was no difference between KO and WT mice during demyelination. Multiplex protein analysis of the corpus callosum lysate revealed upregulated TNF-RI, TNF-RII, and CCL2 in the WT mice in contrast to KO mice. The number of Mac3 + and Iba1 + macrophages/microglia was reduced in the demyelinating corpus callosum of the KO mice. During demyelination, absence of miR-146a reduced inflammatory responses, demyelination, axonal loss, the number of infiltrating macrophages, and increased the number of myelinating oligodendrocytes. The number of OPCs was slightly higher in the WT mice during remyelination, indicating a complex role of miR-146a during in vivo de- and remyelination.
Yu, Kebing; Salomon, Arthur R.
2010-01-01
Recently, dramatic progress has been achieved in expanding the sensitivity, resolution, mass accuracy, and scan rate of mass spectrometers able to fragment and identify peptides through tandem mass spectrometry (MS/MS). Unfortunately, this enhanced ability to acquire proteomic data has not been accompanied by a concomitant increase in the availability of flexible tools allowing users to rapidly assimilate, explore, and analyze this data and adapt to a variety of experimental workflows with minimal user intervention. Here we fill this critical gap by providing a flexible relational database called PeptideDepot for organization of expansive proteomic data sets, collation of proteomic data with available protein information resources, and visual comparison of multiple quantitative proteomic experiments. Our software design, built upon the synergistic combination of a MySQL database for safe warehousing of proteomic data with a FileMaker-driven graphical user interface for flexible adaptation to diverse workflows, enables proteomic end-users to directly tailor the presentation of proteomic data to the unique analysis requirements of the individual proteomics lab. PeptideDepot may be deployed as an independent software tool or integrated directly with our High Throughput Autonomous Proteomic Pipeline (HTAPP) used in the automated acquisition and post-acquisition analysis of proteomic data. PMID:19834895
The National Cancer Institute is soliciting applications for the reissuance of its Clinical Proteomic Tumor Analysis Consortium (CPTAC) program. CPTAC will support broad efforts focused on several cancer types to explore further the complexities of cancer proteomes and their connections to abnormalities in cancer genomes.
WebArray: an online platform for microarray data analysis
Xia, Xiaoqin; McClelland, Michael; Wang, Yipeng
2005-01-01
Background Many cutting-edge microarray analysis tools and algorithms, including commonly used limma and affy packages in Bioconductor, need sophisticated knowledge of mathematics, statistics and computer skills for implementation. Commercially available software can provide a user-friendly interface at considerable cost. To facilitate the use of these tools for microarray data analysis on an open platform we developed an online microarray data analysis platform, WebArray, for bench biologists to utilize these tools to explore data from single/dual color microarray experiments. Results The currently implemented functions were based on limma and affy package from Bioconductor, the spacings LOESS histogram (SPLOSH) method, PCA-assisted normalization method and genome mapping method. WebArray incorporates these packages and provides a user-friendly interface for accessing a wide range of key functions of limma and others, such as spot quality weight, background correction, graphical plotting, normalization, linear modeling, empirical bayes statistical analysis, false discovery rate (FDR) estimation, chromosomal mapping for genome comparison. Conclusion WebArray offers a convenient platform for bench biologists to access several cutting-edge microarray data analysis tools. The website is freely available at . It runs on a Linux server with Apache and MySQL. PMID:16371165
Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping
NASA Technical Reports Server (NTRS)
Royce, Thomas E.; Rozowsky, Joel S.; Bertone, Paul; Samanta, Manoj; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark
2005-01-01
Traditional microarrays use probes complementary to known genes to quantitate the differential gene expression between two or more conditions. Genomic tiling microarray experiments differ in that probes that span a genomic region at regular intervals are used to detect the presence or absence of transcription. This difference means the same sets of biases and the methods for addressing them are unlikely to be relevant to both types of experiment. We introduce the informatics challenges arising in the analysis of tiling microarray experiments as open problems to the scientific community and present initial approaches for the analysis of this nascent technology.
CPTAC | Office of Cancer Clinical Proteomics Research
The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is a national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics.
Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes
NASA Astrophysics Data System (ADS)
Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy
2007-01-01
Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.
Guebel, Daniel V; Torres, Néstor V
2016-01-01
Motivation: In the brain of elderly-healthy individuals, the effects of sexual dimorphism and those due to normal aging appear overlapped. Discrimination of these two dimensions would powerfully contribute to a better understanding of the etiology of some neurodegenerative diseases, such as "sporadic" Alzheimer. Methods: Following a system biology approach, top-down and bottom-up strategies were combined. First, public transcriptome data corresponding to the transition from adulthood to the aging stage in normal, human hippocampus were analyzed through an optimized microarray post-processing (Q-GDEMAR method) together with a proper experimental design (full factorial analysis). Second, the identified genes were placed in context by building compatible networks. The subsequent ontology analyses carried out on these networks clarify the main functionalities involved. Results: Noticeably we could identify large sets of genes according to three groups: those that exclusively depend on the sex, those that exclusively depend on the age, and those that depend on the particular combinations of sex and age (interaction). The genes identified were validated against three independent sources (a proteomic study of aging, a senescence database, and a mitochondrial genetic database). We arrived to several new inferences about the biological functions compromised during aging in two ways: by taking into account the sex-independent effects of aging, and considering the interaction between age and sex where pertinent. In particular, we discuss the impact of our findings on the functions of mitochondria, autophagy, mitophagia, and microRNAs. Conclusions: The evidence obtained herein supports the occurrence of significant neurobiological differences in the hippocampus, not only between adult and elderly individuals, but between old-healthy women and old-healthy men. Hence, to obtain realistic results in further analysis of the transition from the normal aging to incipient Alzheimer, the features derived from the sexual dimorphism in hippocampus should be explicitly considered.
Guebel, Daniel V.; Torres, Néstor V.
2016-01-01
Motivation: In the brain of elderly-healthy individuals, the effects of sexual dimorphism and those due to normal aging appear overlapped. Discrimination of these two dimensions would powerfully contribute to a better understanding of the etiology of some neurodegenerative diseases, such as “sporadic” Alzheimer. Methods: Following a system biology approach, top-down and bottom-up strategies were combined. First, public transcriptome data corresponding to the transition from adulthood to the aging stage in normal, human hippocampus were analyzed through an optimized microarray post-processing (Q-GDEMAR method) together with a proper experimental design (full factorial analysis). Second, the identified genes were placed in context by building compatible networks. The subsequent ontology analyses carried out on these networks clarify the main functionalities involved. Results: Noticeably we could identify large sets of genes according to three groups: those that exclusively depend on the sex, those that exclusively depend on the age, and those that depend on the particular combinations of sex and age (interaction). The genes identified were validated against three independent sources (a proteomic study of aging, a senescence database, and a mitochondrial genetic database). We arrived to several new inferences about the biological functions compromised during aging in two ways: by taking into account the sex-independent effects of aging, and considering the interaction between age and sex where pertinent. In particular, we discuss the impact of our findings on the functions of mitochondria, autophagy, mitophagia, and microRNAs. Conclusions: The evidence obtained herein supports the occurrence of significant neurobiological differences in the hippocampus, not only between adult and elderly individuals, but between old-healthy women and old-healthy men. Hence, to obtain realistic results in further analysis of the transition from the normal aging to incipient Alzheimer, the features derived from the sexual dimorphism in hippocampus should be explicitly considered. PMID:27761111
Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang
2009-01-01
We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365
Karyotype versus microarray testing for genetic abnormalities after stillbirth.
Reddy, Uma M; Page, Grier P; Saade, George R; Silver, Robert M; Thorsten, Vanessa R; Parker, Corette B; Pinar, Halit; Willinger, Marian; Stoll, Barbara J; Heim-Hall, Josefine; Varner, Michael W; Goldenberg, Robert L; Bukowski, Radek; Wapner, Ronald J; Drews-Botsch, Carolyn D; O'Brien, Barbara M; Dudley, Donald J; Levy, Brynn
2012-12-06
Genetic abnormalities have been associated with 6 to 13% of stillbirths, but the true prevalence may be higher. Unlike karyotype analysis, microarray analysis does not require live cells, and it detects small deletions and duplications called copy-number variants. The Stillbirth Collaborative Research Network conducted a population-based study of stillbirth in five geographic catchment areas. Standardized postmortem examinations and karyotype analyses were performed. A single-nucleotide polymorphism array was used to detect copy-number variants of at least 500 kb in placental or fetal tissue. Variants that were not identified in any of three databases of apparently unaffected persons were then classified into three groups: probably benign, clinical significance unknown, or pathogenic. We compared the results of karyotype and microarray analyses of samples obtained after delivery. In our analysis of samples from 532 stillbirths, microarray analysis yielded results more often than did karyotype analysis (87.4% vs. 70.5%, P<0.001) and provided better detection of genetic abnormalities (aneuploidy or pathogenic copy-number variants, 8.3% vs. 5.8%; P=0.007). Microarray analysis also identified more genetic abnormalities among 443 antepartum stillbirths (8.8% vs. 6.5%, P=0.02) and 67 stillbirths with congenital anomalies (29.9% vs. 19.4%, P=0.008). As compared with karyotype analysis, microarray analysis provided a relative increase in the diagnosis of genetic abnormalities of 41.9% in all stillbirths, 34.5% in antepartum stillbirths, and 53.8% in stillbirths with anomalies. Microarray analysis is more likely than karyotype analysis to provide a genetic diagnosis, primarily because of its success with nonviable tissue, and is especially valuable in analyses of stillbirths with congenital anomalies or in cases in which karyotype results cannot be obtained. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development.).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gritsenko, Marina A.; Xu, Zhe; Liu, Tao
Comprehensive, quantitative information on abundances of proteins and their post-translational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labelling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification andmore » quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples, and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.« less
Gritsenko, Marina A; Xu, Zhe; Liu, Tao; Smith, Richard D
2016-01-01
Comprehensive, quantitative information on abundances of proteins and their posttranslational modifications (PTMs) can potentially provide novel biological insights into diseases pathogenesis and therapeutic intervention. Herein, we introduce a quantitative strategy utilizing isobaric stable isotope-labeling techniques combined with two-dimensional liquid chromatography-tandem mass spectrometry (2D-LC-MS/MS) for large-scale, deep quantitative proteome profiling of biological samples or clinical specimens such as tumor tissues. The workflow includes isobaric labeling of tryptic peptides for multiplexed and accurate quantitative analysis, basic reversed-phase LC fractionation and concatenation for reduced sample complexity, and nano-LC coupled to high resolution and high mass accuracy MS analysis for high confidence identification and quantification of proteins. This proteomic analysis strategy has been successfully applied for in-depth quantitative proteomic analysis of tumor samples and can also be used for integrated proteome and PTM characterization, as well as comprehensive quantitative proteomic analysis across samples from large clinical cohorts.
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
Human body fluid proteome analysis
Hu, Shen; Loo, Joseph A.; Wong, David T.
2010-01-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future. PMID:17083142
Human body fluid proteome analysis.
Hu, Shen; Loo, Joseph A; Wong, David T
2006-12-01
The focus of this article is to review the recent advances in proteome analysis of human body fluids, including plasma/serum, urine, cerebrospinal fluid, saliva, bronchoalveolar lavage fluid, synovial fluid, nipple aspirate fluid, tear fluid, and amniotic fluid, as well as its applications to human disease biomarker discovery. We aim to summarize the proteomics technologies currently used for global identification and quantification of body fluid proteins, and elaborate the putative biomarkers discovered for a variety of human diseases through human body fluid proteome (HBFP) analysis. Some critical concerns and perspectives in this emerging field are also discussed. With the advances made in proteomics technologies, the impact of HBFP analysis in the search for clinically relevant disease biomarkers would be realized in the future.
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Bensaddek, Dalila; Narayan, Vikram; Nicolas, Armel; Murillo, Alejandro Brenes; Gartner, Anton; Kenyon, Cynthia J; Lamond, Angus I
2016-02-01
Proteomics studies typically analyze proteins at a population level, using extracts prepared from tens of thousands to millions of cells. The resulting measurements correspond to average values across the cell population and can mask considerable variation in protein expression and function between individual cells or organisms. Here, we report the development of micro-proteomics for the analysis of Caenorhabditis elegans, a eukaryote composed of 959 somatic cells and ∼1500 germ cells, measuring the worm proteome at a single organism level to a depth of ∼3000 proteins. This includes detection of proteins across a wide dynamic range of expression levels (>6 orders of magnitude), including many chromatin-associated factors involved in chromosome structure and gene regulation. We apply the micro-proteomics workflow to measure the global proteome response to heat-shock in individual nematodes. This shows variation between individual animals in the magnitude of proteome response following heat-shock, including variable induction of heat-shock proteins. The micro-proteomics pipeline thus facilitates the investigation of stochastic variation in protein expression between individuals within an isogenic population of C. elegans. All data described in this study are available online via the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd), an open access, searchable database resource. © 2015 The Authors. PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
A Java-based tool for the design of classification microarrays.
Meng, Da; Broschat, Shira L; Call, Douglas R
2008-08-04
Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for analysis of subsequent experimental data. Additionally, PLASMID can be used to construct virtual microarrays with genomes from public databases, which can then be used to identify an optimal set of probes.
Tra, Yolande V; Evans, Irene M
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course.
Evans, Irene M.
2010-01-01
BIO2010 put forth the goal of improving the mathematical educational background of biology students. The analysis and interpretation of microarray high-dimensional data can be very challenging and is best done by a statistician and a biologist working and teaching in a collaborative manner. We set up such a collaboration and designed a course on microarray data analysis. We started using Genome Consortium for Active Teaching (GCAT) materials and Microarray Genome and Clustering Tool software and added R statistical software along with Bioconductor packages. In response to student feedback, one microarray data set was fully analyzed in class, starting from preprocessing to gene discovery to pathway analysis using the latter software. A class project was to conduct a similar analysis where students analyzed their own data or data from a published journal paper. This exercise showed the impact that filtering, preprocessing, and different normalization methods had on gene inclusion in the final data set. We conclude that this course achieved its goals to equip students with skills to analyze data from a microarray experiment. We offer our insight about collaborative teaching as well as how other faculty might design and implement a similar interdisciplinary course. PMID:20810954
Chromosomal Microarray versus Karyotyping for Prenatal Diagnosis
Wapner, Ronald J.; Martin, Christa Lese; Levy, Brynn; Ballif, Blake C.; Eng, Christine M.; Zachary, Julia M.; Savage, Melissa; Platt, Lawrence D.; Saltzman, Daniel; Grobman, William A.; Klugman, Susan; Scholl, Thomas; Simpson, Joe Leigh; McCall, Kimberly; Aggarwal, Vimla S.; Bunke, Brian; Nahum, Odelia; Patel, Ankita; Lamb, Allen N.; Thom, Elizabeth A.; Beaudet, Arthur L.; Ledbetter, David H.; Shaffer, Lisa G.; Jackson, Laird
2013-01-01
Background Chromosomal microarray analysis has emerged as a primary diagnostic tool for the evaluation of developmental delay and structural malformations in children. We aimed to evaluate the accuracy, efficacy, and incremental yield of chromosomal microarray analysis as compared with karyotyping for routine prenatal diagnosis. Methods Samples from women undergoing prenatal diagnosis at 29 centers were sent to a central karyotyping laboratory. Each sample was split in two; standard karyotyping was performed on one portion and the other was sent to one of four laboratories for chromosomal microarray. Results We enrolled a total of 4406 women. Indications for prenatal diagnosis were advanced maternal age (46.6%), abnormal result on Down’s syndrome screening (18.8%), structural anomalies on ultrasonography (25.2%), and other indications (9.4%). In 4340 (98.8%) of the fetal samples, microarray analysis was successful; 87.9% of samples could be used without tissue culture. Microarray analysis of the 4282 nonmosaic samples identified all the aneuploidies and unbalanced rearrangements identified on karyotyping but did not identify balanced translocations and fetal triploidy. In samples with a normal karyotype, microarray analysis revealed clinically relevant deletions or duplications in 6.0% with a structural anomaly and in 1.7% of those whose indications were advanced maternal age or positive screening results. Conclusions In the context of prenatal diagnostic testing, chromosomal microarray analysis identified additional, clinically significant cytogenetic information as compared with karyotyping and was equally efficacious in identifying aneuploidies and unbalanced rearrangements but did not identify balanced translocations and triploidies. (Funded by the Eunice Kennedy Shriver National Institute of Child Health and Human Development and others; ClinicalTrials.gov number, NCT01279733.) PMID:23215555
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar
2016-04-01
Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.
Szcześniak, Katarzyna A; Ciecierska, Anna; Ostaszewski, Piotr; Sadkowski, Tomasz
2016-10-01
β-Hydroxy-β-methylbutyrate (HMB) is a popular ergogenic aid used by human athletes and as a supplement to sport horses, because of its ability to aid muscle recovery, improve performance and body composition. Recent findings suggest that HMB may stimulate satellite cells and affect expressions of genes regulating skeletal muscle cell growth. Despite the scientific data showing benefits of HMB supplementation in horses, no previous study has explained the mechanism of action of HMB in this species. The aim of this study was to reveal the molecular background of HMB action on equine skeletal muscle by investigating the transcriptomic profile changes induced by HMB in equine satellite cells in vitro. Upon isolation from the semitendinosus muscle, equine satellite cells were cultured until the 2nd day of differentiation. Differentiating cells were incubated with HMB for 24 h. Total cellular RNA was isolated, amplified, labelled and hybridised to microarray slides. Microarray data validation was performed with real-time quantitative PCR. HMB induced differential expressions of 361 genes. Functional analysis revealed that the main biological processes influenced by HMB in equine satellite cells were related to muscle organ development, protein metabolism, energy homoeostasis and lipid metabolism. In conclusion, this study demonstrated for the first time that HMB has the potential to influence equine satellite cells by controlling global gene expression. Genes and biological processes targeted by HMB in equine satellite cells may support HMB utility in improving growth and regeneration of equine skeletal muscle; however, the overall role of HMB in horses remains equivocal and requires further proteomic, biochemical and pharmacokinetic studies.
Smoot, L M; Smoot, J C; Graham, M R; Somerville, G A; Sturdevant, D E; Migliaccio, C A; Sylva, G L; Musser, J M
2001-08-28
Pathogens are exposed to different temperatures during an infection cycle and must regulate gene expression accordingly. However, the extent to which virulent bacteria alter gene expression in response to temperatures encountered in the host is unknown. Group A Streptococcus (GAS) is a human-specific pathogen that is responsible for illnesses ranging from superficial skin infections and pharyngitis to severe invasive infections such as necrotizing fasciitis and streptococcal toxic shock syndrome. GAS survives and multiplies at different temperatures during human infection. DNA microarray analysis was used to investigate the influence of temperature on global gene expression in a serotype M1 strain grown to exponential phase at 29 degrees C and 37 degrees C. Approximately 9% of genes were differentially expressed by at least 1.5-fold at 29 degrees C relative to 37 degrees C, including genes encoding transporter proteins, proteins involved in iron homeostasis, transcriptional regulators, phage-associated proteins, and proteins with no known homologue. Relatively few known virulence genes were differentially expressed at this threshold. However, transcription of 28 genes encoding proteins with predicted secretion signal sequences was altered, indicating that growth temperature substantially influences the extracellular proteome. TaqMan real-time reverse transcription-PCR assays confirmed the microarray data. We also discovered that transcription of genes encoding hemolysins, and proteins with inferred roles in iron regulation, transport, and homeostasis, was influenced by growth at 40 degrees C. Thus, GAS profoundly alters gene expression in response to temperature. The data delineate the spectrum of temperature-regulated gene expression in an important human pathogen and provide many unforeseen lines of pathogenesis investigation.
Integrated Proteomic Approaches for Understanding Toxicity of Environmental Chemicals
To apply quantitative proteomic analysis to the evaluation of toxicity of environmental chemicals, we have developed an integrated proteomic technology platform. This platform has been applied to the analysis of the toxic effects and pathways of many important environmental chemi...
The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) announces the release of the cancer proteome confirmatory colon study data. The goal of the study is to analyze the proteomes of approximately 100 confirmatory colon tumor patients, which includes tumor and adjacent normal samples, with liquid chromatography-tandem mass spectrometry (LC-MS/MS) global proteomic and phosphoproteomic profiling.
CPTAC Proteomics Data on UCSC Genome Browser | Office of Cancer Clinical Proteomics Research
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium scientists are working together with the University of California, Santa Cruz (UCSC) Genomics Institute to provide public access to cancer proteomics data via the UCSC Genome Browser. This effort extends accessibility of the CPTAC data to more researchers and provides an additional level of analysis to assist the cancer biology community.
A molecular characterization of the choroid plexus and stress-induced gene regulation
Sathyanesan, M; Girgenti, M J; Banasr, M; Stone, K; Bruce, C; Guilchicek, E; Wilczak-Havill, K; Nairn, A; Williams, K; Sass, S; Duman, J G; Newton, S S
2012-01-01
The role of the choroid plexus (CP) in brain homeostasis is being increasingly recognized and recent studies suggest that the CP has a more important role in physiological and pathological brain functions than currently appreciated. To obtain additional insight on the CP function, we performed a proteomics and transcriptomics characterization employing a combination of high resolution tandem mass spectrometry and gene expression analyses in normal rodent brain. Using multiple protein fractionation approaches, we identified 1400 CP proteins in adult CP. Microarray-based comparison of CP gene expression with the kidney, cortex and hippocampus showed significant overlap between the CP and the kidney. CP gene profiles were validated by in situ hybridization analysis of several target genes including klotho, CLIC 6, OATP 14 and Ezrin. Immunohistochemical analyses were performed for CP and enpendyma detection of several target proteins including cytokeratin, Rab7, klotho, tissue inhibitor of metalloprotease 1 (TIMP1), MMP9 and glial fibrillary acidic protein (GFAP). The molecular functions associated with various proteins of the CP proteome indicate that it is a blood–cerebrospinal fluid (CSF) barrier that exhibits high levels of metabolic activity. We also analyzed the gene expression changes induced by stress, an exacerbating factor for many illnesses, particularly mood disorders. Chronic stress altered the expression of several genes, downregulating 5HT2C, glucocorticoid receptor and the cilia genes IFT88 and smoothened while upregulating 5HT2A, BDNF, TNFα and IL-1b. The data presented here attach additional significance to the emerging importance of CP function in brain health and CNS disease states. PMID:22781172
Haonon, Ornuma; Rucksaken, Rucksak; Pinlaor, Porntip; Pairojkul, Chawalit; Chamgramol, Yaovalux; Intuyod, Kitti; Onsurathum, Sudarat; Khuntikeo, Narong; Pinlaor, Somchai
2016-03-01
To discover protein markers in chronic/advanced opisthorchiasis for the early detection of Opisthorchis viverrini (OV)-associated cholangiocarcinoma (CCA). Liver tissues derived from normal hamsters and those with chronic/advanced opisthorchiasis (n = 5 per group) were subjected to 2DE and LC-MS/MS. Candidate protein expression was confirmed in hamster models and human CCA tissue microarray (TMA) using immunohistochemistry and Western blot. Proteomics analysis detected 14-3-3 eta only in infected hamsters, not in uninfected controls. Immunohistochemistry and Western blot analysis confirmed low expression of 14-3-3 eta in normal hamster livers and demonstrated increased expression through time in infected livers. This protein was also observed in parasite organs, especially during the chronic phase of opisthorchiasis. Moreover, increased expression of 14-3-3 eta, relative to normal hamster livers, was observed during the early stage of CCA induced by OV infection and administration of N-nitrosodimethylamine. Immunohistochemical analysis of human TMA revealed that 14-3-3 eta was highly expressed in CCA (84.23%, 187/222 cases) but was not found in hepatocellular carcinoma or healthy liver tissues. 14-3-3 eta protein has potential as a screening and early diagnostic marker for CCA. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
EuPathDB: the eukaryotic pathogen genomics database resource
Aurrecoechea, Cristina; Barreto, Ana; Basenko, Evelina Y.; Brestelli, John; Brunk, Brian P.; Cade, Shon; Crouch, Kathryn; Doherty, Ryan; Falke, Dave; Fischer, Steve; Gajria, Bindu; Harb, Omar S.; Heiges, Mark; Hertz-Fowler, Christiane; Hu, Sufen; Iodice, John; Kissinger, Jessica C.; Lawrence, Cris; Li, Wei; Pinney, Deborah F.; Pulman, Jane A.; Roos, David S.; Shanmugasundram, Achchuthan; Silva-Franco, Fatima; Steinbiss, Sascha; Stoeckert, Christian J.; Spruill, Drew; Wang, Haiming; Warrenfeltz, Susanne; Zheng, Jie
2017-01-01
The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host–pathogen interactions. PMID:27903906
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Willse, Alan R.
The Automated Microarray Image Analysis (AMIA) Toolbox for MATLAB is a flexible, open-source microarray image analysis tool that allows the user to customize analysis of sets of microarray images. This tool provides several methods of identifying and quantify spot statistics, as well as extensive diagnostic statistics and images to identify poor data quality or processing. The open nature of this software allows researchers to understand the algorithms used to provide intensity estimates and to modify them easily if desired.
ZBIT Bioinformatics Toolbox: A Web-Platform for Systems Biology and Expression Data Analysis
Römer, Michael; Eichner, Johannes; Dräger, Andreas; Wrzodek, Clemens; Wrzodek, Finja; Zell, Andreas
2016-01-01
Bioinformatics analysis has become an integral part of research in biology. However, installation and use of scientific software can be difficult and often requires technical expert knowledge. Reasons are dependencies on certain operating systems or required third-party libraries, missing graphical user interfaces and documentation, or nonstandard input and output formats. In order to make bioinformatics software easily accessible to researchers, we here present a web-based platform. The Center for Bioinformatics Tuebingen (ZBIT) Bioinformatics Toolbox provides web-based access to a collection of bioinformatics tools developed for systems biology, protein sequence annotation, and expression data analysis. Currently, the collection encompasses software for conversion and processing of community standards SBML and BioPAX, transcription factor analysis, and analysis of microarray data from transcriptomics and proteomics studies. All tools are hosted on a customized Galaxy instance and run on a dedicated computation cluster. Users only need a web browser and an active internet connection in order to benefit from this service. The web platform is designed to facilitate the usage of the bioinformatics tools for researchers without advanced technical background. Users can combine tools for complex analyses or use predefined, customizable workflows. All results are stored persistently and reproducible. For each tool, we provide documentation, tutorials, and example data to maximize usability. The ZBIT Bioinformatics Toolbox is freely available at https://webservices.cs.uni-tuebingen.de/. PMID:26882475
Plant proteome analysis: a 2006 update.
Jorrín, Jesús V; Maldonado, Ana M; Castillejo, Ma Angeles
2007-08-01
This 2006 'Plant Proteomics Update' is a continuation of the two previously published in 'Proteomics' by 2004 (Canovas et al., Proteomics 2004, 4, 285-298) and 2006 (Rossignol et al., Proteomics 2006, 6, 5529-5548) and it aims to bring up-to-date the contribution of proteomics to plant biology on the basis of the original research papers published throughout 2006, with references to those appearing last year. According to the published papers and topics addressed, we can conclude that, as observed for the three previous years, there has been a quantitative, but not qualitative leap in plant proteomics. The full potential of proteomics is far from being exploited in plant biology research, especially if compared to other organisms, mainly yeast and humans, and a number of challenges, mainly technological, remain to be tackled. The original papers published last year numbered nearly 100 and deal with the proteome of at least 26 plant species, with a high percentage for Arabidopsis thaliana (28) and rice (11). Scientific objectives ranged from proteomic analysis of organs/tissues/cell suspensions (57) or subcellular fractions (29), to the study of plant development (12), the effect of hormones and signalling molecules (8) and response to symbionts (4) and stresses (27). A small number of contributions have covered PTMs (8) and protein interactions (4). 2-DE (specifically IEF-SDS-PAGE) coupled to MS still constitutes the almost unique platform utilized in plant proteome analysis. The application of gel-free protein separation methods and 'second generation' proteomic techniques such as multidimensional protein identification technology (MudPIT), and those for quantitative proteomics including DIGE, isotope-coded affinity tags (ICAT), iTRAQ and stable isotope labelling by amino acids in cell culture (SILAC) still remains anecdotal. This review is divided into seven sections: Introduction, Methodology, Subcellular proteomes, Development, Responses to biotic and abiotic stresses, PTMs and Protein interactions. Section 8 summarizes the major pitfalls and challenges of plant proteomics.
ELISA-BASE: An Integrated Bioinformatics Tool for Analyzing and Tracking ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Collett, James L.; Seurynck-Servoss, Shannon L.
ELISA-BASE is an open-source database for capturing, organizing and analyzing protein enzyme-linked immunosorbent assay (ELISA) microarray data. ELISA-BASE is an extension of the BioArray Soft-ware Environment (BASE) database system, which was developed for DNA microarrays. In order to make BASE suitable for protein microarray experiments, we developed several plugins for importing and analyzing quantitative ELISA microarray data. Most notably, our Protein Microarray Analysis Tool (ProMAT) for processing quantita-tive ELISA data is now available as a plugin to the database.
Investigators from the National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) who comprehensively analyzed 95 human colorectal tumor samples, have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, provides a more comprehensive view of the biological features that drive cancer than genomic analysis alone and may help identify the most important targets for cancer detection and intervention.
Executioner Caspase-3 and 7 Deficiency Reduces Myocyte Number in the Developing Mouse Heart
Cardona, Maria; López, Juan Antonio; Serafín, Anna; Rongvaux, Anthony; Inserte, Javier; García-Dorado, David; Flavell, Richard; Llovera, Marta; Cañas, Xavier; Vázquez, Jesús; Sanchis, Daniel
2015-01-01
Executioner caspase-3 and -7 are proteases promoting cell death but non-apoptotic roles are being discovered. The heart expresses caspases only during development, suggesting they contribute to the organ maturation process. Therefore, we aimed at identifying novel functions of caspases in heart development. We induced simultaneous deletion of executioner caspase-3 and -7 in the mouse myocardium and studied its effects. Caspase knockout hearts are hypoplastic at birth, reaching normal weight progressively through myocyte hypertrophy. To identify the molecular pathways involved in these effects, we used microarray-based transcriptomics and multiplexed quantitative proteomics to compare wild type and executioner caspase-deficient myocardium at different developmental stages. Transcriptomics showed reduced expression of genes promoting DNA replication and cell cycle progression in the neonatal caspase-deficient heart suggesting reduced myocyte proliferation, and expression of non-cardiac isoforms of structural proteins in the adult null myocardium. Proteomics showed reduced abundance of proteins involved in oxidative phosphorylation accompanied by increased abundance of glycolytic enzymes underscoring retarded metabolic maturation of the caspase-null myocardium. Correlation between mRNA expression and protein abundance of relevant genes was confirmed, but transcriptomics and proteomics indentified complementary molecular pathways influenced by caspases in the developing heart. Forced expression of wild type or proteolytically inactive caspases in cultured cardiomyocytes induced expression of genes promoting cell division. The results reveal that executioner caspases can modulate heart’s cellularity and maturation during development, contributing novel information about caspase biology and heart development. PMID:26121671
A Proteomics View of the Molecular Mechanisms and Biomarkers of Glaucomatous Neurodegeneration
Tezel, Gülgün
2013-01-01
Despite improving understanding of glaucoma, key molecular players of neurodegeneration that can be targeted for treatment of glaucoma, or molecular biomarkers that can be useful for clinical testing, remain unclear. Proteomics technology offers a powerful toolbox to accomplish these important goals of the glaucoma research and is increasingly being applied to identify molecular mechanisms and biomarkers of glaucoma. Recent studies of glaucoma using proteomics analysis techniques have resulted in the lists of differentially expressed proteins in human glaucoma and animal models. The global analysis of protein expression in glaucoma has been followed by cell-specific proteome analysis of retinal ganglion cells and astrocytes. The proteomics data have also guided targeted studies to identify post-translational modifications and protein-protein interactions during glaucomatous neurodegeneration. In addition, recent applications of proteomics have provided a number of potential biomarker candidates. Proteomics technology holds great promise to move glaucoma research forward toward new treatment strategies and biomarker discovery. By reviewing the major proteomics approaches and their applications in the field of glaucoma, this article highlights the power of proteomics in translational and clinical research related to glaucoma and also provides a framework for future research to functionally test the importance of specific molecular pathways and validate candidate biomarkers. PMID:23396249
2010-01-01
Background Recent developments in high-throughput methods of analyzing transcriptomic profiles are promising for many areas of biology, including ecophysiology. However, although commercial microarrays are available for most common laboratory models, transcriptome analysis in non-traditional model species still remains a challenge. Indeed, the signal resulting from heterologous hybridization is low and difficult to interpret because of the weak complementarity between probe and target sequences, especially when no microarray dedicated to a genetically close species is available. Results We show here that transcriptome analysis in a species genetically distant from laboratory models is made possible by using MAXRS, a new method of analyzing heterologous hybridization on microarrays. This method takes advantage of the design of several commercial microarrays, with different probes targeting the same transcript. To illustrate and test this method, we analyzed the transcriptome of king penguin pectoralis muscle hybridized to Affymetrix chicken microarrays, two organisms separated by an evolutionary distance of approximately 100 million years. The differential gene expression observed between different physiological situations computed by MAXRS was confirmed by real-time PCR on 10 genes out of 11 tested. Conclusions MAXRS appears to be an appropriate method for gene expression analysis under heterologous hybridization conditions. PMID:20509979
A Human Lectin Microarray for Sperm Surface Glycosylation Analysis *
Sun, Yangyang; Cheng, Li; Gu, Yihua; Xin, Aijie; Wu, Bin; Zhou, Shumin; Guo, Shujuan; Liu, Yin; Diao, Hua; Shi, Huijuan; Wang, Guangyu; Tao, Sheng-ce
2016-01-01
Glycosylation is one of the most abundant and functionally important protein post-translational modifications. As such, technology for efficient glycosylation analysis is in high demand. Lectin microarrays are a powerful tool for such investigations and have been successfully applied for a variety of glycobiological studies. However, most of the current lectin microarrays are primarily constructed from plant lectins, which are not well suited for studies of human glycosylation because of the extreme complexity of human glycans. Herein, we constructed a human lectin microarray with 60 human lectin and lectin-like proteins. All of the lectins and lectin-like proteins were purified from yeast, and most showed binding to human glycans. To demonstrate the applicability of the human lectin microarray, human sperm were probed on the microarray and strong bindings were observed for several lectins, including galectin-1, 7, 8, GalNAc-T6, and ERGIC-53 (LMAN1). These bindings were validated by flow cytometry and fluorescence immunostaining. Further, mass spectrometry analysis showed that galectin-1 binds several membrane-associated proteins including heat shock protein 90. Finally, functional assays showed that binding of galectin-8 could significantly enhance the acrosome reaction within human sperms. To our knowledge, this is the first construction of a human lectin microarray, and we anticipate it will find wide use for a range of human or mammalian studies, alone or in combination with plant lectin microarrays. PMID:27364157
FunRich proteomics software analysis, let the fun begin!
Benito-Martin, Alberto; Peinado, Héctor
2015-08-01
Protein MS analysis is the preferred method for unbiased protein identification. It is normally applied to a large number of both small-scale and high-throughput studies. However, user-friendly computational tools for protein analysis are still needed. In this issue, Mathivanan and colleagues (Proteomics 2015, 15, 2597-2601) report the development of FunRich software, an open-access software that facilitates the analysis of proteomics data, providing tools for functional enrichment and interaction network analysis of genes and proteins. FunRich is a reinterpretation of proteomic software, a standalone tool combining ease of use with customizable databases, free access, and graphical representations. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Top-down proteomics for the analysis of proteolytic events - Methods, applications and perspectives.
Tholey, Andreas; Becker, Alexander
2017-11-01
Mass spectrometry based proteomics is an indispensable tool for almost all research areas relevant for the understanding of proteolytic processing, ranging from the identification of substrates, products and cleavage sites up to the analysis of structural features influencing protease activity. The majority of methods for these studies are based on bottom-up proteomics performing analysis at peptide level. As this approach is characterized by a number of pitfalls, e.g. loss of molecular information, there is an ongoing effort to establish top-down proteomics, performing separation and MS analysis both at intact protein level. We briefly introduce major approaches of bottom-up proteomics used in the field of protease research and highlight the shortcomings of these methods. We then discuss the present state-of-the-art of top-down proteomics. Together with the discussion of known challenges we show the potential of this approach and present a number of successful applications of top-down proteomics in protease research. This article is part of a Special Issue entitled: Proteolysis as a Regulatory Event in Pathophysiology edited by Stefan Rose-John. Copyright © 2017 Elsevier B.V. All rights reserved.
Shotgun proteomics of plant plasma membrane and microdomain proteins using nano-LC-MS/MS.
Takahashi, Daisuke; Li, Bin; Nakayama, Takato; Kawamura, Yukio; Uemura, Matsuo
2014-01-01
Shotgun proteomics allows the comprehensive analysis of proteins extracted from plant cells, subcellular organelles, and membranes. Previously, two-dimensional gel electrophoresis-based proteomics was used for mass spectrometric analysis of plasma membrane proteins. In order to get comprehensive proteome profiles of the plasma membrane including highly hydrophobic proteins with a number of transmembrane domains, a mass spectrometry-based shotgun proteomics method using nano-LC-MS/MS for proteins from the plasma membrane proteins and plasma membrane microdomain fraction is described. The results obtained are easily applicable to label-free protein semiquantification.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gentry, T.; Schadt, C.; Zhou, J.
Microarray technology has the unparalleled potential tosimultaneously determine the dynamics and/or activities of most, if notall, of the microbial populations in complex environments such as soilsand sediments. Researchers have developed several types of arrays thatcharacterize the microbial populations in these samples based on theirphylogenetic relatedness or functional genomic content. Several recentstudies have used these microarrays to investigate ecological issues;however, most have only analyzed a limited number of samples withrelatively few experiments utilizing the full high-throughput potentialof microarray analysis. This is due in part to the unique analyticalchallenges that these samples present with regard to sensitivity,specificity, quantitation, and data analysis. Thismore » review discussesspecific applications of microarrays to microbial ecology research alongwith some of the latest studies addressing the difficulties encounteredduring analysis of complex microbial communities within environmentalsamples. With continued development, microarray technology may ultimatelyachieve its potential for comprehensive, high-throughput characterizationof microbial populations in near real-time.« less
Fully automated analysis of multi-resolution four-channel micro-array genotyping data
NASA Astrophysics Data System (ADS)
Abbaspour, Mohsen; Abugharbieh, Rafeef; Podder, Mohua; Tebbutt, Scott J.
2006-03-01
We present a fully-automated and robust microarray image analysis system for handling multi-resolution images (down to 3-micron with sizes up to 80 MBs per channel). The system is developed to provide rapid and accurate data extraction for our recently developed microarray analysis and quality control tool (SNP Chart). Currently available commercial microarray image analysis applications are inefficient, due to the considerable user interaction typically required. Four-channel DNA microarray technology is a robust and accurate tool for determining genotypes of multiple genetic markers in individuals. It plays an important role in the state of the art trend where traditional medical treatments are to be replaced by personalized genetic medicine, i.e. individualized therapy based on the patient's genetic heritage. However, fast, robust, and precise image processing tools are required for the prospective practical use of microarray-based genetic testing for predicting disease susceptibilities and drug effects in clinical practice, which require a turn-around timeline compatible with clinical decision-making. In this paper we have developed a fully-automated image analysis platform for the rapid investigation of hundreds of genetic variations across multiple genes. Validation tests indicate very high accuracy levels for genotyping results. Our method achieves a significant reduction in analysis time, from several hours to just a few minutes, and is completely automated requiring no manual interaction or guidance.
Goeminne, Ludger J E; Gevaert, Kris; Clement, Lieven
2018-01-16
Label-free shotgun proteomics is routinely used to assess proteomes. However, extracting relevant information from the massive amounts of generated data remains difficult. This tutorial provides a strong foundation on analysis of quantitative proteomics data. We provide key statistical concepts that help researchers to design proteomics experiments and we showcase how to analyze quantitative proteomics data using our recent free and open-source R package MSqRob, which was developed to implement the peptide-level robust ridge regression method for relative protein quantification described by Goeminne et al. MSqRob can handle virtually any experimental proteomics design and outputs proteins ordered by statistical significance. Moreover, its graphical user interface and interactive diagnostic plots provide easy inspection and also detection of anomalies in the data and flaws in the data analysis, allowing deeper assessment of the validity of results and a critical review of the experimental design. Our tutorial discusses interactive preprocessing, data analysis and visualization of label-free MS-based quantitative proteomics experiments with simple and more complex designs. We provide well-documented scripts to run analyses in bash mode on GitHub, enabling the integration of MSqRob in automated pipelines on cluster environments (https://github.com/statOmics/MSqRob). The concepts outlined in this tutorial aid in designing better experiments and analyzing the resulting data more appropriately. The two case studies using the MSqRob graphical user interface will contribute to a wider adaptation of advanced peptide-based models, resulting in higher quality data analysis workflows and more reproducible results in the proteomics community. We also provide well-documented scripts for experienced users that aim at automating MSqRob on cluster environments. Copyright © 2017 Elsevier B.V. All rights reserved.
Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus
2014-01-01
The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868
ProteoSign: an end-user online differential proteomics statistical analysis platform.
Efstathiou, Georgios; Antonakis, Andreas N; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Divanach, Peter; Trudgian, David C; Thomas, Benjamin; Papanikolaou, Nikolas; Aivaliotis, Michalis; Acuto, Oreste; Iliopoulos, Ioannis
2017-07-03
Profiling of proteome dynamics is crucial for understanding cellular behavior in response to intrinsic and extrinsic stimuli and maintenance of homeostasis. Over the last 20 years, mass spectrometry (MS) has emerged as the most powerful tool for large-scale identification and characterization of proteins. Bottom-up proteomics, the most common MS-based proteomics approach, has always been challenging in terms of data management, processing, analysis and visualization, with modern instruments capable of producing several gigabytes of data out of a single experiment. Here, we present ProteoSign, a freely available web application, dedicated in allowing users to perform proteomics differential expression/abundance analysis in a user-friendly and self-explanatory way. Although several non-commercial standalone tools have been developed for post-quantification statistical analysis of proteomics data, most of them are not end-user appealing as they often require very stringent installation of programming environments, third-party software packages and sometimes further scripting or computer programming. To avoid this bottleneck, we have developed a user-friendly software platform accessible via a web interface in order to enable proteomics laboratories and core facilities to statistically analyse quantitative proteomics data sets in a resource-efficient manner. ProteoSign is available at http://bioinformatics.med.uoc.gr/ProteoSign and the source code at https://github.com/yorgodillo/ProteoSign. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
The Escherichia coli Proteome: Past, Present, and Future Prospects†
Han, Mee-Jung; Lee, Sang Yup
2006-01-01
Proteomics has emerged as an indispensable methodology for large-scale protein analysis in functional genomics. The Escherichia coli proteome has been extensively studied and is well defined in terms of biochemical, biological, and biotechnological data. Even before the entire E. coli proteome was fully elucidated, the largest available data set had been integrated to decipher regulatory circuits and metabolic pathways, providing valuable insights into global cellular physiology and the development of metabolic and cellular engineering strategies. With the recent advent of advanced proteomic technologies, the E. coli proteome has been used for the validation of new technologies and methodologies such as sample prefractionation, protein enrichment, two-dimensional gel electrophoresis, protein detection, mass spectrometry (MS), combinatorial assays with n-dimensional chromatographies and MS, and image analysis software. These important technologies will not only provide a great amount of additional information on the E. coli proteome but also synergistically contribute to other proteomic studies. Here, we review the past development and current status of E. coli proteome research in terms of its biological, biotechnological, and methodological significance and suggest future prospects. PMID:16760308
Derivative component analysis for mass spectral serum proteomic profiles.
Han, Henry
2014-01-01
As a promising way to transform medicine, mass spectrometry based proteomics technologies have seen a great progress in identifying disease biomarkers for clinical diagnosis and prognosis. However, there is a lack of effective feature selection methods that are able to capture essential data behaviors to achieve clinical level disease diagnosis. Moreover, it faces a challenge from data reproducibility, which means that no two independent studies have been found to produce same proteomic patterns. Such reproducibility issue causes the identified biomarker patterns to lose repeatability and prevents it from real clinical usage. In this work, we propose a novel machine-learning algorithm: derivative component analysis (DCA) for high-dimensional mass spectral proteomic profiles. As an implicit feature selection algorithm, derivative component analysis examines input proteomics data in a multi-resolution approach by seeking its derivatives to capture latent data characteristics and conduct de-noising. We further demonstrate DCA's advantages in disease diagnosis by viewing input proteomics data as a profile biomarker via integrating it with support vector machines to tackle the reproducibility issue, besides comparing it with state-of-the-art peers. Our results show that high-dimensional proteomics data are actually linearly separable under proposed derivative component analysis (DCA). As a novel multi-resolution feature selection algorithm, DCA not only overcomes the weakness of the traditional methods in subtle data behavior discovery, but also suggests an effective resolution to overcoming proteomics data's reproducibility problem and provides new techniques and insights in translational bioinformatics and machine learning. The DCA-based profile biomarker diagnosis makes clinical level diagnostic performances reproducible across different proteomic data, which is more robust and systematic than the existing biomarker discovery based diagnosis. Our findings demonstrate the feasibility and power of the proposed DCA-based profile biomarker diagnosis in achieving high sensitivity and conquering the data reproducibility issue in serum proteomics. Furthermore, our proposed derivative component analysis suggests the subtle data characteristics gleaning and de-noising are essential in separating true signals from red herrings for high-dimensional proteomic profiles, which can be more important than the conventional feature selection or dimension reduction. In particular, our profile biomarker diagnosis can be generalized to other omics data for derivative component analysis (DCA)'s nature of generic data analysis.
Tumor Cold Ischemia | Office of Cancer Clinical Proteomics Research
In a recently published manuscript in the journal of Molecular and Cellular Proteomics, researchers from the National Cancer Institutes (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigated the effect of cold ischemia on the proteome of fresh frozen tumors.
Proteomics wants cRacker: automated standardized data analysis of LC-MS derived proteomic data.
Zauber, Henrik; Schulze, Waltraud X
2012-11-02
The large-scale analysis of thousands of proteins under various experimental conditions or in mutant lines has gained more and more importance in hypothesis-driven scientific research and systems biology in the past years. Quantitative analysis by large scale proteomics using modern mass spectrometry usually results in long lists of peptide ion intensities. The main interest for most researchers, however, is to draw conclusions on the protein level. Postprocessing and combining peptide intensities of a proteomic data set requires expert knowledge, and the often repetitive and standardized manual calculations can be time-consuming. The analysis of complex samples can result in very large data sets (lists with several 1000s to 100,000 entries of different peptides) that cannot easily be analyzed using standard spreadsheet programs. To improve speed and consistency of the data analysis of LC-MS derived proteomic data, we developed cRacker. cRacker is an R-based program for automated downstream proteomic data analysis including data normalization strategies for metabolic labeling and label free quantitation. In addition, cRacker includes basic statistical analysis, such as clustering of data, or ANOVA and t tests for comparison between treatments. Results are presented in editable graphic formats and in list files.
PRACTICAL STRATEGIES FOR PROCESSING AND ANALYZING SPOTTED OLIGONUCLEOTIDE MICROARRAY DATA
Thoughtful data analysis is as important as experimental design, biological sample quality, and appropriate experimental procedures for making microarrays a useful supplement to traditional toxicology. In the present study, spotted oligonucleotide microarrays were used to profile...
Individualised cancer therapeutics: dream or reality? Therapeutics construction.
Shen, Yuqiao; Senzer, Neil; Nemunaitis, John
2005-11-01
The analysis of DNA microarray and proteomic data, and the subsequent integration into functional expression sets, provides a circuit map of the hierarchical cellular networks responsible for sustaining the viability and environmental competitiveness of cancer cells, that is, their robust systematics. These technologies can be used to 'snapshot' the unique patterns of molecular derangements and modified interactions in cancer, and allow for strategic selection of therapeutics that best match the individual profile of the tumour. This review highlights technology that can be used to selectively disrupt critical molecular targets and describes possible vehicles to deliver the synthesised molecular therapeutics to the relevant cellular compartments of the malignant cells. RNA interference (RNAi) involves a group of evolutionarily conserved gene silencing mechanisms in which small sequences of double-stranded RNA or intrinsic antisense RNA trigger mRNA cleavage or translational repression, respectively. Although RNAi molecules can be synthesised to 'silence' virtually any gene, even if upregulated, a mechanism for selective delivery of RNAi effectors to sites of malignant disease remains challenging. The authors will discuss gene-modified conditionally replicating viruses as candidate vehicles for the delivery of RNAi.
The Ser/Thr Protein Kinase Protein-Protein Interaction Map of M. tuberculosis.
Wu, Fan-Lin; Liu, Yin; Jiang, He-Wei; Luan, Yi-Zhao; Zhang, Hai-Nan; He, Xiang; Xu, Zhao-Wei; Hou, Jing-Li; Ji, Li-Yun; Xie, Zhi; Czajkowsky, Daniel M; Yan, Wei; Deng, Jiao-Yu; Bi, Li-Jun; Zhang, Xian-En; Tao, Sheng-Ce
2017-08-01
Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis, the leading cause of death among all infectious diseases. There are 11 eukaryotic-like serine/threonine protein kinases (STPKs) in Mtb, which are thought to play pivotal roles in cell growth, signal transduction and pathogenesis. However, their underlying mechanisms of action remain largely uncharacterized. In this study, using a Mtb proteome microarray, we have globally identified the binding proteins in Mtb for all of the STPKs, and constructed the first STPK protein interaction (KPI) map that includes 492 binding proteins and 1,027 interactions. Bioinformatics analysis showed that the interacting proteins reflect diverse functions, including roles in two-component system, transcription, protein degradation, and cell wall integrity. Functional investigations confirmed that PknG regulates cell wall integrity through key components of peptidoglycan (PG) biosynthesis, e.g. MurC. The global STPK-KPIs network constructed here is expected to serve as a rich resource for understanding the key signaling pathways in Mtb, thus facilitating drug development and effective control of Mtb. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Identification of a Conserved Glycan Signature for Microvesicles
Batista, Bianca S.; Eng, William S.; Pilobello, Kanoelani T.; Hendricks-Muñoz, Karen D.; Mahal, Lara K.
2011-01-01
Microvesicles (exosomes) are important mediators of intercellular communication, playing a role in immune regulation, cancer progression and the spread of infectious agents. The biological functions of these small vesicles are dependent upon their composition, which is regulated by mechanisms that are not well understood. Although numerous proteomic studies of these particles exist, little is known about their glycosylation. Carbohydrates are involved in protein trafficking and cellular recognition. Glycomic analysis may thus provide valuable insights into microvesicle biology. In this study, we analyzed glycosylation patterns of microvesicles derived from a variety of biological sources using lectin microarray technology. Comparison of the microvesicle glycomes with their parent cell membranes revealed both enrichment and depletion of specific glycan epitopes in these particles. These include enrichment in high mannose, polylactosamine, α-2,6 sialic acid, and complex N-linked glycans and exclusion of terminal blood group A and B antigens. The polylactosamine signature derives from distinct glycoprotein cohorts in microvesicles of different origins. Taken together our data point to the emergence of microvesicles from a specific membrane microdomain, implying a role for glycosylation in microvesicle protein sorting. PMID:21859146
Yeap, Swee Keong; Abu, Nadiah; Akthar, Nadeem; Ho, Wan Yong; Ky, Huynh; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Kamarul, Tunku
2016-01-01
Flavokawain B (FKB) is known to possess promising anticancer abilities. This is demonstrated in various cancer cell lines including HeLa cells. Cervical cancer is among the most widely diagnosed cancer among women today. Though FKB has been shown to be effective in treating cancer cells, the exact molecular mechanism is still unknown. This study is aimed at understanding the effects of FKB on HeLa cells using a microarray-based mRNA expression profiling and proteome profiling of stress-related proteins. The results of this study suggest that FKB induced cell death through p21-mediated cell cycle arrest and activation of p38. However, concurrent activation of antioxidant-related pathways and iron sequestration pathway followed by activation of ER-resident stress proteins clearly indicate that FKB failed to induce apoptosis in HeLa cells via oxidative stress. This effect implies that the protection of HeLa cells by FKB from H2O2–induced cell death is via neutralization of reactive oxygen species. PMID:27458249
Yeap, Swee Keong; Abu, Nadiah; Akthar, Nadeem; Ho, Wan Yong; Ky, Huynh; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Kamarul, Tunku
2017-09-01
Flavokawain B (FKB) is known to possess promising anticancer abilities. This is demonstrated in various cancer cell lines including HeLa cells. Cervical cancer is among the most widely diagnosed cancer among women today. Though FKB has been shown to be effective in treating cancer cells, the exact molecular mechanism is still unknown. This study is aimed at understanding the effects of FKB on HeLa cells using a microarray-based mRNA expression profiling and proteome profiling of stress-related proteins. The results of this study suggest that FKB induced cell death through p21-mediated cell cycle arrest and activation of p38. However, concurrent activation of antioxidant-related pathways and iron sequestration pathway followed by activation of ER-resident stress proteins clearly indicate that FKB failed to induce apoptosis in HeLa cells via oxidative stress. This effect implies that the protection of HeLa cells by FKB from H 2 O 2 -induced cell death is via neutralization of reactive oxygen species.
Building ProteomeTools based on a complete synthetic human proteome
Zolg, Daniel P.; Wilhelm, Mathias; Schnatbaum, Karsten; Zerweck, Johannes; Knaute, Tobias; Delanghe, Bernard; Bailey, Derek J.; Gessulat, Siegfried; Ehrlich, Hans-Christian; Weininger, Maximilian; Yu, Peng; Schlegl, Judith; Kramer, Karl; Schmidt, Tobias; Kusebauch, Ulrike; Deutsch, Eric W.; Aebersold, Ruedi; Moritz, Robert L.; Wenschuh, Holger; Moehring, Thomas; Aiche, Stephan; Huhmer, Andreas; Reimer, Ulf; Kuster, Bernhard
2018-01-01
The ProteomeTools project builds molecular and digital tools from the human proteome to facilitate biomedical and life science research. Here, we report the generation and multimodal LC-MS/MS analysis of >330,000 synthetic tryptic peptides representing essentially all canonical human gene products and exemplify the utility of this data. The resource will be extended to >1 million peptides and all data will be shared with the community via ProteomicsDB and proteomeXchange. PMID:28135259
Comparative shotgun proteomics using spectral count data and quasi-likelihood modeling.
Li, Ming; Gray, William; Zhang, Haixia; Chung, Christine H; Billheimer, Dean; Yarbrough, Wendell G; Liebler, Daniel C; Shyr, Yu; Slebos, Robbert J C
2010-08-06
Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography-tandem mass spectrometry (LC-MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (QuasiTel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher's Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography-multiple reaction monitoring mass spectrometry (LC-MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples.
Comparative Shotgun Proteomics Using Spectral Count Data and Quasi-Likelihood Modeling
2010-01-01
Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography−tandem mass spectrometry (LC−MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (QuasiTel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher’s Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography−multiple reaction monitoring mass spectrometry (LC−MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples. PMID:20586475
Analysis of high accuracy, quantitative proteomics data in the MaxQB database.
Schaab, Christoph; Geiger, Tamar; Stoehr, Gabriele; Cox, Juergen; Mann, Matthias
2012-03-01
MS-based proteomics generates rapidly increasing amounts of precise and quantitative information. Analysis of individual proteomic experiments has made great strides, but the crucial ability to compare and store information across different proteome measurements still presents many challenges. For example, it has been difficult to avoid contamination of databases with low quality peptide identifications, to control for the inflation in false positive identifications when combining data sets, and to integrate quantitative data. Although, for example, the contamination with low quality identifications has been addressed by joint analysis of deposited raw data in some public repositories, we reasoned that there should be a role for a database specifically designed for high resolution and quantitative data. Here we describe a novel database termed MaxQB that stores and displays collections of large proteomics projects and allows joint analysis and comparison. We demonstrate the analysis tools of MaxQB using proteome data of 11 different human cell lines and 28 mouse tissues. The database-wide false discovery rate is controlled by adjusting the project specific cutoff scores for the combined data sets. The 11 cell line proteomes together identify proteins expressed from more than half of all human genes. For each protein of interest, expression levels estimated by label-free quantification can be visualized across the cell lines. Similarly, the expression rank order and estimated amount of each protein within each proteome are plotted. We used MaxQB to calculate the signal reproducibility of the detected peptides for the same proteins across different proteomes. Spearman rank correlation between peptide intensity and detection probability of identified proteins was greater than 0.8 for 64% of the proteome, whereas a minority of proteins have negative correlation. This information can be used to pinpoint false protein identifications, independently of peptide database scores. The information contained in MaxQB, including high resolution fragment spectra, is accessible to the community via a user-friendly web interface at http://www.biochem.mpg.de/maxqb.
Characterization of individual mouse cerebrospinal fluid proteomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, Jeffrey S.; Angel, Thomas E.; Chavkin, Charles
2014-03-20
Analysis of cerebrospinal fluid (CSF) offers key insight into the status of the central nervous system. Characterization of murine CSF proteomes can provide a valuable resource for studying central nervous system injury and disease in animal models. However, the small volume of CSF in mice has thus far limited individual mouse proteome characterization. Through non-terminal CSF extractions in C57Bl/6 mice and high-resolution liquid chromatography-mass spectrometry analysis of individual murine samples, we report the most comprehensive proteome characterization of individual murine CSF to date. Utilizing stringent protein inclusion criteria that required the identification of at least two unique peptides (1% falsemore » discovery rate at the peptide level) we identified a total of 566 unique proteins, including 128 proteins from three individual CSF samples that have been previously identified in brain tissue. Our methods and analysis provide a mechanism for individual murine CSF proteome analysis.« less
Identification of functional modules using network topology and high-throughput data.
Ulitsky, Igor; Shamir, Ron
2007-01-26
With the advent of systems biology, biological knowledge is often represented today by networks. These include regulatory and metabolic networks, protein-protein interaction networks, and many others. At the same time, high-throughput genomics and proteomics techniques generate very large data sets, which require sophisticated computational analysis. Usually, separate and different analysis methodologies are applied to each of the two data types. An integrated investigation of network and high-throughput information together can improve the quality of the analysis by accounting simultaneously for topological network properties alongside intrinsic features of the high-throughput data. We describe a novel algorithmic framework for this challenge. We first transform the high-throughput data into similarity values, (e.g., by computing pairwise similarity of gene expression patterns from microarray data). Then, given a network of genes or proteins and similarity values between some of them, we seek connected sub-networks (or modules) that manifest high similarity. We develop algorithms for this problem and evaluate their performance on the osmotic shock response network in S. cerevisiae and on the human cell cycle network. We demonstrate that focused, biologically meaningful and relevant functional modules are obtained. In comparison with extant algorithms, our approach has higher sensitivity and higher specificity. We have demonstrated that our method can accurately identify functional modules. Hence, it carries the promise to be highly useful in analysis of high throughput data.
Fu, Wenjiang J.; Stromberg, Arnold J.; Viele, Kert; Carroll, Raymond J.; Wu, Guoyao
2009-01-01
Over the past two decades, there have been revolutionary developments in life science technologies characterized by high throughput, high efficiency, and rapid computation. Nutritionists now have the advanced methodologies for the analysis of DNA, RNA, protein, low-molecular-weight metabolites, as well as access to bioinformatics databases. Statistics, which can be defined as the process of making scientific inferences from data that contain variability, has historically played an integral role in advancing nutritional sciences. Currently, in the era of systems biology, statistics has become an increasingly important tool to quantitatively analyze information about biological macromolecules. This article describes general terms used in statistical analysis of large, complex experimental data. These terms include experimental design, power analysis, sample size calculation, and experimental errors (type I and II errors) for nutritional studies at population, tissue, cellular, and molecular levels. In addition, we highlighted various sources of experimental variations in studies involving microarray gene expression, real-time polymerase chain reaction, proteomics, and other bioinformatics technologies. Moreover, we provided guidelines for nutritionists and other biomedical scientists to plan and conduct studies and to analyze the complex data. Appropriate statistical analyses are expected to make an important contribution to solving major nutrition-associated problems in humans and animals (including obesity, diabetes, cardiovascular disease, cancer, ageing, and intrauterine fetal retardation). PMID:20233650
[Methods of quantitative proteomics].
Kopylov, A T; Zgoda, V G
2007-01-01
In modern science proteomic analysis is inseparable from other fields of systemic biology. Possessing huge resources quantitative proteomics operates colossal information on molecular mechanisms of life. Advances in proteomics help researchers to solve complex problems of cell signaling, posttranslational modification, structure and functional homology of proteins, molecular diagnostics etc. More than 40 various methods have been developed in proteomics for quantitative analysis of proteins. Although each method is unique and has certain advantages and disadvantages all these use various isotope labels (tags). In this review we will consider the most popular and effective methods employing both chemical modifications of proteins and also metabolic and enzymatic methods of isotope labeling.
Xu, Huilei; Baroukh, Caroline; Dannenfelser, Ruth; Chen, Edward Y; Tan, Christopher M; Kou, Yan; Kim, Yujin E; Lemischka, Ihor R; Ma'ayan, Avi
2013-01-01
High content studies that profile mouse and human embryonic stem cells (m/hESCs) using various genome-wide technologies such as transcriptomics and proteomics are constantly being published. However, efforts to integrate such data to obtain a global view of the molecular circuitry in m/hESCs are lagging behind. Here, we present an m/hESC-centered database called Embryonic Stem Cell Atlas from Pluripotency Evidence integrating data from many recent diverse high-throughput studies including chromatin immunoprecipitation followed by deep sequencing, genome-wide inhibitory RNA screens, gene expression microarrays or RNA-seq after knockdown (KD) or overexpression of critical factors, immunoprecipitation followed by mass spectrometry proteomics and phosphoproteomics. The database provides web-based interactive search and visualization tools that can be used to build subnetworks and to identify known and novel regulatory interactions across various regulatory layers. The web-interface also includes tools to predict the effects of combinatorial KDs by additive effects controlled by sliders, or through simulation software implemented in MATLAB. Overall, the Embryonic Stem Cell Atlas from Pluripotency Evidence database is a comprehensive resource for the stem cell systems biology community. Database URL: http://www.maayanlab.net/ESCAPE
Phosphoproteomic biomarkers predicting histologic nonalcoholic steatohepatitis and fibrosis.
Younossi, Zobair M; Baranova, Ancha; Stepanova, Maria; Page, Sandra; Calvert, Valerie S; Afendy, Arian; Goodman, Zachary; Chandhoke, Vikas; Liotta, Lance; Petricoin, Emanuel
2010-06-04
The progression of nonalcoholic fatty liver disease (NAFLD) has been linked to deregulated exchange of the endocrine signaling between adipose and liver tissue. Proteomic assays for the phosphorylation events that characterize the activated or deactivated state of the kinase-driven signaling cascades in visceral adipose tissue (VAT) could shed light on the pathogenesis of nonalcoholic steatohepatitis (NASH) and related fibrosis. Reverse-phase protein microarrays (RPMA) were used to develop biomarkers for NASH and fibrosis using VAT collected from 167 NAFLD patients (training cohort, N = 117; testing cohort, N = 50). Three types of models were developed for NASH and advanced fibrosis: clinical models, proteomics models, and combination models. NASH was predicted by a model that included measurements of two components of the insulin signaling pathway: AKT kinase and insulin receptor substrate 1 (IRS1). The models for fibrosis were less reliable when predictions were based on phosphoproteomic, clinical, or the combination data. The best performing model relied on levels of the phosphorylation of GSK3 as well as on two subunits of cyclic AMP regulated protein kinase A (PKA). Phosphoproteomics technology could potentially be used to provide pathogenic information about NASH and NASH-related fibrosis. This information can lead to a clinically relevant diagnostic/prognostic biomarker for NASH.
Cui, Jian; Liu, Jinghua; Li, Yuhua; Shi, Tieliu
2011-01-01
Mitochondria are major players on the production of energy, and host several key reactions involved in basic metabolism and biosynthesis of essential molecules. Currently, the majority of nucleus-encoded mitochondrial proteins are unknown even for model plant Arabidopsis. We reported a computational framework for predicting Arabidopsis mitochondrial proteins based on a probabilistic model, called Naive Bayesian Network, which integrates disparate genomic data generated from eight bioinformatics tools, multiple orthologous mappings, protein domain properties and co-expression patterns using 1,027 microarray profiles. Through this approach, we predicted 2,311 candidate mitochondrial proteins with 84.67% accuracy and 2.53% FPR performances. Together with those experimental confirmed proteins, 2,585 mitochondria proteins (named CoreMitoP) were identified, we explored those proteins with unknown functions based on protein-protein interaction network (PIN) and annotated novel functions for 26.65% CoreMitoP proteins. Moreover, we found newly predicted mitochondrial proteins embedded in particular subnetworks of the PIN, mainly functioning in response to diverse environmental stresses, like salt, draught, cold, and wound etc. Candidate mitochondrial proteins involved in those physiological acitivites provide useful targets for further investigation. Assigned functions also provide comprehensive information for Arabidopsis mitochondrial proteome. PMID:21297957
GeneXplorer: an interactive web application for microarray data visualization and analysis.
Rees, Christian A; Demeter, Janos; Matese, John C; Botstein, David; Sherlock, Gavin
2004-10-01
When publishing large-scale microarray datasets, it is of great value to create supplemental websites where either the full data, or selected subsets corresponding to figures within the paper, can be browsed. We set out to create a CGI application containing many of the features of some of the existing standalone software for the visualization of clustered microarray data. We present GeneXplorer, a web application for interactive microarray data visualization and analysis in a web environment. GeneXplorer allows users to browse a microarray dataset in an intuitive fashion. It provides simple access to microarray data over the Internet and uses only HTML and JavaScript to display graphic and annotation information. It provides radar and zoom views of the data, allows display of the nearest neighbors to a gene expression vector based on their Pearson correlations and provides the ability to search gene annotation fields. The software is released under the permissive MIT Open Source license, and the complete documentation and the entire source code are freely available for download from CPAN http://search.cpan.org/dist/Microarray-GeneXplorer/.
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J.; Chatterjee, Aditi; Prasad, T. S. Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division. PMID:25874956
Reddy, Panga Jaipal; Sinha, Sneha; Ray, Sandipan; Sathe, Gajanan J; Chatterjee, Aditi; Prasad, T S Keshava; Dhali, Snigdha; Srikanth, Rapole; Panda, Dulal; Srivastava, Sanjeeva
2015-01-01
Curcumin is a natural dietary compound with antimicrobial activity against various gram positive and negative bacteria. This study aims to investigate the proteome level alterations in Bacillus subtilis due to curcumin treatment and identification of its molecular/cellular targets to understand the mechanism of action. We have performed a comprehensive proteomic analysis of B. subtilis AH75 strain at different time intervals of curcumin treatment (20, 60 and 120 min after the drug exposure, three replicates) to compare the protein expression profiles using two complementary quantitative proteomic techniques, 2D-DIGE and iTRAQ. To the best of our knowledge, this is the first comprehensive longitudinal investigation describing the effect of curcumin treatment on B. subtilis proteome. The proteomics analysis revealed several interesting targets such UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1, putative septation protein SpoVG and ATP-dependent Clp protease proteolytic subunit. Further, in silico pathway analysis using DAVID and KOBAS has revealed modulation of pathways related to the fatty acid metabolism and cell wall synthesis, which are crucial for cell viability. Our findings revealed that curcumin treatment lead to inhibition of the cell wall and fatty acid synthesis in addition to differential expression of many crucial proteins involved in modulation of bacterial metabolism. Findings obtained from proteomics analysis were further validated using 5-cyano-2,3-ditolyl tetrazolium chloride (CTC) assay for respiratory activity, resazurin assay for metabolic activity and membrane integrity assay by potassium and inorganic phosphate leakage measurement. The gene expression analysis of selected cell wall biosynthesis enzymes has strengthened the proteomics findings and indicated the major effect of curcumin on cell division.
Nanotechnology: moving from microarrays toward nanoarrays.
Chen, Hua; Li, Jun
2007-01-01
Microarrays are important tools for high-throughput analysis of biomolecules. The use of microarrays for parallel screening of nucleic acid and protein profiles has become an industry standard. A few limitations of microarrays are the requirement for relatively large sample volumes and elongated incubation time, as well as the limit of detection. In addition, traditional microarrays make use of bulky instrumentation for the detection, and sample amplification and labeling are quite laborious, which increase analysis cost and delays the time for obtaining results. These problems limit microarray techniques from point-of-care and field applications. One strategy for overcoming these problems is to develop nanoarrays, particularly electronics-based nanoarrays. With further miniaturization, higher sensitivity, and simplified sample preparation, nanoarrays could potentially be employed for biomolecular analysis in personal healthcare and monitoring of trace pathogens. In this chapter, it is intended to introduce the concept and advantage of nanotechnology and then describe current methods and protocols for novel nanoarrays in three aspects: (1) label-free nucleic acids analysis using nanoarrays, (2) nanoarrays for protein detection by conventional optical fluorescence microscopy as well as by novel label-free methods such as atomic force microscopy, and (3) nanoarray for enzymatic-based assay. These nanoarrays will have significant applications in drug discovery, medical diagnosis, genetic testing, environmental monitoring, and food safety inspection.
Analyzing large-scale proteomics projects with latent semantic indexing.
Klie, Sebastian; Martens, Lennart; Vizcaíno, Juan Antonio; Côté, Richard; Jones, Phil; Apweiler, Rolf; Hinneburg, Alexander; Hermjakob, Henning
2008-01-01
Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.
Decentralized Data Sharing of Tissue Microarrays for Investigative Research in Oncology
Chen, Wenjin; Schmidt, Cristina; Parashar, Manish; Reiss, Michael; Foran, David J.
2007-01-01
Tissue microarray technology (TMA) is a relatively new approach for efficiently and economically assessing protein and gene expression across large ensembles of tissue specimens. Tissue microarray technology holds great potential for reducing the time and cost associated with conducting research in tissue banking, proteomics, and outcome studies. However, the sheer volume of images and other data generated from even limited studies involving tissue microarrays quickly approach the processing capacity and resources of a division or department. This challenge is compounded by the fact that large-scale projects in several areas of modern research rely upon multi-institutional efforts in which investigators and resources are spread out over multiple campuses, cities, and states. To address some of the data management issues several leading institutions have begun to develop their own “in-house” systems, independently, but such data will be only minimally useful if it isn’t accessible to others in the scientific community. Investigators at different institutions studying the same or related disorders might benefit from the synergy of sharing results. To facilitate sharing of TMA data across different database implementations, the Technical Standards Committee of the Association for Pathology Informatics organized workshops in efforts to establish a standardized TMA data exchange specification. The focus of our research does not relate to the establishment of standards for exchange, but rather builds on these efforts and concentrates on the design, development and deployment of a decentralized collaboratory for the unsupervised characterization, and seamless and secure discovery and sharing of TMA data. Specifically, we present a self-organizing, peer-to-peer indexing and discovery infrastructure for quantitatively assessing digitized TMA’s. The system utilizes a novel, optimized decentralized search engine that supports flexible querying, while guaranteeing that once information has been stored in the system, it will be found with bounded costs. PMID:19081778
A meta-data based method for DNA microarray imputation.
Jörnsten, Rebecka; Ouyang, Ming; Wang, Hui-Yu
2007-03-29
DNA microarray experiments are conducted in logical sets, such as time course profiling after a treatment is applied to the samples, or comparisons of the samples under two or more conditions. Due to cost and design constraints of spotted cDNA microarray experiments, each logical set commonly includes only a small number of replicates per condition. Despite the vast improvement of the microarray technology in recent years, missing values are prevalent. Intuitively, imputation of missing values is best done using many replicates within the same logical set. In practice, there are few replicates and thus reliable imputation within logical sets is difficult. However, it is in the case of few replicates that the presence of missing values, and how they are imputed, can have the most profound impact on the outcome of downstream analyses (e.g. significance analysis and clustering). This study explores the feasibility of imputation across logical sets, using the vast amount of publicly available microarray data to improve imputation reliability in the small sample size setting. We download all cDNA microarray data of Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans from the Stanford Microarray Database. Through cross-validation and simulation, we find that, for all three species, our proposed imputation using data from public databases is far superior to imputation within a logical set, sometimes to an astonishing degree. Furthermore, the imputation root mean square error for significant genes is generally a lot less than that of non-significant ones. Since downstream analysis of significant genes, such as clustering and network analysis, can be very sensitive to small perturbations of estimated gene effects, it is highly recommended that researchers apply reliable data imputation prior to further analysis. Our method can also be applied to cDNA microarray experiments from other species, provided good reference data are available.
NCI's Office of Cancer Clinical Proteomics Research authored a review of the current state of clinical proteomics in the peer-reviewed Journal of Proteome Research. The review highlights outcomes from the CPTC program and also provides a thorough overview of the different technologies that have pushed the field forward. Additionally, the review provides a vision for moving the field forward through linking advances in genomic and proteomic analysis to develop new, molecularly targeted interventions.
2010-01-01
Background The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design allows existing transcriptomic data to be readily compared and re-analysed in the light of new data, and the combination of this design with large datasets is ideal for 'systems'-level analyses. One issue is that these datasets are typically collected over many years and may be heterogeneous in nature, containing different microarray file formats and gene array layouts, dye-swaps, and showing varying scales of log2- ratios of expression between microarrays. Excellent software exists for the normalisation and analysis of microarray data but many data have yet to be analysed as existing methods struggle with heterogeneous datasets; options include normalising microarrays on an individual or experimental group basis. Our solution was to develop the Batch Anti-Banana Algorithm in R (BABAR) algorithm and software package which uses cyclic loess to normalise across the complete dataset. We have already used BABAR to analyse the function of Salmonella genes involved in the process of infection of mammalian cells. Results The only input required by BABAR is unprocessed GenePix or BlueFuse microarray data files. BABAR provides a combination of 'within' and 'between' microarray normalisation steps and diagnostic boxplots. When applied to a real heterogeneous dataset, BABAR normalised the dataset to produce a comparable scaling between the microarrays, with the microarray data in excellent agreement with RT-PCR analysis. When applied to a real non-heterogeneous dataset and a simulated dataset, BABAR's performance in identifying differentially expressed genes showed some benefits over standard techniques. Conclusions BABAR is an easy-to-use software tool, simplifying the simultaneous normalisation of heterogeneous two-colour common reference design cDNA microarray-based transcriptomic datasets. We show BABAR transforms real and simulated datasets to allow for the correct interpretation of these data, and is the ideal tool to facilitate the identification of differentially expressed genes or network inference analysis from transcriptomic datasets. PMID:20128918
Arntzen, Magnus Ø; Thiede, Bernd
2012-02-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no.
Arntzen, Magnus Ø.; Thiede, Bernd
2012-01-01
Apoptosis is the most commonly described form of programmed cell death, and dysfunction is implicated in a large number of human diseases. Many quantitative proteome analyses of apoptosis have been performed to gain insight in proteins involved in the process. This resulted in large and complex data sets that are difficult to evaluate. Therefore, we developed the ApoptoProteomics database for storage, browsing, and analysis of the outcome of large scale proteome analyses of apoptosis derived from human, mouse, and rat. The proteomics data of 52 publications were integrated and unified with protein annotations from UniProt-KB, the caspase substrate database homepage (CASBAH), and gene ontology. Currently, more than 2300 records of more than 1500 unique proteins were included, covering a large proportion of the core signaling pathways of apoptosis. Analysis of the data set revealed a high level of agreement between the reported changes in directionality reported in proteomics studies and expected apoptosis-related function and may disclose proteins without a current recognized involvement in apoptosis based on gene ontology. Comparison between induction of apoptosis by the intrinsic and the extrinsic apoptotic signaling pathway revealed slight differences. Furthermore, proteomics has significantly contributed to the field of apoptosis in identifying hundreds of caspase substrates. The database is available at http://apoptoproteomics.uio.no. PMID:22067098
Clinical proteomic analysis of scrub typhus infection.
Park, Edmond Changkyun; Lee, Sang-Yeop; Yun, Sung Ho; Choi, Chi-Won; Lee, Hayoung; Song, Hyun Seok; Jun, Sangmi; Kim, Gun-Hwa; Lee, Chang-Seop; Kim, Seung Il
2018-01-01
Scrub typhus is an acute and febrile infectious disease caused by the Gram-negative α-proteobacterium Orientia tsutsugamushi from the family Rickettsiaceae that is widely distributed in Northern, Southern and Eastern Asia. In the present study, we analysed the serum proteome of scrub typhus patients to investigate specific clinical protein patterns in an attempt to explain pathophysiology and discover potential biomarkers of infection. Serum samples were collected from three patients (before and after treatment with antibiotics) and three healthy subjects. One-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis followed by liquid chromatography-tandem mass spectrometry was performed to identify differentially abundant proteins using quantitative proteomic approaches. Bioinformatic analysis was then performed using Ingenuity Pathway Analysis. Proteomic analysis identified 236 serum proteins, of which 32 were differentially expressed in normal subjects, naive scrub typhus patients and patients treated with antibiotics. Comparative bioinformatic analysis of the identified proteins revealed up-regulation of proteins involved in immune responses, especially complement system, following infection with O. tsutsugamushi , and normal expression was largely rescued by antibiotic treatment. This is the first proteomic study of clinical serum samples from scrub typhus patients. Proteomic analysis identified changes in protein expression upon infection with O. tsutsugamushi and following antibiotic treatment. Our results provide valuable information for further investigation of scrub typhus therapy and diagnosis.
Röst, Hannes L; Liu, Yansheng; D'Agostino, Giuseppe; Zanella, Matteo; Navarro, Pedro; Rosenberger, George; Collins, Ben C; Gillet, Ludovic; Testa, Giuseppe; Malmström, Lars; Aebersold, Ruedi
2016-09-01
Next-generation mass spectrometric (MS) techniques such as SWATH-MS have substantially increased the throughput and reproducibility of proteomic analysis, but ensuring consistent quantification of thousands of peptide analytes across multiple liquid chromatography-tandem MS (LC-MS/MS) runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we developed TRIC (http://proteomics.ethz.ch/tric/), a software tool that utilizes fragment-ion data to perform cross-run alignment, consistent peak-picking and quantification for high-throughput targeted proteomics. TRIC reduced the identification error compared to a state-of-the-art SWATH-MS analysis without alignment by more than threefold at constant recall while correcting for highly nonlinear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups. Thus, TRIC fills a gap in the pipeline for automated analysis of massively parallel targeted proteomics data sets.
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-01-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)1 not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. PMID:27215607
Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes.
Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen
2016-08-01
Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)(1) not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Anderson, Karen S.; Ramachandran, Niroshan; Wong, Jessica; Raphael, Jacob V.; Hainsworth, Eugenie; Demirkan, Gokhan; Cramer, Daniel; Aronzon, Diana; Hodi, F. Stephen; Harris, Lyndsay; Logvinenko, Tanya; LaBaer, Joshua
2012-01-01
There is strong preclinical evidence that cancer, including breast cancer, undergoes immune surveillance. This continual monitoring, by both the innate and the adaptive immune systems, recognizes changes in protein expression, mutation, folding, glycosylation, and degradation. Local immune responses to tumor antigens are amplified in draining lymph nodes, and then enter the systemic circulation. The antibody response to tumor antigens, such as p53 protein, are robust, stable, and easily detected in serum, may exist in greater concentrations than their cognate antigens, and are potential highly specific biomarkers for cancer. However, antibodies have limited sensitivities as single analytes, and differences in protein purification and assay characteristics have limited their clinical application. For example, p53 autoantibodies in the sera are highly specific for cancer patients, but are only detected in the sera of 10-20% of patients with breast cancer. Detection of p53 autoantibodies is dependent on tumor burden, p53 mutation, rapidly decreases with effective therapy, but is relatively independent of breast cancer subtype. Although antibodies to hundreds of other tumor antigens have been identified in the sera of breast cancer patients, very little is known about the specificity and clinical impact of the antibody immune repertoire to breast cancer. Recent advances in proteomic technologies have the potential for rapid identification of immune response signatures for breast cancer diagnosis and monitoring. We have adapted programmable protein microarrays for the specific detection of autoantibodies in breast cancer. Here, we present the first demonstration of the application of programmable protein microarray ELISAs for the rapid identification of breast cancer autoantibodies. PMID:18311903
van Huet, Ramon A. C.; Pierrache, Laurence H.M.; Meester-Smoor, Magda A.; Klaver, Caroline C.W.; van den Born, L. Ingeborgh; Hoyng, Carel B.; de Wijs, Ilse J.; Collin, Rob W. J.; Hoefsloot, Lies H.
2015-01-01
Purpose To determine the efficacy of multiple versions of a commercially available arrayed primer extension (APEX) microarray chip for autosomal recessive retinitis pigmentosa (arRP). Methods We included 250 probands suspected of arRP who were genetically analyzed with the APEX microarray between January 2008 and November 2013. The mode of inheritance had to be autosomal recessive according to the pedigree (including isolated cases). If the microarray identified a heterozygous mutation, we performed Sanger sequencing of exons and exon–intron boundaries of that specific gene. The efficacy of this microarray chip with the additional Sanger sequencing approach was determined by the percentage of patients that received a molecular diagnosis. We also collected data from genetic tests other than the APEX analysis for arRP to provide a detailed description of the molecular diagnoses in our study cohort. Results The APEX microarray chip for arRP identified the molecular diagnosis in 21 (8.5%) of the patients in our cohort. Additional Sanger sequencing yielded a second mutation in 17 patients (6.8%), thereby establishing the molecular diagnosis. In total, 38 patients (15.2%) received a molecular diagnosis after analysis using the microarray and additional Sanger sequencing approach. Further genetic analyses after a negative result of the arRP microarray (n = 107) resulted in a molecular diagnosis of arRP (n = 23), autosomal dominant RP (n = 5), X-linked RP (n = 2), and choroideremia (n = 1). Conclusions The efficacy of the commercially available APEX microarray chips for arRP appears to be low, most likely caused by the limitations of this technique and the genetic and allelic heterogeneity of RP. Diagnostic yields up to 40% have been reported for next-generation sequencing (NGS) techniques that, as expected, thereby outperform targeted APEX analysis. PMID:25999674
Marine proteomics: a critical assessment of an emerging technology.
Slattery, Marc; Ankisetty, Sridevi; Corrales, Jone; Marsh-Hunkin, K Erica; Gochfeld, Deborah J; Willett, Kristine L; Rimoldi, John M
2012-10-26
The application of proteomics to marine sciences has increased in recent years because the proteome represents the interface between genotypic and phenotypic variability and, thus, corresponds to the broadest possible biomarker for eco-physiological responses and adaptations. Likewise, proteomics can provide important functional information regarding biosynthetic pathways, as well as insights into mechanism of action, of novel marine natural products. The goal of this review is to (1) explore the application of proteomics methodologies to marine systems, (2) assess the technical approaches that have been used, and (3) evaluate the pros and cons of this proteomic research, with the intent of providing a critical analysis of its future roles in marine sciences. To date, proteomics techniques have been utilized to investigate marine microbe, plant, invertebrate, and vertebrate physiology, developmental biology, seafood safety, susceptibility to disease, and responses to environmental change. However, marine proteomics studies often suffer from poor experimental design, sample processing/optimization difficulties, and data analysis/interpretation issues. Moreover, a major limitation is the lack of available annotated genomes and proteomes for most marine organisms, including several "model species". Even with these challenges in mind, there is no doubt that marine proteomics is a rapidly expanding and powerful integrative molecular research tool from which our knowledge of the marine environment, and the natural products from this resource, will be significantly expanded.
Preprocessing and Analysis of LC-MS-Based Proteomic Data
Tsai, Tsung-Heng; Wang, Minkun; Ressom, Habtom W.
2016-01-01
Liquid chromatography coupled with mass spectrometry (LC-MS) has been widely used for profiling protein expression levels. This chapter is focused on LC-MS data preprocessing, which is a crucial step in the analysis of LC-MS based proteomics. We provide a high-level overview, highlight associated challenges, and present a step-by-step example for analysis of data from LC-MS based untargeted proteomic study. Furthermore, key procedures and relevant issues with the subsequent analysis by multiple reaction monitoring (MRM) are discussed. PMID:26519169
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo
2009-04-01
For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.
Support vector machine and principal component analysis for microarray data classification
NASA Astrophysics Data System (ADS)
Astuti, Widi; Adiwijaya
2018-03-01
Cancer is a leading cause of death worldwide although a significant proportion of it can be cured if it is detected early. In recent decades, technology called microarray takes an important role in the diagnosis of cancer. By using data mining technique, microarray data classification can be performed to improve the accuracy of cancer diagnosis compared to traditional techniques. The characteristic of microarray data is small sample but it has huge dimension. Since that, there is a challenge for researcher to provide solutions for microarray data classification with high performance in both accuracy and running time. This research proposed the usage of Principal Component Analysis (PCA) as a dimension reduction method along with Support Vector Method (SVM) optimized by kernel functions as a classifier for microarray data classification. The proposed scheme was applied on seven data sets using 5-fold cross validation and then evaluation and analysis conducted on term of both accuracy and running time. The result showed that the scheme can obtained 100% accuracy for Ovarian and Lung Cancer data when Linear and Cubic kernel functions are used. In term of running time, PCA greatly reduced the running time for every data sets.
Experimental Approaches to Microarray Analysis of Tumor Samples
ERIC Educational Resources Information Center
Furge, Laura Lowe; Winter, Michael B.; Meyers, Jacob I.; Furge, Kyle A.
2008-01-01
Comprehensive measurement of gene expression using high-density nucleic acid arrays (i.e. microarrays) has become an important tool for investigating the molecular differences in clinical and research samples. Consequently, inclusion of discussion in biochemistry, molecular biology, or other appropriate courses of microarray technologies has…
Multiplex cDNA quantification method that facilitates the standardization of gene expression data
Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira
2011-01-01
Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008
Spot detection and image segmentation in DNA microarray data.
Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune
2005-01-01
Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.
Split-plot microarray experiments: issues of design, power and sample size.
Tsai, Pi-Wen; Lee, Mei-Ling Ting
2005-01-01
This article focuses on microarray experiments with two or more factors in which treatment combinations of the factors corresponding to the samples paired together onto arrays are not completely random. A main effect of one (or more) factor(s) is confounded with arrays (the experimental blocks). This is called a split-plot microarray experiment. We utilise an analysis of variance (ANOVA) model to assess differentially expressed genes for between-array and within-array comparisons that are generic under a split-plot microarray experiment. Instead of standard t- or F-test statistics that rely on mean square errors of the ANOVA model, we use a robust method, referred to as 'a pooled percentile estimator', to identify genes that are differentially expressed across different treatment conditions. We illustrate the design and analysis of split-plot microarray experiments based on a case application described by Jin et al. A brief discussion of power and sample size for split-plot microarray experiments is also presented.
Szcześniak, K A; Ciecierska, A; Ostaszewski, P; Sadkowski, T
2016-01-01
Adult skeletal muscle myogenesis depends on the activation of satellite cells that have the potential to differentiate into new fibers. Gamma-oryzanol (GO), a commercially available nutriactive phytochemical, has gained global interest on account of its muscle-building and regenerating effects. Here, we investigated GO for its potential influence on myogenesis, using equine satellite cell culture model, since the horse is a unique animal, bred and exercised for competitive sport. To our knowledge, this is the first report where the global gene expression in cultured equine satellite cells has been described. Equine satellite cells were isolated from semitendinosus muscle and cultured until the second day of differentiation. Differentiating cells were incubated with GO for the next 24 h. Subsequently, total RNA from GO-treated and control cells was isolated, amplified, labeled, and hybridized to two-color Horse Gene Expression Microarray slides. Quantitative PCR was used for the validation of microarray data. Our results revealed 58 genes with changed expression in GO-treated vs. control cells. Analysis of expression changes suggests that various processes are reinforced by GO in differentiating equine satellite cells, including inhibition of myoblast differentiation, increased proliferation and differentiation, stress response, and increased myogenic lineage commitment. The present study may confirm putative muscle-enhancing abilities of GO; however, the collective role of GO in skeletal myogenesis remains equivocal. The diversity of these changes is likely due to heterogenous growth rate of cells in primary culture. Genes identified in our study, modulated by the presence of GO, may become potential targets of future research investigating impact of this supplement in skeletal muscle on proteomic and biochemical level.
Tilton, Susan C.; Menachery, Vineet D.; Gralinski, Lisa E.; Schäfer, Alexandra; Matzke, Melissa M.; Webb-Robertson, Bobbie-Jo M.; Chang, Jean; Luna, Maria L.; Long, Casey E.; Shukla, Anil K.; Bankhead, Armand R.; Burkett, Susan E.; Zornetzer, Gregory; Tseng, Chien-Te Kent; Metz, Thomas O.; Pickles, Raymond; McWeeney, Shannon; Smith, Richard D.; Katze, Michael G.; Waters, Katrina M.; Baric, Ralph S.
2013-01-01
The severe acute respiratory syndrome coronavirus accessory protein ORF6 antagonizes interferon signaling by blocking karyopherin-mediated nuclear import processes. Viral nuclear import antagonists, expressed by several highly pathogenic RNA viruses, likely mediate pleiotropic effects on host gene expression, presumably interfering with transcription factors, cytokines, hormones, and/or signaling cascades that occur in response to infection. By bioinformatic and systems biology approaches, we evaluated the impact of nuclear import antagonism on host expression networks by using human lung epithelial cells infected with either wild-type virus or a mutant that does not express ORF6 protein. Microarray analysis revealed significant changes in differential gene expression, with approximately twice as many upregulated genes in the mutant virus samples by 48 h postinfection, despite identical viral titers. Our data demonstrated that ORF6 protein expression attenuates the activity of numerous karyopherin-dependent host transcription factors (VDR, CREB1, SMAD4, p53, EpasI, and Oct3/4) that are critical for establishing antiviral responses and regulating key host responses during virus infection. Results were confirmed by proteomic and chromatin immunoprecipitation assay analyses and in parallel microarray studies using infected primary human airway epithelial cell cultures. The data strongly support the hypothesis that viral antagonists of nuclear import actively manipulate host responses in specific hierarchical patterns, contributing to the viral pathogenic potential in vivo. Importantly, these studies and modeling approaches not only provide templates for evaluating virus antagonism of nuclear import processes but also can reveal candidate cellular genes and pathways that may significantly influence disease outcomes following severe acute respiratory syndrome coronavirus infection in vivo. PMID:23365422
Exposure to Cobalt Causes Transcriptomic and Proteomic Changes in Two Rat Liver Derived Cell Lines
Permenter, Matthew G.; Dennis, William E.; Sutto, Thomas E.; Jackson, David A.; Lewis, John A.; Stallings, Jonathan D.
2013-01-01
Cobalt is a transition group metal present in trace amounts in the human diet, but in larger doses it can be acutely toxic or cause adverse health effects in chronic exposures. Its use in many industrial processes and alloys worldwide presents opportunities for occupational exposures, including military personnel. While the toxic effects of cobalt have been widely studied, the exact mechanisms of toxicity remain unclear. In order to further elucidate these mechanisms and identify potential biomarkers of exposure or effect, we exposed two rat liver-derived cell lines, H4-II-E-C3 and MH1C1, to two concentrations of cobalt chloride. We examined changes in gene expression using DNA microarrays in both cell lines and examined changes in cytoplasmic protein abundance in MH1C1 cells using mass spectrometry. We chose to closely examine differentially expressed genes and proteins changing in abundance in both cell lines in order to remove cell line specific effects. We identified enriched pathways, networks, and biological functions using commercial bioinformatic tools and manual annotation. Many of the genes, proteins, and pathways modulated by exposure to cobalt appear to be due to an induction of a hypoxic-like response and oxidative stress. Genes that may be differentially expressed due to a hypoxic-like response are involved in Hif-1α signaling, glycolysis, gluconeogenesis, and other energy metabolism related processes. Gene expression changes linked to oxidative stress are also known to be involved in the NRF2-mediated response, protein degradation, and glutathione production. Using microarray and mass spectrometry analysis, we were able to identify modulated genes and proteins, further elucidate the mechanisms of toxicity of cobalt, and identify biomarkers of exposure and effect in vitro, thus providing targets for focused in vivo studies. PMID:24386269
Gust, Kurt A; Nanduri, Bindu; Rawat, Arun; Wilbanks, Mitchell S; Ang, Choo Yaw; Johnson, David R; Pendarvis, Ken; Chen, Xianfeng; Quinn, Michael J; Johnson, Mark S; Burgess, Shane C; Perkins, Edward J
2015-08-07
A systems toxicology investigation comparing and integrating transcriptomic and proteomic results was conducted to develop holistic effects characterizations for the wildlife bird model, Northern bobwhite (Colinus virginianus) dosed with the explosives degradation product 2-amino-4,6-dinitrotoluene (2A-DNT). A subchronic 60 d toxicology bioassay was leveraged where both sexes were dosed via daily gavage with 0, 3, 14, or 30 mg/kg-d 2A-DNT. Effects on global transcript expression were investigated in liver and kidney tissue using custom microarrays for C. virginianus in both sexes at all doses, while effects on proteome expression were investigated in liver for both sexes and kidney in males, at 30 mg/kg-d. As expected, transcript expression was not directly indicative of protein expression in response to 2A-DNT. However, a high degree of correspondence was observed among gene and protein expression when investigating higher-order functional responses including statistically enriched gene networks and canonical pathways, especially when connected to toxicological outcomes of 2A-DNT exposure. Analysis of networks statistically enriched for both transcripts and proteins demonstrated common responses including inhibition of programmed cell death and arrest of cell cycle in liver tissues at 2A-DNT doses that caused liver necrosis and death in females. Additionally, both transcript and protein expression in liver tissue was indicative of induced phase I and II xenobiotic metabolism potentially as a mechanism to detoxify and excrete 2A-DNT. Nuclear signaling assays, transcript expression and protein expression each implicated peroxisome proliferator-activated receptor (PPAR) nuclear signaling as a primary molecular target in the 2A-DNT exposure with significant downstream enrichment of PPAR-regulated pathways including lipid metabolic pathways and gluconeogenesis suggesting impaired bioenergetic potential. Although the differential expression of transcripts and proteins was largely unique, the consensus of functional pathways and gene networks enriched among transcriptomic and proteomic datasets provided the identification of many critical metabolic functions underlying 2A-DNT toxicity as well as impaired PPAR signaling, a key molecular initiating event known to be affected in di- and trinitrotoluene exposures.
NCI's Proteome Characterization Centers Announced | Office of Cancer Clinical Proteomics Research
The National Cancer Institute (NCI), part of the National Institutes of Health, announces the launch of a Clinical Proteomic Tumor Analysis Consortium (CPTAC). CPTAC is a comprehensive, coordinated team effort to accelerate the understanding of the molecular basis of cancer through the application of robust, quantitative, proteomic technologies and workflows.
Proteomics in medical microbiology.
Cash, P
2000-04-01
The techniques of proteomics (high resolution two-dimensional electrophoresis and protein characterisation) are widely used for microbiological research to analyse global protein synthesis as an indicator of gene expression. The rapid progress in microbial proteomics has been achieved through the wide availability of whole genome sequences for a number of bacterial groups. Beyond providing a basic understanding of microbial gene expression, proteomics has also played a role in medical areas of microbiology. Progress has been made in the use of the techniques for investigating the epidemiology and taxonomy of human microbial pathogens, the identification of novel pathogenic mechanisms and the analysis of drug resistance. In each of these areas, proteomics has provided new insights that complement genomic-based investigations. This review describes the current progress in these research fields and highlights some of the technical challenges existing for the application of proteomics in medical microbiology. The latter concern the analysis of genetically heterogeneous bacterial populations and the integration of the proteomic and genomic data for these bacteria. The characterisation of the proteomes of bacterial pathogens growing in their natural hosts remains a future challenge.
Proteomic analysis of ligamentum flavum from patients with lumbar spinal stenosis.
Kamita, Masahiro; Mori, Taiki; Sakai, Yoshihito; Ito, Sadayuki; Gomi, Masahiro; Miyamoto, Yuko; Harada, Atsushi; Niida, Shumpei; Yamada, Tesshi; Watanabe, Ken; Ono, Masaya
2015-05-01
Lumbar spinal stenosis (LSS) is a syndromic degenerative spinal disease and is characterized by spinal canal narrowing with subsequent neural compression causing gait disturbances. Although LSS is a major age-related musculoskeletal disease that causes large decreases in the daily living activities of the elderly, its molecular pathology has not been investigated using proteomics. Thus, we used several proteomic technologies to analyze the ligamentum flavum (LF) of individuals with LSS. Using comprehensive proteomics with strong cation exchange fractionation, we detected 1288 proteins in these LF samples. A GO analysis of the comprehensive proteome revealed that more than 30% of the identified proteins were extracellular. Next, we used 2D image converted analysis of LC/MS to compare LF obtained from individuals with LSS to that obtained from individuals with disc herniation (nondegenerative control). We detected 64 781 MS peaks and identified 1675 differentially expressed peptides derived from 286 proteins. We verified four differentially expressed proteins (fibronectin, serine protease HTRA1, tenascin, and asporin) by quantitative proteomics using SRM/MRM. The present proteomic study is the first to identify proteins from degenerated and hypertrophied LF in LSS, which will help in studying LSS. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Silva, Wanderson M; Carvalho, Rodrigo D; Soares, Siomar C; Bastos, Isabela Fs; Folador, Edson L; Souza, Gustavo Hmf; Le Loir, Yves; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco
2014-12-04
Corynebacterium pseudotuberculosis biovar ovis is a facultative intracellular pathogen, and the etiological agent of caseous lymphadenitis in small ruminants. During the infection process, the bacterium is subjected to several stress conditions, including nitrosative stress, which is caused by nitric oxide (NO). In silico analysis of the genome of C. pseudotuberculosis ovis 1002 predicted several genes that could influence the resistance of this pathogen to nitrosative stress. Here, we applied high-throughput proteomics using high definition mass spectrometry to characterize the functional genome of C. pseudotuberculosis ovis 1002 in the presence of NO-donor Diethylenetriamine/nitric oxide adduct (DETA/NO), with the aim of identifying proteins involved in nitrosative stress resistance. We characterized 835 proteins, representing approximately 41% of the predicted proteome of C. pseudotuberculosis ovis 1002, following exposure to nitrosative stress. In total, 102 proteins were exclusive to the proteome of DETA/NO-induced cells, and a further 58 proteins were differentially regulated between the DETA/NO and control conditions. An interactomic analysis of the differential proteome of C. pseudotuberculosis in response to nitrosative stress was also performed. Our proteomic data set suggested the activation of both a general stress response and a specific nitrosative stress response, as well as changes in proteins involved in cellular metabolism, detoxification, transcriptional regulation, and DNA synthesis and repair. Our proteomic analysis validated previously-determined in silico data for C. pseudotuberculosis ovis 1002. In addition, proteomic screening performed in the presence of NO enabled the identification of a set of factors that can influence the resistance and survival of C. pseudotuberculosis during exposure to nitrosative stress.
A new funding opportunity in support of the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) seeks to prospectively procure tumor samples, collected for proteomics investigation.
Proteomics: a new approach to the study of disease.
Chambers, G; Lawrie, L; Cash, P; Murray, G I
2000-11-01
The global analysis of cellular proteins has recently been termed proteomics and is a key area of research that is developing in the post-genome era. Proteomics uses a combination of sophisticated techniques including two-dimensional (2D) gel electrophoresis, image analysis, mass spectrometry, amino acid sequencing, and bio-informatics to resolve comprehensively, to quantify, and to characterize proteins. The application of proteomics provides major opportunities to elucidate disease mechanisms and to identify new diagnostic markers and therapeutic targets. This review aims to explain briefly the background to proteomics and then to outline proteomic techniques. Applications to the study of human disease conditions ranging from cancer to infectious diseases are reviewed. Finally, possible future advances are briefly considered, especially those which may lead to faster sample throughput and increased sensitivity for the detection of individual proteins. Copyright 2000 John Wiley & Sons, Ltd.
Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi
2013-01-01
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424
Birth of plant proteomics in India: a new horizon.
Narula, Kanika; Pandey, Aarti; Gayali, Saurabh; Chakraborty, Niranjan; Chakraborty, Subhra
2015-09-08
In the post-genomic era, proteomics is acknowledged as the next frontier for biological research. Although India has a long and distinguished tradition in protein research, the initiation of proteomics studies was a new horizon. Protein research witnessed enormous progress in protein separation, high-resolution refinements, biochemical identification of the proteins, protein-protein interaction, and structure-function analysis. Plant proteomics research, in India, began its journey on investigation of the proteome profiling, complexity analysis, protein trafficking, and biochemical modeling. The research article by Bhushan et al. in 2006 marked the birth of the plant proteomics research in India. Since then plant proteomics studies expanded progressively and are now being carried out in various institutions spread across the country. The compilation presented here seeks to trace the history of development in the area during the past decade based on publications till date. In this review, we emphasize on outcomes of the field providing prospects on proteomic pathway analyses. Finally, we discuss the connotation of strategies and the potential that would provide the framework of plant proteome research. The past decades have seen rapidly growing number of sequenced plant genomes and associated genomic resources. To keep pace with this increasing body of data, India is in the provisional phase of proteomics research to develop a comparative hub for plant proteomes and protein families, but it requires a strong impetus from intellectuals, entrepreneurs, and government agencies. Here, we aim to provide an overview of past, present and future of Indian plant proteomics, which would serve as an evaluation platform for those seeking to incorporate proteomics into their research programs. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.
Porwollik, Steffen; Mottaz-Brewer, Heather; Petritis, Brianne O.; Jaitly, Navdeep; Adkins, Joshua N.; McClelland, Michael; Heffron, Fred; Smith, Richard D.
2009-01-01
Using sample-matched transcriptomics and proteomics measurements it is now possible to begin to understand the impact of post-transcriptional regulatory programs in Enterobacteria. In bacteria post-transcriptional regulation is mediated by relatively few identified RNA-binding protein factors including CsrA, Hfq and SmpB. A mutation in any one of these three genes, csrA, hfq, and smpB, in Salmonella is attenuated for mouse virulence and unable to survive in macrophages. CsrA has a clearly defined specificity based on binding to a specific mRNA sequence to inhibit translation. However, the proteins regulated by Hfq and SmpB are not as clearly defined. Previous work identified proteins regulated by hfq using purification of the RNA-protein complex with direct sequencing of the bound RNAs and found binding to a surprisingly large number of transcripts. In this report we have used global proteomics to directly identify proteins regulated by Hfq or SmpB by comparing protein abundance in the parent and isogenic hfq or smpB mutant. From these same samples we also prepared RNA for microarray analysis to determine if alteration of protein expression was mediated post-transcriptionally. Samples were analyzed from bacteria grown under four different conditions; two laboratory conditions and two that are thought to mimic the intracellular environment. We show that mutants of hfq and smpB directly or indirectly modulate at least 20% and 4% of all possible Salmonella proteins, respectively, with limited correlation between transcription and protein expression. These proteins represent a broad spectrum of Salmonella proteins required for many biological processes including host cell invasion, motility, central metabolism, LPS biosynthesis, two-component regulatory systems, and fatty acid metabolism. Our results represent one of the first global analyses of post-transcriptional regulons in any organism and suggest that regulation at the translational level is widespread and plays an important role in virulence regulation and environmental adaptation for Salmonella. PMID:19277208
Mansfeldt, Cresten B.; Rowe, Annette R.; Heavner, Gretchen L. W.; Zinder, Stephen H.
2014-01-01
A cDNA-microarray was designed and used to monitor the transcriptomic profile of Dehalococcoides mccartyi strain 195 (in a mixed community) respiring various chlorinated organics, including chloroethenes and 2,3-dichlorophenol. The cultures were continuously fed in order to establish steady-state respiration rates and substrate levels. The organization of array data into a clustered heat map revealed two major experimental partitions. This partitioning in the data set was further explored through principal component analysis. The first two principal components separated the experiments into those with slow (1.6 ± 0.6 μM Cl−/h)- and fast (22.9 ± 9.6 μM Cl−/h)-respiring cultures. Additionally, the transcripts with the highest loadings in these principal components were identified, suggesting that those transcripts were responsible for the partitioning of the experiments. By analyzing the transcriptomes (n = 53) across experiments, relationships among transcripts were identified, and hypotheses about the relationships between electron transport chain members were proposed. One hypothesis, that the hydrogenases Hup and Hym and the formate dehydrogenase-like oxidoreductase (DET0186-DET0187) form a complex (as displayed by their tight clustering in the heat map analysis), was explored using a nondenaturing protein separation technique combined with proteomic sequencing. Although these proteins did not migrate as a single complex, DET0112 (an FdhB-like protein encoded in the Hup operon) was found to comigrate with DET0187 rather than with the catalytic Hup subunit DET0110. On closer inspection of the genome annotations of all Dehalococcoides strains, the DET0185-to-DET0187 operon was found to lack a key subunit, an FdhB-like protein. Therefore, on the basis of the transcriptomic, genomic, and proteomic evidence, the place of the missing subunit in the DET0185-to-DET0187 operon is likely filled by recruiting a subunit expressed from the Hup operon (DET0112). PMID:25063656
Bruno, D L; Ganesamoorthy, D; Schoumans, J; Bankier, A; Coman, D; Delatycki, M; Gardner, R J M; Hunter, M; James, P A; Kannu, P; McGillivray, G; Pachter, N; Peters, H; Rieubland, C; Savarirayan, R; Scheffer, I E; Sheffield, L; Tan, T; White, S M; Yeung, A; Bowman, Z; Ngo, C; Choy, K W; Cacheux, V; Wong, L; Amor, D J; Slater, H R
2009-02-01
Microarray genome analysis is realising its promise for improving detection of genetic abnormalities in individuals with mental retardation and congenital abnormality. Copy number variations (CNVs) are now readily detectable using a variety of platforms and a major challenge is the distinction of pathogenic from ubiquitous, benign polymorphic CNVs. The aim of this study was to investigate replacement of time consuming, locus specific testing for specific microdeletion and microduplication syndromes with microarray analysis, which theoretically should detect all known syndromes with CNV aetiologies as well as new ones. Genome wide copy number analysis was performed on 117 patients using Affymetrix 250K microarrays. 434 CNVs (195 losses and 239 gains) were found, including 18 pathogenic CNVs and 9 identified as "potentially pathogenic". Almost all pathogenic CNVs were larger than 500 kb, significantly larger than the median size of all CNVs detected. Segmental regions of loss of heterozygosity larger than 5 Mb were found in 5 patients. Genome microarray analysis has improved diagnostic success in this group of patients. Several examples of recently discovered "new syndromes" were found suggesting they are more common than previously suspected and collectively are likely to be a major cause of mental retardation. The findings have several implications for clinical practice. The study revealed the potential to make genetic diagnoses that were not evident in the clinical presentation, with implications for pretest counselling and the consent process. The importance of contributing novel CNVs to high quality databases for genotype-phenotype analysis and review of guidelines for selection of individuals for microarray analysis is emphasised.
NCI Launches Proteomics Assay Portal | Office of Cancer Clinical Proteomics Research
In a paper recently published by the journal Nature Methods, Investigators from the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (NCI-CPTAC) announced the launch of a proteomics Assay Portal for multiple reaction monitoring-mass spectrometry (MRM-MS) assays. This community web-based repository for well-characterized quantitative proteomic assays currently consists of 456 unique peptide assays to 282 unique proteins and ser
On September 4, 2013, NCI’s Clinical Proteomics Tumor Analysis Consortium (CPTAC) publicly released proteomic data produced from colorectal tumor samples previously analyzed by The Cancer Genome Atlas (TCGA). This is the initial release of proteomic tumor data designed to complement genomic data on the same tumors. The data is publicly available at the CPTAC data portal.
USDA-ARS?s Scientific Manuscript database
2-DE analysis of complex plant proteomes has limited dynamic resolution because only abundant proteins can be detected. Proteomic assessment of the low abundance proteins within leaf tissue is difficult when it is comprised of 30 – 50% of the CO2 fixation enzyme Rubisco. Resolution can be improved t...
Heterogeneity in Neutrophil Microparticles Reveals Distinct Proteome and Functional Properties*
Dalli, Jesmond; Montero-Melendez, Trinidad; Norling, Lucy V; Yin, Xiaoke; Hinds, Charles; Haskard, Dorian; Mayr, Manuel; Perretti, Mauro
2013-01-01
Altered plasma neutrophil microparticle levels have recently been implicated in a number of vascular and inflammatory diseases, yet our understanding of their actions is very limited. Herein, we investigate the proteome of neutrophil microparticles in order to shed light on their biological actions. Stimulation of human neutrophils, either in suspension or adherent to an endothelial monolayer, led to the production of microparticles containing >400 distinct proteins with only 223 being shared by the two subsets. For instance, postadherent microparticles were enriched in alpha-2 macroglobulin and ceruloplasmin, whereas microparticles produced by neutrophils in suspension were abundant in heat shock 70 kDa protein 1. Annexin A1 and lactotransferrin were expressed in both microparticle subsets. We next determined relative abundance of these proteins in three types of human microparticle samples: healthy volunteer plasma, plasma of septic patients and skin blister exudates finding that these proteins were differentially expressed on neutrophil microparticles from these samples reflecting in part the expression profiles we found in vitro. Functional assessment of the neutrophil microparticles subsets demonstrated that in response to direct stimulation neutrophil microparticles produced reactive oxygen species and leukotriene B4 as well as locomoted toward a chemotactic gradient. Finally, we investigated the actions of the two neutrophil microparticles subsets described herein on target cell responses. Microarray analysis with human primary endothelial cells incubated with either microparticle subset revealed a discrete modulation of endothelial cell gene expression profile. These findings demonstrate that neutrophil microparticles are heterogenous and can deliver packaged information propagating the activation status of the parent cell, potentially exerting novel and fundamental roles both under homeostatic and disease conditions. PMID:23660474
Carrieri, Damian; Lombardi, Thomas; Paddock, Troy; ...
2016-11-17
Molecular mechanisms that regulate carbon flux are poorly understood in algae. The ΔglgC mutant of the cyanobacterium Synechocystis sp. PCC 6803 is incapable of glycogen storage and displays an array of physiological responses under nitrogen starvation that are different from wild-type (WT). These include non-bleaching phenotype and the redirection of photosynthetically fixed carbon towards excreted organic acids (overflow metabolism) without biomass growth. To understand the role of gene/protein expression in these responses, we followed the time course of transcripts by genome-scale microarrays and proteins by shotgun proteomics in ΔglgC and WT cells upon nitrogen starvation. Compared to WT, the degradationmore » of phycobilisome rod proteins was delayed and attenuated in the mutant, and the core proteins were less degraded; both contributed to the non-bleaching appearance despite the induction of nblA genes, suggesting the presence of a break in regulation of the phycobilisome degradation pathway downstream of nblA induction. The mutant displayed NtcA-mediated transcriptional response to nitrogen starvation, indicating that it is able to sense nitrogen status. Furthermore, some responses to nitrogen starvation appear to be stronger in the mutant, as shown by the increases in transcripts for the transcriptional regulator, rre37, which regulates central carbon metabolism. Accordingly, multiple proteins involved in photosynthesis, central carbon metabolism, and carbon storage and utilization showed lower abundance in the mutant. Furthermore, these results indicate that the transition in the central carbon metabolism from growth to overflow metabolism in ΔglgC does not require increases in expression of the overflow pathway enzymes; the transition and non-bleaching phenotype are likely regulated instead at the metabolite level.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carrieri, Damian; Lombardi, Thomas; Paddock, Troy
Molecular mechanisms that regulate carbon flux are poorly understood in algae. The ΔglgC mutant of the cyanobacterium Synechocystis sp. PCC 6803 is incapable of glycogen storage and displays an array of physiological responses under nitrogen starvation that are different from wild-type (WT). These include non-bleaching phenotype and the redirection of photosynthetically fixed carbon towards excreted organic acids (overflow metabolism) without biomass growth. To understand the role of gene/protein expression in these responses, we followed the time course of transcripts by genome-scale microarrays and proteins by shotgun proteomics in ΔglgC and WT cells upon nitrogen starvation. Compared to WT, the degradationmore » of phycobilisome rod proteins was delayed and attenuated in the mutant, and the core proteins were less degraded; both contributed to the non-bleaching appearance despite the induction of nblA genes, suggesting the presence of a break in regulation of the phycobilisome degradation pathway downstream of nblA induction. The mutant displayed NtcA-mediated transcriptional response to nitrogen starvation, indicating that it is able to sense nitrogen status. Furthermore, some responses to nitrogen starvation appear to be stronger in the mutant, as shown by the increases in transcripts for the transcriptional regulator, rre37, which regulates central carbon metabolism. Accordingly, multiple proteins involved in photosynthesis, central carbon metabolism, and carbon storage and utilization showed lower abundance in the mutant. Furthermore, these results indicate that the transition in the central carbon metabolism from growth to overflow metabolism in ΔglgC does not require increases in expression of the overflow pathway enzymes; the transition and non-bleaching phenotype are likely regulated instead at the metabolite level.« less
Wang, Ruo-Chiau; Huang, Chien-Yu; Pan, Tai-Long; Chen, Wei-Yu; Ho, Chun-Te; Liu, Tsan-Zon; Chang, Yu-Jia
2015-01-01
To search for reliable biomarkers and drug targets for management of hepatocellular carcinoma (HCC), we performed a global proteomic analysis of a pair of HCC cell lines with distinct differentiation statuses using 2-DE coupled with MALDI-TOF MS. In total, 106 and 55 proteins were successfully identified from the total cell lysate and the cytosolic, nuclear and membrane fractions in well-differentiated (HepG2) and poorly differentiated (SK-Hep-1) HCC clonal variants, respectively. Among these proteins, nine spots corresponding to proteins differentially expressed between HCC cell types were selected and confirmed by immunofluorescence staining and western blotting. Notably, Annexin 1 (ANX1), ANX-2, vimentin and stress-associated proteins, such as GRP78, HSP75, HSC-70, protein disulfide isomerase (PDI), and heat shock protein-27 (HSP27), were exclusively up-regulated in SK-Hep-1 cells. Elevated levels of ANX-4 and antioxidant/metabolic enzymes, such as MnSOD, peroxiredoxin, NADP-dependent isocitrate dehydrogenase, α-enolase and UDP-glucose dehydrogenase, were observed in HepG2 cells. We functionally demonstrated that ANX1 and HSP27 were abundantly overexpressed only in highly invasive types of HCC cells, such as Mahlavu and SK-Hep-1. Knockdown of ANX1 or HSP27 in HCC cells resulted in a severe reduction in cell migration. The in-vitro observations of ANX1 and HSP27 expressions in HCC sample was demonstrated by immunohistochemical stains performed on HCC tissue microarrays. Poorly differentiated HCC tended to have stronger ANX1 and HSP27 expressions than well-differentiated or moderately differentiated HCC. Collectively, our findings suggest that ANX1 and HSP27 are two novel biomarkers for predicting invasive HCC phenotypes and could serve as potential treatment targets.
Wang, Xiaoliang; Shojaie, Ali; Zhang, Yuzheng; Shelley, David; Lampe, Paul D; Levy, Lisa; Peters, Ulrike; Potter, John D; White, Emily; Lampe, Johanna W
2017-01-01
Long-term use of aspirin is associated with lower risk of colorectal cancer and other cancers; however, the mechanism of chemopreventive effect of aspirin is not fully understood. Animal studies suggest that COX-2, NFκB signaling and Wnt/β-catenin pathways may play a role, but no clinical trials have systematically evaluated the biological response to aspirin in healthy humans. Using a high-density antibody array, we assessed the difference in plasma protein levels after 60 days of regular dose aspirin (325 mg/day) compared to placebo in a randomized double-blinded crossover trial of 44 healthy non-smoking men and women, aged 21-45 years. The plasma proteome was analyzed on an antibody microarray with ~3,300 full-length antibodies, printed in triplicate. Moderated paired t-tests were performed on individual antibodies, and gene-set analyses were performed based on KEGG and GO pathways. Among the 3,000 antibodies analyzed, statistically significant differences in plasma protein levels were observed for nine antibodies after adjusting for false discoveries (FDR adjusted p-value<0.1). The most significant protein was succinate dehydrogenase subunit C (SDHC), a key enzyme complex of the mitochondrial tricarboxylic acid (TCA) cycle. The other statistically significant proteins (NR2F1, MSI1, MYH1, FOXO1, KHDRBS3, NFKBIE, LYZ and IKZF1) are involved in multiple pathways, including DNA base-pair repair, inflammation and oncogenic pathways. None of the 258 KEGG and 1,139 GO pathways was found to be statistically significant after FDR adjustment. This study suggests several chemopreventive mechanisms of aspirin in humans, which have previously been reported to play a role in anti- or pro-carcinogenesis in cell systems; however, larger, confirmatory studies are needed.
Genomes2Drugs: Identifies Target Proteins and Lead Drugs from Proteome Data
Toomey, David; Hoppe, Heinrich C.; Brennan, Marian P.; Nolan, Kevin B.; Chubb, Anthony J.
2009-01-01
Background Genome sequencing and bioinformatics have provided the full hypothetical proteome of many pathogenic organisms. Advances in microarray and mass spectrometry have also yielded large output datasets of possible target proteins/genes. However, the challenge remains to identify new targets for drug discovery from this wealth of information. Further analysis includes bioinformatics and/or molecular biology tools to validate the findings. This is time consuming and expensive, and could fail to yield novel drugs if protein purification and crystallography is impossible. To pre-empt this, a researcher may want to rapidly filter the output datasets for proteins that show good homology to proteins that have already been structurally characterised or proteins that are already targets for known drugs. Critically, those researchers developing novel antibiotics need to select out the proteins that show close homology to any human proteins, as future inhibitors are likely to cross-react with the host protein, causing off-target toxicity effects later in clinical trials. Methodology/Principal Findings To solve many of these issues, we have developed a free online resource called Genomes2Drugs which ranks sequences to identify proteins that are (i) homologous to previously crystallized proteins or (ii) targets of known drugs, but are (iii) not homologous to human proteins. When tested using the Plasmodium falciparum malarial genome the program correctly enriched the ranked list of proteins with known drug target proteins. Conclusions/Significance Genomes2Drugs rapidly identifies proteins that are likely to succeed in drug discovery pipelines. This free online resource helps in the identification of potential drug targets. Importantly, the program further highlights proteins that are likely to be inhibited by FDA-approved drugs. These drugs can then be rapidly moved into Phase IV clinical studies under ‘change-of-application’ patents. PMID:19593435
de Bernonville, Thomas Dugé; Albenne, Cécile; Arlat, Matthieu; Hoffmann, Laurent; Lauber, Emmanuelle; Jamet, Elisabeth
2014-01-01
Proteomic analysis of xylem sap has recently become a major field of interest to understand several biological questions related to plant development and responses to environmental clues. The xylem sap appears as a dynamic fluid undergoing changes in its proteome upon abiotic and biotic stresses. Unlike cell compartments which are amenable to purification in sufficient amount prior to proteomic analysis, the xylem sap has to be collected in particular conditions to avoid contamination by intracellular proteins and to obtain enough material. A model plant like Arabidopsis thaliana is not suitable for such an analysis because efficient harvesting of xylem sap is difficult. The analysis of the xylem sap proteome also requires specific procedures to concentrate proteins and to focus on proteins predicted to be secreted. Indeed, xylem sap proteins appear to be synthesized and secreted in the root stele or to originate from dying differentiated xylem cells. This chapter describes protocols to collect xylem sap from Brassica species and to prepare total and N-glycoprotein extracts for identification of proteins by mass spectrometry analyses and bioinformatics.
Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild
2009-07-01
Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.
Microarray analysis of potential genes in the pathogenesis of recurrent oral ulcer.
Han, Jingying; He, Zhiwei; Li, Kun; Hou, Lu
2015-01-01
Recurrent oral ulcer seriously threatens patients' daily life and health. This study investigated potential genes and pathways that participate in the pathogenesis of recurrent oral ulcer by high throughput bioinformatic analysis. RT-PCR and Western blot were applied to further verify screened interleukins effect. Recurrent oral ulcer related genes were collected from websites and papers, and further found out from Human Genome 280 6.0 microarray data. Each pathway of recurrent oral ulcer related genes were got through chip hybridization. RT-PCR was applied to test four recurrent oral ulcer related genes to verify the microarray data. Data transformation, scatter plot, clustering analysis, and expression pattern analysis were used to analyze recurrent oral ulcer related gene expression changes. Recurrent oral ulcer gene microarray was successfully established. Microarray showed that 551 genes involved in recurrent oral ulcer activity and 196 genes were recurrent oral ulcer related genes. Of them, 76 genes up-regulated, 62 genes down-regulated, and 58 genes up-/down-regulated. Total expression level up-regulated 752 times (60%) and down-regulated 485 times (40%). IL-2 plays an important role in the occurrence, development and recurrence of recurrent oral ulcer on the mRNA and protein levels. Gene microarray can be used to analyze potential genes and pathways in recurrent oral ulcer. IL-2 may be involved in the pathogenesis of recurrent oral ulcer.
2010-01-01
Background Analysis of gene expression and gene mutation may add information to be different from ordinary pathological tissue diagnosis. Since samples obtained endoscopically are very small, it is desired that more sensitive technology is developed for gene analysis. We investigated whether gene expression and gene mutation analysis by newly developed ultra-sensitive three-dimensional (3D) microarray is possible using small amount samples from endoscopic ultrasound-guided fine-needle aspiration (EUS-FNA) specimens and pancreatic juices. Methods Small amount samples from 17 EUS-FNA specimens and 16 pancreatic juices were obtained. After nucleic acid extraction, the samples were amplified with labeling and analyzed by the 3D microarray. Results The analyzable rate with the microarray was 46% (6/13) in EUS-FNA specimens of RNAlater® storage, and RNA degradations were observed in all the samples of frozen storage. In pancreatic juices, the analyzable rate was 67% (4/6) in frozen storage samples and 20% (2/10) in RNAlater® storage. EUS-FNA specimens were classified into cancer and non-cancer by gene expression analysis and K-ras codon 12 mutations were also detected using the 3D microarray. Conclusions Gene analysis from small amount samples obtained endoscopically was possible by newly developed 3D microarray technology. High quality RNA from EUS-FNA samples were obtained and remained in good condition only using RNA stabilizer. In contrast, high quality RNA from pancreatic juice samples were obtained only in frozen storage without RNA stabilizer. PMID:20416107
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data
2014-01-01
Background Mandatory deposit of raw microarray data files for public access, prior to study publication, provides significant opportunities to conduct new bioinformatics analyses within and across multiple datasets. Analysis of raw microarray data files (e.g. Affymetrix CEL files) can be time consuming, complex, and requires fundamental computational and bioinformatics skills. The development of analytical workflows to automate these tasks simplifies the processing of, improves the efficiency of, and serves to standardize multiple and sequential analyses. Once installed, workflows facilitate the tedious steps required to run rapid intra- and inter-dataset comparisons. Results We developed a workflow to facilitate and standardize Meta-Analysis of Affymetrix Microarray Data analysis (MAAMD) in Kepler. Two freely available stand-alone software tools, R and AltAnalyze were embedded in MAAMD. The inputs of MAAMD are user-editable csv files, which contain sample information and parameters describing the locations of input files and required tools. MAAMD was tested by analyzing 4 different GEO datasets from mice and drosophila. MAAMD automates data downloading, data organization, data quality control assesment, differential gene expression analysis, clustering analysis, pathway visualization, gene-set enrichment analysis, and cross-species orthologous-gene comparisons. MAAMD was utilized to identify gene orthologues responding to hypoxia or hyperoxia in both mice and drosophila. The entire set of analyses for 4 datasets (34 total microarrays) finished in ~ one hour. Conclusions MAAMD saves time, minimizes the required computer skills, and offers a standardized procedure for users to analyze microarray datasets and make new intra- and inter-dataset comparisons. PMID:24621103
Design and analysis issues in quantitative proteomics studies.
Karp, Natasha A; Lilley, Kathryn S
2007-09-01
Quantitative proteomics is the comparison of distinct proteomes which enables the identification of protein species which exhibit changes in expression or post-translational state in response to a given stimulus. Many different quantitative techniques are being utilized and generate large datasets. Independent of the technique used, these large datasets need robust data analysis to ensure valid conclusions are drawn from such studies. Approaches to address the problems that arise with large datasets are discussed to give insight into the types of statistical analyses of data appropriate for the various experimental strategies that can be employed by quantitative proteomic studies. This review also highlights the importance of employing a robust experimental design and highlights various issues surrounding the design of experiments. The concepts and examples discussed within will show how robust design and analysis will lead to confident results that will ensure quantitative proteomics delivers.
Establishing Substantial Equivalence: Proteomics
NASA Astrophysics Data System (ADS)
Lovegrove, Alison; Salt, Louise; Shewry, Peter R.
Wheat is a major crop in world agriculture and is consumed after processing into a range of food products. It is therefore of great importance to determine the consequences (intended and unintended) of transgenesis in wheat and whether genetically modified lines are substantially equivalent to those produced by conventional plant breeding. Proteomic analysis is one of several approaches which can be used to address these questions. Two-dimensional PAGE (2D PAGE) remains the most widely available method for proteomic analysis, but is notoriously difficult to reproduce between laboratories. We therefore describe methods which have been developed as standard operating procedures in our laboratory to ensure the reproducibility of proteomic analyses of wheat using 2D PAGE analysis of grain proteins.
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Diagnostic Pathology and Laboratory Medicine in the Age of “Omics”
Finn, William G.
2007-01-01
Functional genomics and proteomics involve the simultaneous analysis of hundreds or thousands of expressed genes or proteins and have spawned the modern discipline of computational biology. Novel informatic applications, including sophisticated dimensionality reduction strategies and cancer outlier profile analysis, can distill clinically exploitable biomarkers from enormous experimental datasets. Diagnostic pathologists are now charged with translating the knowledge generated by the “omics” revolution into clinical practice. Food and Drug Administration-approved proprietary testing platforms based on microarray technologies already exist and will expand greatly in the coming years. However, for diagnostic pathology, the greatest promise of the “omics” age resides in the explosion in information technology (IT). IT applications allow for the digitization of histological slides, transforming them into minable data and enabling content-based searching and archiving of histological materials. IT will also allow for the optimization of existing (and often underused) clinical laboratory technologies such as flow cytometry and high-throughput core laboratory functions. The state of pathology practice does not always keep up with the pace of technological advancement. However, to use fully the potential of these emerging technologies for the benefit of patients, pathologists and clinical scientists must embrace the changes and transformational advances that will characterize this new era. PMID:17652635
Takahara, Hiroyuki; Dolf, Andreas; Endl, Elmar; O'Connell, Richard
2009-08-01
Generation of stage-specific cDNA libraries is a powerful approach to identify pathogen genes that are differentially expressed during plant infection. Biotrophic pathogens develop specialized infection structures inside living plant cells, but sampling the transcriptome of these structures is problematic due to the low ratio of fungal to plant RNA, and the lack of efficient methods to isolate them from infected plants. Here we established a method, based on fluorescence-activated cell sorting (FACS), to purify the intracellular biotrophic hyphae of Colletotrichum higginsianum from homogenates of infected Arabidopsis leaves. Specific selection of viable hyphae using a fluorescent vital marker provided intact RNA for cDNA library construction. Pilot-scale sequencing showed that the library was enriched with plant-induced and pathogenicity-related fungal genes, including some encoding small, soluble secreted proteins that represent candidate fungal effectors. The high purity of the hyphae (94%) prevented contamination of the library by sequences derived from host cells or other fungal cell types. RT-PCR confirmed that genes identified in the FACS-purified hyphae were also expressed in planta. The method has wide applicability for isolating the infection structures of other plant pathogens, and will facilitate cell-specific transcriptome analysis via deep sequencing and microarray hybridization, as well as proteomic analyses.
Tan, Thomas C J; Knight, John; Sbarrato, Thomas; Dudek, Kate; Willis, Anne E; Zamoyska, Rose
2017-07-25
Global transcriptomic and proteomic analyses of T cells have been rich sources of unbiased data for understanding T-cell activation. Lack of full concordance of these datasets has illustrated that important facets of T-cell activation are controlled at the level of translation. We undertook translatome analysis of CD8 T-cell activation, combining polysome profiling and microarray analysis. We revealed that altering T-cell receptor stimulation influenced recruitment of mRNAs to heavy polysomes and translation of subsets of genes. A major pathway that was compromised, when TCR signaling was suboptimal, was linked to ribosome biogenesis, a rate-limiting factor in both cell growth and proliferation. Defective TCR signaling affected transcription and processing of ribosomal RNA precursors, as well as the translation of specific ribosomal proteins and translation factors. Mechanistically, IL-2 production was compromised in weakly stimulated T cells, affecting the abundance of Myc protein, a known regulator of ribosome biogenesis. Consequently, weakly activated T cells showed impaired production of ribosomes and a failure to maintain proliferative capacity after stimulation. We demonstrate that primary T cells respond to various environmental cues by regulating ribosome biogenesis and mRNA translation at multiple levels to sustain proliferation and differentiation.
Demir, E; Babur, O; Dogrusoz, U; Gursoy, A; Nisanci, G; Cetin-Atalay, R; Ozturk, M
2002-07-01
Availability of the sequences of entire genomes shifts the scientific curiosity towards the identification of function of the genomes in large scale as in genome studies. In the near future, data produced about cellular processes at molecular level will accumulate with an accelerating rate as a result of proteomics studies. In this regard, it is essential to develop tools for storing, integrating, accessing, and analyzing this data effectively. We define an ontology for a comprehensive representation of cellular events. The ontology presented here enables integration of fragmented or incomplete pathway information and supports manipulation and incorporation of the stored data, as well as multiple levels of abstraction. Based on this ontology, we present the architecture of an integrated environment named Patika (Pathway Analysis Tool for Integration and Knowledge Acquisition). Patika is composed of a server-side, scalable, object-oriented database and client-side editors to provide an integrated, multi-user environment for visualizing and manipulating network of cellular events. This tool features automated pathway layout, functional computation support, advanced querying and a user-friendly graphical interface. We expect that Patika will be a valuable tool for rapid knowledge acquisition, microarray generated large-scale data interpretation, disease gene identification, and drug development. A prototype of Patika is available upon request from the authors.
Integrated Microfluidic Lectin Barcode Platform for High-Performance Focused Glycomic Profiling
NASA Astrophysics Data System (ADS)
Shang, Yuqin; Zeng, Yun; Zeng, Yong
2016-02-01
Protein glycosylation is one of the key processes that play essential roles in biological functions and dysfunctions. However, progress in glycomics has considerably lagged behind genomics and proteomics, due in part to the enormous challenges in analysis of glycans. Here we present a new integrated and automated microfluidic lectin barcode platform to substantially improve the performance of lectin array for focused glycomic profiling. The chip design and flow control were optimized to promote the lectin-glycan binding kinetics and speed of lectin microarray. Moreover, we established an on-chip lectin assay which employs a very simple blocking method to effectively suppress the undesired background due to lectin binding of antibodies. Using this technology, we demonstrated focused differential profiling of tissue-specific glycosylation changes of a biomarker, CA125 protein purified from ovarian cancer cell line and different tissues from ovarian cancer patients in a fast, reproducible, and high-throughput fashion. Highly sensitive CA125 detection was also demonstrated with a detection limit much lower than the clinical cutoff value for cancer diagnosis. This microfluidic platform holds the potential to integrate with sample preparation functions to construct a fully integrated “sample-to-answer” microsystem for focused differential glycomic analysis. Thus, our technology should present a powerful tool in support of rapid advance in glycobiology and glyco-biomarker development.
Integrated Microfluidic Lectin Barcode Platform for High-Performance Focused Glycomic Profiling
Shang, Yuqin; Zeng, Yun; Zeng, Yong
2016-01-01
Protein glycosylation is one of the key processes that play essential roles in biological functions and dysfunctions. However, progress in glycomics has considerably lagged behind genomics and proteomics, due in part to the enormous challenges in analysis of glycans. Here we present a new integrated and automated microfluidic lectin barcode platform to substantially improve the performance of lectin array for focused glycomic profiling. The chip design and flow control were optimized to promote the lectin-glycan binding kinetics and speed of lectin microarray. Moreover, we established an on-chip lectin assay which employs a very simple blocking method to effectively suppress the undesired background due to lectin binding of antibodies. Using this technology, we demonstrated focused differential profiling of tissue-specific glycosylation changes of a biomarker, CA125 protein purified from ovarian cancer cell line and different tissues from ovarian cancer patients in a fast, reproducible, and high-throughput fashion. Highly sensitive CA125 detection was also demonstrated with a detection limit much lower than the clinical cutoff value for cancer diagnosis. This microfluidic platform holds the potential to integrate with sample preparation functions to construct a fully integrated “sample-to-answer” microsystem for focused differential glycomic analysis. Thus, our technology should present a powerful tool in support of rapid advance in glycobiology and glyco-biomarker development. PMID:26831207
Chang, Ting-Yu; Wu, Yu-Hsuan; Cheng, Cheng-Chung; Wang, Hsei-Wei
2011-09-01
Alternative RNA splicing greatly increases proteome diversity, and the possibility of studying genome-wide alternative splicing (AS) events becomes available with the advent of high-throughput genomics tools devoted to this issue. Kaposi's sarcoma associated herpesvirus (KSHV) is the etiological agent of KS, a tumor of lymphatic endothelial cell (LEC) lineage, but little is known about the AS variations induced by KSHV. We analyzed KSHV-controlled AS using high-density microarrays capable of detecting all exons in the human genome. Splicing variants and altered exon-intron usage in infected LEC were found, and these correlated with protein domain modification. The different 3'-UTR used in new transcripts also help isoforms to escape microRNA-mediated surveillance. Exome-level analysis further revealed information that cannot be disclosed using classical gene-level profiling: a significant exon usage difference existed between LEC and CD34(+) precursor cells, and KSHV infection resulted in LEC-to-precursor, dedifferentiation-like exon level reprogramming. Our results demonstrate the application of exon arrays in systems biology research, and suggest the regulatory effects of AS in endothelial cells are far more complex than previously observed. This extra layer of molecular diversity helps to account for various aspects of endothelial biology, KSHV life cycle and disease pathogenesis that until now have been unexplored.
Integrated Analysis of Transcriptomic and Proteomic Data
Haider, Saad; Pal, Ranadip
2013-01-01
Until recently, understanding the regulatory behavior of cells has been pursued through independent analysis of the transcriptome or the proteome. Based on the central dogma, it was generally assumed that there exist a direct correspondence between mRNA transcripts and generated protein expressions. However, recent studies have shown that the correlation between mRNA and Protein expressions can be low due to various factors such as different half lives and post transcription machinery. Thus, a joint analysis of the transcriptomic and proteomic data can provide useful insights that may not be deciphered from individual analysis of mRNA or protein expressions. This article reviews the existing major approaches for joint analysis of transcriptomic and proteomic data. We categorize the different approaches into eight main categories based on the initial algorithm and final analysis goal. We further present analogies with other domains and discuss the existing research problems in this area. PMID:24082820
Welker, F
2018-02-20
The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
Deng, Ning; Li, Zhenye; Pan, Chao; Duan, Huilong
2015-01-01
Study of complex proteome brings forward higher request for the quantification method using mass spectrometry technology. In this paper, we present a mass spectrometry label-free quantification tool for complex proteomes, called freeQuant, which integrated quantification with functional analysis effectively. freeQuant consists of two well-integrated modules: label-free quantification and functional analysis with biomedical knowledge. freeQuant supports label-free quantitative analysis which makes full use of tandem mass spectrometry (MS/MS) spectral count, protein sequence length, shared peptides, and ion intensity. It adopts spectral count for quantitative analysis and builds a new method for shared peptides to accurately evaluate abundance of isoforms. For proteins with low abundance, MS/MS total ion count coupled with spectral count is included to ensure accurate protein quantification. Furthermore, freeQuant supports the large-scale functional annotations for complex proteomes. Mitochondrial proteomes from the mouse heart, the mouse liver, and the human heart were used to evaluate the usability and performance of freeQuant. The evaluation showed that the quantitative algorithms implemented in freeQuant can improve accuracy of quantification with better dynamic range.
Genetic Analysis of Mice Skin Exposed by Hyper-Gravity
NASA Astrophysics Data System (ADS)
Takahashi, Rika; Terada, Masahiro; Seki, Masaya; Higashibata, Akira; Majima, Hideyuki J.; Ohira, Yoshinobu; Mukai, Chiaki; Ishioka, Noriaki
2013-02-01
In the space environment, physiological alterations, such as low bone density, muscle weakness and decreased immunity, are caused by microgravity and cosmic radiation. On the other hand, it is known that the leg muscles are hypertrophy by 2G-gravity. An understanding of the effects on human body from microgravity to hyper-gravity is very important. Recently, the Japan Aerospace Exploration Agency (JAXA) has started a project to detect the changes on gene expression and mineral metabolism caused by microgravity by analyzing the hair of astronauts who stay in the international Space Station (ISS) for a long time. From these results of human hair’s research, the genetic effects of human hair roots by microgravity will become clear. However, it is unclear how the gene expression of hair roots was effected by hypergravity. Therefore, in this experiment, we analyzed the effect on mice skin contained hair roots by comparing microgravity or hypergravity exposed mice. The purpose of this experiment is to evaluate the genetic effects on mice skin by microgravity or 2G-gravity. The samples were taken from mice exposed to space flight (FL) or hypergravity environment (2G) for 3-months, respectively. The extracted and amplified RNA from these mice skin was used to DNA microarray analysis. in this experiment, we analyzed the effect of gravity by using mice skin contained hair roots, which exposed space (FL) and hyper-gravity (2G) for 3 months and each control. By DNA microarray analysis, we found the common 98 genes changed in both FL and 2G. Among these 98 genes, the functions and pathways were identified by Gene Ontology (GO) analysis and Ingenuity Pathways Analysis (IPA) software. Next, we focused the one of the identified pathways and compared the effects on each molecules in this pathways by the different environments, such as FL and 2G. As the results, we could detect some interesting molecules, which might be depended on the gravity levels. In addition, to investigate the relationships between genes and protein expression, the proteome analysis was performed. From the result of 2-dimentional electrophoresis, we could detect the some different spots between FL and 2G. These identifications are now in progress using by MALDI-TOF-MS/MS. These results suggested that many genes or proteins on the mice skin might be effected by the different gravity levels.
NASA Astrophysics Data System (ADS)
Bogdanov, Valery L.; Boyce-Jacino, Michael
1999-05-01
Confined arrays of biochemical probes deposited on a solid support surface (analytical microarray or 'chip') provide an opportunity to analysis multiple reactions simultaneously. Microarrays are increasingly used in genetics, medicine and environment scanning as research and analytical instruments. A power of microarray technology comes from its parallelism which grows with array miniaturization, minimization of reagent volume per reaction site and reaction multiplexing. An optical detector of microarray signals should combine high sensitivity, spatial and spectral resolution. Additionally, low-cost and a high processing rate are needed to transfer microarray technology into biomedical practice. We designed an imager that provides confocal and complete spectrum detection of entire fluorescently-labeled microarray in parallel. Imager uses microlens array, non-slit spectral decomposer, and high- sensitive detector (cooled CCD). Two imaging channels provide a simultaneous detection of localization, integrated and spectral intensities for each reaction site in microarray. A dimensional matching between microarray and imager's optics eliminates all in moving parts in instrumentation, enabling highly informative, fast and low-cost microarray detection. We report theory of confocal hyperspectral imaging with microlenses array and experimental data for implementation of developed imager to detect fluorescently labeled microarray with a density approximately 103 sites per cm2.
Systems Proteomics for Translational Network Medicine
Arrell, D. Kent; Terzic, Andre
2012-01-01
Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016
HTAPP: High-Throughput Autonomous Proteomic Pipeline
Yu, Kebing; Salomon, Arthur R.
2011-01-01
Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676
NASA Astrophysics Data System (ADS)
Brazhnik, Kristina; Sokolova, Zinaida; Baryshnikova, Maria; Bilan, Regina; Nabiev, Igor; Sukhanova, Alyona
Multiplexed analysis of cancer markers is crucial for early tumor diagnosis and screening. We have designed lab-on-a-bead microarray for quantitative detection of three breast cancer markers in human serum. Quantum dots were used as bead-bound fluorescent tags for identifying each marker by means of flow cytometry. Antigen-specific beads reliably detected CA 15-3, CEA, and CA 125 in serum samples, providing clear discrimination between the samples with respect to the antigen levels. The novel microarray is advantageous over the routine single-analyte ones due to the simultaneous detection of various markers. Therefore the developed microarray is a promising tool for serum tumor marker profiling.
Achievements and perspectives of top-down proteomics.
Armirotti, Andrea; Damonte, Gianluca
2010-10-01
Over the last years, top-down (TD) MS has gained a remarkable space in proteomics, rapidly trespassing the limit between a promising approach and a solid, established technique. Several research groups worldwide have implemented TD analysis in their routine work on proteomics, deriving structural information on proteins with the level of accuracy that is impossible to achieve with classical bottom-up approaches. Complete maps of PTMs and assessment of single aminoacid polymorphisms are only a few of the results that can be obtained with this technique. Despite some existing technical and economical limitations, TD analysis is at present the most powerful instrument for MS-based proteomics and its implementation in routine workflow is a rapidly approaching turning point in proteomics. In this review article, the state-of-the-art of TD approach is described along with its major advantages and drawbacks and the most recent trends in TD analysis are discussed. References for all the covered topics are reported in the text, with the aim to support both newcomers and mass spectrometrists already introduced to TD proteomics.
Alberio, Tiziana; Pieroni, Luisa; Ronci, Maurizio; Banfi, Cristina; Bongarzone, Italia; Bottoni, Patrizia; Brioschi, Maura; Caterino, Marianna; Chinello, Clizia; Cormio, Antonella; Cozzolino, Flora; Cunsolo, Vincenzo; Fontana, Simona; Garavaglia, Barbara; Giusti, Laura; Greco, Viviana; Lucacchini, Antonio; Maffioli, Elisa; Magni, Fulvio; Monteleone, Francesca; Monti, Maria; Monti, Valentina; Musicco, Clara; Petrosillo, Giuseppe; Porcelli, Vito; Saletti, Rosaria; Scatena, Roberto; Soggiu, Alessio; Tedeschi, Gabriella; Zilocchi, Mara; Roncada, Paola; Urbani, Andrea; Fasano, Mauro
2017-12-01
The Mitochondrial Human Proteome Project aims at understanding the function of the mitochondrial proteome and its crosstalk with the proteome of other organelles. Being able to choose a suitable and validated enrichment protocol of functional mitochondria, based on the specific needs of the downstream proteomics analysis, would greatly help the researchers in the field. Mitochondrial fractions from ten model cell lines were prepared using three enrichment protocols and analyzed on seven different LC-MS/MS platforms. All data were processed using neXtProt as reference database. The data are available for the Human Proteome Project purposes through the ProteomeXchange Consortium with the identifier PXD007053. The processed data sets were analyzed using a suite of R routines to perform a statistical analysis and to retrieve subcellular and submitochondrial localizations. Although the overall number of identified total and mitochondrial proteins was not significantly dependent on the enrichment protocol, specific line to line differences were observed. Moreover, the protein lists were mapped to a network representing the functional mitochondrial proteome, encompassing mitochondrial proteins and their first interactors. More than 80% of the identified proteins resulted in nodes of this network but with a different ability in coisolating mitochondria-associated structures for each enrichment protocol/cell line pair.
Construction of a cDNA microarray derived from the ascidian Ciona intestinalis.
Azumi, Kaoru; Takahashi, Hiroki; Miki, Yasufumi; Fujie, Manabu; Usami, Takeshi; Ishikawa, Hisayoshi; Kitayama, Atsusi; Satou, Yutaka; Ueno, Naoto; Satoh, Nori
2003-10-01
A cDNA microarray was constructed from a basal chordate, the ascidian Ciona intestinalis. The draft genome of Ciona has been read and inferred to contain approximately 16,000 protein-coding genes, and cDNAs for transcripts of 13,464 genes have been characterized and compiled as the "Ciona intestinalis Gene Collection Release I". In the present study, we constructed a cDNA microarray of these 13,464 Ciona genes. A preliminary experiment with Cy3- and Cy5-labeled probes showed extensive differential gene expression between fertilized eggs and larvae. In addition, there was a good correlation between results obtained by the present microarray analysis and those from previous EST analyses. This first microarray of a large collection of Ciona intestinalis cDNA clones should facilitate the analysis of global gene expression and gene networks during the embryogenesis of basal chordates.
Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production
DOE Office of Scientific and Technical Information (OSTI.GOV)
Papoutsakis, Elefterios
This is the final project report for project "Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production" for the funding period of 9/1/12 to 2/28/2015 (three years with a 6-month no-cost extension) OVERVIEW AND PROJECT GOALS The bottleneck of achieving higher rates and titers of toxic metabolites (such as solvents and carboxylic acids that can used as biofuels or biofuel precursors) can be overcome by engineering the stress response system. Thus, understanding and modeling the response of cells to toxic metabolites is a problem of great fundamental and practical significance. In this project, our goal is to dissect at the molecular systemsmore » level and build models (conceptual and quantitative) for the stress response of C. acetobutylicum (Cac) to its two toxic metabolites: butanol (BuOH) and butyrate (BA). Transcriptional (RNAseq and microarray based), proteomic and fluxomic data and their analysis are key requirements for this goal. Transcriptional data from mid-exponential cultures of Cac under 4 different levels of BuOH and BA stress was obtained using both microarrays (Papoutsakis group) and deep sequencing (RNAseq; Meyers and Papoutsakis groups). These two sets of data do not only serve to validate each other, but are also used for identification of stress-induced changes in transcript levels, small regulatory RNAs, & in transcriptional start sites. Quantitative proteomic data (Lee group), collected using the iTRAQ technology, are essential for understanding of protein levels and turnover under stress and the various protein-protein interactions that orchestrate the stress response. Metabolic flux changes (Antoniewicz group) of core pathways, which provide important information on the re-allocation of energy and carbon resources under metabolite stress, were examined using 13C-labelled chemicals. Omics data are integrated at different levels and scales. At the metabolic-pathway level, omics data are integrated into a 2nd generation genome-scale model (GSM) (Maranas group). Omics data are also integrated using bioinformatics (Wu and Huang group), whereby regulatory details of gene and protein expression, protein-protein interactions and metabolic flux regulation are incorporated. The PI (Papoutsakis) facilitated project integration through monthly meeting and reports, conference calls, and collaborative manuscript preparation. The five groups collaborated extensively and made a large number of presentations in national and international meetings. It has also published several papers, with several more in the preparation stage. Several PhD, MS and postdoctoral students were trained as part of this collaborative and interdisciplinary project.« less
Implementation of GenePattern within the Stanford Microarray Database.
Hubble, Jeremy; Demeter, Janos; Jin, Heng; Mao, Maria; Nitzberg, Michael; Reddy, T B K; Wymore, Farrell; Zachariah, Zachariah K; Sherlock, Gavin; Ball, Catherine A
2009-01-01
Hundreds of researchers across the world use the Stanford Microarray Database (SMD; http://smd.stanford.edu/) to store, annotate, view, analyze and share microarray data. In addition to providing registered users at Stanford access to their own data, SMD also provides access to public data, and tools with which to analyze those data, to any public user anywhere in the world. Previously, the addition of new microarray data analysis tools to SMD has been limited by available engineering resources, and in addition, the existing suite of tools did not provide a simple way to design, execute and share analysis pipelines, or to document such pipelines for the purposes of publication. To address this, we have incorporated the GenePattern software package directly into SMD, providing access to many new analysis tools, as well as a plug-in architecture that allows users to directly integrate and share additional tools through SMD. In this article, we describe our implementation of the GenePattern microarray analysis software package into the SMD code base. This extension is available with the SMD source code that is fully and freely available to others under an Open Source license, enabling other groups to create a local installation of SMD with an enriched data analysis capability.
Multidimensional proteomics for cell biology.
Larance, Mark; Lamond, Angus I
2015-05-01
The proteome is a dynamic system in which each protein has interconnected properties - dimensions - that together contribute to the phenotype of a cell. Measuring these properties has proved challenging owing to their diversity and dynamic nature. Advances in mass spectrometry-based proteomics now enable the measurement of multiple properties for thousands of proteins, including their abundance, isoform expression, turnover rate, subcellular localization, post-translational modifications and interactions. Complementing these experimental developments are new data analysis, integration and visualization tools as well as data-sharing resources. Together, these advances in the multidimensional analysis of the proteome are transforming our understanding of various cellular and physiological processes.
A catalogue of molecular aberrations that cause ovarian cancer is critical for developing and deploying diagnostics and therapies that will improve patients’ lives. Because a comprehensive molecular view of cancer is important for ultimately guiding treatment, the National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) has released the cancer proteome confirmatory ovarian study data sets.
Couto, Narciso; Schooling, Sarah R; Dutcher, John R; Barber, Jill
2015-10-02
In the present work, two different proteomic platforms, gel-based and gel-free, were used to map the matrix and outer membrane vesicle exoproteomes of Pseudomonas aeruginosa PAO1 biofilms. These two proteomic strategies allowed us a confident identification of 207 and 327 proteins from enriched outer membrane vesicles and whole matrix isolated from biofilms. Because of the physicochemical characteristics of these subproteomes, the two strategies showed complementarity, and thus, the most comprehensive analysis of P. aeruginosa exoproteome to date was achieved. Under our conditions, outer membrane vesicles contribute approximately 20% of the whole matrix proteome, demonstrating that membrane vesicles are an important component of the matrix. The proteomic profiles were analyzed in terms of their biological context, namely, a biofilm. Accordingly relevant metabolic processes involved in cellular adaptation to the biofilm lifestyle as well as those related to P. aeruginosa virulence capabilities were a key feature of the analyses. The diversity of the matrix proteome corroborates the idea of high heterogeneity within the biofilm; cells can display different levels of metabolism and can adapt to local microenvironments making this proteomic analysis challenging. In addition to analyzing our own primary data, we extend the analysis to published data by other groups in order to deepen our understanding of the complexity inherent within biofilm populations.
Advances of Proteomic Sciences in Dentistry.
Khurshid, Zohaib; Zohaib, Sana; Najeeb, Shariq; Zafar, Muhammad Sohail; Rehman, Rabia; Rehman, Ihtesham Ur
2016-05-13
Applications of proteomics tools revolutionized various biomedical disciplines such as genetics, molecular biology, medicine, and dentistry. The aim of this review is to highlight the major milestones in proteomics in dentistry during the last fifteen years. Human oral cavity contains hard and soft tissues and various biofluids including saliva and crevicular fluid. Proteomics has brought revolution in dentistry by helping in the early diagnosis of various diseases identified by the detection of numerous biomarkers present in the oral fluids. This paper covers the role of proteomics tools for the analysis of oral tissues. In addition, dental materials proteomics and their future directions are discussed.
Van, Phu T; Schmid, Amy K; King, Nichole L; Kaur, Amardeep; Pan, Min; Whitehead, Kenia; Koide, Tie; Facciotti, Marc T; Goo, Young Ah; Deutsch, Eric W; Reiss, David J; Mallick, Parag; Baliga, Nitin S
2008-09-01
The relatively small numbers of proteins and fewer possible post-translational modifications in microbes provide a unique opportunity to comprehensively characterize their dynamic proteomes. We have constructed a PeptideAtlas (PA) covering 62.7% of the predicted proteome of the extremely halophilic archaeon Halobacterium salinarum NRC-1 by compiling approximately 636 000 tandem mass spectra from 497 mass spectrometry runs in 88 experiments. Analysis of the PA with respect to biophysical properties of constituent peptides, functional properties of parent proteins of detected peptides, and performance of different mass spectrometry approaches has highlighted plausible strategies for improving proteome coverage and selecting signature peptides for targeted proteomics. Notably, discovery of a significant correlation between absolute abundances of mRNAs and proteins has helped identify low abundance of proteins as the major limitation in peptide detection. Furthermore, we have discovered that iTRAQ labeling for quantitative proteomic analysis introduces a significant bias in peptide detection by mass spectrometry. Therefore, despite identifying at least one proteotypic peptide for almost all proteins in the PA, a context-dependent selection of proteotypic peptides appears to be the most effective approach for targeted proteomics.
The journal Molecular & Cellular Proteomics (MCP), in collaboration with the Clinical Proteomic Tumor Analysis Consortium (CPTAC) of the National Cancer Institute (NCI), part of the National Institutes of Health, announce new guidelines and requirements for papers describing the development and application of targeted mass spectrometry measurements of peptides, modified peptides and proteins (Mol Cell Proteomics 2017; PMID: 28183812). NCI’s participation is part of NIH’s overall effort to address the r
An estimated 252,710 new cases of female breast cancer, accounting for 15% of all new cancer cases, occurred in 2017. To better understand proteogenomic abnormalities in breast cancer, the National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) announces the release of the cancer proteome confirmatory breast study data. The goal of the study was to comprehensively characterize the proteome and phosphoproteome on approximately 100 prospectively collected breast tumor and adjacent normal tissues.
Fabrication of Carbohydrate Microarrays by Boronate Formation.
Adak, Avijit K; Lin, Ting-Wei; Li, Ben-Yuan; Lin, Chun-Cheng
2017-01-01
The interactions between soluble carbohydrates and/or surface displayed glycans and protein receptors are essential to many biological processes and cellular recognition events. Carbohydrate microarrays provide opportunities for high-throughput quantitative analysis of carbohydrate-protein interactions. Over the past decade, various techniques have been implemented for immobilizing glycans on solid surfaces in a microarray format. Herein, we describe a detailed protocol for fabricating carbohydrate microarrays that capitalizes on the intrinsic reactivity of boronic acid toward carbohydrates to form stable boronate diesters. A large variety of unprotected carbohydrates ranging in structure from simple disaccharides and trisaccharides to considerably more complex human milk and blood group (oligo)saccharides have been covalently immobilized in a single step on glass slides, which were derivatized with high-affinity boronic acid ligands. The immobilized ligands in these microarrays maintain the receptor-binding activities including those of lectins and antibodies according to the structures of their pendant carbohydrates for rapid analysis of a number of carbohydrate-recognition events within 30 h. This method facilitates the direct construction of otherwise difficult to obtain carbohydrate microarrays from underivatized glycans.
The Glycan Microarray Story from Construction to Applications.
Hyun, Ji Young; Pai, Jaeyoung; Shin, Injae
2017-04-18
Not only are glycan-mediated binding processes in cells and organisms essential for a wide range of physiological processes, but they are also implicated in various pathological processes. As a result, elucidation of glycan-associated biomolecular interactions and their consequences is of great importance in basic biological research and biomedical applications. In 2002, we and others were the first to utilize glycan microarrays in efforts aimed at the rapid analysis of glycan-associated recognition events. Because they contain a number of glycans immobilized in a dense and orderly manner on a solid surface, glycan microarrays enable multiple parallel analyses of glycan-protein binding events while utilizing only small amounts of glycan samples. Therefore, this microarray technology has become a leading edge tool in studies aimed at elucidating roles played by glycans and glycan binding proteins in biological systems. In this Account, we summarize our efforts on the construction of glycan microarrays and their applications in studies of glycan-associated interactions. Immobilization strategies of functionalized and unmodified glycans on derivatized glass surfaces are described. Although others have developed immobilization techniques, our efforts have focused on improving the efficiencies and operational simplicity of microarray construction. The microarray-based technology has been most extensively used for rapid analysis of the glycan binding properties of proteins. In addition, glycan microarrays have been employed to determine glycan-protein interactions quantitatively, detect pathogens, and rapidly assess substrate specificities of carbohydrate-processing enzymes. More recently, the microarrays have been employed to identify functional glycans that elicit cell surface lectin-mediated cellular responses. Owing to these efforts, it is now possible to use glycan microarrays to expand the understanding of roles played by glycans and glycan binding proteins in biological systems.
EDGE3: A web-based solution for management and analysis of Agilent two color microarray experiments
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-01-01
Background The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE3 was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. Results EDGE3 has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE3 is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Conclusion Here, we present EDGE3, an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE3 provides a means for managing RNA samples and arrays during the hybridization process. EDGE3 is freely available for download at . PMID:19732451
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-09-04
The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE(3) was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. EDGE(3) has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE(3) is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Here, we present EDGE(3), an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE(3) provides a means for managing RNA samples and arrays during the hybridization process. EDGE(3) is freely available for download at http://edge.oncology.wisc.edu/.
Proteomic analysis of bovine nucleolus.
Patel, Amrutlal K; Olson, Doug; Tikoo, Suresh K
2010-09-01
Nucleolus is the most prominent subnuclear structure, which performs a wide variety of functions in the eukaryotic cellular processes. In order to understand the structural and functional role of the nucleoli in bovine cells, we analyzed the proteomic composition of the bovine nucleoli. The nucleoli were isolated from Madin Darby bovine kidney cells and subjected to proteomic analysis by LC-MS/MS after fractionation by SDS-PAGE and strong cation exchange chromatography. Analysis of the data using the Mascot database search and the GPM database search identified 311 proteins in the bovine nucleoli, which contained 22 proteins previously not identified in the proteomic analysis of human nucleoli. Analysis of the identified proteins using the GoMiner software suggested that the bovine nucleoli contained proteins involved in ribosomal biogenesis, cell cycle control, transcriptional, translational and post-translational regulation, transport, and structural organization. Copyright © 2010 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
What is the study? This study is the first to use microarray analysis in the Ames strains of Salmonella. The microarray chips were custom-designed for this study and are not commercially available, and we evaluated the well-studied drinking water mutagen, MX. Because much inform...
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
MICROARRAY ANALYSIS OF DICHLOROACETIC ACID-INDUCED CHANGES IN GENE EXPRESSION
Dichloroacetic acid (DCA) is a major by-product of water disinfection by chlorination. Several studies have demonstrated the hepatocarcinogenicity of DCA in rodents when administered in dri...
Colangelo, Christopher M.; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L.; Carriero, Nicholas J.; Gulcicek, Erol E.; Lam, TuKiet T.; Wu, Terence; Bjornson, Robert D.; Bruce, Can; Nairn, Angus C.; Rinehart, Jesse; Miller, Perry L.; Williams, Kenneth R.
2015-01-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry (LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED’s database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. PMID:25712262
Choi, Hyungwon; Kim, Sinae; Fermin, Damian; Tsou, Chih-Chiang; Nesvizhskii, Alexey I
2015-11-03
We introduce QPROT, a statistical framework and computational tool for differential protein expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite, originally developed for spectral count data, adapted for the analysis using continuously measured protein-level intensity data. QPROT offers a new intensity normalization procedure and model-based differential expression analysis, both of which account for missing data. Determination of differential expression of each protein is based on the standardized Z-statistic based on the posterior distribution of the log fold change parameter, guided by the false discovery rate estimated by a well-known Empirical Bayes method. We evaluated the classification performance of QPROT using the quantification calibration data from the clinical proteomic technology assessment for cancer (CPTAC) study and a recently published Escherichia coli benchmark dataset, with evaluation of FDR accuracy in the latter. QPROT is a statistical framework with computational software tool for comparative quantitative proteomics analysis. It features various extensions of QSPEC method originally built for spectral count data analysis, including probabilistic treatment of missing values in protein intensity data. With the increasing popularity of label-free quantitative proteomics data, the proposed method and accompanying software suite will be immediately useful for many proteomics laboratories. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015 Elsevier B.V. All rights reserved.
Colangelo, Christopher M; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L; Carriero, Nicholas J; Gulcicek, Erol E; Lam, TuKiet T; Wu, Terence; Bjornson, Robert D; Bruce, Can; Nairn, Angus C; Rinehart, Jesse; Miller, Perry L; Williams, Kenneth R
2015-02-01
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Robust gene selection methods using weighting schemes for microarray data analysis.
Kang, Suyeon; Song, Jongwoo
2017-09-02
A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.
Halobacterium salinarum NRC-1 PeptideAtlas: strategies for targeted proteomics
Van, Phu T.; Schmid, Amy K.; King, Nichole L.; Kaur, Amardeep; Pan, Min; Whitehead, Kenia; Koide, Tie; Facciotti, Marc T.; Goo, Young-Ah; Deutsch, Eric W.; Reiss, David J.; Mallick, Parag; Baliga, Nitin S.
2009-01-01
The relatively small numbers of proteins and fewer possible posttranslational modifications in microbes provides a unique opportunity to comprehensively characterize their dynamic proteomes. We have constructed a Peptide Atlas (PA) for 62.7% of the predicted proteome of the extremely halophilic archaeon Halobacterium salinarum NRC-1 by compiling approximately 636,000 tandem mass spectra from 497 mass spectrometry runs in 88 experiments. Analysis of the PA with respect to biophysical properties of constituent peptides, functional properties of parent proteins of detected peptides, and performance of different mass spectrometry approaches has helped highlight plausible strategies for improving proteome coverage and selecting signature peptides for targeted proteomics. Notably, discovery of a significant correlation between absolute abundances of mRNAs and proteins has helped identify low abundance of proteins as the major limitation in peptide detection. Furthermore we have discovered that iTRAQ labeling for quantitative proteomic analysis introduces a significant bias in peptide detection by mass spectrometry. Therefore, despite identifying at least one proteotypic peptide for almost all proteins in the PA, a context-dependent selection of proteotypic peptides appears to be the most effective approach for targeted proteomics. PMID:18652504
[Techniques for rapid production of monoclonal antibodies for use with antibody technology].
Kamada, Haruhiko
2012-01-01
A monoclonal antibody (Mab), due to its specific binding ability to a target protein, can potentially be one of the most useful tools for the functional analysis of proteins in recent proteomics-based research. However, the production of Mab is a very time-consuming and laborious process (i.e., preparation of recombinant antigens, immunization of animals, preparation of hybridomas), making it the rate-limiting step in using Mabs in high-throughput proteomics research, which heavily relies on comprehensive and rapid methods. Therefore, there is a great demand for new methods to efficiently generate Mabs against a group of proteins identified by proteome analysis. Here, we describe a useful method called "Antibody proteomic technique" for the rapid generations of Mabs to pharmaceutical target, which were identified by proteomic analyses of disease samples (ex. tumor tissue, etc.). We also introduce another method to find profitable targets on vasculature, which is called "Vascular proteomic technique". Our results suggest that this method for the rapid generation of Mabs to proteins may be very useful in proteomics-based research as well as in clinical applications.
Bergerat, Agnes; Decano, Julius; Wu, Chang-Jiun; Choi, Hyungwon; Nesvizhskii, Alexey I; Moran, Ann Marie; Ruiz-Opazo, Nelson; Steffen, Martin; Herrera, Victoria LM
2011-01-01
Stroke is the third leading cause of death in the United States with high rates of morbidity among survivors. The search to fill the unequivocal need for new therapeutic approaches would benefit from unbiased proteomic analyses of animal models of spontaneous stroke in the prestroke stage. Since brain microvessels play key roles in neurovascular coupling, we investigated prestroke microvascular proteome changes. Proteomic analysis of cerebral cortical microvessels (cMVs) was done by tandem mass spectrometry comparing two prestroke time points. Metaprotein-pathway analyses of proteomic spectral count data were done to identify risk factor–induced changes, followed by QSPEC-analyses of individual protein changes associated with increased stroke susceptibility. We report 26 cMV proteome profiles from male and female stroke-prone and non–stroke-prone rats at 2 months and 4.5 months of age prior to overt stroke events. We identified 1,934 proteins by two or more peptides. Metaprotein pathway analysis detected age-associated changes in energy metabolism and cell-to-microenvironment interactions, as well as sex-specific changes in energy metabolism and endothelial leukocyte transmigration pathways. Stroke susceptibility was associated independently with multiple protein changes associated with ischemia, angiogenesis or involved in blood brain barrier (BBB) integrity. Immunohistochemical analysis confirmed aquaporin-4 and laminin-α1 induction in cMVs, representative of proteomic changes with >65 Bayes factor (BF), associated with stroke susceptibility. Altogether, proteomic analysis demonstrates significant molecular changes in ischemic cerebral microvasculature in the prestroke stage, which could contribute to the observed model phenotype of microhemorrhages and postischemic hemorrhagic transformation. These pathways comprise putative targets for translational research of much needed novel diagnostic and therapeutic approaches for stroke. PMID:21519634
The application of DNA microarrays in gene expression analysis.
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
2000-03-31
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Implementation of mutual information and bayes theorem for classification microarray data
NASA Astrophysics Data System (ADS)
Dwifebri Purbolaksono, Mahendra; Widiastuti, Kurnia C.; Syahrul Mubarok, Mohamad; Adiwijaya; Aminy Ma’ruf, Firda
2018-03-01
Microarray Technology is one of technology which able to read the structure of gen. The analysis is important for this technology. It is for deciding which attribute is more important than the others. Microarray technology is able to get cancer information to diagnose a person’s gen. Preparation of microarray data is a huge problem and takes a long time. That is because microarray data contains high number of insignificant and irrelevant attributes. So, it needs a method to reduce the dimension of microarray data without eliminating important information in every attribute. This research uses Mutual Information to reduce dimension. System is built with Machine Learning approach specifically Bayes Theorem. This theorem uses a statistical and probability approach. By combining both methods, it will be powerful for Microarray Data Classification. The experiment results show that system is good to classify Microarray data with highest F1-score using Bayesian Network by 91.06%, and Naïve Bayes by 88.85%.
Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.
Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N
2009-10-27
The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a highly adaptable, integrative, yet flexible tool which can be used for automated quality control, analysis, annotation and visualization of microarray data, constituting a starting point for further data interpretation and integration with numerous other tools.
Microarray data mining using Bioconductor packages.
Nie, Haisheng; Neerincx, Pieter B T; van der Poel, Jan; Ferrari, Francesco; Bicciato, Silvio; Leunissen, Jack A M; Groenen, Martien A M
2009-07-16
This paper describes the results of a Gene Ontology (GO) term enrichment analysis of chicken microarray data using the Bioconductor packages. By checking the enriched GO terms in three contrasts, MM8-PM8, MM8-MA8, and MM8-MM24, of the provided microarray data during this workshop, this analysis aimed to investigate the host reactions in chickens occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. The results of GO enrichment analysis using GO terms annotated to chicken genes and GO terms annotated to chicken-human orthologous genes were also compared. Furthermore, a locally adaptive statistical procedure (LAP) was performed to test differentially expressed chromosomal regions, rather than individual genes, in the chicken genome after Eimeria challenge. GO enrichment analysis identified significant (raw p-value < 0.05) GO terms for all three contrasts included in the analysis. Some of the GO terms linked to, generally, primary immune responses or secondary immune responses indicating the GO enrichment analysis is a useful approach to analyze microarray data. The comparisons of GO enrichment results using chicken gene information and chicken-human orthologous gene information showed more refined GO terms related to immune responses when using chicken-human orthologous gene information, this suggests that using chicken-human orthologous gene information has higher power to detect significant GO terms with more refined functionality. Furthermore, three chromosome regions were identified to be significantly up-regulated in contrast MM8-PM8 (q-value < 0.01). Overall, this paper describes a practical approach to analyze microarray data in farm animals where the genome information is still incomplete. For farm animals, such as chicken, with currently limited gene annotation, borrowing gene annotation information from orthologous genes in well-annotated species, such as human, will help improve the pathway analysis results substantially. Furthermore, LAP analysis approach is a relatively new and very useful way to be applied in microarray analysis.
The accurate quantitation of proteins or peptides using Mass Spectrometry (MS) is gaining prominence in the biomedical research community as an alternative method for analyte measurement. The Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigators have been at the forefront in the promotion of reproducible MS techniques, through the development and application of standardized proteomic methods for protein quantitation on biologically relevant samples.
Polyadenylation state microarray (PASTA) analysis.
Beilharz, Traude H; Preiss, Thomas
2011-01-01
Nearly all eukaryotic mRNAs terminate in a poly(A) tail that serves important roles in mRNA utilization. In the cytoplasm, the poly(A) tail promotes both mRNA stability and translation, and these functions are frequently regulated through changes in tail length. To identify the scope of poly(A) tail length control in a transcriptome, we developed the polyadenylation state microarray (PASTA) method. It involves the purification of mRNA based on poly(A) tail length using thermal elution from poly(U) sepharose, followed by microarray analysis of the resulting fractions. In this chapter we detail our PASTA approach and describe some methods for bulk and mRNA-specific poly(A) tail length measurements of use to monitor the procedure and independently verify the microarray data.
accumulation," J. Proteomics (2013) "Comparative Proteomics Lends Insight into Genotype-Specific Pathogenicity," J. Proteomics (2013) "De Novo Transcriptomic Analysis of Hydrogen Production in the amino acid changes in the small envelope protein and rescued by a novel glycosolation site," J
The Use of Atomic Force Microscopy for 3D Analysis of Nucleic Acid Hybridization on Microarrays.
Dubrovin, E V; Presnova, G V; Rubtsova, M Yu; Egorov, A M; Grigorenko, V G; Yaminsky, I V
2015-01-01
Oligonucleotide microarrays are considered today to be one of the most efficient methods of gene diagnostics. The capability of atomic force microscopy (AFM) to characterize the three-dimensional morphology of single molecules on a surface allows one to use it as an effective tool for the 3D analysis of a microarray for the detection of nucleic acids. The high resolution of AFM offers ways to decrease the detection threshold of target DNA and increase the signal-to-noise ratio. In this work, we suggest an approach to the evaluation of the results of hybridization of gold nanoparticle-labeled nucleic acids on silicon microarrays based on an AFM analysis of the surface both in air and in liquid which takes into account of their three-dimensional structure. We suggest a quantitative measure of the hybridization results which is based on the fraction of the surface area occupied by the nanoparticles.
The Utility of Chromosomal Microarray Analysis in Developmental and Behavioral Pediatrics
ERIC Educational Resources Information Center
Beaudet, Arthur L.
2013-01-01
Chromosomal microarray analysis (CMA) has emerged as a powerful new tool to identify genomic abnormalities associated with a wide range of developmental disabilities including congenital malformations, cognitive impairment, and behavioral abnormalities. CMA includes array comparative genomic hybridization (CGH) and single nucleotide polymorphism…
2011-01-01
Background Cytogenetic evaluation is a key component of the diagnosis and prognosis of chronic lymphocytic leukemia (CLL). We performed oligonucleotide-based comparative genomic hybridization microarray analysis on 34 samples with CLL and known abnormal karyotypes previously determined by cytogenetics and/or fluorescence in situ hybridization (FISH). Results Using a custom designed microarray that targets >1800 genes involved in hematologic disease and other malignancies, we identified additional cryptic aberrations and novel findings in 59% of cases. These included gains and losses of genes associated with cell cycle regulation, apoptosis and susceptibility loci on 3p21.31, 5q35.2q35.3, 10q23.31q23.33, 11q22.3, and 22q11.23. Conclusions Our results show that microarray analysis will detect known aberrations, including microscopic and cryptic alterations. In addition, novel genomic changes will be uncovered that may become important prognostic predictors or treatment targets for CLL in the future. PMID:22087757
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clair, Geremy; Piehowski, Paul D.; Nicola, Teodora
Global proteomics approaches allow characterization of whole tissue lysates to an impressive depth. However, it is now increasingly recognized that to better understand the complexity of multicellular organisms, global protein profiling of specific spatially defined regions/substructures of tissues (i.e. spatially-resolved proteomics) is essential. Laser capture microdissection (LCM) enables microscopic isolation of defined regions of tissues preserving crucial spatial information. However, current proteomics workflows entail several manual sample preparation steps and are challenged by the microscopic mass-limited samples generated by LCM, and that impact measurement robustness, quantification, and throughput. Here, we coupled LCM with a fully automated sample preparation workflow thatmore » with a single manual step allows: protein extraction, tryptic digestion, peptide cleanup and LC-MS/MS analysis of proteomes from microdissected tissues. Benchmarking against the current state of the art in ultrasensitive global proteomic analysis, our approach demonstrated significant improvements in quantification and throughput. Using our LCM-SNaPP proteomics approach, we characterized to a depth of more than 3,400 proteins, the ontogeny of protein changes during normal lung development in laser capture microdissected alveolar tissue containing ~4,000 cells per sample. Importantly, the data revealed quantitative changes for 350 low abundance transcription factors and signaling molecules, confirming earlier transcript-level observations and defining seven modules of coordinated transcription factor/signaling molecule expression patterns, suggesting that a complex network of temporal regulatory control directs normal lung development with epigenetic regulation fine-tuning pre-natal developmental processes. Our LCM-proteomics approach facilitates efficient, spatially-resolved, ultrasensitive global proteomics analyses in high-throughput that will be enabling for several clinical and biological applications.« less
A reference guide for tree analysis and visualization
2010-01-01
The quantities of data obtained by the new high-throughput technologies, such as microarrays or ChIP-Chip arrays, and the large-scale OMICS-approaches, such as genomics, proteomics and transcriptomics, are becoming vast. Sequencing technologies become cheaper and easier to use and, thus, large-scale evolutionary studies towards the origins of life for all species and their evolution becomes more and more challenging. Databases holding information about how data are related and how they are hierarchically organized expand rapidly. Clustering analysis is becoming more and more difficult to be applied on very large amounts of data since the results of these algorithms cannot be efficiently visualized. Most of the available visualization tools that are able to represent such hierarchies, project data in 2D and are lacking often the necessary user friendliness and interactivity. For example, the current phylogenetic tree visualization tools are not able to display easy to understand large scale trees with more than a few thousand nodes. In this study, we review tools that are currently available for the visualization of biological trees and analysis, mainly developed during the last decade. We describe the uniform and standard computer readable formats to represent tree hierarchies and we comment on the functionality and the limitations of these tools. We also discuss on how these tools can be developed further and should become integrated with various data sources. Here we focus on freely available software that offers to the users various tree-representation methodologies for biological data analysis. PMID:20175922
Advances of Proteomic Sciences in Dentistry
Khurshid, Zohaib; Zohaib, Sana; Najeeb, Shariq; Zafar, Muhammad Sohail; Rehman, Rabia; Rehman, Ihtesham Ur
2016-01-01
Applications of proteomics tools revolutionized various biomedical disciplines such as genetics, molecular biology, medicine, and dentistry. The aim of this review is to highlight the major milestones in proteomics in dentistry during the last fifteen years. Human oral cavity contains hard and soft tissues and various biofluids including saliva and crevicular fluid. Proteomics has brought revolution in dentistry by helping in the early diagnosis of various diseases identified by the detection of numerous biomarkers present in the oral fluids. This paper covers the role of proteomics tools for the analysis of oral tissues. In addition, dental materials proteomics and their future directions are discussed. PMID:27187379
NASA Technical Reports Server (NTRS)
Karouia, Fathi; Peyvan, Kia; Danley, David; Ricco, Antonio J.; Santos, Orlando; Pohorille, Andrew
2011-01-01
Human space travelers experience a unique environment that affects homeostasis and physiologic adaptation. The spacecraft environment subjects the traveler to noise, chemical and microbiological contaminants, increased radiation, and variable gravity forces. As humans prepare for long-duration missions to the International Space Station (ISS) and beyond, effective measures must be developed, verified and implemented to ensure mission success. Limited biomedical quantitative capabilities are currently available onboard the ISS. Therefore, the development of versatile instruments to perform space biological analysis and to monitor astronauts' health is needed. We are developing a fully automated, miniaturized system for measuring gene expression on small spacecraft in order to better understand the influence of the space environment on biological systems. This low-cost, low-power, multi-purpose instrument represents a major scientific and technological advancement by providing data on cellular metabolism and regulation. The current system will support growth of microorganisms, extract and purify the RNA, hybridize it to the array, read the expression levels of a large number of genes by microarray analysis, and transmit the measurements to Earth. The system will help discover how bacteria develop resistance to antibiotics and how pathogenic bacteria sometimes increase their virulence in space, facilitating the development of adequate countermeasures to decrease risks associated with human spaceflight. The current stand-alone technology could be used as an integrated platform onboard the ISS to perform similar genetic analyses on any biological systems from the tree of life. Additionally, with some modification the system could be implemented to perform real-time in-situ microbial monitoring of the ISS environment (air, surface and water samples) and the astronaut's microbiome using 16SrRNA microarray technology. Furthermore, the current system can be enhanced substantially by combining it with other technologies for automated, miniaturized, high-throughput biological measurements, such as fast sequencing, protein identification (proteomics) and metabolite profiling (metabolomics). Thus, the system can be integrated with other biomedical instruments in order to support and enhance telemedicine capability onboard ISS. NASA's mission includes sustained investment in critical research leading to effective countermeasures to minimize the risks associated with human spaceflight, and the use of appropriate technology to sustain space exploration at reasonable cost. Our integrated microarray technology is expected to fulfill these two critical requirements and to enable the scientific community to better understand and monitor the effects of the space environment on microorganisms and on the astronaut, in the process leveraging current capabilities and overcoming present limitations.
Recent advances in proteomics of cereals.
Bansal, Monika; Sharma, Madhu; Kanwar, Priyanka; Goyal, Aakash
Cereals contribute a major part of human nutrition and are considered as an integral source of energy for human diets. With genomic databases already available in cereals such as rice, wheat, barley, and maize, the focus has now moved to proteome analysis. Proteomics studies involve the development of appropriate databases based on developing suitable separation and purification protocols, identification of protein functions, and can confirm their functional networks based on already available data from other sources. Tremendous progress has been made in the past decade in generating huge data-sets for covering interactions among proteins, protein composition of various organs and organelles, quantitative and qualitative analysis of proteins, and to characterize their modulation during plant development, biotic, and abiotic stresses. Proteomics platforms have been used to identify and improve our understanding of various metabolic pathways. This article gives a brief review of efforts made by different research groups on comparative descriptive and functional analysis of proteomics applications achieved in the cereal science so far.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis.
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V Lila; Karagouni, Amalia D; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V. Lila; Karagouni, Amalia D.; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality. PMID:22267904
Proteomics Analysis of Bladder Cancer Exosomes*
Welton, Joanne L.; Khanna, Sanjay; Giles, Peter J.; Brennan, Paul; Brewis, Ian A.; Staffurth, John; Mason, Malcolm D.; Clayton, Aled
2010-01-01
Exosomes are nanometer-sized vesicles, secreted by various cell types, present in biological fluids that are particularly rich in membrane proteins. Ex vivo analysis of exosomes may provide biomarker discovery platforms and form non-invasive tools for disease diagnosis and monitoring. These vesicles have never before been studied in the context of bladder cancer, a major malignancy of the urological tract. We present the first proteomics analysis of bladder cancer cell exosomes. Using ultracentrifugation on a sucrose cushion, exosomes were highly purified from cultured HT1376 bladder cancer cells and verified as low in contaminants by Western blotting and flow cytometry of exosome-coated beads. Solubilization in a buffer containing SDS and DTT was essential for achieving proteomics analysis using an LC-MALDI-TOF/TOF MS approach. We report 353 high quality identifications with 72 proteins not previously identified by other human exosome proteomics studies. Overrepresentation analysis to compare this data set with previous exosome proteomics studies (using the ExoCarta database) revealed that the proteome was consistent with that of various exosomes with particular overlap with exosomes of carcinoma origin. Interrogating the Gene Ontology database highlighted a strong association of this proteome with carcinoma of bladder and other sites. The data also highlighted how homology among human leukocyte antigen haplotypes may confound MASCOT designation of major histocompatability complex Class I nomenclature, requiring data from PCR-based human leukocyte antigen haplotyping to clarify anomalous identifications. Validation of 18 MS protein identifications (including basigin, galectin-3, trophoblast glycoprotein (5T4), and others) was performed by a combination of Western blotting, flotation on linear sucrose gradients, and flow cytometry, confirming their exosomal expression. Some were confirmed positive on urinary exosomes from a bladder cancer patient. In summary, the exosome proteomics data set presented is of unrivaled quality. The data will aid in the development of urine exosome-based clinical tools for monitoring disease and will inform follow-up studies into varied aspects of exosome manufacture and function. PMID:20224111
Ellis, Matthew J; Gillette, Michael; Carr, Steven A; Paulovich, Amanda G; Smith, Richard D; Rodland, Karin K; Townsend, R Reid; Kinsinger, Christopher; Mesri, Mehdi; Rodriguez, Henry; Liebler, Daniel C
2013-10-01
The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium is applying the latest generation of proteomic technologies to genomically annotated tumors from The Cancer Genome Atlas (TCGA) program, a joint initiative of the NCI and the National Human Genome Research Institute. By providing a fully integrated accounting of DNA, RNA, and protein abnormalities in individual tumors, these datasets will illuminate the complex relationship between genomic abnormalities and cancer phenotypes, thus producing biologic insights as well as a wave of novel candidate biomarkers and therapeutic targets amenable to verification using targeted mass spectrometry methods. ©2013 AACR.
Advances in Proteomics Data Analysis and Display Using an Accurate Mass and Time Tag Approach
Zimmer, Jennifer S.D.; Monroe, Matthew E.; Qian, Wei-Jun; Smith, Richard D.
2007-01-01
Proteomics has recently demonstrated utility in understanding cellular processes on the molecular level as a component of systems biology approaches and for identifying potential biomarkers of various disease states. The large amount of data generated by utilizing high efficiency (e.g., chromatographic) separations coupled to high mass accuracy mass spectrometry for high-throughput proteomics analyses presents challenges related to data processing, analysis, and display. This review focuses on recent advances in nanoLC-FTICR-MS-based proteomics approaches and the accompanying data processing tools that have been developed to display and interpret the large volumes of data being produced. PMID:16429408
Gabbard, Joseph L.; Shukla, Maulik; Sobral, Bruno
2010-01-01
Systems biology and infectious disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This paper describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC, http://patric.vbi.vt.edu/), with a goal of developing bioinformatics resources for the research and countermeasures development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR, http://www.proteomicsresource.org/), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include (1) an approach for addressing discrepancies between experiment results and interpreted results and (2) expanding the range of data integration techniques to include usability engineering at the presentation level. PMID:20491070
Karaosmanoglu, Kubra; Sayar, Nihat Alpagu; Kurnaz, Isil Aksan; Akbulut, Berna Sariyar
2014-01-01
Postgenomics drug development is undergoing major transformation in the age of multi-omics studies and drug repositioning. Rather than applications solely in personalized medicine, omics science thus additionally offers a better understanding of a broader range of drug targets and drug repositioning. Berberine is an isoquinoline alkaloid found in many medicinal plants. We report here a whole genome microarray study in tandem with proteomics techniques for mining the plethora of targets that are putatively involved in the antimicrobial activity of berberine against Escherichia coli. We found DNA replication/repair and transcription to be triggered by berberine, indicating that nucleic acids, in general, are among its targets. Our combined transcriptomics and proteomics multi-omics findings underscore that, in the presence of berberine, cell wall or cell membrane transport and motility-related functions are also specifically regulated. We further report a general decline in metabolism, as seen by repression of genes in carbohydrate and amino acid metabolism, energy production, and conversion. An involvement of multidrug efflux pumps, as well as reduced membrane permeability for developing resistance against berberine in E. coli was noted. Collectively, these findings offer original and significant leads for omics-guided drug discovery and future repositioning approaches in the postgenomics era, using berberine as a multi-omics case study.
Time-dependent Translational Response of E. coli to Excess Zn(II)
Easton, J. Allen; Thompson, Peter; Crowder, Michael W.
2006-01-01
Zinc homeostasis is not well understood beyond methods of import and export. In order to better understand zinc homeostasis in Escherichia coli by identifying Zn(ii)-responsive proteins, a proteomic approach was taken. Through the use of two-dimensional gel electrophoresis, we were able to show that the levels of OmpF, AspC, YcdO, Eno, and CysE increased after 30 min of Zn(ii) stress, while the levels of Tig, TufA, SelA, and LeuC decreased relative to non-stressed controls. After 4 h of Zn(ii) stress, the levels of three proteins (DnaK, YeaU, and Mdh) were found to be up-regulated, while the levels of seven amino acid importers (HisJ, ArgT, LivJ, DppA, OppA, RbsB, and GinH) were found to be decreased. None of these proteins had been reported to be up- or down-regulated in any previously published cDNA microarray experiments. This result raises questions about the validity of cDNA arrays when they are used to make assumptions concerning protein levels within bacterial cells. These data also suggest that time is a factor when characterizing how the E. coli proteome responds to Zn(ii) stress. PMID:17122063
Shipitsin, M; Small, C; Choudhury, S; Giladi, E; Friedlander, S; Nardone, J; Hussain, S; Hurley, A D; Ernst, C; Huang, Y E; Chang, H; Nifong, T P; Rimm, D L; Dunyak, J; Loda, M; Berman, D M; Blume-Jensen, P
2014-09-09
Key challenges of biopsy-based determination of prostate cancer aggressiveness include tumour heterogeneity, biopsy-sampling error, and variations in biopsy interpretation. The resulting uncertainty in risk assessment leads to significant overtreatment, with associated costs and morbidity. We developed a performance-based strategy to identify protein biomarkers predictive of prostate cancer aggressiveness and lethality regardless of biopsy-sampling variation. Prostatectomy samples from a large patient cohort with long follow-up were blindly assessed by expert pathologists who identified the tissue regions with the highest and lowest Gleason grade from each patient. To simulate biopsy-sampling error, a core from a high- and a low-Gleason area from each patient sample was used to generate a 'high' and a 'low' tumour microarray, respectively. Using a quantitative proteomics approach, we identified from 160 candidates 12 biomarkers that predicted prostate cancer aggressiveness (surgical Gleason and TNM stage) and lethal outcome robustly in both high- and low-Gleason areas. Conversely, a previously reported lethal outcome-predictive marker signature for prostatectomy tissue was unable to perform under circumstances of maximal sampling error. Our results have important implications for cancer biomarker discovery in general and development of a sampling error-resistant clinical biopsy test for prediction of prostate cancer aggressiveness.
Rice proteome analysis: a step toward functional analysis of the rice genome.
Komatsu, Setsuko; Tanaka, Naoki
2005-03-01
The technique of proteome analysis using 2-DE has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this review, we describe construction of the rice proteome database, the cataloging of rice proteins, and the functional characterization of some of the proteins identified. Initially, proteins extracted from various tissues and organelles were separated by 2-DE and an image analyzer was used to construct a display or reference map of the proteins. The rice proteome database currently contains 23 reference maps based on 2-DE of proteins from different rice tissues and subcellular compartments. These reference maps comprise 13 129 rice proteins, and the amino acid sequences of 5092 of these proteins are entered in the database. Major proteins involved in growth or stress responses have been identified by using a proteomics approach and some of these proteins have unique functions. Furthermore, initial work has also begun on analyzing the phosphoproteome and protein-protein interactions in rice. The information obtained from the rice proteome database will aid in the molecular cloning of rice genes and in predicting the function of unknown proteins.
Placental Proteomics: A Shortcut to Biological Insight
Robinson, John M.; Vandré, Dale D.; Ackerman, William E.
2012-01-01
Proteomics analysis of biological samples has the potential to identify novel protein expression patterns and/or changes in protein expression patterns in different developmental or disease states. An important component of successful proteomics research, at least in its present form, is to reduce the complexity of the sample if it is derived from cells or tissues. One method to simplify complex tissues is to focus on a specific, highly purified sub-proteome. Using this approach we have developed methods to prepare highly enriched fractions of the apical plasma membrane of the syncytiotrophoblast. Through proteomics analysis of this fraction we have identified over five hundred proteins several of which were previously not known to reside in the syncytiotrophoblast. Herein, we focus on two of these, dysferlin and myoferlin. These proteins, largely known from studies of skeletal muscle, may not have been found in the human placenta were it not for discovery-based proteomics analysis. This new knowledge, acquired through a discovery-driven approach, can now be applied for the generation of hypothesis-based experimentation. Thus discovery-based and hypothesis-based research are complimentary approaches that when coupled together can hasten scientific discoveries. PMID:19070895
Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S.; Dai, Susie Y.
2013-01-01
High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.
Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben
2017-06-06
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.
Huerta, Mario; Munyi, Marc; Expósito, David; Querol, Enric; Cedano, Juan
2014-06-15
The microarrays performed by scientific teams grow exponentially. These microarray data could be useful for researchers around the world, but unfortunately they are underused. To fully exploit these data, it is necessary (i) to extract these data from a repository of the high-throughput gene expression data like Gene Expression Omnibus (GEO) and (ii) to make the data from different microarrays comparable with tools easy to use for scientists. We have developed these two solutions in our server, implementing a database of microarray marker genes (Marker Genes Data Base). This database contains the marker genes of all GEO microarray datasets and it is updated monthly with the new microarrays from GEO. Thus, researchers can see whether the marker genes of their microarray are marker genes in other microarrays in the database, expanding the analysis of their microarray to the rest of the public microarrays. This solution helps not only to corroborate the conclusions regarding a researcher's microarray but also to identify the phenotype of different subsets of individuals under investigation, to frame the results with microarray experiments from other species, pathologies or tissues, to search for drugs that promote the transition between the studied phenotypes, to detect undesirable side effects of the treatment applied, etc. Thus, the researcher can quickly add relevant information to his/her studies from all of the previous analyses performed in other studies as long as they have been deposited in public repositories. Marker-gene database tool: http://ibb.uab.es/mgdb © The Author 2014. Published by Oxford University Press.
Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
Dowsey, Andrew W; Dunn, Michael J; Yang, Guang-Zhong
2008-04-01
The quest for high-throughput proteomics has revealed a number of challenges in recent years. Whilst substantial improvements in automated protein separation with liquid chromatography and mass spectrometry (LC/MS), aka 'shotgun' proteomics, have been achieved, large-scale open initiatives such as the Human Proteome Organization (HUPO) Brain Proteome Project have shown that maximal proteome coverage is only possible when LC/MS is complemented by 2D gel electrophoresis (2-DE) studies. Moreover, both separation methods require automated alignment and differential analysis to relieve the bioinformatics bottleneck and so make high-throughput protein biomarker discovery a reality. The purpose of this article is to describe a fully automatic image alignment framework for the integration of 2-DE into a high-throughput differential expression proteomics pipeline. The proposed method is based on robust automated image normalization (RAIN) to circumvent the drawbacks of traditional approaches. These use symbolic representation at the very early stages of the analysis, which introduces persistent errors due to inaccuracies in modelling and alignment. In RAIN, a third-order volume-invariant B-spline model is incorporated into a multi-resolution schema to correct for geometric and expression inhomogeneity at multiple scales. The normalized images can then be compared directly in the image domain for quantitative differential analysis. Through evaluation against an existing state-of-the-art method on real and synthetically warped 2D gels, the proposed analysis framework demonstrates substantial improvements in matching accuracy and differential sensitivity. High-throughput analysis is established through an accelerated GPGPU (general purpose computation on graphics cards) implementation. Supplementary material, software and images used in the validation are available at http://www.proteomegrid.org/rain/.
Kawaura, Kanako; Mochida, Keiichi; Yamazaki, Yukiko; Ogihara, Yasunari
2006-04-01
In this study, we constructed a 22k wheat oligo-DNA microarray. A total of 148,676 expressed sequence tags of common wheat were collected from the database of the Wheat Genomics Consortium of Japan. These were grouped into 34,064 contigs, which were then used to design an oligonucleotide DNA microarray. Following a multistep selection of the sense strand, 21,939 60-mer oligo-DNA probes were selected for attachment on the microarray slide. This 22k oligo-DNA microarray was used to examine the transcriptional response of wheat to salt stress. More than 95% of the probes gave reproducible hybridization signals when targeted with RNAs extracted from salt-treated wheat shoots and roots. With the microarray, we identified 1,811 genes whose expressions changed more than 2-fold in response to salt. These included genes known to mediate response to salt, as well as unknown genes, and they were classified into 12 major groups by hierarchical clustering. These gene expression patterns were also confirmed by real-time reverse transcription-PCR. Many of the genes with unknown function were clustered together with genes known to be involved in response to salt stress. Thus, analysis of gene expression patterns combined with gene ontology should help identify the function of the unknown genes. Also, functional analysis of these wheat genes should provide new insight into the response to salt stress. Finally, these results indicate that the 22k oligo-DNA microarray is a reliable method for monitoring global gene expression patterns in wheat.
Bălăcescu, Loredana; Bălăcescu, O; Crişan, N; Fetica, B; Petruţ, B; Bungărdean, Cătălina; Rus, Meda; Tudoran, Oana; Meurice, G; Irimie, Al; Dragoş, N; Berindan-Neagoe, Ioana
2011-01-01
Prostate cancer represents the first leading cause of cancer among western male population, with different clinical behavior ranging from indolent to metastatic disease. Although many molecules and deregulated pathways are known, the molecular mechanisms involved in the development of prostate cancer are not fully understood. The aim of this study was to explore the molecular variation underlying the prostate cancer, based on microarray analysis and bioinformatics approaches. Normal and prostate cancer tissues were collected by macrodissection from prostatectomy pieces. All prostate cancer specimens used in our study were Gleason score 7. Gene expression microarray (Agilent Technologies) was used for Whole Human Genome evaluation. The bioinformatics and functional analysis were based on Limma and Ingenuity software. The microarray analysis identified 1119 differentially expressed genes between prostate cancer and normal prostate, which were up- or down-regulated at least 2-fold. P-values were adjusted for multiple testing using Benjamini-Hochberg method with a false discovery rate of 0.01. These genes were analyzed with Ingenuity Pathway Analysis software and were established 23 genetic networks. Our microarray results provide new information regarding the molecular networks in prostate cancer stratified as Gleason 7. These data highlighted gene expression profiles for better understanding of prostate cancer progression.
Wu, Qi; Yuan, Huiming; Zhang, Lihua; Zhang, Yukui
2012-06-20
With the acceleration of proteome research, increasing attention has been paid to multidimensional liquid chromatography-mass spectrometry (MDLC-MS) due to its high peak capacity and separation efficiency. Recently, many efforts have been put to improve MDLC-based strategies including "top-down" and "bottom-up" to enable highly sensitive qualitative and quantitative analysis of proteins, as well as accelerate the whole analytical procedure. Integrated platforms with combination of sample pretreatment, multidimensional separations and identification were also developed to achieve high throughput and sensitive detection of proteomes, facilitating highly accurate and reproducible quantification. This review summarized the recent advances of such techniques and their applications in qualitative and quantitative analysis of proteomes. Copyright © 2012 Elsevier B.V. All rights reserved.
Proteomic analysis of tissue samples in translational breast cancer research.
Gromov, Pavel; Moreira, José M A; Gromova, Irina
2014-06-01
In the last decade, many proteomic technologies have been applied, with varying success, to the study of tissue samples of breast carcinoma for protein expression profiling in order to discover protein biomarkers/signatures suitable for: characterization and subtyping of tumors; early diagnosis, and both prognosis and prediction of outcome of chemotherapy. The purpose of this review is to critically appraise what has been achieved to date using proteomic technologies and to bring forward novel strategies - based on the analysis of clinically relevant samples - that promise to accelerate the translation of basic discoveries into the daily breast cancer clinical practice. In particular, we address major issues in experimental design by reviewing the strengths and weaknesses of current proteomic strategies in the context of the analysis of human breast tissue specimens.
Weckwerth, Wolfram; Wienkoop, Stefanie; Hoehenwarter, Wolfgang; Egelhofer, Volker; Sun, Xiaoliang
2014-01-01
Genome sequencing and systems biology are revolutionizing life sciences. Proteomics emerged as a fundamental technique of this novel research area as it is the basis for gene function analysis and modeling of dynamic protein networks. Here a complete proteomics platform suited for functional genomics and systems biology is presented. The strategy includes MAPA (mass accuracy precursor alignment; http://www.univie.ac.at/mosys/software.html ) as a rapid exploratory analysis step; MASS WESTERN for targeted proteomics; COVAIN ( http://www.univie.ac.at/mosys/software.html ) for multivariate statistical analysis, data integration, and data mining; and PROMEX ( http://www.univie.ac.at/mosys/databases.html ) as a database module for proteogenomics and proteotypic peptides for targeted analysis. Moreover, the presented platform can also be utilized to integrate metabolomics and transcriptomics data for the analysis of metabolite-protein-transcript correlations and time course analysis using COVAIN. Examples for the integration of MAPA and MASS WESTERN data, proteogenomic and metabolic modeling approaches for functional genomics, phosphoproteomics by integration of MOAC (metal-oxide affinity chromatography) with MAPA, and the integration of metabolomics, transcriptomics, proteomics, and physiological data using this platform are presented. All software and step-by-step tutorials for data processing and data mining can be downloaded from http://www.univie.ac.at/mosys/software.html.
Tojo, Axel; Malm, Johan; Marko-Varga, György; Lilja, Hans; Laurell, Thomas
2014-01-01
The antibody microarrays have become widespread, but their use for quantitative analyses in clinical samples has not yet been established. We investigated an immunoassay based on nanoporous silicon antibody microarrays for quantification of total prostate-specific-antigen (PSA) in 80 clinical plasma samples, and provide quantitative data from a duplex microarray assay that simultaneously quantifies free and total PSA in plasma. To further develop the assay the porous silicon chips was placed into a standard 96-well microtiter plate for higher throughput analysis. The samples analyzed by this quantitative microarray were 80 plasma samples obtained from men undergoing clinical PSA testing (dynamic range: 0.14-44ng/ml, LOD: 0.14ng/ml). The second dataset, measuring free PSA (dynamic range: 0.40-74.9ng/ml, LOD: 0.47ng/ml) and total PSA (dynamic range: 0.87-295ng/ml, LOD: 0.76ng/ml), was also obtained from the clinical routine. The reference for the quantification was a commercially available assay, the ProStatus PSA Free/Total DELFIA. In an analysis of 80 plasma samples the microarray platform performs well across the range of total PSA levels. This assay might have the potential to substitute for the large-scale microtiter plate format in diagnostic applications. The duplex assay paves the way for a future quantitative multiplex assay, which analyses several prostate cancer biomarkers simultaneously. PMID:22921878
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Integrative Exploratory Analysis of Two or More Genomic Datasets.
Meng, Chen; Culhane, Aedin
2016-01-01
Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.
Molecular strategies of the Caenorhabditis elegans dauer larva to survive extreme desiccation.
Erkut, Cihan; Vasilj, Andrej; Boland, Sebastian; Habermann, Bianca; Shevchenko, Andrej; Kurzchalia, Teymuras V
2013-01-01
Massive water loss is a serious challenge for terrestrial animals, which usually has fatal consequences. However, some organisms have developed means to survive this stress by entering an ametabolic state called anhydrobiosis. The molecular and cellular mechanisms underlying this phenomenon are poorly understood. We recently showed that Caenorhabditis elegans dauer larva, an arrested stage specialized for survival in adverse conditions, is resistant to severe desiccation. However, this requires a preconditioning step at a mild desiccative environment to prepare the organism for harsher desiccation conditions. A systems approach was used to identify factors that are activated during this preconditioning. Using microarray analysis, proteomics, and bioinformatics, genes, proteins, and biochemical pathways that are upregulated during this process were identified. These pathways were validated via reverse genetics by testing the desiccation tolerances of mutants. These data show that the desiccation response is activated by hygrosensation (sensing the desiccative environment) via head neurons. This leads to elimination of reactive oxygen species and xenobiotics, expression of heat shock and intrinsically disordered proteins, polyamine utilization, and induction of fatty acid desaturation pathway. Remarkably, this response is specific and involves a small number of functional pathways, which represent the generic toolkit for anhydrobiosis in plants and animals.
Genome scale transcriptomics of baculovirus-insect interactions.
Nguyen, Quan; Nielsen, Lars K; Reid, Steven
2013-11-12
Baculovirus-insect cell technologies are applied in the production of complex proteins, veterinary and human vaccines, gene delivery vectors' and biopesticides. Better understanding of how baculoviruses and insect cells interact would facilitate baculovirus-based production. While complete genomic sequences are available for over 58 baculovirus species, little insect genomic information is known. The release of the Bombyx mori and Plutella xylostella genomes, the accumulation of EST sequences for several Lepidopteran species, and especially the availability of two genome-scale analysis tools, namely oligonucleotide microarrays and next generation sequencing (NGS), have facilitated expression studies to generate a rich picture of insect gene responses to baculovirus infections. This review presents current knowledge on the interaction dynamics of the baculovirus-insect system' which is relatively well studied in relation to nucleocapsid transportation, apoptosis, and heat shock responses, but is still poorly understood regarding responses involved in pro-survival pathways, DNA damage pathways, protein degradation, translation, signaling pathways, RNAi pathways, and importantly metabolic pathways for energy, nucleotide and amino acid production. We discuss how the two genome-scale transcriptomic tools can be applied for studying such pathways and suggest that proteomics and metabolomics can produce complementary findings to transcriptomic studies.
Molecular Strategies of the Caenorhabditis elegans Dauer Larva to Survive Extreme Desiccation
Erkut, Cihan; Vasilj, Andrej; Boland, Sebastian; Habermann, Bianca; Shevchenko, Andrej; Kurzchalia, Teymuras V.
2013-01-01
Massive water loss is a serious challenge for terrestrial animals, which usually has fatal consequences. However, some organisms have developed means to survive this stress by entering an ametabolic state called anhydrobiosis. The molecular and cellular mechanisms underlying this phenomenon are poorly understood. We recently showed that Caenorhabditis elegans dauer larva, an arrested stage specialized for survival in adverse conditions, is resistant to severe desiccation. However, this requires a preconditioning step at a mild desiccative environment to prepare the organism for harsher desiccation conditions. A systems approach was used to identify factors that are activated during this preconditioning. Using microarray analysis, proteomics, and bioinformatics, genes, proteins, and biochemical pathways that are upregulated during this process were identified. These pathways were validated via reverse genetics by testing the desiccation tolerances of mutants. These data show that the desiccation response is activated by hygrosensation (sensing the desiccative environment) via head neurons. This leads to elimination of reactive oxygen species and xenobiotics, expression of heat shock and intrinsically disordered proteins, polyamine utilization, and induction of fatty acid desaturation pathway. Remarkably, this response is specific and involves a small number of functional pathways, which represent the generic toolkit for anhydrobiosis in plants and animals. PMID:24324795
Yu, Su Jong; Jang, Eun Sun; Yu, Jiyoung; Cho, Geunhee; Yoon, Jung-Hwan; Kim, Youngsoo
2013-01-01
Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers and is associated with a poor survival rate. Clinically, the level of alpha-fetoprotein (AFP) has been used as a biomarker for the diagnosis of HCC. The discovery of useful biomarkers for HCC, focused solely on the proteome, has been difficult; thus, wide-ranging global data mining of genomic and proteomic databases from previous reports would be valuable in screening biomarker candidates. Further, multiple reaction monitoring (MRM), based on triple quadrupole mass spectrometry, has been effective with regard to high-throughput verification, complementing antibody-based verification pipelines. In this study, global data mining was performed using 5 types of HCC data to screen for candidate biomarker proteins: cDNA microarray, copy number variation, somatic mutation, epigenetic, and quantitative proteomics data. Next, we applied MRM to verify HCC candidate biomarkers in individual serum samples from 3 groups: a healthy control group, patients who have been diagnosed with HCC (Before HCC treatment group), and HCC patients who underwent locoregional therapy (After HCC treatment group). After determining the relative quantities of the candidate proteins by MRM, we compared their expression levels between the 3 groups, identifying 4 potential biomarkers: the actin-binding protein anillin (ANLN), filamin-B (FLNB), complementary C4-A (C4A), and AFP. The combination of 2 markers (ANLN, FLNB) improved the discrimination of the before HCC treatment group from the healthy control group compared with AFP. We conclude that the combination of global data mining and MRM verification enhances the screening and verification of potential HCC biomarkers. This efficacious integrative strategy is applicable to the development of markers for cancer and other diseases. PMID:23717429
Attar-Schneider, Oshrat; Pasmanik-Chor, Metsada; Tartakover-Matalon, Shelly
2015-01-01
Accumulating data indicate translation plays a role in cancer biology, particularly its rate limiting stage of initiation. Despite this evolving recognition, the function and importance of specific translation initiation factors is unresolved. The eukaryotic translation initiation complex eIF4F consists of eIF4E and eIF4G at a 1:1 ratio. Although it is expected that they display interdependent functions, several publications suggest independent mechanisms. This study is the first to directly assess the relative contribution of eIF4F components to the expressed cellular proteome, transcription factors, microRNAs, and phenotype in a malignancy known for extensive protein synthesis-multiple myeloma (MM). Previously, we have shown that eIF4E/eIF4GI attenuation (siRNA/Avastin) deleteriously affected MM cells' fate and reduced levels of eIF4E/eIF4GI established targets. Here, we demonstrated that eIF4E/eIF4GI indeed have individual influences on cell proteome. We used an objective, high throughput assay of mRNA microarrays to examine the significance of eIF4E/eIF4GI silencing to several cellular facets such as transcription factors, microRNAs and phenotype. We showed different imprints for eIF4E and eIF4GI in all assayed aspects. These results promote our understanding of the relative contribution and importance of eIF4E and eIF4GI to the malignant phenotype and shed light on their function in eIF4F translation initiation complex. PMID:25717031
Ivarsson, Ylva; Arnold, Roland; McLaughlin, Megan; Nim, Satra; Joshi, Rakesh; Ray, Debashish; Liu, Bernard; Teyra, Joan; Pawson, Tony; Moffat, Jason; Li, Shawn Shun-Cheng; Sidhu, Sachdev S; Kim, Philip M
2014-02-18
The human proteome contains a plethora of short linear motifs (SLiMs) that serve as binding interfaces for modular protein domains. Such interactions are crucial for signaling and other cellular processes, but are difficult to detect because of their low to moderate affinities. Here we developed a dedicated approach, proteomic peptide-phage display (ProP-PD), to identify domain-SLiM interactions. Specifically, we generated phage libraries containing all human and viral C-terminal peptides using custom oligonucleotide microarrays. With these libraries we screened the nine PSD-95/Dlg/ZO-1 (PDZ) domains of human Densin-180, Erbin, Scribble, and Disks large homolog 1 for peptide ligands. We identified several known and putative interactions potentially relevant to cellular signaling pathways and confirmed interactions between full-length Scribble and the target proteins β-PIX, plakophilin-4, and guanylate cyclase soluble subunit α-2 using colocalization and coimmunoprecipitation experiments. The affinities of recombinant Scribble PDZ domains and the synthetic peptides representing the C termini of these proteins were in the 1- to 40-μM range. Furthermore, we identified several well-established host-virus protein-protein interactions, and confirmed that PDZ domains of Scribble interact with the C terminus of Tax-1 of human T-cell leukemia virus with micromolar affinity. Previously unknown putative viral protein ligands for the PDZ domains of Scribble and Erbin were also identified. Thus, we demonstrate that our ProP-PD libraries are useful tools for probing PDZ domain interactions. The method can be extended to interrogate all potential eukaryotic, bacterial, and viral SLiMs and we suggest it will be a highly valuable approach for studying cellular and pathogen-host protein-protein interactions.
Varas, Macarena; Valdivieso, Camilo; Mauriaca, Cecilia; Ortíz-Severín, Javiera; Paradela, Alberto; Poblete-Castro, Ignacio; Cabrera, Ricardo; Chávez, Francisco P
2017-04-01
Polyphosphate (polyP) is a linear biopolymer found in all living cells. In bacteria, mutants lacking polyphosphate kinase 1 (PPK1), the enzyme responsible for synthesis of most polyP, have many structural and functional defects. However, little is known about the causes of these pleiotropic alterations. The link between ppk1 deletion and those numerous phenotypes observed can be the result of complex molecular interactions that can be elucidated via a systems biology approach. By integrating different omics levels (transcriptome, proteome and phenome), we described the functioning of various metabolic pathways among Escherichia coli polyphosphate mutant strains (Δppk1, Δppx, and ΔpolyP). Bioinformatic analyses reveal the complex metabolic and regulatory bases of the phenotypes unique to polyP mutants. Our results suggest that during polyP deficiency (Δppk1 mutant), metabolic pathways needed for energy supply are up-regulated, including fermentation, aerobic and anaerobic respiration. Transcriptomic and q-proteomic contrasting changes between Δppk1 and Δppx mutant strains were observed in those central metabolic pathways and confirmed by using Phenotypic microarrays. In addition, our results suggest a regulatory connection between polyP, second messenger metabolism, alternative Sigma/Anti-Sigma factors and type-II toxin-antitoxin (TA) systems. We suggest a broader role for polyP via regulation of ATP-dependent proteolysis of type II toxin-antitoxin system and alternative Sigma/Anti-Sigma factors, that could explain the multiple structural and functional deficiencies described due to alteration of polyP metabolism. Understanding the interplay of polyP in bacterial metabolism using a systems biology approach can help to improve design of novel antimicrobials toward pathogens. Copyright © 2017 Elsevier B.V. All rights reserved.
Kim, Hyunsoo; Kim, Kyunggon; Yu, Su Jong; Jang, Eun Sun; Yu, Jiyoung; Cho, Geunhee; Yoon, Jung-Hwan; Kim, Youngsoo
2013-01-01
Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers and is associated with a poor survival rate. Clinically, the level of alpha-fetoprotein (AFP) has been used as a biomarker for the diagnosis of HCC. The discovery of useful biomarkers for HCC, focused solely on the proteome, has been difficult; thus, wide-ranging global data mining of genomic and proteomic databases from previous reports would be valuable in screening biomarker candidates. Further, multiple reaction monitoring (MRM), based on triple quadrupole mass spectrometry, has been effective with regard to high-throughput verification, complementing antibody-based verification pipelines. In this study, global data mining was performed using 5 types of HCC data to screen for candidate biomarker proteins: cDNA microarray, copy number variation, somatic mutation, epigenetic, and quantitative proteomics data. Next, we applied MRM to verify HCC candidate biomarkers in individual serum samples from 3 groups: a healthy control group, patients who have been diagnosed with HCC (Before HCC treatment group), and HCC patients who underwent locoregional therapy (After HCC treatment group). After determining the relative quantities of the candidate proteins by MRM, we compared their expression levels between the 3 groups, identifying 4 potential biomarkers: the actin-binding protein anillin (ANLN), filamin-B (FLNB), complementary C4-A (C4A), and AFP. The combination of 2 markers (ANLN, FLNB) improved the discrimination of the before HCC treatment group from the healthy control group compared with AFP. We conclude that the combination of global data mining and MRM verification enhances the screening and verification of potential HCC biomarkers. This efficacious integrative strategy is applicable to the development of markers for cancer and other diseases.
Advancing Cell Biology Through Proteomics in Space and Time (PROSPECTS)*
Lamond, Angus I.; Uhlen, Mathias; Horning, Stevan; Makarov, Alexander; Robinson, Carol V.; Serrano, Luis; Hartl, F. Ulrich; Baumeister, Wolfgang; Werenskiold, Anne Katrin; Andersen, Jens S.; Vorm, Ole; Linial, Michal; Aebersold, Ruedi; Mann, Matthias
2012-01-01
The term “proteomics” encompasses the large-scale detection and analysis of proteins and their post-translational modifications. Driven by major improvements in mass spectrometric instrumentation, methodology, and data analysis, the proteomics field has burgeoned in recent years. It now provides a range of sensitive and quantitative approaches for measuring protein structures and dynamics that promise to revolutionize our understanding of cell biology and molecular mechanisms in both human cells and model organisms. The Proteomics Specification in Time and Space (PROSPECTS) Network is a unique EU-funded project that brings together leading European research groups, spanning from instrumentation to biomedicine, in a collaborative five year initiative to develop new methods and applications for the functional analysis of cellular proteins. This special issue of Molecular and Cellular Proteomics presents 16 research papers reporting major recent progress by the PROSPECTS groups, including improvements to the resolution and sensitivity of the Orbitrap family of mass spectrometers, systematic detection of proteins using highly characterized antibody collections, and new methods for absolute as well as relative quantification of protein levels. Manuscripts in this issue exemplify approaches for performing quantitative measurements of cell proteomes and for studying their dynamic responses to perturbation, both during normal cellular responses and in disease mechanisms. Here we present a perspective on how the proteomics field is moving beyond simply identifying proteins with high sensitivity toward providing a powerful and versatile set of assay systems for characterizing proteome dynamics and thereby creating a new “third generation” proteomics strategy that offers an indispensible tool for cell biology and molecular medicine. PMID:22311636
2012-01-01
Background Accurate diagnostic and monitoring tools for ulcerative colitis (UC) are missing. Our aim was to describe the proteomic profile of UC and search for markers associated with disease exacerbation. Therefore, we aimed to characterize specific proteins associated with inflamed colon mucosa from patients with acute UC using mass spectrometry-based proteomic analysis. Methods Biopsies were sampled from rectum, sigmoid colon and left colonic flexure from twenty patients with active proctosigmoiditis and from four healthy controls for proteomics and histology. Proteomic profiles of whole colonic biopsies were characterized using 2D-gel electrophoresis, and peptide mass fingerprinting using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) was applied for identification of differently expressed protein spots. Results A total of 597 spots were annotated by image analysis and 222 of these had a statistically different protein level between inflamed and non-inflamed tissue in the patient group. Principal component analysis clearly grouped non-inflamed samples separately from the inflamed samples indicating that the proteomic signature of colon mucosa with acute UC is strong. Totally, 43 individual protein spots were identified, including proteins involved in energy metabolism (triosephosphate isomerase, glycerol-3-phosphate-dehydrogenase, alpha enolase and L-lactate dehydrogenase B-chain) and in oxidative stress (superoxide dismutase, thioredoxins and selenium binding protein). Conclusions A distinct proteomic profile of inflamed tissue in UC patients was found. Specific proteins involved in energy metabolism and oxidative stress were identified as potential candidate markers for UC. PMID:22726388
2016-01-01
Abstract Microarray gene expression data sets are jointly analyzed to increase statistical power. They could either be merged together or analyzed by meta-analysis. For a given ensemble of data sets, it cannot be foreseen which of these paradigms, merging or meta-analysis, works better. In this article, three joint analysis methods, Z -score normalization, ComBat and the inverse normal method (meta-analysis) were selected for survival prognosis and risk assessment of breast cancer patients. The methods were applied to eight microarray gene expression data sets, totaling 1324 patients with two clinical endpoints, overall survival and relapse-free survival. The performance derived from the joint analysis methods was evaluated using Cox regression for survival analysis and independent validation used as bias estimation. Overall, Z -score normalization had a better performance than ComBat and meta-analysis. Higher Area Under the Receiver Operating Characteristic curve and hazard ratio were also obtained when independent validation was used as bias estimation. With a lower time and memory complexity, Z -score normalization is a simple method for joint analysis of microarray gene expression data sets. The derived findings suggest further assessment of this method in future survival prediction and cancer classification applications. PMID:26504096
Oligonucleotide microarrays are a powerful tool for unsupervised analysis of chemical impacts on biological systems. However, the lack of well annotated biological pathways for many aquatic organisms, including fish, and the poor power of microarray-based analyses to detect diffe...
Ogunnaike, Babatunde A; Gelmi, Claudio A; Edwards, Jeremy S
2010-05-21
Gene expression studies generate large quantities of data with the defining characteristic that the number of genes (whose expression profiles are to be determined) exceed the number of available replicates by several orders of magnitude. Standard spot-by-spot analysis still seeks to extract useful information for each gene on the basis of the number of available replicates, and thus plays to the weakness of microarrays. On the other hand, because of the data volume, treating the entire data set as an ensemble, and developing theoretical distributions for these ensembles provides a framework that plays instead to the strength of microarrays. We present theoretical results that under reasonable assumptions, the distribution of microarray intensities follows the Gamma model, with the biological interpretations of the model parameters emerging naturally. We subsequently establish that for each microarray data set, the fractional intensities can be represented as a mixture of Beta densities, and develop a procedure for using these results to draw statistical inference regarding differential gene expression. We illustrate the results with experimental data from gene expression studies on Deinococcus radiodurans following DNA damage using cDNA microarrays. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Microarray-based screening of heat shock protein inhibitors.
Schax, Emilia; Walter, Johanna-Gabriela; Märzhäuser, Helene; Stahl, Frank; Scheper, Thomas; Agard, David A; Eichner, Simone; Kirschning, Andreas; Zeilinger, Carsten
2014-06-20
Based on the importance of heat shock proteins (HSPs) in diseases such as cancer, Alzheimer's disease or malaria, inhibitors of these chaperons are needed. Today's state-of-the-art techniques to identify HSP inhibitors are performed in microplate format, requiring large amounts of proteins and potential inhibitors. In contrast, we have developed a miniaturized protein microarray-based assay to identify novel inhibitors, allowing analysis with 300 pmol of protein. The assay is based on competitive binding of fluorescence-labeled ATP and potential inhibitors to the ATP-binding site of HSP. Therefore, the developed microarray enables the parallel analysis of different ATP-binding proteins on a single microarray. We have demonstrated the possibility of multiplexing by immobilizing full-length human HSP90α and HtpG of Helicobacter pylori on microarrays. Fluorescence-labeled ATP was competed by novel geldanamycin/reblastatin derivatives with IC50 values in the range of 0.5 nM to 4 μM and Z(*)-factors between 0.60 and 0.96. Our results demonstrate the potential of a target-oriented multiplexed protein microarray to identify novel inhibitors for different members of the HSP90 family. Copyright © 2014 Elsevier B.V. All rights reserved.
Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
Automatic Identification and Quantification of Extra-Well Fluorescence in Microarray Images.
Rivera, Robert; Wang, Jie; Yu, Xiaobo; Demirkan, Gokhan; Hopper, Marika; Bian, Xiaofang; Tahsin, Tasnia; Magee, D Mitchell; Qiu, Ji; LaBaer, Joshua; Wallstrom, Garrick
2017-11-03
In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.
Combining proteomics and metabolite analyses to unravel cadmium stress-response in poplar leaves.
Kieffer, Pol; Planchon, Sébastien; Oufir, Mouhssin; Ziebel, Johanna; Dommes, Jacques; Hoffmann, Lucien; Hausman, Jean-François; Renaut, Jenny
2009-01-01
A proteomic analysis of poplar leaves exposed to cadmium, combined with biochemical analysis of pigments and carbohydrates revealed changes in primary carbon metabolism. Proteomic results suggested that photosynthesis was slightly affected. Together with a growth inhibition, photoassimilates were less needed for developmental processes and could be stored in the form of hexoses or complex sugars, acting also as osmoprotectants. Simultaneously, mitochondrial respiration was upregulated, providing energy needs of cadmium-exposed plants.
Shui, Wenqing; Xiong, Yun; Xiao, Weidi; Qi, Xianni; Zhang, Yong; Lin, Yuping; Guo, Yufeng; Zhang, Zhidan; Wang, Qinhong; Ma, Yanhe
2015-01-01
Saccharomyces cerevisiae has been intensively studied in responses to different environmental stresses such as heat shock through global omic analysis. However, the S. cerevisiae industrial strains with superior thermotolerance have not been explored in any proteomic studies for elucidating the tolerance mechanism. Recently a new diploid strain was obtained through evolutionary engineering of a parental industrial strain, and it exhibited even higher resistance to prolonged thermal stress. Herein, we performed iTRAQ-based quantitative proteomic analysis on both the parental and evolved industrial strains to further understand the mechanism of thermotolerant adaptation. Out of ∼2600 quantifiable proteins from biological quadruplicates, 193 and 204 proteins were differentially regulated in the parental and evolved strains respectively during heat-stressed growth. The proteomic response of the industrial strains cultivated under prolonged thermal stress turned out to be substantially different from that of the laboratory strain exposed to sudden heat shock. Further analysis of transcription factors underlying the proteomic perturbation also indicated the distinct regulatory mechanism of thermotolerance. Finally, a cochaperone Mdj1 and a metabolic enzyme Adh1 were selected to investigate their roles in mediating heat-stressed growth and ethanol production of yeasts. Our proteomic characterization of the industrial strain led to comprehensive understanding of the molecular basis of thermotolerance, which would facilitate future improvement in the industrially important trait of S. cerevisiae by rational engineering. PMID:25926660
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis
Padula, Matthew P.; Berry, Iain J.; O′Rourke, Matthew B.; Raymond, Benjamin B.A.; Santos, Jerran; Djordjevic, Steven P.
2017-01-01
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O′Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single ‘spots’ in a polyacrylamide gel, allowing the quantitation of changes in a proteoform′s abundance to ascertain changes in an organism′s phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the ‘Top-Down’. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O’Farrell’s paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism′s proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer. PMID:28387712
A Comprehensive Guide for Performing Sample Preparation and Top-Down Protein Analysis.
Padula, Matthew P; Berry, Iain J; O Rourke, Matthew B; Raymond, Benjamin B A; Santos, Jerran; Djordjevic, Steven P
2017-04-07
Methodologies for the global analysis of proteins in a sample, or proteome analysis, have been available since 1975 when Patrick O'Farrell published the first paper describing two-dimensional gel electrophoresis (2D-PAGE). This technique allowed the resolution of single protein isoforms, or proteoforms, into single 'spots' in a polyacrylamide gel, allowing the quantitation of changes in a proteoform's abundance to ascertain changes in an organism's phenotype when conditions change. In pursuit of the comprehensive profiling of the proteome, significant advances in technology have made the identification and quantitation of intact proteoforms from complex mixtures of proteins more routine, allowing analysis of the proteome from the 'Top-Down'. However, the number of proteoforms detected by Top-Down methodologies such as 2D-PAGE or mass spectrometry has not significantly increased since O'Farrell's paper when compared to Bottom-Up, peptide-centric techniques. This article explores and explains the numerous methodologies and technologies available to analyse the proteome from the Top-Down with a strong emphasis on the necessity to analyse intact proteoforms as a better indicator of changes in biology and phenotype. We arrive at the conclusion that the complete and comprehensive profiling of an organism's proteome is still, at present, beyond our reach but the continuing evolution of protein fractionation techniques and mass spectrometry brings comprehensive Top-Down proteome profiling closer.
Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.
2015-01-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363
Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.
Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L
2015-02-01
Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Translational plant proteomics: a perspective.
Agrawal, Ganesh Kumar; Pedreschi, Romina; Barkla, Bronwyn J; Bindschedler, Laurence Veronique; Cramer, Rainer; Sarkar, Abhijit; Renaut, Jenny; Job, Dominique; Rakwal, Randeep
2012-08-03
Translational proteomics is an emerging sub-discipline of the proteomics field in the biological sciences. Translational plant proteomics aims to integrate knowledge from basic sciences to translate it into field applications to solve issues related but not limited to the recreational and economic values of plants, food security and safety, and energy sustainability. In this review, we highlight the substantial progress reached in plant proteomics during the past decade which has paved the way for translational plant proteomics. Increasing proteomics knowledge in plants is not limited to model and non-model plants, proteogenomics, crop improvement, and food analysis, safety, and nutrition but to many more potential applications. Given the wealth of information generated and to some extent applied, there is the need for more efficient and broader channels to freely disseminate the information to the scientific community. This article is part of a Special Issue entitled: Translational Proteomics. Copyright © 2012 Elsevier B.V. All rights reserved.
Thermodynamically optimal whole-genome tiling microarray design and validation.
Cho, Hyejin; Chou, Hui-Hsien
2016-06-13
Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
Usadel, Björn; Nagel, Axel; Steinhauser, Dirk; Gibon, Yves; Bläsing, Oliver E; Redestig, Henning; Sreenivasulu, Nese; Krall, Leonard; Hannah, Matthew A; Poree, Fabien; Fernie, Alisdair R; Stitt, Mark
2006-12-18
Microarray technology has become a widely accepted and standardized tool in biology. The first microarray data analysis programs were developed to support pair-wise comparison. However, as microarray experiments have become more routine, large scale experiments have become more common, which investigate multiple time points or sets of mutants or transgenics. To extract biological information from such high-throughput expression data, it is necessary to develop efficient analytical platforms, which combine manually curated gene ontologies with efficient visualization and navigation tools. Currently, most tools focus on a few limited biological aspects, rather than offering a holistic, integrated analysis. Here we introduce PageMan, a multiplatform, user-friendly, and stand-alone software tool that annotates, investigates, and condenses high-throughput microarray data in the context of functional ontologies. It includes a GUI tool to transform different ontologies into a suitable format, enabling the user to compare and choose between different ontologies. It is equipped with several statistical modules for data analysis, including over-representation analysis and Wilcoxon statistical testing. Results are exported in a graphical format for direct use, or for further editing in graphics programs.PageMan provides a fast overview of single treatments, allows genome-level responses to be compared across several microarray experiments covering, for example, stress responses at multiple time points. This aids in searching for trait-specific changes in pathways using mutants or transgenics, analyzing development time-courses, and comparison between species. In a case study, we analyze the results of publicly available microarrays of multiple cold stress experiments using PageMan, and compare the results to a previously published meta-analysis.PageMan offers a complete user's guide, a web-based over-representation analysis as well as a tutorial, and is freely available at http://mapman.mpimp-golm.mpg.de/pageman/. PageMan allows multiple microarray experiments to be efficiently condensed into a single page graphical display. The flexible interface allows data to be quickly and easily visualized, facilitating comparisons within experiments and to published experiments, thus enabling researchers to gain a rapid overview of the biological responses in the experiments.
Biochemical and genetic analysis of the yeast proteome with a movable ORF collection
Gelperin, Daniel M.; White, Michael A.; Wilkinson, Martha L.; Kon, Yoshiko; Kung, Li A.; Wise, Kevin J.; Lopez-Hoyo, Nelson; Jiang, Lixia; Piccirillo, Stacy; Yu, Haiyuan; Gerstein, Mark; Dumont, Mark E.; Phizicky, Eric M.; Snyder, Michael; Grayhack, Elizabeth J.
2005-01-01
Functional analysis of the proteome is an essential part of genomic research. To facilitate different proteomic approaches, a MORF (moveable ORF) library of 5854 yeast expression plasmids was constructed, each expressing a sequence-verified ORF as a C-terminal ORF fusion protein, under regulated control. Analysis of 5573 MORFs demonstrates that nearly all verified ORFs are expressed, suggests the authenticity of 48 ORFs characterized as dubious, and implicates specific processes including cytoskeletal organization and transcriptional control in growth inhibition caused by overexpression. Global analysis of glycosylated proteins identifies 109 new confirmed N-linked and 345 candidate glycoproteins, nearly doubling the known yeast glycome. PMID:16322557
Findeisen, Peter; Neumaier, Michael
2009-01-01
Proteomics analysis has been heralded as a novel tool for identifying new and specific biomarkers that may improve diagnosis and monitoring of various disease states. Recent years have brought a number of proteomics profiling technologies. Although proteomics profiling has resulted in the detection of disease-associated differences and modification of proteins, current proteomics technologies display certain limitations that are hampering the introduction of these new technologies into clinical laboratory diagnostics and routine applications. In this review, we summarize current advances in mass spectrometry based biomarker discovery. The promises and challenges of this new technology are discussed with particular emphasis on diagnostic perspectives of mass-spectrometry based proteomics profiling for malignant diseases.
Identification of candidate genes in osteoporosis by integrated microarray analysis.
Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D
2016-12-01
In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be significantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J. Li, B. Q. Wang, Q. Fei, Y. Yang, D. Li. Identification of candidate genes in osteoporosis by integrated microarray analysis. Bone Joint Res 2016;5:594-601. DOI: 10.1302/2046-3758.512.BJR-2016-0073.R1. © 2016 Fei et al.
Nanoliter-Scale Oil-Air-Droplet Chip-Based Single Cell Proteomic Analysis.
Li, Zi-Yi; Huang, Min; Wang, Xiu-Kun; Zhu, Ying; Li, Jin-Song; Wong, Catherine C L; Fang, Qun
2018-04-17
Single cell proteomic analysis provides crucial information on cellular heterogeneity in biological systems. Herein, we describe a nanoliter-scale oil-air-droplet (OAD) chip for achieving multistep complex sample pretreatment and injection for single cell proteomic analysis in the shotgun mode. By using miniaturized stationary droplet microreaction and manipulation techniques, our system allows all sample pretreatment and injection procedures to be performed in a nanoliter-scale droplet with minimum sample loss and a high sample injection efficiency (>99%), thus substantially increasing the analytical sensitivity for single cell samples. We applied the present system in the proteomic analysis of 100 ± 10, 50 ± 5, 10, and 1 HeLa cell(s), and protein IDs of 1360, 612, 192, and 51 were identified, respectively. The OAD chip-based system was further applied in single mouse oocyte analysis, with 355 protein IDs identified at the single oocyte level, which demonstrated its special advantages of high enrichment of sequence coverage, hydrophobic proteins, and enzymatic digestion efficiency over the traditional in-tube system.
Data from quantitative label free proteomics analysis of rat spleen.
Dudekula, Khadar; Le Bihan, Thierry
2016-09-01
The dataset presented in this work has been obtained using a label-free quantitative proteomic analysis of rat spleen. A robust method for extraction of proteins from rat spleen tissue and LC-MS-MS analysis was developed using a urea and SDS-based buffer. Different fractionation methods were compared. A total of 3484 different proteins were identified from the pool of all experiments run in this study (a total of 2460 proteins with at least two peptides). A total of 1822 proteins were identified from nine non-fractionated pulse gels, 2288 proteins and 2864 proteins were identified by SDS-PAGE fractionation into three and five fractions respectively. The proteomics data are deposited in ProteomeXchange Consortium via PRIDE PXD003520, Progenesis and Maxquant output are presented in the supported information. The generated list of proteins under different regimes of fractionation allow assessing the nature of the identified proteins; variability in the quantitative analysis associated with the different sampling strategy and allow defining a proper number of replicates for future quantitative analysis.
Affinity Proteomics in the mountains: Alpbach 2015.
Taussig, Michael J
2016-09-25
The 2015 Alpbach Workshop on Affinity Proteomics, organised by the EU AFFINOMICS consortium, was the 7th workshop in this series. As in previous years, the focus of the event was the current state of affinity methods for proteome analysis, including complementarity with mass spectrometry, progress in recombinant binder production methods, alternatives to classical antibodies as affinity reagents, analysis of proteome targets, industry focus on biomarkers, and diagnostic and clinical applications. The combination of excellent science with Austrian mountain scenery and winter sports engender an atmosphere that makes this series of workshops exceptional. The articles in this Special Issue represent a cross-section of the presentations at the 2015 meeting. Copyright © 2016 Elsevier B.V. All rights reserved.
Sherlock Holmes and the proteome--a detective story.
Righetti, Pier Giorgio; Boschetti, Egisto
2007-02-01
The performance of a hexapeptide ligand library in capturing the 'hidden proteome' is illustrated and evaluated. This library, insolubilized on an organic polymer and available under the trade name 'Equalizer Bead Technology', acts by capturing all components of a given proteome, by concentrating rare and very rare proteins, and simultaneously diluting the abundant ones. This results in a proteome of 'normalized' relative abundances, amenable to analysis by MS and any other analytical tool. Examples are given of analysis of human urine and serum, as well as cell and tissue lysates, such as Escherichia coli and Saccharomyces cerevisiae extracts. Another important application is impurity tracking and polishing of recombinant DNA products, especially biopharmaceuticals meant for human consumption.
Workflows for microarray data processing in the Kepler environment.
Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark
2012-05-17
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R/BioConductor scripting approaches to pipeline design. Finally, we suggest that microarray data processing task workflows may provide a basis for future example-based comparison of different workflow systems. We provide a set of tools and complete workflows for microarray data analysis in the Kepler environment, which has the advantages of offering graphical, clear display of conceptual steps and parameters and the ability to easily integrate other resources such as remote data and web services.
Wada, Yoshinao; Dell, Anne; Haslam, Stuart M; Tissot, Bérangère; Canis, Kévin; Azadi, Parastoo; Bäckström, Malin; Costello, Catherine E; Hansson, Gunnar C; Hiki, Yoshiyuki; Ishihara, Mayumi; Ito, Hiromi; Kakehi, Kazuaki; Karlsson, Niclas; Hayes, Catherine E; Kato, Koichi; Kawasaki, Nana; Khoo, Kay-Hooi; Kobayashi, Kunihiko; Kolarich, Daniel; Kondo, Akihiro; Lebrilla, Carlito; Nakano, Miyako; Narimatsu, Hisashi; Novak, Jan; Novotny, Milos V; Ohno, Erina; Packer, Nicolle H; Palaima, Elizabeth; Renfrow, Matthew B; Tajiri, Michiko; Thomsson, Kristina A; Yagi, Hirokazu; Yu, Shin-Yi; Taniguchi, Naoyuki
2010-04-01
The Human Proteome Organisation Human Disease Glycomics/Proteome Initiative recently coordinated a multi-institutional study that evaluated methodologies that are widely used for defining the N-glycan content in glycoproteins. The study convincingly endorsed mass spectrometry as the technique of choice for glycomic profiling in the discovery phase of diagnostic research. The present study reports the extension of the Human Disease Glycomics/Proteome Initiative's activities to an assessment of the methodologies currently used for O-glycan analysis. Three samples of IgA1 isolated from the serum of patients with multiple myeloma were distributed to 15 laboratories worldwide for O-glycomics analysis. A variety of mass spectrometric and chromatographic procedures representative of current methodologies were used. Similar to the previous N-glycan study, the results convincingly confirmed the pre-eminent performance of MS for O-glycan profiling. Two general strategies were found to give the most reliable data, namely direct MS analysis of mixtures of permethylated reduced glycans in the positive ion mode and analysis of native reduced glycans in the negative ion mode using LC-MS approaches. In addition, mass spectrometric methodologies to analyze O-glycopeptides were also successful.
Sheng, Yue; Zhao, Wei; Song, Ying; Li, Zhigang; Luo, Majing; Lei, Quan; Cheng, Hanhua; Zhou, Rongjia
2015-05-18
A variety of mechanisms are engaged in sex determination in vertebrates. The teleost fish swamp eel undergoes sex reversal naturally and is an ideal model for vertebrate sexual development. However, the importance of proteome-wide scanning for gonad reversal was not previously determined. We report a 2-D electrophoresis analysis of three gonad types of proteomes during sex reversal. MS/MS analysis revealed a group of differentially expressed proteins during ovary to ovotestis to testis transformation. Cbx3 is up-regulated during gonad reversal and is likely to have a role in spermatogenesis. Rab37 is down-regulated during the reversal and is mainly associated with oogenesis. Both Cbx3 and Rab37 are linked up in a protein network. These datasets in gonadal proteomes provide a new resource for further studies in gonadal development.
Stadlmann, Johannes; Hoi, David M; Taubenschmid, Jasmin; Mechtler, Karl; Penninger, Josef M
2018-05-18
SugarQb (www.imba.oeaw.ac.at/sugarqb) is a freely available collection of computational tools for the automated identification of intact glycopeptides from high-resolution HCD MS/MS data-sets in the Proteome Discoverer environment. We report the migration of SugarQb to the latest and free version of Proteome Discoverer 2.1, and apply it to the analysis of PNGase F-resistant N-glycopeptides from mouse embryonic stem cells. The analysis of intact glycopeptides highlights unexpected technical limitations to PNGase F-dependent glycoproteomic workflows at the proteome level, and warrants a critical re-interpretation of seminal data-sets in the context of N-glycosylation-site prediction. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
MAPU: Max-Planck Unified database of organellar, cellular, tissue and body fluid proteomes
Zhang, Yanling; Zhang, Yong; Adachi, Jun; Olsen, Jesper V.; Shi, Rong; de Souza, Gustavo; Pasini, Erica; Foster, Leonard J.; Macek, Boris; Zougman, Alexandre; Kumar, Chanchal; Wiśniewski, Jacek R.; Jun, Wang; Mann, Matthias
2007-01-01
Mass spectrometry (MS)-based proteomics has become a powerful technology to map the protein composition of organelles, cell types and tissues. In our department, a large-scale effort to map these proteomes is complemented by the Max-Planck Unified (MAPU) proteome database. MAPU contains several body fluid proteomes; including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities and is freely available at using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools. PMID:17090601
Proteomics of filamentous fungi.
Kim, Yonghyun; Nandakumar, M P; Marten, Mark R
2007-09-01
Proteomic analysis, defined here as the global assessment of cellular proteins expressed in a particular biological state, is a powerful tool that can provide a systematic understanding of events at the molecular level. Proteomic studies of filamentous fungi have only recently begun to appear in the literature, despite the prevalence of these organisms in the biotechnology industry, and their importance as both human and plant pathogens. Here, we review recent publications that have used a proteomic approach to develop a better understanding of filamentous fungi, highlighting sample preparation methods and whole-cell cytoplasmic proteomics, as well as subproteomics of cell envelope, mitochondrial and secreted proteins.
Goodman, Corey W.; Major, Heather J.; Walls, William D.; Sheffield, Val C.; Casavant, Thomas L.; Darbro, Benjamin W.
2016-01-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. PMID:25595567
Comparative Testis Tissue Proteomics Using 2-Dye Versus 3-Dye DIGE Analysis.
Holland, Ashling
2018-01-01
Comparative tissue proteomics aims to analyze alterations of the proteome in response to a stimulus. Two-dimensional difference gel electrophoresis (2D-DIGE) is a modified and advanced form of 2D gel electrophoresis. DIGE is a powerful biochemical method that compares two or three protein samples on the same analytical gel, and can be used to establish differentially expressed protein levels between healthy normal and diseased pathological tissue sample groups. Minimal DIGE labeling can be used via a 2-dye system with Cy3 and Cy5 or a 3-dye system with Cy2, Cy3, and Cy5 to fluorescently label samples with CyDye flours pre-electrophoresis. DIGE circumvents gel-to-gel variability by multiplexing samples to a single gel and through the use of a pooled internal standard for normalization. This form of quantitative high-resolution proteomics facilitates the comparative analysis and evaluation of tissue protein compositions. Comparing tissue groups under different conditions is crucially important for advancing the biomedical field by characterization of cellular processes, understanding pathophysiological development and tissue biomarker discovery. This chapter discusses 2D-DIGE as a comparative tissue proteomic technique and describes in detail the experimental steps required for comparative proteomic analysis employing both options of 2-dye and 3-dye DIGE minimal labeling.
Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L
2010-07-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.
2010-01-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087