Sample records for functional genomics proteomics

  1. Proteomics in the genome engineering era.

    PubMed

    Vandemoortele, Giel; Gevaert, Kris; Eyckerman, Sven

    2016-01-01

    Genome engineering experiments used to be lengthy, inefficient, and often expensive, preventing a widespread adoption of such experiments for the full assessment of endogenous protein functions. With the revolutionary clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 technology, genome engineering became accessible to the broad life sciences community and is now implemented in several research areas. One particular field that can benefit significantly from this evolution is proteomics where a substantial impact on experimental design and general proteome biology can be expected. In this review, we describe the main applications of genome engineering in proteomics, including the use of engineered disease models and endogenous epitope tagging. In addition, we provide an overview on current literature and highlight important considerations when launching genome engineering technologies in proteomics workflows. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Combining genomic and proteomic approaches for epigenetics research

    PubMed Central

    Han, Yumiao; Garcia, Benjamin A

    2014-01-01

    Epigenetics is the study of changes in gene expression or cellular phenotype that do not change the DNA sequence. In this review, current methods, both genomic and proteomic, associated with epigenetics research are discussed. Among them, chromatin immunoprecipitation (ChIP) followed by sequencing and other ChIP-based techniques are powerful techniques for genome-wide profiling of DNA-binding proteins, histone post-translational modifications or nucleosome positions. However, mass spectrometry-based proteomics is increasingly being used in functional biological studies and has proved to be an indispensable tool to characterize histone modifications, as well as DNA–protein and protein–protein interactions. With the development of genomic and proteomic approaches, combination of ChIP and mass spectrometry has the potential to expand our knowledge of epigenetics research to a higher level. PMID:23895656

  3. Exploring the post-genomic world: differing explanatory and manipulatory functions of post-genomic sciences.

    PubMed

    Holmes, Christina; Carlson, Siobhan M; McDonald, Fiona; Jones, Mavis; Graham, Janice

    2016-01-02

    Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics.

  4. ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function

    PubMed Central

    Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira

    2015-01-01

    Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026

  5. Personalized medicine beyond genomics: alternative futures in big data-proteomics, environtome and the social proteome.

    PubMed

    Özdemir, Vural; Dove, Edward S; Gürsoy, Ulvi K; Şardaş, Semra; Yıldırım, Arif; Yılmaz, Şenay Görücü; Ömer Barlas, I; Güngör, Kıvanç; Mete, Alper; Srivastava, Sanjeeva

    2017-01-01

    No field in science and medicine today remains untouched by Big Data, and psychiatry is no exception. Proteomics is a Big Data technology and a next generation biomarker, supporting novel system diagnostics and therapeutics in psychiatry. Proteomics technology is, in fact, much older than genomics and dates to the 1970s, well before the launch of the international Human Genome Project. While the genome has long been framed as the master or "elite" executive molecule in cell biology, the proteome by contrast is humble. Yet the proteome is critical for life-it ensures the daily functioning of cells and whole organisms. In short, proteins are the blue-collar workers of biology, the down-to-earth molecules that we cannot live without. Since 2010, proteomics has found renewed meaning and international attention with the launch of the Human Proteome Project and the growing interest in Big Data technologies such as proteomics. This article presents an interdisciplinary technology foresight analysis and conceptualizes the terms "environtome" and "social proteome". We define "environtome" as the entire complement of elements external to the human host, from microbiome, ambient temperature and weather conditions to government innovation policies, stock market dynamics, human values, political power and social norms that collectively shape the human host spatially and temporally. The "social proteome" is the subset of the environtome that influences the transition of proteomics technology to innovative applications in society. The social proteome encompasses, for example, new reimbursement schemes and business innovation models for proteomics diagnostics that depart from the "once-a-life-time" genotypic tests and the anticipated hype attendant to context and time sensitive proteomics tests. Building on the "nesting principle" for governance of complex systems as discussed by Elinor Ostrom, we propose here a 3-tiered organizational architecture for Big Data science such as

  6. Exploring the post-genomic world: differing explanatory and manipulatory functions of post-genomic sciences

    PubMed Central

    Holmes, Christina; Carlson, Siobhan M.; McDonald, Fiona; Jones, Mavis; Graham, Janice

    2016-01-01

    Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics. PMID:27134568

  7. The proteome: structure, function and evolution

    PubMed Central

    Fleming, Keiran; Kelley, Lawrence A; Islam, Suhail A; MacCallum, Robert M; Muller, Arne; Pazos, Florencio; Sternberg, Michael J.E

    2006-01-01

    This paper reports two studies to model the inter-relationships between protein sequence, structure and function. First, an automated pipeline to provide a structural annotation of proteomes in the major genomes is described. The results are stored in a database at Imperial College, London (3D-GENOMICS) that can be accessed at www.sbg.bio.ic.ac.uk. Analysis of the assignments to structural superfamilies provides evolutionary insights. 3D-GENOMICS is being integrated with related proteome annotation data at University College London and the European Bioinformatics Institute in a project known as e-protein (http://www.e-protein.org/). The second topic is motivated by the developments in structural genomics projects in which the structure of a protein is determined prior to knowledge of its function. We have developed a new approach PHUNCTIONER that uses the gene ontology (GO) classification to supervise the extraction of the sequence signal responsible for protein function from a structure-based sequence alignment. Using GO we can obtain profiles for a range of specificities described in the ontology. In the region of low sequence similarity (around 15%), our method is more accurate than assignment from the closest structural homologue. The method is also able to identify the specific residues associated with the function of the protein family. PMID:16524832

  8. DEFINING THE MANDATE OF PROTEOMICS IN THE POST-GENOMIC ERA: WORKSHOP REPORT

    EPA Science Inventory

    Research in proteomics is the next step after genomics in understanding life processes at the molecular level. In the largest sense proteomics encompasses knowledge of the structure, function and expression of all proteins in the biochemical or biological contexts of all organism...

  9. The Functional Network of the Arabidopsis Plastoglobule Proteome Based on Quantitative Proteomics and Genome-Wide Coexpression Analysis1[C][W][OA

    PubMed Central

    Lundquist, Peter K.; Poliakov, Anton; Bhuiyan, Nazmul H.; Zybailov, Boris; Sun, Qi; van Wijk, Klaas J.

    2012-01-01

    Plastoglobules (PGs) in chloroplasts are thylakoid-associated monolayer lipoprotein particles containing prenyl and neutral lipids and several dozen proteins mostly with unknown functions. An integrated view of the role of the PG is lacking. Here, we better define the PG proteome and provide a conceptual framework for further studies. The PG proteome from Arabidopsis (Arabidopsis thaliana) leaf chloroplasts was determined by mass spectrometry of isolated PGs and quantitative comparison with the proteomes of unfractionated leaves, thylakoids, and stroma. Scanning electron microscopy showed the purity and size distribution of the isolated PGs. Compared with previous PG proteome analyses, we excluded several proteins and identified six new PG proteins, including an M48 metallopeptidase and two Absence of bc1 complex (ABC1) atypical kinases, confirmed by immunoblotting. This refined PG proteome consisted of 30 proteins, including six ABC1 kinases and seven fibrillins together comprising more than 70% of the PG protein mass. Other fibrillins were located predominantly in the stroma or thylakoid and not in PGs; we discovered that this partitioning can be predicted by their isoelectric point and hydrophobicity. A genome-wide coexpression network for the PG genes was then constructed from mRNA expression data. This revealed a modular network with four distinct modules that each contained at least one ABC1K and/or fibrillin gene. Each module showed clear enrichment in specific functions, including chlorophyll degradation/senescence, isoprenoid biosynthesis, plastid proteolysis, and redox regulators and phosphoregulators of electron flow. We propose a new testable model for the PGs, in which sets of genes are associated with specific PG functions. PMID:22274653

  10. Proteomic analysis of Medulloblastoma reveals functional biology with translational potential.

    PubMed

    Rivero-Hinojosa, Samuel; Lau, Ling San; Stampar, Mojca; Staal, Jerome; Zhang, Huizhen; Gordish-Dressman, Heather; Northcott, Paul A; Pfister, Stefan M; Taylor, Michael D; Brown, Kristy J; Rood, Brian R

    2018-06-07

    Genomic characterization has begun to redefine diagnostic classifications of cancers. However, it remains a challenge to infer disease phenotypes from genomic alterations alone. To help realize the promise of genomics, we have performed a quantitative proteomics investigation using Stable Isotope Labeling by Amino Acids in Cell Culture (SILAC) and 41 tissue samples spanning the 4 genomically based subgroups of medulloblastoma and control cerebellum. We have identified and quantitated thousands of proteins across these groups and find that we are able to recapitulate the genomic subgroups based upon subgroup restricted and differentially abundant proteins while also identifying subgroup specific protein isoforms. Integrating our proteomic measurements with genomic data, we calculate a poor correlation between mRNA and protein abundance. Using EPIC 850 k methylation array data on the same tissues, we also investigate the influence of copy number alterations and DNA methylation on the proteome in an attempt to characterize the impact of these genetic features on the proteome. Reciprocally, we are able to use the proteome to identify which genomic alterations result in altered protein abundance and thus are most likely to impact biology. Finally, we are able to assemble protein-based pathways yielding potential avenues for clinical intervention. From these, we validate the EIF4F cap-dependent translation pathway as a novel druggable pathway in medulloblastoma. Thus, quantitative proteomics complements genomic platforms to yield a more complete understanding of functional tumor biology and identify novel therapeutic targets for medulloblastoma.

  11. The Proteome Folding Project: Proteome-scale prediction of structure and function

    PubMed Central

    Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard

    2011-01-01

    The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995

  12. Fungal proteomics: from identification to function.

    PubMed

    Doyle, Sean

    2011-08-01

    Some fungi cause disease in humans and plants, while others have demonstrable potential for the control of insect pests. In addition, fungi are also a rich reservoir of therapeutic metabolites and industrially useful enzymes. Detailed analysis of fungal biochemistry is now enabled by multiple technologies including protein mass spectrometry, genome and transcriptome sequencing and advances in bioinformatics. Yet, the assignment of function to fungal proteins, encoded either by in silico annotated, or unannotated genes, remains problematic. The purpose of this review is to describe the strategies used by many researchers to reveal protein function in fungi, and more importantly, to consolidate the nomenclature of 'unknown function protein' as opposed to 'hypothetical protein' - once any protein has been identified by protein mass spectrometry. A combination of approaches including comparative proteomics, pathogen-induced protein expression and immunoproteomics are outlined, which, when used in combination with a variety of other techniques (e.g. functional genomics, microarray analysis, immunochemical and infection model systems), appear to yield comprehensive and definitive information on protein function in fungi. The relative advantages of proteomic, as opposed to transcriptomic-only, analyses are also described. In the future, combined high-throughput, quantitative proteomics, allied to transcriptomic sequencing, are set to reveal much about protein function in fungi. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  13. An object model and database for functional genomics.

    PubMed

    Jones, Andrew; Hunt, Ela; Wastling, Jonathan M; Pizarro, Angel; Stoeckert, Christian J

    2004-07-10

    Large-scale functional genomics analysis is now feasible and presents significant challenges in data analysis, storage and querying. Data standards are required to enable the development of public data repositories and to improve data sharing. There is an established data format for microarrays (microarray gene expression markup language, MAGE-ML) and a draft standard for proteomics (PEDRo). We believe that all types of functional genomics experiments should be annotated in a consistent manner, and we hope to open up new ways of comparing multiple datasets used in functional genomics. We have created a functional genomics experiment object model (FGE-OM), developed from the microarray model, MAGE-OM and two models for proteomics, PEDRo and our own model (Gla-PSI-Glasgow Proposal for the Proteomics Standards Initiative). FGE-OM comprises three namespaces representing (i) the parts of the model common to all functional genomics experiments; (ii) microarray-specific components; and (iii) proteomics-specific components. We believe that FGE-OM should initiate discussion about the contents and structure of the next version of MAGE and the future of proteomics standards. A prototype database called RNA And Protein Abundance Database (RAPAD), based on FGE-OM, has been implemented and populated with data from microbial pathogenesis. FGE-OM and the RAPAD schema are available from http://www.gusdb.org/fge.html, along with a set of more detailed diagrams. RAPAD can be accessed by registration at the site.

  14. Genomics and functional genomics in Chlamydomonas reinhardtii

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Blaby, Ian K.; Blaby-Haas, Crysten E.

    The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less

  15. Genomics and functional genomics in Chlamydomonas reinhardtii

    DOE PAGES

    Blaby, Ian K.; Blaby-Haas, Crysten E.

    2017-03-21

    The availability of the Chlamydomonas reinhardtii nuclear genome sequence continues to enable researchers to address biological questions relevant to algae, land plants and animals in unprecedented ways. As we continue to characterize and understand biological processes in C. reinhardtii and translate that knowledge to other systems, we are faced with the realization that many genes encode proteins without a defined function. The field of functional genomics aims to close this gap between genome sequence and protein function. Transcriptomes, proteomes and phenomes can each provide layers of gene-specific functional data while supplying a global snapshot of cellular behavior under different conditions.more » Herein we present a brief history of functional genomics, the present status of the C. reinhardtii genome, how genome-wide experiments can aid in supplying protein function inferences, and provide an outlook for functional genomics in C. reinhardtii.« less

  16. The path to enlightenment: making sense of genomic and proteomic information.

    PubMed

    Maurer, Martin H

    2004-05-01

    Whereas genomics describes the study of genome, mainly represented by its gene expression on the DNA or RNA level, the term proteomics denotes the study of the proteome, which is the protein complement encoded by the genome. In recent years, the number of proteomic experiments increased tremendously. While all fields of proteomics have made major technological advances, the biggest step was seen in bioinformatics. Biological information management relies on sequence and structure databases and powerful software tools to translate experimental results into meaningful biological hypotheses and answers. In this resource article, I provide a collection of databases and software available on the Internet that are useful to interpret genomic and proteomic data. The article is a toolbox for researchers who have genomic or proteomic datasets and need to put their findings into a biological context.

  17. Principles of proteome allocation are revealed using proteomic data and genome-scale models

    PubMed Central

    Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.

    2016-01-01

    Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205

  18. Principles of proteome allocation are revealed using proteomic data and genome-scale models

    DOE PAGES

    Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...

    2016-11-18

    Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less

  19. CPTAC Proteomics Data on UCSC Genome Browser | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium scientists are working together with the University of California, Santa Cruz (UCSC) Genomics Institute to provide public access to cancer proteomics data via the UCSC Genome Browser. This effort extends accessibility of the CPTAC data to more researchers and provides an additional level of analysis to assist the cancer biology community.

  20. PeroxisomeDB: a database for the peroxisomal proteome, functional genomics and disease

    PubMed Central

    Schlüter, Agatha; Fourcade, Stéphane; Domènech-Estévez, Enric; Gabaldón, Toni; Huerta-Cepas, Jaime; Berthommier, Guillaume; Ripp, Raymond; Wanders, Ronald J. A.; Poch, Olivier; Pujol, Aurora

    2007-01-01

    Peroxisomes are essential organelles of eukaryotic origin, ubiquitously distributed in cells and organisms, playing key roles in lipid and antioxidant metabolism. Loss or malfunction of peroxisomes causes more than 20 fatal inherited conditions. We have created a peroxisomal database () that includes the complete peroxisomal proteome of Homo sapiens and Saccharomyces cerevisiae, by gathering, updating and integrating the available genetic and functional information on peroxisomal genes. PeroxisomeDB is structured in interrelated sections ‘Genes’, ‘Functions’, ‘Metabolic pathways’ and ‘Diseases’, that include hyperlinks to selected features of NCBI, ENSEMBL and UCSC databases. We have designed graphical depictions of the main peroxisomal metabolic routes and have included updated flow charts for diagnosis. Precomputed BLAST, PSI-BLAST, multiple sequence alignment (MUSCLE) and phylogenetic trees are provided to assist in direct multispecies comparison to study evolutionary conserved functions and pathways. Highlights of the PeroxisomeDB include new tools developed for facilitating (i) identification of novel peroxisomal proteins, by means of identifying proteins carrying peroxisome targeting signal (PTS) motifs, (ii) detection of peroxisomes in silico, particularly useful for screening the deluge of newly sequenced genomes. PeroxisomeDB should contribute to the systematic characterization of the peroxisomal proteome and facilitate system biology approaches on the organelle. PMID:17135190

  1. Comparative analysis of genomics and proteomics in Bacillus thuringiensis 4.0718.

    PubMed

    Rang, Jie; He, Hao; Wang, Ting; Ding, Xuezhi; Zuo, Mingxing; Quan, Meifang; Sun, Yunjun; Yu, Ziquan; Hu, Shengbiao; Xia, Liqiu

    2015-01-01

    Bacillus thuringiensis is a widely used biopesticide that produced various insecticidal active substances during its life cycle. Separation and purification of numerous insecticide active substances have been difficult because of the relatively short half-life of such substances. On the other hand, substances can be synthetized at different times during development, so samples at different stages have to be studied, further complicating the analysis. A dual genomic and proteomic approach would enhance our ability to identify such substances, and particularily using mass spectrometry-based proteomic methods. The comparative analysis for genomic and proteomic data have showed that not all of the products deduced from the annotated genome could be identified among the proteomic data. For instance, genome annotation results showed that 39 coding sequences in the whole genome were related to insect pathogenicity, including five cry genes. However, Cry2Ab, Cry1Ia, Cytotoxin K, Bacteriocin, Exoenzyme C3 and Alveolysin could not be detected in the proteomic data obtained. The sporulation-related proteins were also compared analysis, results showed that the great majority sporulation-related proteins can be detected by mass spectrometry. This analysis revealed Spo0A~P, SigF, SigE(+), SigK(+) and SigG(+), all known to play an important role in the process of spore formation regulatory network, also were displayed in the proteomic data. Through the comparison of the two data sets, it was possible to infer that some genes were silenced or were expressed at very low levels. For instance, found that cry2Ab seems to lack a functional promoter while cry1Ia may not be expressed due to the presence of transposons. With this comparative study a relatively complete database can be constructed and used to transform hereditary material, thereby prompting the high expression of toxic proteins. A theoretical basis is provided for constructing highly virulent engineered bacteria and for

  2. CPTAC Releases Largest-Ever Ovarian Cancer Proteome Dataset from Previously Genome Characterized Tumors | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have just released a comprehensive dataset of the proteomic analysis of high grade serous ovarian tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA).  This is one of the largest public datasets covering the proteome, phosphoproteome and glycoproteome with complementary deep genomic sequencing data on the same tumor.

  3. Connecting genomic alterations to cancer biology with proteomics: the NCI Clinical Proteomic Tumor Analysis Consortium.

    PubMed

    Ellis, Matthew J; Gillette, Michael; Carr, Steven A; Paulovich, Amanda G; Smith, Richard D; Rodland, Karin K; Townsend, R Reid; Kinsinger, Christopher; Mesri, Mehdi; Rodriguez, Henry; Liebler, Daniel C

    2013-10-01

    The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium is applying the latest generation of proteomic technologies to genomically annotated tumors from The Cancer Genome Atlas (TCGA) program, a joint initiative of the NCI and the National Human Genome Research Institute. By providing a fully integrated accounting of DNA, RNA, and protein abnormalities in individual tumors, these datasets will illuminate the complex relationship between genomic abnormalities and cancer phenotypes, thus producing biologic insights as well as a wave of novel candidate biomarkers and therapeutic targets amenable to verification using targeted mass spectrometry methods. ©2013 AACR.

  4. Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti.

    PubMed

    Maringer, Kevin; Yousuf, Amjad; Heesom, Kate J; Fan, Jun; Lee, David; Fernandez-Sesma, Ana; Bessant, Conrad; Matthews, David A; Davidson, Andrew D

    2017-01-19

    Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the

  5. Rice proteome analysis: a step toward functional analysis of the rice genome.

    PubMed

    Komatsu, Setsuko; Tanaka, Naoki

    2005-03-01

    The technique of proteome analysis using 2-DE has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this review, we describe construction of the rice proteome database, the cataloging of rice proteins, and the functional characterization of some of the proteins identified. Initially, proteins extracted from various tissues and organelles were separated by 2-DE and an image analyzer was used to construct a display or reference map of the proteins. The rice proteome database currently contains 23 reference maps based on 2-DE of proteins from different rice tissues and subcellular compartments. These reference maps comprise 13 129 rice proteins, and the amino acid sequences of 5092 of these proteins are entered in the database. Major proteins involved in growth or stress responses have been identified by using a proteomics approach and some of these proteins have unique functions. Furthermore, initial work has also begun on analyzing the phosphoproteome and protein-protein interactions in rice. The information obtained from the rice proteome database will aid in the molecular cloning of rice genes and in predicting the function of unknown proteins.

  6. Rice proteome database: a step toward functional analysis of the rice genome.

    PubMed

    Komatsu, Setsuko

    2005-09-01

    The technique of proteome analysis using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this study, the proteins of rice were cataloged, a rice proteome database was constructed, and a functional characterization of some of the identified proteins was undertaken. Proteins extracted from various tissues and subcellular compartments in rice were separated by 2D-PAGE and an image analyzer was used to construct a display of the proteins. The Rice Proteome Database contains 23 reference maps based on 2D-PAGE of proteins from various rice tissues and subcellular compartments. These reference maps comprise 13129 identified proteins, and the amino acid sequences of 5092 proteins are entered in the database. Major proteins involved in growth or stress responses were identified using the proteome approach. Some of these proteins, including a beta-tubulin, calreticulin, and ribulose-1,5-bisphosphate carboxylase/oxygenase activase in rice, have unexpected functions. The information obtained from the Rice Proteome Database will aid in cloning the genes for and predicting the function of unknown proteins.

  7. CPTAC Releases Largest-Ever Colorectal Cancer Proteome Dataset from Previously Genome Characterized Tumors | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    On September 4, 2013, NCI’s Clinical Proteomics Tumor Analysis Consortium (CPTAC) publicly released proteomic data produced from colorectal tumor samples previously analyzed by The Cancer Genome Atlas (TCGA).  This is the initial release of proteomic tumor data designed to complement genomic data on the same tumors. The data is publicly available at the CPTAC data portal.

  8. University of Victoria Genome British Columbia Proteomics Centre Partners with CPTAC | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    University of Victoria Genome British Columbia Proteomics Centre, a leader in proteomic technology development, has partnered with the U.S. National Cancer Institute (NCI) to make targeted proteomic assays accessible to the community through NCI’s CPTAC Assay Portal (https://assays.cancer.gov).

  9. How may targeted proteomics complement genomic data in breast cancer?

    PubMed

    Guerin, Mathilde; Gonçalves, Anthony; Toiron, Yves; Baudelet, Emilie; Audebert, Stéphane; Boyer, Jean-Baptiste; Borg, Jean-Paul; Camoin, Luc

    2017-01-01

    Breast cancer (BC) is the most common female cancer in the world and was recently deconstructed in different molecular entities. Although most of the recent assays to characterize tumors at the molecular level are genomic-based, proteins are the actual executors of cellular functions and represent the vast majority of targets for anticancer drugs. Accumulated data has demonstrated an important level of quantitative and qualitative discrepancies between genomic/transcriptomic alterations and their protein counterparts, mostly related to the large number of post-translational modifications. Areas covered: This review will present novel proteomics technologies such as Reverse Phase Protein Array (RPPA) or mass-spectrometry (MS) based approaches that have emerged and that could progressively replace old-fashioned methods (e.g. immunohistochemistry, ELISA, etc.) to validate proteins as diagnostic, prognostic or predictive biomarkers, and eventually monitor them in the routine practice. Expert commentary: These different targeted proteomic approaches, able to complement genomic data in BC and characterize tumors more precisely, will permit to go through a more personalized treatment for each patient and tumor.

  10. Functional proteomics within the genus Lactobacillus.

    PubMed

    De Angelis, Maria; Calasso, Maria; Cavallo, Noemi; Di Cagno, Raffaella; Gobbetti, Marco

    2016-03-01

    Lactobacillus are mainly used for the manufacture of fermented dairy, sourdough, meat, and vegetable foods or used as probiotics. Under optimal processing conditions, Lactobacillus strains contribute to food functionality through their enzyme portfolio and the release of metabolites. An extensive genomic diversity analysis was conducted to elucidate the core features of the genus Lactobacillus, and to provide a better comprehension of niche adaptation of the strains. However, proteomics is an indispensable "omics" science to elucidate the proteome diversity, and the mechanisms of regulation and adaptation of Lactobacillus strains. This review focuses on the novel and comprehensive knowledge of functional proteomics and metaproteomics of Lactobacillus species. A large list of proteomic case studies of different Lactobacillus species is provided to illustrate the adaptability of the main metabolic pathways (e.g., carbohydrate transport and metabolism, pyruvate metabolism, proteolytic system, amino acid metabolism, and protein synthesis) to various life conditions. These investigations have highlighted that lactobacilli modulate the level of a complex panel of proteins to growth/survive in different ecological niches. In addition to the general regulation and stress response, specific metabolic pathways can be switched on and off, modifying the behavior of the strains. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. CPTAC Releases Largest-Ever Breast Cancer Proteome Dataset from Previously Genome Characterized Tumors | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium (CPTAC) scientists have released a dataset of proteins and  phosphopeptides identified through deep proteomic and phosphoproteomic analysis of breast tumor samples, previously genomically analyzed by The Cancer Genome Atlas (TCGA).

  12. Microchip-Based Single-Cell Functional Proteomics for Biomedical Applications

    PubMed Central

    Lu, Yao; Yang, Liu; Wei, Wei; Shi, Qihui

    2017-01-01

    Cellular heterogeneity has been widely recognized but only recently have single cell tools become available that allow characterizing heterogeneity at the genomic and proteomic levels. We review the technological advances in microchip-based toolkits for single-cell functional proteomics. Each of these tools has distinct advantages and limitations, and a few have advanced toward being applied to address biological or clinical problems that fail to be addressed by traditional population-based methods. High-throughput single-cell proteomic assays generate high-dimensional data sets that contain new information and thus require developing new analytical framework to extract new biology. In this review article, we highlight a few biological and clinical applications in which the microchip-based single-cell proteomic tools provide unique advantages. The examples include resolving functional heterogeneity and dynamics of immune cells, dissecting cell-cell interaction by creating well-contolled on-chip microenvironment, capturing high-resolution snapshots of immune system functions in patients for better immunotherapy and elucidating phosphoprotein signaling networks in cancer cells for guiding effective molecularly targeted therapies. PMID:28280819

  13. Comparative Bacterial Proteomics: Analysis of the Core Genome Concept

    PubMed Central

    Callister, Stephen J.; McCue, Lee Ann; Turse, Joshua E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.

    2008-01-01

    While comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry, experimental validation of the existence of this core genome requires extensive measurement and is typically not undertaken. Enabled by an extensive proteome database developed over six years, we have experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. Although genomic studies can establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits. PMID:18253490

  14. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database.

    PubMed

    Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

    2017-06-23

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from

  15. Annotation of Protein Domains Reveals Remarkable Conservation in the Functional Make up of Proteomes Across Superkingdoms

    PubMed Central

    Nasir, Arshan; Naeem, Aisha; Khan, Muhammad Jawad; Lopez-Nicora, Horacio D.; Caetano-Anollés, Gustavo

    2011-01-01

    The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins encoded in the genome of an organism. The molecular functions of proteins are the direct consequence of their structure and structure can be inferred from sequence using hidden Markov models of structural recognition. Here we analyze the functional annotation of protein domain structures in almost a thousand sequenced genomes, exploring the functional and structural diversity of proteomes. We find there is a remarkable conservation in the distribution of domains with respect to the molecular functions they perform in the three superkingdoms of life. In general, most of the protein repertoire is spent in functions related to metabolic processes but there are significant differences in the usage of domains for regulatory and extra-cellular processes both within and between superkingdoms. Our results support the hypotheses that the proteomes of superkingdom Eukarya evolved via genome expansion mechanisms that were directed towards innovating new domain architectures for regulatory and extra/intracellular process functions needed for example to maintain the integrity of multicellular structure or to interact with environmental biotic and abiotic factors (e.g., cell signaling and adhesion, immune responses, and toxin production). Proteomes of microbial superkingdoms Archaea and Bacteria retained fewer numbers of domains and maintained simple and smaller protein repertoires. Viruses appear to play an important role in the evolution of superkingdoms. We finally identify few genomic outliers that deviate significantly from the conserved functional design. These include Nanoarchaeum equitans, proteobacterial symbionts of insects with extremely reduced genomes, Tenericutes and Guillardia theta. These organisms spend most of their domains on information functions, including translation and transcription, rather than on metabolism and harbor a domain repertoire characteristic of

  16. Explore, Visualize, and Analyze Functional Cancer Proteomic Data Using the Cancer Proteome Atlas. | Office of Cancer Genomics

    Cancer.gov

    Reverse-phase protein arrays (RPPA) represent a powerful functional proteomic approach to elucidate cancer-related molecular mechanisms and to develop novel cancer therapies. To facilitate community-based investigation of the large-scale protein expression data generated by this platform, we have developed a user-friendly, open-access bioinformatic resource, The Cancer Proteome Atlas (TCPA, http://tcpaportal.org), which contains two separate web applications.

  17. High-throughput cloning and expression library creation for functional proteomics.

    PubMed

    Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua

    2013-05-01

    The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particularly important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single-gene experiments, creating the need for fast, flexible, and reliable cloning systems. These collections of ORF clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial, we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator(TM) DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This tutorial is part of the International Proteomics Tutorial Programme (IPTP12). © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Characterization, design, and function of the mitochondrial proteome: from organs to organisms.

    PubMed

    Lotz, Christopher; Lin, Amanda J; Black, Caitlin M; Zhang, Jun; Lau, Edward; Deng, Ning; Wang, Yueju; Zong, Nobel C; Choi, Jeong H; Xu, Tao; Liem, David A; Korge, Paavo; Weiss, James N; Hermjakob, Henning; Yates, John R; Apweiler, Rolf; Ping, Peipei

    2014-02-07

    Mitochondria are a common energy source for organs and organisms; their diverse functions are specialized according to the unique phenotypes of their hosting environment. Perturbation of mitochondrial homeostasis accompanies significant pathological phenotypes. However, the connections between mitochondrial proteome properties and function remain to be experimentally established on a systematic level. This uncertainty impedes the contextualization and translation of proteomic data to the molecular derivations of mitochondrial diseases. We present a collection of mitochondrial features and functions from four model systems, including two cardiac mitochondrial proteomes from distinct genomes (human and mouse), two unique organ mitochondrial proteomes from identical genetic codons (mouse heart and mouse liver), as well as a relevant metazoan out-group (drosophila). The data, composed of mitochondrial protein abundance and their biochemical activities, capture the core functionalities of these mitochondria. This investigation allowed us to redefine the core mitochondrial proteome from organs and organisms, as well as the relevant contributions from genetic information and hosting milieu. Our study has identified significant enrichment of disease-associated genes and their products. Furthermore, correlational analyses suggest that mitochondrial proteome design is primarily driven by cellular environment. Taken together, these results connect proteome feature with mitochondrial function, providing a prospective resource for mitochondrial pathophysiology and developing novel therapeutic targets in medicine.

  19. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

    PubMed Central

    Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

    2016-01-01

    Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates. PMID:27633273

  20. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  1. High-Throughput Cloning and Expression Library Creation for Functional Proteomics

    PubMed Central

    Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua

    2013-01-01

    The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047

  2. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data.

    PubMed

    Peterson, Elena S; McCue, Lee Ann; Schrimpe-Rutledge, Alexandra C; Jensen, Jeffrey L; Walker, Hyunjoo; Kobold, Markus A; Webb, Samantha R; Payne, Samuel H; Ansong, Charles; Adkins, Joshua N; Cannon, William R; Webb-Robertson, Bobbie-Jo M

    2012-04-05

    The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via

  3. GENOMIC AND PROTEOMIC TECHNIQUES APPLIED TO REPRODUCTIVE BIOLOGY

    EPA Science Inventory

    Genomic and proteomic techniques applied to reproductive biology
    John C. Rockett
    Reproductive Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, United States Environmental Protection Agency, Research Tria...

  4. Integrated proteomic and genomic analysis of colorectal cancer

    Cancer.gov

    Investigators who analyzed 95 human colorectal tumor samples have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, pro

  5. Genomics, transcriptomics and proteomics to elucidate the pathogenesis of rheumatoid arthritis.

    PubMed

    Song, Xinqiang; Lin, Qingsong

    2017-08-01

    Rheumatoid arthritis is an autoimmune disease that affects several organs and tissues, predominantly the synovial joints. The pathogenesis of this disease is not completely understood, which maybe involved in the genomic variations, gene expression, protein translation and post-translational modifications. These system variations in genomics, transcriptomics and proteomics are dynamic in nature and their crosstalk is overwhelmingly complex, thus analyzing them separately may not be very informative. However, various '-omics' techniques developed in recent years have opened up new possibilities for clarifying disease pathways and thereby facilitating early diagnosis and specific therapies. This review examines how recent advances in the fields of genomics, transcriptomics and proteomics have contributed to our understanding of rheumatoid arthritis.

  6. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data

    PubMed Central

    2012-01-01

    Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic

  7. Proteomic and genomic studies of non-alcoholic fatty liver disease - clues in the pathogenesis

    PubMed Central

    Lim, Jun Wei; Dillon, John; Miller, Michael

    2014-01-01

    Non-alcoholic fatty liver disease (NAFLD) is a widely prevalent hepatic disorder that covers wide spectrum of liver pathology. NAFLD is strongly associated with liver inflammation, metabolic hyperlipidaemia and insulin resistance. Frequently, NAFLD has been considered as the hepatic manifestation of metabolic syndrome. The pathophysiology of NAFLD has not been fully elucidated. Some patients can remain in the stage of simple steatosis, which generally is a benign condition; whereas others can develop liver inflammation and progress into non-alcoholic steatohepatitis, fibrosis, cirrhosis and hepatocellular carcinoma. The mechanism behind the progression is still not fully understood. Much ongoing proteomic researches have focused on discovering the unbiased circulating biochemical markers to allow early detection and treatment of NAFLD. Comprehensive genomic studies have also begun to provide new insights into the gene polymorphism to understand patient-disease variations. Therefore, NAFLD is considered a complex and mutifactorial disease phenotype resulting from environmental exposures acting on a susceptible polygenic background. This paper reviewed the current status of proteomic and genomic studies that have contributed to the understanding of NAFLD pathogenesis. For proteomics section, this review highlighted functional proteins that involved in: (1) transportation; (2) metabolic pathway; (3) acute phase reaction; (4) anti-inflammatory; (5) extracellular matrix; and (6) immune system. In the genomic studies, this review will discuss genes which involved in: (1) lipolysis; (2) adipokines; and (3) cytokines production. PMID:25024592

  8. Genomics and proteomics in liver fibrosis and cirrhosis

    PubMed Central

    2012-01-01

    Genomics and proteomics have become increasingly important in biomedical science in the past decade, as they provide an opportunity for hypothesis-free experiments that can yield major insights not previously foreseen when scientific and clinical questions are based only on hypothesis-driven approaches. Use of these tools, therefore, opens new avenues for uncovering physiological and pathological pathways. Liver fibrosis is a complex disease provoked by a range of chronic injuries to the liver, among which are viral hepatitis, (non-) alcoholic steatohepatitis and autoimmune disorders. Some chronic liver patients will never develop fibrosis or cirrhosis, whereas others rapidly progress towards cirrhosis in a few years. This variety can be caused by disease-related factors (for example, viral genotype) or host-factors (genetic/epigenetic). It is vital to establish accurate tools to identify those patients at highest risk for disease severity or progression in order to determine who are in need of immediate therapies. Moreover, there is an urgent imperative to identify non-invasive markers that can accurately distinguish mild and intermediate stages of fibrosis. Ideally, biomarkers can be used to predict disease progression and treatment response, but these studies will take many years due to the requirement for lengthy follow-up periods to assess outcomes. Current genomic and proteomic research provides many candidate biomarkers, but independent validation of these biomarkers is lacking, and reproducibility is still a key concern. Thus, great opportunities and challenges lie ahead in the field of genomics and proteomics, which, if successful, could transform the diagnosis and treatment of chronic fibrosing liver diseases. PMID:22214245

  9. Bacterial membrane proteomics.

    PubMed

    Poetsch, Ansgar; Wolters, Dirk

    2008-10-01

    About one quarter to one third of all bacterial genes encode proteins of the inner or outer bacterial membrane. These proteins perform essential physiological functions, such as the import or export of metabolites, the homeostasis of metal ions, the extrusion of toxic substances or antibiotics, and the generation or conversion of energy. The last years have witnessed completion of a plethora of whole-genome sequences of bacteria important for biotechnology or medicine, which is the foundation for proteome and other functional genome analyses. In this review, we discuss the challenges in membrane proteome analysis, starting from sample preparation and leading to MS-data analysis and quantification. The current state of available proteomics technologies as well as their advantages and disadvantages will be described with a focus on shotgun proteomics. Then, we will briefly introduce the most abundant proteins and protein families present in bacterial membranes before bacterial membrane proteomics studies of the last years will be presented. It will be shown how these works enlarged our knowledge about the physiological adaptations that take place in bacteria during fine chemical production, bioremediation, protein overexpression, and during infections. Furthermore, several examples from literature demonstrate the suitability of membrane proteomics for the identification of antigens and different pathogenic strains, as well as the elucidation of membrane protein structure and function.

  10. CPTAC researchers report first large-scale integrated proteomic and genomic analysis of a human cancer | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    Investigators from the National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) who comprehensively analyzed 95 human colorectal tumor samples, have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, provides a more comprehensive view of the biological features that drive cancer than genomic analysis alone and may help identify the most important targets for cancer detection and intervention.

  11. Genomes, Proteomes and the Central Dogma

    PubMed Central

    Franklin, Sarah; Vondriska, Thomas M.

    2011-01-01

    Systems biology, with its associated technologies of proteomics, genomics and metabolomics, is driving the evolution of our understanding of cardiovascular physiology. Rather than studying individual molecules or even single reactions, a systems approach allows integration of orthogonal datasets from distinct tiers of biological data, including gene, RNA, protein, metabolite and other component networks. Together these networks give rise to emergent properties of cellular function and it is their reprogramming that causes disease. We present five observations regarding how systems biology is guiding a revisiting of the central dogma: (i) de-emphasizing the unidirectional flow of information from genes to proteins; (ii) revealing the role of modules of molecules as opposed to individual proteins acting in isolation; (iii) enabling discovery of novel emergent properties; (iv) demonstrating the importance of networks in biology; and (v) adding new dimensionality to the study of biological systems. PMID:22010165

  12. Functional analysis of proteins and protein species using shotgun proteomics and linear mathematics.

    PubMed

    Hoehenwarter, Wolfgang; Chen, Yanmei; Recuenco-Munoz, Luis; Wienkoop, Stefanie; Weckwerth, Wolfram

    2011-07-01

    Covalent post-translational modification of proteins is the primary modulator of protein function in the cell. It greatly expands the functional potential of the proteome compared to the genome. In the past few years shotgun proteomics-based research, where the proteome is digested into peptides prior to mass spectrometric analysis has been prolific in this area. It has determined the kinetics of tens of thousands of sites of covalent modification on an equally large number of proteins under various biological conditions and uncovered a transiently active regulatory network that extends into diverse branches of cellular physiology. In this review, we discuss this work in light of the concept of protein speciation, which emphasizes the entire post-translationally modified molecule and its interactions and not just the modification site as the functional entity. Sometimes, particularly when considering complex multisite modification, all of the modified molecular species involved in the investigated condition, the protein species must be completely resolved for full understanding. We present a mathematical technique that delivers a good approximation for shotgun proteomics data.

  13. Proteomics and comparative genomics of Nitrososphaera viennensis reveal the core genome and adaptations of archaeal ammonia oxidizers

    PubMed Central

    Kerou, Melina; Offre, Pierre; Valledor, Luis; Abby, Sophie S.; Melcher, Michael; Nagler, Matthias; Weckwerth, Wolfram; Schleper, Christa

    2016-01-01

    Ammonia-oxidizing archaea (AOA) are among the most abundant microorganisms and key players in the global nitrogen and carbon cycles. They share a common energy metabolism but represent a heterogeneous group with respect to their environmental distribution and adaptions, growth requirements, and genome contents. We report here the genome and proteome of Nitrososphaera viennensis EN76, the type species of the archaeal class Nitrososphaeria of the phylum Thaumarchaeota encompassing all known AOA. N. viennensis is a soil organism with a 2.52-Mb genome and 3,123 predicted protein-coding genes. Proteomic analysis revealed that nearly 50% of the predicted genes were translated under standard laboratory growth conditions. Comparison with genomes of closely related species of the predominantly terrestrial Nitrososphaerales as well as the more streamlined marine Nitrosopumilales [Candidatus (Ca.) order] and the acidophile “Ca. Nitrosotalea devanaterra” revealed a core genome of AOA comprising 860 genes, which allowed for the reconstruction of central metabolic pathways common to all known AOA and expressed in the N. viennensis and “Ca. Nitrosopelagicus brevis” proteomes. Concomitantly, we were able to identify candidate proteins for as yet unidentified crucial steps in central metabolisms. In addition to unraveling aspects of core AOA metabolism, we identified specific metabolic innovations associated with the Nitrososphaerales mediating growth and survival in the soil milieu, including the capacity for biofilm formation, cell surface modifications and cell adhesion, and carbohydrate conversions as well as detoxification of aromatic compounds and drugs. PMID:27864514

  14. Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger.

    PubMed

    Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J

    2009-02-04

    Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.

  15. Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger

    PubMed Central

    Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J

    2009-01-01

    Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). Results 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. Conclusion This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method. PMID:19193216

  16. Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome

    PubMed Central

    2013-01-01

    Background The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its members. Given that mitochondria play important roles in fungal virulence and the susceptibility/resistance of fungi to fungicides, we performed the first functional analysis of this organelle in Ceratocystis using integrated “omics” approaches. Results The C. cacaofunesta mitochondrial genome (mtDNA) consists of a single, 103,147-bp circular molecule, making this the second largest mtDNA among the Sordariomycetes. Bioinformatics analysis revealed the presence of 15 conserved genes and 37 intronic open reading frames in C. cacaofunesta mtDNA. Here, we predicted the mitochondrial proteome (mtProt) of C. cacaofunesta, which is comprised of 1,124 polypeptides - 52 proteins that are mitochondrially encoded and 1,072 that are nuclearly encoded. Transcriptome analysis revealed 33 probable novel genes. Comparisons among the Gene Ontology results of the predicted mtProt of C. cacaofunesta, Neurospora crassa and Saccharomyces cerevisiae revealed no significant differences. Moreover, C. cacaofunesta mitochondria were isolated, and the mtProt was subjected to mass spectrometric analysis. The experimental proteome validated 27% of the predicted mtProt. Our results confirmed the existence of 110 hypothetical proteins and 7 novel proteins of which 83 and 1, respectively, had putative mitochondrial localization. Conclusions The present study provides the first partial genomic analysis of a species of the Ceratocystis genus and the first predicted mitochondrial protein inventory of a phytopathogenic fungus. In addition to the known mitochondrial role in pathogenicity, our results demonstrated that the global function analysis of this organelle is similar in pathogenic and non

  17. Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome.

    PubMed

    Ambrosio, Alinne Batista; do Nascimento, Leandro Costa; Oliveira, Bruno V; Teixeira, Paulo José P L; Tiburcio, Ricardo A; Toledo Thomazella, Daniela P; Leme, Adriana F P; Carazzolle, Marcelo F; Vidal, Ramon O; Mieczkowski, Piotr; Meinhardt, Lyndel W; Pereira, Gonçalo A G; Cabrera, Odalys G

    2013-02-11

    The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its members. Given that mitochondria play important roles in fungal virulence and the susceptibility/resistance of fungi to fungicides, we performed the first functional analysis of this organelle in Ceratocystis using integrated "omics" approaches. The C. cacaofunesta mitochondrial genome (mtDNA) consists of a single, 103,147-bp circular molecule, making this the second largest mtDNA among the Sordariomycetes. Bioinformatics analysis revealed the presence of 15 conserved genes and 37 intronic open reading frames in C. cacaofunesta mtDNA. Here, we predicted the mitochondrial proteome (mtProt) of C. cacaofunesta, which is comprised of 1,124 polypeptides - 52 proteins that are mitochondrially encoded and 1,072 that are nuclearly encoded. Transcriptome analysis revealed 33 probable novel genes. Comparisons among the Gene Ontology results of the predicted mtProt of C. cacaofunesta, Neurospora crassa and Saccharomyces cerevisiae revealed no significant differences. Moreover, C. cacaofunesta mitochondria were isolated, and the mtProt was subjected to mass spectrometric analysis. The experimental proteome validated 27% of the predicted mtProt. Our results confirmed the existence of 110 hypothetical proteins and 7 novel proteins of which 83 and 1, respectively, had putative mitochondrial localization. The present study provides the first partial genomic analysis of a species of the Ceratocystis genus and the first predicted mitochondrial protein inventory of a phytopathogenic fungus. In addition to the known mitochondrial role in pathogenicity, our results demonstrated that the global function analysis of this organelle is similar in pathogenic and non-pathogenic fungi, suggesting that its

  18. Evaluation of a genome-scale in silico metabolic model for Geobacter metallireducens by using proteomic data from a field biostimulation experiment.

    PubMed

    Fang, Yilin; Wilkins, Michael J; Yabusaki, Steven B; Lipton, Mary S; Long, Philip E

    2012-12-01

    Accurately predicting the interactions between microbial metabolism and the physical subsurface environment is necessary to enhance subsurface energy development, soil and groundwater cleanup, and carbon management. This study was an initial attempt to confirm the metabolic functional roles within an in silico model using environmental proteomic data collected during field experiments. Shotgun global proteomics data collected during a subsurface biostimulation experiment were used to validate a genome-scale metabolic model of Geobacter metallireducens-specifically, the ability of the metabolic model to predict metal reduction, biomass yield, and growth rate under dynamic field conditions. The constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes. Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low abundances of proteins associated with amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.

  19. Comprehensive genome-wide proteomic analysis of human placental tissue for the Chromosome-Centric Human Proteome Project.

    PubMed

    Lee, Hyoung-Joo; Jeong, Seul-Ki; Na, Keun; Lee, Min Jung; Lee, Sun Hee; Lim, Jong-Sun; Cha, Hyun-Jeong; Cho, Jin-Young; Kwon, Ja-Young; Kim, Hoguen; Song, Si Young; Yoo, Jong Shin; Park, Young Mok; Kim, Hail; Hancock, William S; Paik, Young-Ki

    2013-06-07

    As a starting point of the Chromosome-Centric Human Proteome Project (C-HPP), we established strategies of genome-wide proteomic analysis, including protein identification, quantitation of disease-specific proteins, and assessment of post-translational modifications, using paired human placental tissues from healthy and preeclampsia patients. This analysis resulted in identification of 4239 unique proteins with high confidence (two or more unique peptides with a false discovery rate less than 1%), covering 21% of approximately 20, 059 (Ensembl v69, Oct 2012) human proteins, among which 28 proteins exhibited differentially expressed preeclampsia-specific proteins. When these proteins are assigned to all human chromosomes, the pattern of the newly identified placental protein population is proportional to that of the gene count distribution of each chromosome. We also identified 219 unique N-linked glycopeptides, 592 unique phosphopeptides, and 66 chromosome 13-specific proteins. In particular, protein evidence of 14 genes previously known to be specifically up-regulated in human placenta was verified by mass spectrometry. With respect to the functional implication of these proteins, 38 proteins were found to be involved in regulatory factor biosynthesis or the immune system in the placenta, but the molecular mechanism of these proteins during pregnancy warrants further investigation. As far as we know, this work produced the highest number of proteins identified in the placenta and will be useful for annotating and mapping all proteins encoded in the human genome.

  20. Proteomic approaches in brain research and neuropharmacology.

    PubMed

    Vercauteren, Freya G G; Bergeron, John J M; Vandesande, Frans; Arckens, Lut; Quirion, Rémi

    2004-10-01

    Numerous applications of genomic technologies have enabled the assembly of unprecedented inventories of genes, expressed in cells under specific physiological and pathophysiological conditions. Complementing the valuable information generated through functional genomics with the integrative knowledge of protein expression and function should enable the development of more efficient diagnostic tools and therapeutic agents. Proteomic analyses are particularly suitable to elucidate posttranslational modifications, expression levels and protein-protein interactions of thousands of proteins at a time. In this review, two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) investigations of brain tissues in neurodegenerative diseases such as Alzheimer's disease, Down syndrome and schizophrenia, and the construction of 2D-PAGE proteome maps of the brain are discussed. The role of the Human Proteome Organization (HUPO) as an international coordinating organization for proteomic efforts, as well as challenges for proteomic technologies and data analysis are also addressed. It is expected that the use of proteomic strategies will have significant impact in neuropharmacology over the coming decade.

  1. The Role of Clinical Proteomics, Lipidomics, and Genomics in the Diagnosis of Alzheimer's Disease.

    PubMed

    Martins, Ian James

    2016-03-31

    The early diagnosis of Alzheimer's disease (AD) has become important to the reversal and treatment of neurodegeneration, which may be relevant to premature brain aging that is associated with chronic disease progression. Clinical proteomics allows the detection of various proteins in fluids such as the urine, plasma, and cerebrospinal fluid for the diagnosis of AD. Interest in lipidomics has accelerated with plasma testing for various lipid biomarkers that may with clinical proteomics provide a more reproducible diagnosis for early brain aging that is connected to other chronic diseases. The combination of proteomics with lipidomics may decrease the biological variability between studies and provide reproducible results that detect a community's susceptibility to AD. The diagnosis of chronic disease associated with AD that now involves genomics may provide increased sensitivity to avoid inadvertent errors related to plasma versus cerebrospinal fluid testing by proteomics and lipidomics that identify new disease biomarkers in body fluids, cells, and tissues. The diagnosis of AD by various plasma biomarkers with clinical proteomics may now require the involvement of lipidomics and genomics to provide interpretation of proteomic results from various laboratories around the world.

  2. Highlights of recent articles on data mining in genomics & proteomics

    USDA-ARS?s Scientific Manuscript database

    This editorial elaborates on investigations consisting of different “OMICS” technologies and their application to biological sciences. In addition, advantages and recent development of the proteomic, genomic and data mining technologies are discussed. This information will be useful to scientists ...

  3. Genome-wide proteomics analysis on longissimus muscles in Qinchuan beef cattle.

    PubMed

    He, Hua; Chen, Si; Liang, Wei; Liu, Xiaolin

    2017-04-01

    To gain further insight into the molecular mechanism of bovine muscle development, we combined mass spectrometry characterization of proteins with Illumina deep sequencing of RNAs obtained from bovine longissimus muscle (LD) at prenatal and postnatal stages. For the proteomic study, each group of LD proteins was extracted and labeled using isobaric tags for relative and absolute quantitation (iTRAQ) method. Among the 1321 proteins identified from six samples, 390 proteins were differentially expressed in embryos at day 135 post-fertilization (Emb135d) vs. 30-month-old adult cattle (Emb135d vs. 30M) samples. Gene Ontology, Cluster of Orthologous Groups and Kyoto Encyclopedia of Genes and Genomes analyses were further conducted to better understand the different functions. Furthermore, we analyzed the relationship between transcript and protein regulation between samples by direct comparison of expression levels from transcriptomic and iTRAQ-based proteomics. Association results indicated that 1295 of 1321 proteins could be mapped to transcriptome sequencing data. This study provides the most comprehensive, targeted survey of bovine LD proteins to date and has shown the power of combining transcriptomic and proteomic approaches to provide molecular insights for understanding the developmental characteristics in bovine muscle, and even in other mammals. © 2016 Stichting International Foundation for Animal Genetics.

  4. Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data.

    PubMed

    Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O

    2015-08-25

    Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.

  5. Encapsulated in silica: genome, proteome and physiology of the thermophilic bacterium Anoxybacillus flavithermus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Saw, Jimmy H; Mountain, Bruce W; Feng, Lu

    Gram-positive bacteria of the genus Anoxybacillus have been found in diverse thermophilic habitats, such as geothermal hot springs and manure, and in processed foods such as gelatin and milk powder. Anoxybacillus flavithermus is a facultatively anaerobic bacterium found in super-saturated silica solutions and in opaline silica sinter. The ability of A. flavithermus to grow in super-saturated silica solutions makes it an ideal subject to study the processes of sinter formation, which might be similar to the biomineralization processes that occurred at the dawn of life. We report here the complete genome sequence of A. flavithermus strain WK1, isolated from themore » waste water drain at the Wairakei geothermal power station in New Zealand. It consists of a single chromosome of 2,846,746 base pairs and is predicted to encode 2,863 proteins. In silico genome analysis identified several enzymes that could be involved in silica adaptation and biofilm formation, and their predicted functions were experimentally validated in vitro. Proteomic analysis confirmed the regulation of biofilm-related proteins and crucial enzymes for the synthesis of long-chain polyamines as constituents of silica nanospheres. Microbial fossils preserved in silica and silica sinters are excellent objects for studying ancient life, a new paleobiological frontier. An integrated analysis of the A. flavithermus genome and proteome provides the first glimpse of metabolic adaptation during silicification and sinter formation. Comparative genome analysis suggests an extensive gene loss in the Anoxybacillus/Geobacillus branch after its divergence from other bacilli.« less

  6. Spermatogenesis in mammals: proteomic insights.

    PubMed

    Chocu, Sophie; Calvel, Pierre; Rolland, Antoine D; Pineau, Charles

    2012-08-01

    Spermatogenesis is a highly sophisticated process involved in the transmission of genetic heritage. It includes halving ploidy, repackaging of the chromatin for transport, and the equipment of developing spermatids and eventually spermatozoa with the advanced apparatus (e.g., tightly packed mitochondrial sheat in the mid piece, elongating of the tail, reduction of cytoplasmic volume) to elicit motility once they reach the epididymis. Mammalian spermatogenesis is divided into three phases. In the first the primitive germ cells or spermatogonia undergo a series of mitotic divisions. In the second the spermatocytes undergo two consecutive divisions in meiosis to produce haploid spermatids. In the third the spermatids differentiate into spermatozoa in a process called spermiogenesis. Paracrine, autocrine, juxtacrine, and endocrine pathways all contribute to the regulation of the process. The array of structural elements and chemical factors modulating somatic and germ cell activity is such that the network linking the various cellular activities during spermatogenesis is unimaginably complex. Over the past two decades, advances in genomics have greatly improved our knowledge of spermatogenesis, by identifying numerous genes essential for the development of functional male gametes. Large-scale analyses of testicular function have deepened our insight into normal and pathological spermatogenesis. Progress in genome sequencing and microarray technology have been exploited for genome-wide expression studies, leading to the identification of hundreds of genes differentially expressed within the testis. However, although proteomics has now come of age, the proteomics-based investigation of spermatogenesis remains in its infancy. Here, we review the state-of-the-art of large-scale proteomic analyses of spermatogenesis, from germ cell development during sex determination to spermatogenesis in the adult. Indeed, a few laboratories have undertaken differential protein profiling

  7. Computational functional genomics-based approaches in analgesic drug discovery and repurposing.

    PubMed

    Lippmann, Catharina; Kringel, Dario; Ultsch, Alfred; Lötsch, Jörn

    2018-06-01

    Persistent pain is a major healthcare problem affecting a fifth of adults worldwide with still limited treatment options. The search for new analgesics increasingly includes the novel research area of functional genomics, which combines data derived from various processes related to DNA sequence, gene expression or protein function and uses advanced methods of data mining and knowledge discovery with the goal of understanding the relationship between the genome and the phenotype. Its use in drug discovery and repurposing for analgesic indications has so far been performed using knowledge discovery in gene function and drug target-related databases; next-generation sequencing; and functional proteomics-based approaches. Here, we discuss recent efforts in functional genomics-based approaches to analgesic drug discovery and repurposing and highlight the potential of computational functional genomics in this field including a demonstration of the workflow using a novel R library 'dbtORA'.

  8. Evaluation of a Genome-Scale In Silico Metabolic Model for Geobacter metallireducens Using Proteomic Data from a Field Biostimulation Experiment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fang, Yilin; Wilkins, Michael J.; Yabusaki, Steven B.

    2012-12-12

    Biomass and shotgun global proteomics data that reflected relative protein abundances from samples collected during the 2008 experiment at the U.S. Department of Energy Integrated Field-Scale Subsurface Research Challenge site in Rifle, Colorado, provided an unprecedented opportunity to validate a genome-scale metabolic model of Geobacter metallireducens and assess its performance with respect to prediction of metal reduction, biomass yield, and growth rate under dynamic field conditions. Reconstructed from annotated genomic sequence, biochemical, and physiological data, the constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes.more » Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low fluxes through amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.« less

  9. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    DOE PAGES

    Brooks, Brandon; Mueller, R. S.; Young, Jacque C.; ...

    2015-07-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13 21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains ofmore » Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential.« less

  10. Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled

    PubMed Central

    Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran

    2015-01-01

    The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281

  11. Resources for Functional Genomics Studies in Drosophila melanogaster

    PubMed Central

    Mohr, Stephanie E.; Hu, Yanhui; Kim, Kevin; Housden, Benjamin E.; Perrimon, Norbert

    2014-01-01

    Drosophila melanogaster has become a system of choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, “meta” information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases, and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate, and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally. PMID:24653003

  12. Genomics, transcriptomics and proteomics: enabling insights into social evolution and disease challenges for managed and wild bees.

    PubMed

    Trapp, Judith; McAfee, Alison; Foster, Leonard J

    2017-02-01

    Globally, there are over 20 000 bee species (Hymenoptera: Apoidea: Anthophila) with a host of biologically fascinating characteristics. Although they have long been studied as models for social evolution, recent challenges to bee health (mainly diseases and pesticides) have gathered the attention of both public and research communities. Genome sequences of twelve bee species are now complete or under progress, facilitating the application of additional 'omic technologies. Here, we review recent developments in honey bee and native bee research in the genomic era. We discuss the progress in genome sequencing and functional annotation, followed by the enabled comparative genomics, proteomics and transcriptomics applications regarding social evolution and health. Finally, we end with comments on future challenges in the postgenomic era. © 2016 John Wiley & Sons Ltd.

  13. Functional Genomics in the Study of Mind-Body Therapies

    PubMed Central

    Niles, Halsey; Mehta, Darshan H.; Corrigan, Alexandra A.; Bhasin, Manoj K.; Denninger, John W.

    2014-01-01

    Background Mind-body therapies (MBTs) are used throughout the world in treatment, disease prevention, and health promotion. However, the mechanisms by which MBTs exert their positive effects are not well understood. Investigations into MBTs using functional genomics have revolutionized the understanding of MBT mechanisms and their effects on human physiology. Methods We searched the literature for the effects of MBTs on functional genomics determinants using MEDLINE, supplemented by a manual search of additional journals and a reference list review. Results We reviewed 15 trials that measured global or targeted transcriptomic, epigenomic, or proteomic changes in peripheral blood. Sample sizes ranged from small pilot studies (n=2) to large trials (n=500). While the reliability of individual genes from trial to trial was often inconsistent, genes related to inflammatory response, particularly those involved in the nuclear factor-kappa B (NF-κB) pathway, were consistently downregulated across most studies. Conclusion In general, existing trials focusing on gene expression changes brought about by MBTs have revealed intriguing connections to the immune system through the NF-κB cascade, to telomere maintenance, and to apoptotic regulation. However, these findings are limited to a small number of trials and relatively small sample sizes. More rigorous randomized controlled trials of healthy subjects and specific disease states are warranted. Future research should investigate functional genomics areas both upstream and downstream of MBT-related gene expression changes—from epigenomics to proteomics and metabolomics. PMID:25598735

  14. Functional genomics in the study of mind-body therapies.

    PubMed

    Niles, Halsey; Mehta, Darshan H; Corrigan, Alexandra A; Bhasin, Manoj K; Denninger, John W

    2014-01-01

    Mind-body therapies (MBTs) are used throughout the world in treatment, disease prevention, and health promotion. However, the mechanisms by which MBTs exert their positive effects are not well understood. Investigations into MBTs using functional genomics have revolutionized the understanding of MBT mechanisms and their effects on human physiology. We searched the literature for the effects of MBTs on functional genomics determinants using MEDLINE, supplemented by a manual search of additional journals and a reference list review. We reviewed 15 trials that measured global or targeted transcriptomic, epigenomic, or proteomic changes in peripheral blood. Sample sizes ranged from small pilot studies (n=2) to large trials (n=500). While the reliability of individual genes from trial to trial was often inconsistent, genes related to inflammatory response, particularly those involved in the nuclear factor-kappa B (NF-κB) pathway, were consistently downregulated across most studies. In general, existing trials focusing on gene expression changes brought about by MBTs have revealed intriguing connections to the immune system through the NF-κB cascade, to telomere maintenance, and to apoptotic regulation. However, these findings are limited to a small number of trials and relatively small sample sizes. More rigorous randomized controlled trials of healthy subjects and specific disease states are warranted. Future research should investigate functional genomics areas both upstream and downstream of MBT-related gene expression changes-from epigenomics to proteomics and metabolomics.

  15. Proteobionics: biomimetics in proteomics.

    PubMed

    Sommer, Andrei P; Gheorghiu, Eleonora

    2006-03-01

    Proteomics was established 10 years ago by the analysis of microbial genomes via their protein complement or proteome. Bionics is an ancient art, which converts structures optimized by nature into advanced technical products. Previously, we analyzed survival modalities in nanobacteria and converted the interplay between survival-oriented protein functions and nanoscale mineral shells into models for advanced drug delivery. Exploiting protein functions observed in nature to design biomedical products and therapies could be named proteobionics. Here, we present examples for this new branch of nanoproteomics.

  16. Current advances in esophageal cancer proteomics.

    PubMed

    Uemura, Norihisa; Kondo, Tadashi

    2015-06-01

    We review the current status of proteomics for esophageal cancer (EC) from a clinician's viewpoint. The ultimate goal of cancer proteomics is the improvement of clinical outcome. The proteome as a functional translation of the genome is a straightforward representation of genomic mechanisms that trigger carcinogenesis. Cancer proteomics has identified the mechanisms of carcinogenesis and tumor progression, detected biomarker candidates for early diagnosis, and provided novel therapeutic targets for personalized treatments. Our review focuses on three major topics in EC proteomics: diagnostics, treatment, and molecular mechanisms. We discuss the major histological differences between EC types, i.e., esophageal squamous cell carcinoma and adenocarcinoma, and evaluate the clinical significance of published proteomics studies, including promising diagnostic biomarkers and novel therapeutic targets, which should be further validated prior to launching clinical trials. Multi-disciplinary collaborations between basic scientists, clinicians, and pathologists should be established for inter-institutional validation. In conclusion, EC proteomics has provided significant results, which after thorough validation, should lead to the development of novel clinical tools and improvement of the clinical outcome for esophageal cancer patients. This article is part of a Special Issue entitled: Medical Proteomics. Copyright © 2014 Elsevier B.V. All rights reserved.

  17. Effects of Space Environment on Genome, Transcriptome, and Proteome of Klebsiella pneumoniae.

    PubMed

    Guo, Yinghua; Li, Jia; Liu, Jinwen; Wang, Tong; Li, Yinhu; Yuan, Yanting; Zhao, Jiao; Chang, De; Fang, Xiangqun; Li, Tianzhi; Wang, Junfeng; Dai, Wenkui; Fang, Chengxiang; Liu, Changting

    2015-11-01

    The aim of this study was to explore the effects of space flight on Klebsiella pneumoniae. A strain of K. pneumoniae was sent to space for 398 h aboard the ShenZhou VIII spacecraft during November 1, 2011-November 17, 2011. At the same time, a ground simulation with similar temperature conditions during the space flight was performed as a control. After the space mission, the flight and control strains were analyzed using phenotypic, genomic, transcriptomic and proteomic techniques. The flight strains LCT-KP289 exhibited a higher cotrimoxazole resistance level and changes in metabolism relative to the ground control strain LCT-KP214. After the space flight, 73 SNPs and a plasmid copy number variation were identified in the flight strain. Based on the transcriptomic analysis, there are 232 upregulated and 1879 downregulated genes, of which almost all were for metabolism. Proteomic analysis revealed that there were 57 upregulated and 125 downregulated proteins. These differentially expressed proteins had several functions that included energy production and conversion, carbohydrate transport and metabolism, translation, ribosomal structure and biogenesis, posttranslational modification, protein turnover, and chaperone functions. At a systems biology level, the ytfG gene had a synonymous mutation that resulted in significantly downregulated expression at both transcriptomic and proteomic levels. The mutation of the ytfG gene may influence fructose and mannose metabolic processes of K. pneumoniae during space flight, which may be beneficial to the field of space microbiology, providing potential therapeutic strategies to combat or prevent infection in astronauts. Copyright © 2015 IMSS. Published by Elsevier Inc. All rights reserved.

  18. The complete genome sequence and proteomics of Yersinia pestis phage Yep-phi.

    PubMed

    Zhao, Xiangna; Wu, Weili; Qi, Zhizhen; Cui, Yujun; Yan, Yanfeng; Guo, Zhaobiao; Wang, Zuyun; Wang, Hu; Deng, Haijun; Xue, Yan; Chen, Weijun; Wang, Xiaoyi; Yang, Ruifu

    2011-01-01

    Yep-phi, a lytic phage of Yersinia pestis, was isolated in China and is routinely used as a diagnostic phage for the identification of the plague pathogen. Yep-phi has an isometric hexagonal head containing dsDNA and a short non-contractile conical tail. In this study, we sequenced the Yep-phi genome (GenBank accession no. HQ333270) and performed proteomics analysis. The genome consists of 38 ,616 bp of DNA, including direct terminal repeats of 222 bp, and is predicted to contain 45 ORFs. Most structural proteins were identified by proteomics analysis. Compared with the three available genome sequences of lytic phages for Y. pestis, the phages could be divided into two subgroups. Yep-phi displays marked homology to the bacteriophages Berlin (GenBank accession no. AM183667) and Yepe2 (GenBank accession no. EU734170), and these comprise one subgroup. The other subgroup is represented by bacteriophage ΦA1122 (GenBank accession no. AY247822). Potential recombination was detected among the Yep-phi subgroup.

  19. The Use of Functional Genomics in Conjunction with Metabolomics for Mycobacterium tuberculosis Research

    PubMed Central

    Swanepoel, Conrad C.

    2014-01-01

    Tuberculosis (TB), caused by Mycobacterium tuberculosis, is a fatal infectious disease, resulting in 1.4 million deaths globally per annum. Over the past three decades, genomic studies have been conducted in an attempt to elucidate the functionality of the genome of the pathogen. However, many aspects of this complex genome remain largely unexplored, as approaches like genomics, proteomics, and transcriptomics have failed to characterize them successfully. In turn, metabolomics, which is relatively new to the “omics” revolution, has shown great potential for investigating biological systems or their modifications. Furthermore, when these data are interpreted in combination with previously acquired genomics, proteomics and transcriptomics data, using what is termed a systems biology approach, a more holistic understanding of these systems can be achieved. In this review we discuss how metabolomics has contributed so far to characterizing TB, with emphasis on the resulting improved elucidation of M. tuberculosis in terms of (1) metabolism, (2) growth and replication, (3) pathogenicity, and (4) drug resistance, from the perspective of systems biology. PMID:24771957

  20. Impact of nanoscale topography on genomics and proteomics of adherent bacteria.

    PubMed

    Rizzello, Loris; Sorce, Barbara; Sabella, Stefania; Vecchio, Giuseppe; Galeone, Antonio; Brunetti, Virgilio; Cingolani, Roberto; Pompa, Pier Paolo

    2011-03-22

    Bacterial adhesion onto inorganic/nanoengineered surfaces is a key issue in biotechnology and medicine, because it is one of the first necessary steps to determine a general pathogenic event. Understanding the molecular mechanisms of bacteria-surface interaction represents a milestone for planning a new generation of devices with unanimously certified antibacterial characteristics. Here, we show how highly controlled nanostructured substrates impact the bacterial behavior in terms of morphological, genomic, and proteomic response. We observed by atomic force microscopy (AFM) and scanning electron microscopy (SEM) that type-1 fimbriae typically disappear in Escherichia coli adherent onto nanostructured substrates, as opposed to bacteria onto reference glass or flat gold surfaces. A genetic variation of the fimbrial operon regulation was consistently identified by real time qPCR in bacteria interacting with the nanorough substrates. To gain a deeper insight into the molecular basis of the interaction mechanisms, we explored the entire proteomic profile of E. coli by 2D-DIGE, finding significant changes in the bacteria adherent onto the nanorough substrates, such as regulations of proteins involved in stress processes and defense mechanisms. We thus demonstrated that a pure physical stimulus, that is, a nanoscale variation of surface topography, may play per se a significant role in determining the morphological, genetic, and proteomic profile of bacteria. These data suggest that in depth investigations of the molecular processes of microorganisms adhering to surfaces are of great importance for the design of innovative biomaterials with active biological functionalities.

  1. Encapsulated in silica: genome, proteome and physiology of the thermophilic bacterium Anoxybacillus flavithermus WK1

    PubMed Central

    Saw, Jimmy H; Mountain, Bruce W; Feng, Lu; Omelchenko, Marina V; Hou, Shaobin; Saito, Jennifer A; Stott, Matthew B; Li, Dan; Zhao, Guang; Wu, Junli; Galperin, Michael Y; Koonin, Eugene V; Makarova, Kira S; Wolf, Yuri I; Rigden, Daniel J; Dunfield, Peter F; Wang, Lei; Alam, Maqsudul

    2008-01-01

    Background Gram-positive bacteria of the genus Anoxybacillus have been found in diverse thermophilic habitats, such as geothermal hot springs and manure, and in processed foods such as gelatin and milk powder. Anoxybacillus flavithermus is a facultatively anaerobic bacterium found in super-saturated silica solutions and in opaline silica sinter. The ability of A. flavithermus to grow in super-saturated silica solutions makes it an ideal subject to study the processes of sinter formation, which might be similar to the biomineralization processes that occurred at the dawn of life. Results We report here the complete genome sequence of A. flavithermus strain WK1, isolated from the waste water drain at the Wairakei geothermal power station in New Zealand. It consists of a single chromosome of 2,846,746 base pairs and is predicted to encode 2,863 proteins. In silico genome analysis identified several enzymes that could be involved in silica adaptation and biofilm formation, and their predicted functions were experimentally validated in vitro. Proteomic analysis confirmed the regulation of biofilm-related proteins and crucial enzymes for the synthesis of long-chain polyamines as constituents of silica nanospheres. Conclusions Microbial fossils preserved in silica and silica sinters are excellent objects for studying ancient life, a new paleobiological frontier. An integrated analysis of the A. flavithermus genome and proteome provides the first glimpse of metabolic adaptation during silicification and sinter formation. Comparative genome analysis suggests an extensive gene loss in the Anoxybacillus/Geobacillus branch after its divergence from other bacilli. PMID:19014707

  2. Wheat proteomics: proteome modulation and abiotic stress acclimation

    PubMed Central

    Komatsu, Setsuko; Kamal, Abu H. M.; Hossain, Zahed

    2014-01-01

    Cellular mechanisms of stress sensing and signaling represent the initial plant responses to adverse conditions. The development of high-throughput “Omics” techniques has initiated a new era of the study of plant molecular strategies for adapting to environmental changes. However, the elucidation of stress adaptation mechanisms in plants requires the accurate isolation and characterization of stress-responsive proteins. Because the functional part of the genome, namely the proteins and their post-translational modifications, are critical for plant stress responses, proteomic studies provide comprehensive information about the fine-tuning of cellular pathways that primarily involved in stress mitigation. This review summarizes the major proteomic findings related to alterations in the wheat proteomic profile in response to abiotic stresses. Moreover, the strengths and weaknesses of different sample preparation techniques, including subcellular protein extraction protocols, are discussed in detail. The continued development of proteomic approaches in combination with rapidly evolving bioinformatics tools and interactive databases will facilitate understanding of the plant mechanisms underlying stress tolerance. PMID:25538718

  3. Evolution of complexity in the zebrafish synapse proteome

    PubMed Central

    Bayés, Àlex; Collins, Mark O.; Reig-Viader, Rita; Gou, Gemma; Goulding, David; Izquierdo, Abril; Choudhary, Jyoti S.; Emes, Richard D.; Grant, Seth G. N.

    2017-01-01

    The proteome of human brain synapses is highly complex and is mutated in over 130 diseases. This complexity arose from two whole-genome duplications early in the vertebrate lineage. Zebrafish are used in modelling human diseases; however, its synapse proteome is uncharacterized, and whether the teleost-specific genome duplication (TSGD) influenced complexity is unknown. We report the characterization of the proteomes and ultrastructure of central synapses in zebrafish and analyse the importance of the TSGD. While the TSGD increases overall synapse proteome complexity, the postsynaptic density (PSD) proteome of zebrafish has lower complexity than mammals. A highly conserved set of ∼1,000 proteins is shared across vertebrates. PSD ultrastructural features are also conserved. Lineage-specific proteome differences indicate that vertebrate species evolved distinct synapse types and functions. The data sets are a resource for a wide range of studies and have important implications for the use of zebrafish in modelling human synaptic diseases. PMID:28252024

  4. Microbial genomics, transcriptomics and proteomics: new discoveries in decomposition research using complementary methods.

    PubMed

    Baldrian, Petr; López-Mondéjar, Rubén

    2014-02-01

    Molecular methods for the analysis of biomolecules have undergone rapid technological development in the last decade. The advent of next-generation sequencing methods and improvements in instrumental resolution enabled the analysis of complex transcriptome, proteome and metabolome data, as well as a detailed annotation of microbial genomes. The mechanisms of decomposition by model fungi have been described in unprecedented detail by the combination of genome sequencing, transcriptomics and proteomics. The increasing number of available genomes for fungi and bacteria shows that the genetic potential for decomposition of organic matter is widespread among taxonomically diverse microbial taxa, while expression studies document the importance of the regulation of expression in decomposition efficiency. Importantly, high-throughput methods of nucleic acid analysis used for the analysis of metagenomes and metatranscriptomes indicate the high diversity of decomposer communities in natural habitats and their taxonomic composition. Today, the metaproteomics of natural habitats is of interest. In combination with advanced analytical techniques to explore the products of decomposition and the accumulation of information on the genomes of environmentally relevant microorganisms, advanced methods in microbial ecophysiology should increase our understanding of the complex processes of organic matter transformation.

  5. Salivary biomarker development using genomic, proteomic and metabolomic approaches

    PubMed Central

    2012-01-01

    The use of saliva as a diagnostic sample provides a non-invasive, cost-efficient method of sample collection for disease screening without the need for highly trained professionals. Saliva collection is far more practical and safe compared with invasive methods of sample collection, because of the infection risk from contaminated needles during, for example, blood sampling. Furthermore, the use of saliva could increase the availability of accurate diagnostics for remote and impoverished regions. However, the development of salivary diagnostics has required technical innovation to allow stabilization and detection of analytes in the complex molecular mixture that is saliva. The recent development of cost-effective room temperature analyte stabilization methods, nucleic acid pre-amplification techniques and direct saliva transcriptomic analysis have allowed accurate detection and quantification of transcripts found in saliva. Novel protein stabilization methods have also facilitated improved proteomic analyses. Although candidate biomarkers have been discovered using epigenetic, transcriptomic, proteomic and metabolomic approaches, transcriptomic analyses have so far achieved the most progress in terms of sensitivity and specificity, and progress towards clinical implementation. Here, we review recent developments in salivary diagnostics that have been accomplished using genomic, transcriptomic, proteomic and metabolomic approaches. PMID:23114182

  6. A-to-I RNA Editing Contributes to Proteomic Diversity in Cancer. | Office of Cancer Genomics

    Cancer.gov

    Adenosine (A) to inosine (I) RNA editing introduces many nucleotide changes in cancer transcriptomes. However, due to the complexity of post-transcriptional regulation, the contribution of RNA editing to proteomic diversity in human cancers remains unclear. Here, we performed an integrated analysis of TCGA genomic data and CPTAC proteomic data. Despite limited site diversity, we demonstrate that A-to-I RNA editing contributes to proteomic diversity in breast cancer through changes in amino acid sequences. We validate the presence of editing events at both RNA and protein levels.

  7. Application of proteomics to ecology and population biology.

    PubMed

    Karr, T L

    2008-02-01

    Proteomics is a relatively new scientific discipline that merges protein biochemistry, genome biology and bioinformatics to determine the spatial and temporal expression of proteins in cells, tissues and whole organisms. There has been very little application of proteomics to the fields of behavioral genetics, evolution, ecology and population dynamics, and has only recently been effectively applied to the closely allied fields of molecular evolution and genetics. However, there exists considerable potential for proteomics to impact in areas related to functional ecology; this review will introduce the general concepts and methodologies that define the field of proteomics and compare and contrast the advantages and disadvantages with other methods. Examples of how proteomics can aid, complement and indeed extend the study of functional ecology will be discussed including the main tool of ecological studies, population genetics with an emphasis on metapopulation structure analysis. Because proteomic analyses provide a direct measure of gene expression, it obviates some of the limitations associated with other genomic approaches, such as microarray and EST analyses. Likewise, in conjunction with associated bioinformatics and molecular evolutionary tools, proteomics can provide the foundation of a systems-level integration approach that can enhance ecological studies. It can be envisioned that proteomics will provide important new information on issues specific to metapopulation biology and adaptive processes in nature. A specific example of the application of proteomics to sperm ageing is provided to illustrate the potential utility of the approach.

  8. Prediction of vaccine candidates against Pseudomonas aeruginosa: An integrated genomics and proteomics approach.

    PubMed

    Rashid, Muhammad Ibrahim; Naz, Anam; Ali, Amjad; Andleeb, Saadia

    2017-07-01

    Pseudomonas aeruginosa is among top critical nosocomial infectious agents due to its persistent infections and tendency for acquiring drug resistance mechanisms. To date, there is no vaccine available for this pathogen. We attempted to exploit the genomic and proteomic information of P. aeruginosa though reverse-vaccinology approaches to unveil the prospective vaccine candidates. P. aeruginosa strain PAO1 genome was subjected to sequential prioritization approach following genomic, proteomics and structural analyses. Among, the predicted vaccine candidates: surface components of antibiotic efflux pumps (Q9HY88, PA2837), chaperone-usher pathway components (CupC2, CupB3), penicillin binding protein of bacterial cell wall (PBP1a/mrcA), extracellular component of Type 3 secretory system (PscC) and three uncharacterized secretory proteins (PA0629, PA2822, PA0978) were identified as potential candidates qualifying all the set criteria. These proteins were then analyzed for potential immunogenic surface exposed epitopes. These predicted epitopes may provide a basis for development of a reliable subunit vaccine against P. aeruginosa. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

    PubMed Central

    Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

    2016-01-01

    Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794

  10. Exploring hepsin functional genetic variation association with disease specific protein expression in bipolar disorder: Applications of a proteomic informed genomic approach.

    PubMed

    Nassan, Malik; Jia, Yun-Fang; Jenkins, Greg; Colby, Colin; Feeder, Scott; Choi, Doo-Sup; Veldic, Marin; McElroy, Susan L; Bond, David J; Weinshilboum, Richard; Biernacka, Joanna M; Frye, Mark A

    2017-12-01

    In a prior discovery study, increased levels of serum Growth Differentiation Factor 15 (GDF15), Hepsin (HPN), and Matrix Metalloproteinase-7 (MMP7) were observed in bipolar depressed patients vs controls. This exploratory post-hoc analysis applied a proteomic-informed genomic research strategy to study the potential functional role of these proteins in bipolar disorder (BP). Utilizing the Genotype-Tissue Expression (GTEx) database to identify cis-acting blood expression quantitative trait loci (cis-eQTLs), five eQTL variants from the HPN gene were analyzed for association with BP cases using genotype data of cases from the discovery study (n = 58) versus healthy controls (n = 777). After adjusting for relevant covariates, we analyzed the relationship between these 5 cis-eQTLs and HPN serum level in the BP cases. All 5 cis-eQTL minor alleles were significantly more frequent in BP cases vs controls [(rs62122114, OR = 1.6, p = 0.02), (rs67003112, OR = 1.6, p = 0.02), (rs4997929, OR = 1.7, p = 0.01), (rs12610663, OR = 1.7, p = 0.01), (rs62122148, OR = 1.7, P = 0.01)]. The minor allele (A) in rs62122114 was significantly associated with increased serum HPN level in BP cases (Beta = 0.12, P = 0.049). However, this same minor allele was associated with reduced gene expression in GTEx controls. These exploratory analyses suggest that genetic variation in/near the gene encoding for hepsin protein may influence risk of bipolar disorder. This genetic variation, at least for the rs62122114-A allele, may have functional impact (i.e. differential expression) as evidenced by serum HPN protein expression. Although limited by small sample size, this study highlights the merits of proteomic informed functional genomic studies as a tool to investigate with greater precision the genetic risk of bipolar disorder and secondary relationships to protein expression recognizing, and encouraging in subsequent studies, high likelihood of epigenetic modification of

  11. Ascribing Functions to Genes: Journey Towards Genetic Improvement of Rice Via Functional Genomics

    PubMed Central

    Mustafiz, Ananda; Kumari, Sumita; Karan, Ratna

    2016-01-01

    Rice, one of the most important cereal crops for mankind, feeds more than half the world population. Rice has been heralded as a model cereal owing to its small genome size, amenability to easy transformation, high synteny to other cereal crops and availability of complete genome sequence. Moreover, sequence wealth in rice is getting more refined and precise due to resequencing efforts. This humungous resource of sequence data has confronted research fraternity with a herculean challenge as well as an excellent opportunity to functionally validate expressed as well as regulatory portions of the genome. This will not only help us in understanding the genetic basis of plant architecture and physiology but would also steer us towards developing improved cultivars. No single technique can achieve such a mammoth task. Functional genomics through its diverse tools viz. loss and gain of function mutants, multifarious omics strategies like transcriptomics, proteomics, metabolomics and phenomics provide us with the necessary handle. A paradigm shift in technological advances in functional genomics strategies has been instrumental in generating considerable amount of information w.r.t functionality of rice genome. We now have several databases and online resources for functionally validated genes but despite that we are far from reaching the desired milestone of functionally characterizing each and every rice gene. There is an urgent need for a common platform, for information already available in rice, and collaborative efforts between researchers in a concerted manner as well as healthy public-private partnership, for genetic improvement of rice crop better able to handle the pressures of climate change and exponentially increasing population. PMID:27252584

  12. Pre-fractionation strategies to resolve pea (Pisum sativum) sub-proteomes

    PubMed Central

    Meisrimler, Claudia-Nicole; Menckhoff, Ljiljana; Kukavica, Biljana M.; Lüthje, Sabine

    2015-01-01

    Legumes are important crop plants and pea (Pisum sativum L.) has been investigated as a model with respect to several physiological aspects. The sequencing of the pea genome has not been completed. Therefore, proteomic approaches are currently limited. Nevertheless, the increasing numbers of available EST-databases as well as the high homology of the pea and medicago genome (Medicago truncatula Gaertner) allow the successful identification of proteins. Due to the un-sequenced pea genome, pre-fractionation approaches have been used in pea proteomic surveys in the past. Aside from a number of selective proteome studies on crude extracts and the chloroplast, few studies have targeted other components such as the pea secretome, an important sub-proteome of interest due to its role in abiotic and biotic stress processes. The secretome itself can be further divided into different sub-proteomes (plasma membrane, apoplast, cell wall proteins). Cell fractionation in combination with different gel-electrophoresis, chromatography methods and protein identification by mass spectrometry are important partners to gain insight into pea sub-proteomes, post-translational modifications and protein functions. Overall, pea proteomics needs to link numerous existing physiological and biochemical data to gain further insight into adaptation processes, which play important roles in field applications. Future developments and directions in pea proteomics are discussed. PMID:26539198

  13. Proteome studies of filamentous fungi.

    PubMed

    Baker, Scott E; Panisko, Ellen A

    2011-01-01

    The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide variety of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, nongel-based proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of variations on the general methods and technologies for identifying peptides in a given sample. We present a method that can serve as a "baseline" for proteomic studies of fungi.

  14. Genome-wide identification of the subcellular localization of the Escherichia coli B proteome using experimental and computational methods.

    PubMed

    Han, Mee-Jung; Yun, Hongseok; Lee, Jeong Wook; Lee, Yu Hyun; Lee, Sang Yup; Yoo, Jong-Shin; Kim, Jin Young; Kim, Jihyun F; Hur, Cheol-Goo

    2011-04-01

    Escherichia coli K-12 and B strains have most widely been employed for scientific studies as well as industrial applications. Recently, the complete genome sequences of two representative descendants of E. coli B strains, REL606 and BL21(DE3), have been determined. Here, we report the subproteome reference maps of E. coli B REL606 by analyzing cytoplasmic, periplasmic, inner and outer membrane, and extracellular proteomes based on the genome information using experimental and computational approaches. Among the total of 3487 spots, 651 proteins including 410 non-redundant proteins were identified and characterized by 2-DE and LC-MS/MS; they include 440 cytoplasmic, 45 periplasmic, 50 inner membrane, 61 outer membrane, and 55 extracellular proteins. In addition, subcellular localizations of all 4205 ORFs of E. coli B were predicted by combined computational prediction methods. The subcellular localizations of 1812 (43.09%) proteins of currently unknown function were newly assigned. The results of computational prediction were also compared with the experimental results, showing that overall precision and recall were 92.16 and 92.16%, respectively. This work represents the most comprehensive analyses of the subproteomes of E. coli B, and will be useful as a reference for proteome profiling studies under various conditions. The complete proteome data are available online (http://ecolib.kaist.ac.kr). Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Biogeoscience from a Metallomic and Proteomic Perspective

    NASA Astrophysics Data System (ADS)

    Anbar, A. D.; Shock, E.

    2004-12-01

    In the wake of the genomics revolution, life scientists are expanding their focus from the genome to the "proteome" - the assemblage of all proteins in a cell - and the "metallome" - the distribution of inorganic species in a cell. The proteome and metallome are tightly connected because proteins and protein products are intimately involved in the transport and homeostasis of inorganic elements, and because many enzymes depend on inorganic elements for catalytic activity. Together, they are at the heart of metabolic function. Unlike the relatively static genome, the proteome and metallome are extremely dynamic, changing rapidly in response to environmental cues. They are substantially more complex than the genome; for example, in humans, some 30,000 genes code for approximately 500,000 proteins. Metaphorically, the proteome and metallome constitute the complex, dynamic "language" by which the genome and the environment communicate. Therefore biogeochemists, like life scientists, are moving beyond a strictly genomic perspective. Research guided by proteomic and metallomic perspectives and methodologies should provide new insights into the connections between life and the inorganic Earth in modern environments, and the evolution of these connections through time. For example, biogeochemical research in modern environments, such as Yellowstone hot springs, is hindered by the gap between genomic determinations of metabolic potential in ecosystems and geochemical characterizations of the energetic boundary conditions faced by these ecosystems; genomics tells us "who is there" and geochemistry tells us "what they might be doing", but neither genomics nor geochemistry easily provide quantitative information about which metabolisms are actually active or a framework for understanding why ecosystems do not fully exploit the energy available in their surroundings. Such questions are fundamentally kinetic rather than thermodynamic and therefore demand that we characterize and

  16. USING GENOMICS AND PROTEOMICS TO DIAGNOSE EXPOSURE OF AQUATIC ORGANISMS TO ENVIRONMENTAL CONTAMINANTS

    EPA Science Inventory

    Advances in molecular biology allow the use of cutting-edge genomic and proteomic tools to assess the effects of environmental contaminants on aquatic organisms. Techniques are available to measure changes in expression of single genes (quantitative real-time PCR) or to measure g...

  17. Hands-on workshops as an effective means of learning advanced technologies including genomics, proteomics and bioinformatics.

    PubMed

    Reisdorph, Nichole; Stearman, Robert; Kechris, Katerina; Phang, Tzu Lip; Reisdorph, Richard; Prenni, Jessica; Erle, David J; Coldren, Christopher; Schey, Kevin; Nesvizhskii, Alexey; Geraci, Mark

    2013-12-01

    Genomics and proteomics have emerged as key technologies in biomedical research, resulting in a surge of interest in training by investigators keen to incorporate these technologies into their research. At least two types of training can be envisioned in order to produce meaningful results, quality publications and successful grant applications: (1) immediate short-term training workshops and (2) long-term graduate education or visiting scientist programs. We aimed to fill the former need by providing a comprehensive hands-on training course in genomics, proteomics and informatics in a coherent, experimentally-based framework. This was accomplished through a National Heart, Lung, and Blood Institute (NHLBI)-sponsored 10-day Genomics and Proteomics Hands-on Workshop held at National Jewish Health (NJH) and the University of Colorado School of Medicine (UCD). The course content included comprehensive lectures and laboratories in mass spectrometry and genomics technologies, extensive hands-on experience with instrumentation and software, video demonstrations, optional workshops, online sessions, invited keynote speakers, and local and national guest faculty. Here we describe the detailed curriculum and present the results of short- and long-term evaluations from course attendees. Our educational program consistently received positive reviews from participants and had a substantial impact on grant writing and review, manuscript submissions and publications. Copyright © 2013. Production and hosting by Elsevier Ltd.

  18. Proteome Studies of Filamentous Fungi

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baker, Scott E.; Panisko, Ellen A.

    2011-04-20

    The continued fast pace of fungal genome sequence generation has enabled proteomic analysis of a wide breadth of organisms that span the breadth of the Kingdom Fungi. There is some phylogenetic bias to the current catalog of fungi with reasonable DNA sequence databases (genomic or EST) that could be analyzed at a global proteomic level. However, the rapid development of next generation sequencing platforms has lowered the cost of genome sequencing such that in the near future, having a genome sequence will no longer be a time or cost bottleneck for downstream proteomic (and transcriptomic) analyses. High throughput, non-gel basedmore » proteomics offers a snapshot of proteins present in a given sample at a single point in time. There are a number of different variations on the general method and technologies for identifying peptides in a given sample. We present a method that can serve as a “baseline” for proteomic studies of fungi.« less

  19. Recent advances in proteomics of cereals.

    PubMed

    Bansal, Monika; Sharma, Madhu; Kanwar, Priyanka; Goyal, Aakash

    Cereals contribute a major part of human nutrition and are considered as an integral source of energy for human diets. With genomic databases already available in cereals such as rice, wheat, barley, and maize, the focus has now moved to proteome analysis. Proteomics studies involve the development of appropriate databases based on developing suitable separation and purification protocols, identification of protein functions, and can confirm their functional networks based on already available data from other sources. Tremendous progress has been made in the past decade in generating huge data-sets for covering interactions among proteins, protein composition of various organs and organelles, quantitative and qualitative analysis of proteins, and to characterize their modulation during plant development, biotic, and abiotic stresses. Proteomics platforms have been used to identify and improve our understanding of various metabolic pathways. This article gives a brief review of efforts made by different research groups on comparative descriptive and functional analysis of proteomics applications achieved in the cereal science so far.

  20. Proteomics in medical microbiology.

    PubMed

    Cash, P

    2000-04-01

    The techniques of proteomics (high resolution two-dimensional electrophoresis and protein characterisation) are widely used for microbiological research to analyse global protein synthesis as an indicator of gene expression. The rapid progress in microbial proteomics has been achieved through the wide availability of whole genome sequences for a number of bacterial groups. Beyond providing a basic understanding of microbial gene expression, proteomics has also played a role in medical areas of microbiology. Progress has been made in the use of the techniques for investigating the epidemiology and taxonomy of human microbial pathogens, the identification of novel pathogenic mechanisms and the analysis of drug resistance. In each of these areas, proteomics has provided new insights that complement genomic-based investigations. This review describes the current progress in these research fields and highlights some of the technical challenges existing for the application of proteomics in medical microbiology. The latter concern the analysis of genetically heterogeneous bacterial populations and the integration of the proteomic and genomic data for these bacteria. The characterisation of the proteomes of bacterial pathogens growing in their natural hosts remains a future challenge.

  1. Investigation of Yersinia pestis laboratory adaptation through a combined genomics and proteomics approach

    DOE PAGES

    Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.; ...

    2015-11-24

    Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experimentmore » (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.« less

  2. Investigation of Yersinia pestis laboratory adaptation through a combined genomics and proteomics approach

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.

    Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experimentmore » (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.« less

  3. Birth of plant proteomics in India: a new horizon.

    PubMed

    Narula, Kanika; Pandey, Aarti; Gayali, Saurabh; Chakraborty, Niranjan; Chakraborty, Subhra

    2015-09-08

    In the post-genomic era, proteomics is acknowledged as the next frontier for biological research. Although India has a long and distinguished tradition in protein research, the initiation of proteomics studies was a new horizon. Protein research witnessed enormous progress in protein separation, high-resolution refinements, biochemical identification of the proteins, protein-protein interaction, and structure-function analysis. Plant proteomics research, in India, began its journey on investigation of the proteome profiling, complexity analysis, protein trafficking, and biochemical modeling. The research article by Bhushan et al. in 2006 marked the birth of the plant proteomics research in India. Since then plant proteomics studies expanded progressively and are now being carried out in various institutions spread across the country. The compilation presented here seeks to trace the history of development in the area during the past decade based on publications till date. In this review, we emphasize on outcomes of the field providing prospects on proteomic pathway analyses. Finally, we discuss the connotation of strategies and the potential that would provide the framework of plant proteome research. The past decades have seen rapidly growing number of sequenced plant genomes and associated genomic resources. To keep pace with this increasing body of data, India is in the provisional phase of proteomics research to develop a comparative hub for plant proteomes and protein families, but it requires a strong impetus from intellectuals, entrepreneurs, and government agencies. Here, we aim to provide an overview of past, present and future of Indian plant proteomics, which would serve as an evaluation platform for those seeking to incorporate proteomics into their research programs. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Proteomics technique opens new frontiers in mobilome research.

    PubMed

    Davidson, Andrew D; Matthews, David A; Maringer, Kevin

    2017-01-01

    A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti ). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.

  5. Proteomics technique opens new frontiers in mobilome research

    PubMed Central

    Davidson, Andrew D.; Matthews, David A.

    2017-01-01

    ABSTRACT A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the “mobilome,” which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the “domestication” of transposon proteins for cellular functions. Although ‘omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called “proteomics informed by transcriptomics” (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease. PMID:28932623

  6. GENOMIC AND PROTEOMIC ANALYSIS OF SURROGATE TISSUES FOR ASSESSING TOXIC EXPOSURES AND DISEASE STATES

    EPA Science Inventory

    Genomic and Proteomic Analysis of Surrogate Tissues for Assessing Toxic Exposures and Disease States
    David J. Dix and John C. Rockett
    Reproductive Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, USEPA, ...

  7. Search for sarcoidosis candidate genes by integration of data from genomic, transcriptomic and proteomic studies.

    PubMed

    Maver, Ales; Medica, Igor; Peterlin, Borut

    2009-12-01

    The search for gene candidates in multifactorial diseases such as sarcoidosis can be based on the integration of linkage association data, gene expression data, and protein profile data from genomic, transcriptomic and proteomic studies, respectively. In this study we performed a literature-based search for studies reporting such data, followed by integration of collected information. Different databases were examined--Medline, HugGE Navigator, ArrayExpress and Gene Expression Omnibus (GEO). Candidate genes were defined as genes which were reported in at least 2 different types of omics studies. Genes previously investigated in sarcoidosis were excluded from further analyses. We identified 177 genes associated with sarcoidosis as potential new candidate genes. Subsequently, 9 gene candidates identified to overlap in 2 different types of studies (genomic, transcriptomic and/or proteomic) were consistently reported in at least 3 studies: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214. These genes are involved in regulation of immune response, cellular proliferation, apoptosis, inhibition of protease activity, lipid metabolism. Exact biological functions of HBEGF, LRIG1, PTPN23, DPM2 and NUP214 remain to be completely elucidated. We propose 9 candidate genes: SERPINB1, FABP4, S100A8, HBEGF, IL7R, LRIG1, PTPN23, DPM2 and NUP214, as genes with high potential for association with sarcoidosis.

  8. VESPA: Software to Facilitate Genomic Annotation of Prokaryotic Organisms Through Integration of Proteomic and Transcriptomic Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.

    2012-04-25

    Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.

  9. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

    PubMed Central

    Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  10. The most common technologies and tools for functional genome analysis.

    PubMed

    Gasperskaja, Evelina; Kučinskas, Vaidutis

    2017-01-01

    Since the sequence of the human genome is complete, the main issue is how to understand the information written in the DNA sequence. Despite numerous genome-wide studies that have already been performed, the challenge to determine the function of genes, gene products, and also their interaction is still open. As changes in the human genome are highly likely to cause pathological conditions, functional analysis is vitally important for human health. For many years there have been a variety of technologies and tools used in functional genome analysis. However, only in the past decade there has been rapid revolutionizing progress and improvement in high-throughput methods, which are ranging from traditional real-time polymerase chain reaction to more complex systems, such as next-generation sequencing or mass spectrometry. Furthermore, not only laboratory investigation, but also accurate bioinformatic analysis is required for reliable scientific results. These methods give an opportunity for accurate and comprehensive functional analysis that involves various fields of studies: genomics, epigenomics, proteomics, and interactomics. This is essential for filling the gaps in the knowledge about dynamic biological processes at both cellular and organismal level. However, each method has both advantages and limitations that should be taken into account before choosing the right method for particular research in order to ensure successful study. For this reason, the present review paper aims to describe the most frequent and widely-used methods for the comprehensive functional analysis.

  11. Functional Genomics Approaches to Studying Symbioses between Legumes and Nitrogen-Fixing Rhizobia.

    PubMed

    Lardi, Martina; Pessi, Gabriella

    2018-05-18

    Biological nitrogen fixation gives legumes a pronounced growth advantage in nitrogen-deprived soils and is of considerable ecological and economic interest. In exchange for reduced atmospheric nitrogen, typically given to the plant in the form of amides or ureides, the legume provides nitrogen-fixing rhizobia with nutrients and highly specialised root structures called nodules. To elucidate the molecular basis underlying physiological adaptations on a genome-wide scale, functional genomics approaches, such as transcriptomics, proteomics, and metabolomics, have been used. This review presents an overview of the different functional genomics approaches that have been performed on rhizobial symbiosis, with a focus on studies investigating the molecular mechanisms used by the bacterial partner to interact with the legume. While rhizobia belonging to the alpha-proteobacterial group (alpha-rhizobia) have been well studied, few studies to date have investigated this process in beta-proteobacteria (beta-rhizobia).

  12. Why proteomics is not the new genomics and the future of mass spectrometry in cell biology.

    PubMed

    Sidoli, Simone; Kulej, Katarzyna; Garcia, Benjamin A

    2017-01-02

    Mass spectrometry (MS) is an essential part of the cell biologist's proteomics toolkit, allowing analyses at molecular and system-wide scales. However, proteomics still lag behind genomics in popularity and ease of use. We discuss key differences between MS-based -omics and other booming -omics technologies and highlight what we view as the future of MS and its role in our increasingly deep understanding of cell biology. © 2017 Sidoli et al.

  13. Trans-Proteomic Pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics

    PubMed Central

    Deutsch, Eric W.; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L.

    2015-01-01

    Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include mass spectrometry to define protein sequence, protein:protein interactions, and protein post-translational modifications. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative mass spectrometry proteomics. It supports all major operating systems and instrument vendors via open data formats. Here we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of tandem mass spectrometry datasets, as well as some major upcoming features. PMID:25631240

  14. Trans-Proteomic Pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics.

    PubMed

    Deutsch, Eric W; Mendoza, Luis; Shteynberg, David; Slagel, Joseph; Sun, Zhi; Moritz, Robert L

    2015-08-01

    Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Interaction Analysis through Proteomic Phage Display

    PubMed Central

    2014-01-01

    Phage display is a powerful technique for profiling specificities of peptide binding domains. The method is suited for the identification of high-affinity ligands with inhibitor potential when using highly diverse combinatorial peptide phage libraries. Such experiments further provide consensus motifs for genome-wide scanning of ligands of potential biological relevance. A complementary but considerably less explored approach is to display expression products of genomic DNA, cDNA, open reading frames (ORFs), or oligonucleotide libraries designed to encode defined regions of a target proteome on phage particles. One of the main applications of such proteomic libraries has been the elucidation of antibody epitopes. This review is focused on the use of proteomic phage display to uncover protein-protein interactions of potential relevance for cellular function. The method is particularly suited for the discovery of interactions between peptide binding domains and their targets. We discuss the largely unexplored potential of this method in the discovery of domain-motif interactions of potential biological relevance. PMID:25295249

  16. The porcine translational research database: A manually curated, genomics and proteomics-based research resource

    USDA-ARS?s Scientific Manuscript database

    The use of swine in biomedical research has increased dramatically in the last decade. Diverse genomic- and proteomic databases have been developed to facilitate research using human and rodent models. Current porcine gene databases, however, lack the robust annotation to study pig models that are...

  17. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu

    PubMed Central

    Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381

  18. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu.

    PubMed

    Zhang, Yanlin; Luo, Guangbin; Liu, Dongcheng; Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat.

  19. Single-cell-type Proteomics: Toward a Holistic Understanding of Plant Function*

    PubMed Central

    Dai, Shaojun; Chen, Sixue

    2012-01-01

    Multicellular organisms such as plants contain different types of cells with specialized functions. Analyzing the protein characteristics of each type of cell will not only reveal specific cell functions, but also enhance understanding of how an organism works. Most plant proteomics studies have focused on using tissues and organs containing a mixture of different cells. Recent single-cell-type proteomics efforts on pollen grains, guard cells, mesophyll cells, root hairs, and trichomes have shown utility. We expect that high resolution proteomic analyses will reveal novel functions in single cells. This review provides an overview of recent developments in plant single-cell-type proteomics. We discuss application of the approach for understanding important cell functions, and we consider the technical challenges of extending the approach to all plant cell types. Finally, we consider the integration of single-cell-type proteomics with transcriptomics and metabolomics with the goal of providing a holistic understanding of plant function. PMID:22982375

  20. Proteomic and comparative genomic analysis reveals adaptability of Brassica napus to phosphorus-deficient stress.

    PubMed

    Chen, Shuisen; Ding, Guangda; Wang, Zhenhua; Cai, Hongmei; Xu, Fangsen

    2015-03-18

    Given low solubility and immobility in many soils of the world, phosphorus (P) may be the most widely studied macronutrient for plants. In an attempt to gain an insight into the adaptability of Brassica napus to P deficiency, proteome alterations of roots and leaves in two B. napus contrasting genotypes, P-efficient 'Eyou Changjia' and P-inefficient 'B104-2', under long-term low P stress and short-term P-free starvation conditions were investigated, and proteomic combined with comparative genomic analyses were conducted to interpret the interrelation of differential abundance protein species (DAPs) responding to P deficiency with quantitative trait loci (QTLs) for P deficiency tolerance. P-efficient 'Eyou Changjia' had higher dry weight and P content, and showed high tolerance to low P stress compared with P-inefficient 'B104-2'. A total of 146 DAPs were successfully identified by MALDI TOF/TOF MS, which were categorized into several groups including defense and stress response, carbohydrate and energy metabolism, signaling and regulation, amino acid and fatty acid metabolism, protein process, biogenesis and cellular component, and function unknown. 94 of 146 DAPs were mapped to a linkage map constructed by a B. napus population derived from a cross between the two genotypes, and 72 DAPs were located in the confidence intervals of QTLs for P efficiency related traits. We conclude that the identification of these DAPs and the co-location of DAPs with QTLs in the B. napus linkage genetic map provide us novel information in understanding the adaptability of B. napus to P deficiency, and helpful to isolate P-efficient genes in B. napus. Low P seriously limits the production and quality of B. napus. Proteomics and genetic linkage map were widely used to study the adaptive strategies of B. napus response to P deficiency, proteomic combined with comparative genetic analysis to investigate the correlations between DAPs and QTLs are scarce. Thus, we herein investigated

  1. Linkage of exposure and effects using genomics, proteomics and metabolomics in small fish models (presentation)

    EPA Science Inventory

    This research project combines the use of whole organism endpoints, genomic, proteomic and metabolomic approaches, and computational modeling in a systems biology approach to 1) identify molecular indicators of exposure and biomarkers of effect to EDCs representing several modes/...

  2. A DATABASE FOR TRACKING TOXICOGENOMIC SAMPLES AND PROCEDURES WITH GENOMIC, PROTEOMIC AND METABONOMIC COMPONENTS

    EPA Science Inventory

    A Database for Tracking Toxicogenomic Samples and Procedures with Genomic, Proteomic and Metabonomic Components
    Wenjun Bao1, Jennifer Fostel2, Michael D. Waters2, B. Alex Merrick2, Drew Ekman3, Mitchell Kostich4, Judith Schmid1, David Dix1
    Office of Research and Developmen...

  3. Proteomics research in India: an update.

    PubMed

    Reddy, Panga Jaipal; Atak, Apurva; Ghantasala, Saicharan; Kumar, Saurabh; Gupta, Shabarni; Prasad, T S Keshava; Zingde, Surekha M; Srivastava, Sanjeeva

    2015-09-08

    After a successful completion of the Human Genome Project, deciphering the mystery surrounding the human proteome posed a major challenge. Despite not being largely involved in the Human Genome Project, the Indian scientific community contributed towards proteomic research along with the global community. Currently, more than 76 research/academic institutes and nearly 145 research labs are involved in core proteomic research across India. The Indian researchers have been major contributors in drafting the "human proteome map" along with international efforts. In addition to this, virtual proteomics labs, proteomics courses and remote triggered proteomics labs have helped to overcome the limitations of proteomics education posed due to expensive lab infrastructure. The establishment of Proteomics Society, India (PSI) has created a platform for the Indian proteomic researchers to share ideas, research collaborations and conduct annual conferences and workshops. Indian proteomic research is really moving forward with the global proteomics community in a quest to solve the mysteries of proteomics. A draft map of the human proteome enhances the enthusiasm among intellectuals to promote proteomic research in India to the world.This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Post-genomics of microsporidia, with emphasis on a model of minimal eukaryotic proteome: a review.

    PubMed

    Texier, Catherine; Brosson, Damien; El Alaoui, Hicham; Méténier, Guy; Vivarès, Christian P

    2005-05-01

    The genome sequence of the microsporidian parasite Encephalitozoon cuniculi Levaditi, Nicolau et Schoen, 1923 contains about 2,000 genes that are representative of a non-redundant potential proteome composed of 1,909 protein chains. The purpose of this review is to relate some advances in the characterisation of this proteome through bioinformatics and experimental approaches. The reduced diversity of the set of E. cuniculi proteins is perceptible in all the compilations of predicted domains, orthologs, families and superfamilies, available in several public databases. The phyletic patterns of orthologs for seven eukaryotic organisms support an extensive gene loss in the fungal clade, with additional deletions in E. cuniculi. Most microsporidial orthologs are the smallest ones among eukaryotes, justifying an interest in the use of these compacted proteins to better discriminate between essential and non-essential regions. The three components of the E. cuniculi mRNA capping apparatus have been especially well characterized and the three-dimensional structure of the cap methyltransferase has been elucidated following the crystallisation of the microsporidial enzyme Ecm1. So far, our mass spectrometry-based analyses of the E. cuniculi spore proteome has led to the identification of about 170 proteins, one-quarter of these having no clearly predicted function. Immunocytochemical studies are in progress to determine the subcellular localisation of microsporidia-specific proteins. Post-translational modifications such as phosphorylation and glycosylation are expected to be soon explored.

  5. Quantitative proteomics in Giardia duodenalis-Achievements and challenges.

    PubMed

    Emery, Samantha J; Lacey, Ernest; Haynes, Paul A

    2016-08-01

    Giardia duodenalis (syn. G. lamblia and G. intestinalis) is a protozoan parasite of vertebrates and a major contributor to the global burden of diarrheal diseases and gastroenteritis. The publication of multiple genome sequences in the G. duodenalis species complex has provided important insights into parasite biology, and made post-genomic technologies, including proteomics, significantly more accessible. The aims of proteomics are to identify and quantify proteins present in a cell, and assign functions to them within the context of dynamic biological systems. In Giardia, proteomics in the post-genomic era has transitioned from reliance on gel-based systems to utilisation of a diverse array of techniques based on bottom-up LC-MS/MS technologies. Together, these have generated crucial foundations for subcellular proteomes, elucidated intra- and inter-assemblage isolate variation, and identified pathways and markers in differentiation, host-parasite interactions and drug resistance. However, in Giardia, proteomics remains an emerging field, with considerable shortcomings evident from the published research. These include a bias towards assemblage A, a lack of emphasis on quantitative analytical techniques, and limited information on post-translational protein modifications. Additionally, there are multiple areas of research for which proteomic data is not available to add value to published transcriptomic data. The challenge of amalgamating data in the systems biology paradigm necessitates the further generation of large, high-quality quantitative datasets to accurately model parasite biology. This review surveys the current proteomic research available for Giardia and evaluates their technical and quantitative approaches, while contextualising their biological insights into parasite pathology, isolate variation and eukaryotic evolution. Finally, we propose areas of priority for the generation of future proteomic data to explore fundamental questions in Giardia

  6. Computational clustering for viral reference proteomes

    PubMed Central

    Chen, Chuming; Huang, Hongzhan; Mazumder, Raja; Natale, Darren A.; McGarvey, Peter B.; Zhang, Jian; Polson, Shawn W.; Wang, Yuqi; Wu, Cathy H.

    2016-01-01

    Motivation: The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies. Results: We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt’s curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs. Availability and implementation: http://proteininformationresource.org/rps/viruses/ Contact: chenc@udel.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153712

  7. Genome-wide identification, functional and evolutionary analysis of terpene synthases in pineapple.

    PubMed

    Chen, Xiaoe; Yang, Wei; Zhang, Liqin; Wu, Xianmiao; Cheng, Tian; Li, Guanglin

    2017-10-01

    Terpene synthases (TPSs) are vital for the biosynthesis of active terpenoids, which have important physiological, ecological and medicinal value. Although terpenoids have been reported in pineapple (Ananas comosus), genome-wide investigations of the TPS genes responsible for pineapple terpenoid synthesis are still lacking. By integrating pineapple genome and proteome data, twenty-one putative terpene synthase genes were found in pineapple and divided into five subfamilies. Tandem duplication is the cause of TPS gene family duplication. Furthermore, functional differentiation between each TPS subfamily may have occurred for several reasons. Sixty-two key amino acid sites were identified as being type-II functionally divergence between TPS-a and TPS-c subfamily. Finally, coevolution analysis indicated that multiple amino acid residues are involved in coevolutionary processes. In addition, the enzyme activity of two TPSs were tested. This genome-wide identification, functional and evolutionary analysis of pineapple TPS genes provide a new insight into understanding the roles of TPS family and lay the basis for further characterizing the function and evolution of TPS gene family. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Functional insights from proteome-wide structural modeling of Treponema pallidum subspecies pallidum, the causative agent of syphilis.

    PubMed

    Houston, Simon; Lithgow, Karen Vivien; Osbak, Kara Krista; Kenyon, Chris Richard; Cameron, Caroline E

    2018-05-16

    Syphilis continues to be a major global health threat with 11 million new infections each year, and a global burden of 36 million cases. The causative agent of syphilis, Treponema pallidum subspecies pallidum, is a highly virulent bacterium, however the molecular mechanisms underlying T. pallidum pathogenesis remain to be definitively identified. This is due to the fact that T. pallidum is currently uncultivatable, inherently fragile and thus difficult to work with, and phylogenetically distinct with no conventional virulence factor homologs found in other pathogens. In fact, approximately 30% of its predicted protein-coding genes have no known orthologs or assigned functions. Here we employed a structural bioinformatics approach using Phyre2-based tertiary structure modeling to improve our understanding of T. pallidum protein function on a proteome-wide scale. Phyre2-based tertiary structure modeling generated high-confidence predictions for 80% of the T. pallidum proteome (780/978 predicted proteins). Tertiary structure modeling also inferred the same function as primary structure-based annotations from genome sequencing pipelines for 525/605 proteins (87%), which represents 54% (525/978) of all T. pallidum proteins. Of the 175 T. pallidum proteins modeled with high confidence that were not assigned functions in the previously annotated published proteome, 167 (95%) were able to be assigned predicted functions. Twenty-one of the 175 hypothetical proteins modeled with high confidence were also predicted to exhibit significant structural similarity with proteins experimentally confirmed to be required for virulence in other pathogens. Phyre2-based structural modeling is a powerful bioinformatics tool that has provided insight into the potential structure and function of the majority of T. pallidum proteins and helped validate the primary structure-based annotation of more than 50% of all T. pallidum proteins with high confidence. This work represents the first T

  9. Platelet proteomics: from discovery to diagnosis.

    PubMed

    Looße, Christina; Swieringa, Frauke; Heemskerk, Johan W M; Sickmann, Albert; Lorenz, Christin

    2018-05-22

    Platelets are the smallest cells within the circulating blood with key roles in physiological haemostasis and pathological thrombosis regulated by the onset of activating/inhibiting processes via receptor responses and signalling cascades. Areas covered: Proteomics as well as genomic approaches have been fundamental in identifying and quantifying potential targets for future diagnostic strategies in the prevention of bleeding and thrombosis, and uncovering the complexity of platelet functions in health and disease. In this article, we provide a critical overview on current functional tests used in diagnostics and the future perspectives for platelet proteomics in clinical applications. Expert commentary: Proteomics represents a valuable tool for the identification of patients with diverse platelet associated defects. In-depth validation of identified biomarkers, e.g. receptors, signalling proteins, post-translational modifications, in large cohorts is decisive for translation into routine clinical diagnostics.

  10. Analysis of the functional aspects and seminal plasma proteomic profile of sperm from smokers.

    PubMed

    Antoniassi, Mariana Pereira; Intasqui, Paula; Camargo, Mariana; Zylbersztejn, Daniel Suslik; Carvalho, Valdemir Melechco; Cardozo, Karina H M; Bertolla, Ricardo Pimenta

    2016-11-01

    To evaluate the effect of smoking on sperm functional quality and seminal plasma proteomic profile. Sperm functional tests were performed in 20 non-smoking men with normal semen quality, according to the World Health Organization (2010) and in 20 smoking patients. These included: evaluation of DNA fragmentation by alkaline Comet assay; analysis of mitochondrial activity using DAB staining; and acrosomal integrity evaluation by PNA binding. The remaining semen was centrifuged and seminal plasma was used for proteomic analysis (liquid chromatography-tandem mass spectrometry). The quantified proteins were used for Venn diagram construction in Cytoscape 3.2.1 software, using the PINA4MS plug-in. Then, differentially expressed proteins were used for functional enrichment analysis of Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes and Reactome, using Cytoscape software and the ClueGO 2.2.0 plug-in. Smokers had a higher percentage of sperm DNA damage (Comet classes III and IV; P < 0.01), partially and fully inactive mitochondria (DAB classes III and IV; P = 0.001 and P = 0.006, respectively) and non-intact acrosomes (P < 0.01) when compared with the control group. With respect to proteomic analysis, 422 proteins were identified and quantified, of which one protein was absent, 27 proteins were under-represented and six proteins were over-represented in smokers. Functional enrichment analysis showed the enrichment of antigen processing and presentation, positive regulation of prostaglandin secretion involved in immune response, protein kinase A signalling and arachidonic acid secretion, complement activation, regulation of the cytokine-mediated signalling pathway and regulation of acute inflammatory response in the study group (smokers). In conclusion, cigarette smoking was associated with an inflammatory state in the accessory glands and in the testis, as shown by enriched proteomic pathways. This state causes an alteration in sperm functional quality

  11. Comparative Analysis of Predicted Plastid-Targeted Proteomes of Sequenced Higher Plant Genomes

    PubMed Central

    Schaeffer, Scott; Harper, Artemus; Raja, Rajani; Jaiswal, Pankaj; Dhingra, Amit

    2014-01-01

    Plastids are actively involved in numerous plant processes critical to growth, development and adaptation. They play a primary role in photosynthesis, pigment and monoterpene synthesis, gravity sensing, starch and fatty acid synthesis, as well as oil, and protein storage. We applied two complementary methods to analyze the recently published apple genome (Malus × domestica) to identify putative plastid-targeted proteins, the first using TargetP and the second using a custom workflow utilizing a set of predictive programs. Apple shares roughly 40% of its 10,492 putative plastid-targeted proteins with that of the Arabidopsis (Arabidopsis thaliana) plastid-targeted proteome as identified by the Chloroplast 2010 project and ∼57% of its entire proteome with Arabidopsis. This suggests that the plastid-targeted proteomes between apple and Arabidopsis are different, and interestingly alludes to the presence of differential targeting of homologs between the two species. Co-expression analysis of 2,224 genes encoding putative plastid-targeted apple proteins suggests that they play a role in plant developmental and intermediary metabolism. Further, an inter-specific comparison of Arabidopsis, Prunus persica (Peach), Malus × domestica (Apple), Populus trichocarpa (Black cottonwood), Fragaria vesca (Woodland Strawberry), Solanum lycopersicum (Tomato) and Vitis vinifera (Grapevine) also identified a large number of novel species-specific plastid-targeted proteins. This analysis also revealed the presence of alternatively targeted homologs across species. Two separate analyses revealed that a small subset of proteins, one representing 289 protein clusters and the other 737 unique protein sequences, are conserved between seven plastid-targeted angiosperm proteomes. Majority of the novel proteins were annotated to play roles in stress response, transport, catabolic processes, and cellular component organization. Our results suggest that the current state of knowledge regarding

  12. Microbial Interactions in Plants: Perspectives and Applications of Proteomics.

    PubMed

    Imam, Jahangir; Shukla, Pratyoosh; Mandal, Nimai Prasad; Variar, Mukund

    2017-01-01

    The structure and function of proteins involved in plant-microbe interactions is investigated through large-scale proteomics technology in a complex biological sample. Since the whole genome sequences are now available for several plant species and microbes, proteomics study has become easier, accurate and huge amount of data can be generated and analyzed during plant-microbe interactions. Proteomics approaches are highly important and relevant in many studies and showed that only genomics approaches are not sufficient enough as much significant information are lost as the proteins and not the genes coding them are final product that is responsible for the observed phenotype. Novel approaches in proteomics are developing continuously enabling the study of the various aspects in arrangements and configuration of proteins and its functions. Its application is becoming more common and frequently used in plant-microbe interactions with the advancement in new technologies. They are more used for the portrayal of cell and extracellular destructiveness and pathogenicity variables delivered by pathogens. This distinguishes the protein level adjustments in host plants when infected with pathogens and advantageous partners. This review provides a brief overview of different proteomics technology which is currently available followed by their exploitation to study the plant-microbe interaction. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  13. Proteomic analysis of bovine nucleolus.

    PubMed

    Patel, Amrutlal K; Olson, Doug; Tikoo, Suresh K

    2010-09-01

    Nucleolus is the most prominent subnuclear structure, which performs a wide variety of functions in the eukaryotic cellular processes. In order to understand the structural and functional role of the nucleoli in bovine cells, we analyzed the proteomic composition of the bovine nucleoli. The nucleoli were isolated from Madin Darby bovine kidney cells and subjected to proteomic analysis by LC-MS/MS after fractionation by SDS-PAGE and strong cation exchange chromatography. Analysis of the data using the Mascot database search and the GPM database search identified 311 proteins in the bovine nucleoli, which contained 22 proteins previously not identified in the proteomic analysis of human nucleoli. Analysis of the identified proteins using the GoMiner software suggested that the bovine nucleoli contained proteins involved in ribosomal biogenesis, cell cycle control, transcriptional, translational and post-translational regulation, transport, and structural organization. Copyright © 2010 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.

  14. Computer applications making rapid advances in high throughput microbial proteomics (HTMP).

    PubMed

    Anandkumar, Balakrishna; Haga, Steve W; Wu, Hui-Fen

    2014-02-01

    The last few decades have seen the rise of widely-available proteomics tools. From new data acquisition devices, such as MALDI-MS and 2DE to new database searching softwares, these new products have paved the way for high throughput microbial proteomics (HTMP). These tools are enabling researchers to gain new insights into microbial metabolism, and are opening up new areas of study, such as protein-protein interactions (interactomics) discovery. Computer software is a key part of these emerging fields. This current review considers: 1) software tools for identifying the proteome, such as MASCOT or PDQuest, 2) online databases of proteomes, such as SWISS-PROT, Proteome Web, or the Proteomics Facility of the Pathogen Functional Genomics Resource Center, and 3) software tools for applying proteomic data, such as PSI-BLAST or VESPA. These tools allow for research in network biology, protein identification, functional annotation, target identification/validation, protein expression, protein structural analysis, metabolic pathway engineering and drug discovery.

  15. Proteomic profiling of white muscle from freshwater catfish Rita rita.

    PubMed

    Mohanty, Bimal Prasanna; Mitra, Tandrima; Banerjee, Sudeshna; Bhattacharjee, Soma; Mahanty, Arabinda; Ganguly, Satabdi; Purohit, Gopal Krishna; Karunakaran, Dhanasekar; Mohanty, Sasmita

    2015-06-01

    Muscle tissues contribute 34-48 % of the total body mass in fish. Proteomic analysis enables better understanding of the skeletal muscle physiology and metabolism. A proteome map reflects the general fingerprinting of the fish species and has the potential to identify novel proteins which could serve as biomarkers for many aspects of aquaculture including fish physiology and growth, flesh quality, food safety and aquatic environmental monitoring. The freshwater catfish Rita rita of the family Bagridae inhabiting the tropical rivers and estuaries is an important food fish with high nutritive value and is also considered a species of choice in riverine pollution monitoring. Omics information that could enhance utility of this species in molecular research is meager. Therefore, in the present study, proteomic analysis of Rita rita muscle has been carried out and functional genomics data have been generated. A reference muscle proteome has been developed, and 23 protein spots, representing 18 proteins, have been identified by MALDI-TOF/TOF-MS and LC-MS/MS. Besides, transcript information on a battery of heat shock proteins (Hsps) has been generated. The functional genomics information generated could act as the baseline data for further molecular research on this species.

  16. Plasmodium vivax Biology: Insights Provided by Genomics, Transcriptomics and Proteomics

    PubMed Central

    Bourgard, Catarina; Albrecht, Letusa; Kayano, Ana C. A. V.; Sunnerhagen, Per; Costa, Fabio T. M.

    2018-01-01

    During the last decade, the vast omics field has revolutionized biological research, especially the genomics, transcriptomics and proteomics branches, as technological tools become available to the field researcher and allow difficult question-driven studies to be addressed. Parasitology has greatly benefited from next generation sequencing (NGS) projects, which have resulted in a broadened comprehension of basic parasite molecular biology, ecology and epidemiology. Malariology is one example where application of this technology has greatly contributed to a better understanding of Plasmodium spp. biology and host-parasite interactions. Among the several parasite species that cause human malaria, the neglected Plasmodium vivax presents great research challenges, as in vitro culturing is not yet feasible and functional assays are heavily limited. Therefore, there are gaps in our P. vivax biology knowledge that affect decisions for control policies aiming to eradicate vivax malaria in the near future. In this review, we provide a snapshot of key discoveries already achieved in P. vivax sequencing projects, focusing on developments, hurdles, and limitations currently faced by the research community, as well as perspectives on future vivax malaria research. PMID:29473024

  17. Laser Capture Microdissection in the Genomic and Proteomic Era: Targeting the Genetic Basis of Cancer

    PubMed Central

    Domazet, Barbara; MacLennan, Gregory T.; Lopez-Beltran, Antonio; Montironi, Rodolfo; Cheng, Liang

    2008-01-01

    The advent of new technologies has enabled deeper insight into processes atsubcellular levels, which will ultimately improve diagnostic procedures and patient outcome. Thanks to cell enrichment methods, it is now possible to study cells in their native environment. This has greatly contributed to a rapid growth in several areas, such as gene expression analysis, proteomics, and metabolonomics. Laser capture microdissection (LCM) as a method of procuring subpopulations of cells under direct visual inspection is playing an important role in these areas. This review provides an overview of existing LCM technology and its downstream applications in genomics, proteomics, diagnostics and therapy. PMID:18787684

  18. Laser capture microdissection in the genomic and proteomic era: targeting the genetic basis of cancer.

    PubMed

    Domazet, Barbara; Maclennan, Gregory T; Lopez-Beltran, Antonio; Montironi, Rodolfo; Cheng, Liang

    2008-03-15

    The advent of new technologies has enabled deeper insight into processes at subcellular levels, which will ultimately improve diagnostic procedures and patient outcome. Thanks to cell enrichment methods, it is now possible to study cells in their native environment. This has greatly contributed to a rapid growth in several areas, such as gene expression analysis, proteomics, and metabolonomics. Laser capture microdissection (LCM) as a method of procuring subpopulations of cells under direct visual inspection is playing an important role in these areas. This review provides an overview of existing LCM technology and its downstream applications in genomics, proteomics, diagnostics and therapy.

  19. Enriching the annotation of Mycobacterium tuberculosis H37Rv proteome using remote homology detection approaches: insights into structure and function.

    PubMed

    Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy

    2015-01-01

    The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better

  20. Advances in Proteomics of Mycobacterium leprae.

    PubMed

    Parkash, O; Singh, B P

    2012-04-01

    Although Mycobacterium leprae was the first bacterial pathogen identified causing human disease, it remains one of the few that is non-cultivable. Understanding the biology of M. leprae is one of the primary challenges in current leprosy research. Genomics has been extremely valuable, nonetheless, functional proteins are ultimately responsible for controlling most aspects of cellular functions, which in turn could facilitate parasitizing the host. Furthermore, bacterial proteins provide targets for most of the vaccines and immunodiagnostic tools. Better understanding of the proteomics of M. leprae could also help in developing new drugs against M. leprae. During the past nearly 15 years, there have been several developments towards the identification of M. leprae proteins employing contemporary proteomics tools. In this review, we discuss the knowledge gained on the biology and pathogenesis of M. leprae from current proteomic studies. © 2012 The Authors. Scandinavian Journal of Immunology © 2012 Blackwell Publishing Ltd.

  1. Draft Map of Human Proteome Published | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    In a recently published article in the journal Nature, researchers have developed a draft map of the human proteome.  Striving for the protein equivalent of the Human Genome Project, an international team of researchers has created an initial catalog of the human proteome. In total, using 30 different human tissues, the researchers identified proteins encoded by 17,294 genes, which is approximately 84 percent of all of the genes in the human genome predicted to encode proteins.

  2. Label-free proteomic analysis to confirm the predicted proteome of Corynebacterium pseudotuberculosis under nitrosative stress mediated by nitric oxide.

    PubMed

    Silva, Wanderson M; Carvalho, Rodrigo D; Soares, Siomar C; Bastos, Isabela Fs; Folador, Edson L; Souza, Gustavo Hmf; Le Loir, Yves; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco

    2014-12-04

    Corynebacterium pseudotuberculosis biovar ovis is a facultative intracellular pathogen, and the etiological agent of caseous lymphadenitis in small ruminants. During the infection process, the bacterium is subjected to several stress conditions, including nitrosative stress, which is caused by nitric oxide (NO). In silico analysis of the genome of C. pseudotuberculosis ovis 1002 predicted several genes that could influence the resistance of this pathogen to nitrosative stress. Here, we applied high-throughput proteomics using high definition mass spectrometry to characterize the functional genome of C. pseudotuberculosis ovis 1002 in the presence of NO-donor Diethylenetriamine/nitric oxide adduct (DETA/NO), with the aim of identifying proteins involved in nitrosative stress resistance. We characterized 835 proteins, representing approximately 41% of the predicted proteome of C. pseudotuberculosis ovis 1002, following exposure to nitrosative stress. In total, 102 proteins were exclusive to the proteome of DETA/NO-induced cells, and a further 58 proteins were differentially regulated between the DETA/NO and control conditions. An interactomic analysis of the differential proteome of C. pseudotuberculosis in response to nitrosative stress was also performed. Our proteomic data set suggested the activation of both a general stress response and a specific nitrosative stress response, as well as changes in proteins involved in cellular metabolism, detoxification, transcriptional regulation, and DNA synthesis and repair. Our proteomic analysis validated previously-determined in silico data for C. pseudotuberculosis ovis 1002. In addition, proteomic screening performed in the presence of NO enabled the identification of a set of factors that can influence the resistance and survival of C. pseudotuberculosis during exposure to nitrosative stress.

  3. Activity-based protein profiling: from enzyme chemistry to proteomic chemistry.

    PubMed

    Cravatt, Benjamin F; Wright, Aaron T; Kozarich, John W

    2008-01-01

    Genome sequencing projects have provided researchers with a complete inventory of the predicted proteins produced by eukaryotic and prokaryotic organisms. Assignment of functions to these proteins represents one of the principal challenges for the field of proteomics. Activity-based protein profiling (ABPP) has emerged as a powerful chemical proteomic strategy to characterize enzyme function directly in native biological systems on a global scale. Here, we review the basic technology of ABPP, the enzyme classes addressable by this method, and the biological discoveries attributable to its application.

  4. Sma3s: A universal tool for easy functional annotation of proteomes and transcriptomes.

    PubMed

    Casimiro-Soriguer, Carlos S; Muñoz-Mérida, Antonio; Pérez-Pulido, Antonio J

    2017-06-01

    The current cheapening of next-generation sequencing has led to an enormous growth in the number of sequenced genomes and transcriptomes, allowing wet labs to get the sequences from their organisms of study. To make the most of these data, one of the first things that should be done is the functional annotation of the protein-coding genes. But it used to be a slow and tedious step that can involve the characterization of thousands of sequences. Sma3s is an accurate computational tool for annotating proteins in an unattended way. Now, we have developed a completely new version, which includes functionalities that will be of utility for fundamental and applied science. Currently, the results provide functional categories such as biological processes, which become useful for both characterizing particular sequence datasets and comparing results from different projects. But one of the most important implemented innovations is that it has now low computational requirements, and the complete annotation of a simple proteome or transcriptome usually takes around 24 hours in a personal computer. Sma3s has been tested with a large amount of complete proteomes and transcriptomes, and it has demonstrated its potential in health science and other specific projects. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. The Changing Face of Scientific Discourse: Analysis of Genomic and Proteomic Database Usage and Acceptance.

    ERIC Educational Resources Information Center

    Brown, Cecelia

    2003-01-01

    Discusses the growth in use and acceptance of Web-based genomic and proteomic databases (GPD) in scholarly communication. Confirms the role of GPD in the scientific literature cycle, suggests GPD are a storage and retrieval mechanism for molecular biology information, and recommends that existing models of scientific communication be updated to…

  6. The chordate proteome history database.

    PubMed

    Levasseur, Anthony; Paganini, Julien; Dainat, Jacques; Thompson, Julie D; Poch, Olivier; Pontarotti, Pierre; Gouret, Philippe

    2012-01-01

    The chordate proteome history database (http://ioda.univ-provence.fr) comprises some 20,000 evolutionary analyses of proteins from chordate species. Our main objective was to characterize and study the evolutionary histories of the chordate proteome, and in particular to detect genomic events and automatic functional searches. Firstly, phylogenetic analyses based on high quality multiple sequence alignments and a robust phylogenetic pipeline were performed for the whole protein and for each individual domain. Novel approaches were developed to identify orthologs/paralogs, and predict gene duplication/gain/loss events and the occurrence of new protein architectures (domain gains, losses and shuffling). These important genetic events were localized on the phylogenetic trees and on the genomic sequence. Secondly, the phylogenetic trees were enhanced by the creation of phylogroups, whereby groups of orthologous sequences created using OrthoMCL were corrected based on the phylogenetic trees; gene family size and gene gain/loss in a given lineage could be deduced from the phylogroups. For each ortholog group obtained from the phylogenetic or the phylogroup analysis, functional information and expression data can be retrieved. Database searches can be performed easily using biological objects: protein identifier, keyword or domain, but can also be based on events, eg, domain exchange events can be retrieved. To our knowledge, this is the first database that links group clustering, phylogeny and automatic functional searches along with the detection of important events occurring during genome evolution, such as the appearance of a new domain architecture.

  7. Genome and proteome annotation: organization, interpretation and integration

    PubMed Central

    Reeves, Gabrielle A.; Talavera, David; Thornton, Janet M.

    2008-01-01

    Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations. PMID:19019817

  8. Directed shotgun proteomics guided by saturated RNA-seq identifies a complete expressed prokaryotic proteome

    PubMed Central

    Omasits, Ulrich; Quebatte, Maxime; Stekhoven, Daniel J.; Fortes, Claudia; Roschitzki, Bernd; Robinson, Mark D.; Dehio, Christoph; Ahrens, Christian H.

    2013-01-01

    Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, we could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ∼90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor. PMID:23878158

  9. Directed Shotgun Proteomics Guided by Saturated RNA-seq Identifies a Complete Expressed Prokaryotic Proteome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Omasits, U.; Quebatte, Maxime; Stekhoven, Daniel J.

    2013-11-01

    Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, wemore » could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ~90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor.« less

  10. Novel Phage Group Infecting Lactobacillus delbrueckii subsp. lactis, as Revealed by Genomic and Proteomic Analysis of Bacteriophage Ldl1

    PubMed Central

    Casey, Eoghan; Mahony, Jennifer; Neve, Horst; Noben, Jean-Paul; Dal Bello, Fabio

    2014-01-01

    Ldl1 is a virulent phage infecting the dairy starter Lactobacillus delbrueckii subsp. lactis LdlS. Electron microscopy analysis revealed that this phage exhibits a large head and a long tail and bears little resemblance to other characterized phages infecting Lactobacillus delbrueckii. In vitro propagation of this phage revealed a latent period of 30 to 40 min and a burst size of 59.9 ± 1.9 phage particles. Comparative genomic and proteomic analyses showed remarkable similarity between the genome of Ldl1 and that of Lactobacillus plantarum phage ATCC 8014-B2. The genomic and proteomic characteristics of Ldl1 demonstrate that this phage does not belong to any of the four previously recognized L. delbrueckii phage groups, necessitating the creation of a new group, called group e, thus adding to the knowledge on the diversity of phages targeting strains of this industrially important lactic acid bacterial species. PMID:25501478

  11. Novel phage group infecting Lactobacillus delbrueckii subsp. lactis, as revealed by genomic and proteomic analysis of bacteriophage Ldl1.

    PubMed

    Casey, Eoghan; Mahony, Jennifer; Neve, Horst; Noben, Jean-Paul; Dal Bello, Fabio; van Sinderen, Douwe

    2015-02-01

    Ldl1 is a virulent phage infecting the dairy starter Lactobacillus delbrueckii subsp. lactis LdlS. Electron microscopy analysis revealed that this phage exhibits a large head and a long tail and bears little resemblance to other characterized phages infecting Lactobacillus delbrueckii. In vitro propagation of this phage revealed a latent period of 30 to 40 min and a burst size of 59.9 +/- 1.9 phage particles. Comparative genomic and proteomic analyses showed remarkable similarity between the genome of Ldl1 and that of Lactobacillus plantarum phage ATCC 8014-B2. The genomic and proteomic characteristics of Ldl1 demonstrate that this phage does not belong to any of the four previously recognized L. delbrueckii phage groups, necessitating the creation of a new group, called group e, thus adding to the knowledge on the diversity of phages targeting strains of this industrially important lactic acid bacterial species.

  12. Integrated genomics and proteomics of the Torpedo californica electric organ: concordance with the mammalian neuromuscular junction

    PubMed Central

    2011-01-01

    Background During development, the branchial mesoderm of Torpedo californica transdifferentiates into an electric organ capable of generating high voltage discharges to stun fish. The organ contains a high density of cholinergic synapses and has served as a biochemical model for the membrane specialization of myofibers, the neuromuscular junction (NMJ). We studied the genome and proteome of the electric organ to gain insight into its composition, to determine if there is concordance with skeletal muscle and the NMJ, and to identify novel synaptic proteins. Results Of 435 proteins identified, 300 mapped to Torpedo cDNA sequences with ≥2 peptides. We identified 14 uncharacterized proteins in the electric organ that are known to play a role in acetylcholine receptor clustering or signal transduction. In addition, two human open reading frames, C1orf123 and C6orf130, showed high sequence similarity to electric organ proteins. Our profile lists several proteins that are highly expressed in skeletal muscle or are muscle specific. Synaptic proteins such as acetylcholinesterase, acetylcholine receptor subunits, and rapsyn were present in the electric organ proteome but absent in the skeletal muscle proteome. Conclusions Our integrated genomic and proteomic analysis supports research describing a muscle-like profile of the organ. We show that it is a repository of NMJ proteins but we present limitations on its use as a comprehensive model of the NMJ. Finally, we identified several proteins that may become candidates for signaling proteins not previously characterized as components of the NMJ. PMID:21798097

  13. From proteomics to systems biology: MAPA, MASS WESTERN, PROMEX, and COVAIN as a user-oriented platform.

    PubMed

    Weckwerth, Wolfram; Wienkoop, Stefanie; Hoehenwarter, Wolfgang; Egelhofer, Volker; Sun, Xiaoliang

    2014-01-01

    Genome sequencing and systems biology are revolutionizing life sciences. Proteomics emerged as a fundamental technique of this novel research area as it is the basis for gene function analysis and modeling of dynamic protein networks. Here a complete proteomics platform suited for functional genomics and systems biology is presented. The strategy includes MAPA (mass accuracy precursor alignment; http://www.univie.ac.at/mosys/software.html ) as a rapid exploratory analysis step; MASS WESTERN for targeted proteomics; COVAIN ( http://www.univie.ac.at/mosys/software.html ) for multivariate statistical analysis, data integration, and data mining; and PROMEX ( http://www.univie.ac.at/mosys/databases.html ) as a database module for proteogenomics and proteotypic peptides for targeted analysis. Moreover, the presented platform can also be utilized to integrate metabolomics and transcriptomics data for the analysis of metabolite-protein-transcript correlations and time course analysis using COVAIN. Examples for the integration of MAPA and MASS WESTERN data, proteogenomic and metabolic modeling approaches for functional genomics, phosphoproteomics by integration of MOAC (metal-oxide affinity chromatography) with MAPA, and the integration of metabolomics, transcriptomics, proteomics, and physiological data using this platform are presented. All software and step-by-step tutorials for data processing and data mining can be downloaded from http://www.univie.ac.at/mosys/software.html.

  14. Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools.

    PubMed

    Deshmukh, Rupesh K; Sonah, Humira; Bélanger, Richard R

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is

  15. Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools

    PubMed Central

    Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is

  16. Scientific Approaches | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    CPTAC employs two complementary scientific approaches, a "Targeting Genome to Proteome" (Targeting G2P) approach and a "Mapping Proteome to Genome" (Mapping P2G) approach, in order to address biological questions from data generated on a sample.

  17. HTAPP: High-Throughput Autonomous Proteomic Pipeline

    PubMed Central

    Yu, Kebing; Salomon, Arthur R.

    2011-01-01

    Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic datasets is critically important. The high-throughput autonomous proteomic pipeline (HTAPP) described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is comprised of software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of HTAPP focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples. PMID:20336676

  18. Proteogenomics | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    Proteogenomics, or the integration of proteomics with genomics and transcriptomics, is an emerging approach that promises to advance basic, translational and clinical research.  By combining genomic and proteomic information, leading scientists are gaining new insights due to a more complete and unified understanding of complex biological processes.

  19. The Multinational Arabidopsis Steering Subcommittee for Proteomics Assembles the Largest Proteome Database Resource for Plant Systems Biology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weckwerth, Wolfram; Baginsky, Sacha; Van Wijk, Klass

    2009-12-01

    In the past 10 years, we have witnessed remarkable advances in the field of plant molecular biology. The rapid development of proteomic technologies and the speed with which these techniques have been applied to the field have altered our perception of how we can analyze proteins in complex systems. At nearly the same time, the availability of the complete genome for the model plant Arabidopsis thaliana was released; this effort provides an unsurpassed resource for the identification of proteins when researchers use MS to analyze plant samples. Recognizing the growth in this area, the Multinational Arabidopsis Steering Committee (MASC) establishedmore » a subcommittee for A. thaliana proteomics in 2006 with the objective of consolidating databases, technique standards, and experimentally validated candidate genes and functions. Since the establishment of the Multinational Arabidopsis Steering Subcommittee for Proteomics (MASCP), many new approaches and resources have become available. Recently, the subcommittee established a webpage to consolidate this information (www.masc-proteomics.org). It includes links to plant proteomic databases, general information about proteomic techniques, meeting information, a summary of proteomic standards, and other relevant resources. Altogether, this website provides a useful resource for the Arabidopsis proteomics community. In the future, the website will host discussions and investigate the cross-linking of databases. The subcommittee members have extensive experience in arabidopsis proteomics and collectively have produced some of the most extensive proteomics data sets for this model plant (Table S1 in the Supporting Information has a list of resources). The largest collection of proteomics data from a single study in A. thaliana was assembled into an accessible database (AtProteome; http://fgcz-atproteome.unizh.ch/index.php) and was recently published by the Baginsky lab.1 The database provides links to major Arabidopsis

  20. Automation, parallelism, and robotics for proteomics.

    PubMed

    Alterovitz, Gil; Liu, Jonathan; Chow, Jijun; Ramoni, Marco F

    2006-07-01

    The speed of the human genome project (Lander, E. S., Linton, L. M., Birren, B., Nusbaum, C. et al., Nature 2001, 409, 860-921) was made possible, in part, by developments in automation of sequencing technologies. Before these technologies, sequencing was a laborious, expensive, and personnel-intensive task. Similarly, automation and robotics are changing the field of proteomics today. Proteomics is defined as the effort to understand and characterize proteins in the categories of structure, function and interaction (Englbrecht, C. C., Facius, A., Comb. Chem. High Throughput Screen. 2005, 8, 705-715). As such, this field nicely lends itself to automation technologies since these methods often require large economies of scale in order to achieve cost and time-saving benefits. This article describes some of the technologies and methods being applied in proteomics in order to facilitate automation within the field as well as in linking proteomics-based information with other related research areas.

  1. Evolution of complete proteomes: guanine-cytosine pressure, phylogeny and environmental influences blend the proteomic architecture

    PubMed Central

    2013-01-01

    Background Guanine-cytosine (GC) composition is an important feature of genomes. Likewise, amino acid composition is a distinct, but less valued, feature of proteomes. A major concern is that it is not clear what valuable information can be acquired from amino acid composition data. To address this concern, in-depth analyses of the amino acid composition of the complete proteomes from 63 archaea, 270 bacteria, and 128 eukaryotes were performed. Results Principal component analysis of the amino acid matrices showed that the main contributors to proteomic architecture were genomic GC variation, phylogeny, and environmental influences. GC pressure drove positive selection on Ala, Arg, Gly, Pro, Trp, and Val, and adverse selection on Asn, Lys, Ile, Phe, and Tyr. The physico-chemical framework of the complete proteomes withstood GC pressure by frequency complementation of GC-dependent amino acid pairs with similar physico-chemical properties. Gln, His, Ser, and Val were responsible for phylogeny and their constituted components could differentiate archaea, bacteria, and eukaryotes. Environmental niche was also a significant factor in determining proteomic architecture, especially for archaea for which the main amino acids were Cys, Leu, and Thr. In archaea, hyperthermophiles, acidophiles, mesophiles, psychrophiles, and halophiles gathered successively along the environment-based principal component. Concordance between proteomic architecture and the genetic code was also related closely to genomic GC content, phylogeny, and lifestyles. Conclusions Large-scale analyses of the complete proteomes of a wide range of organisms suggested that amino acid composition retained the trace of GC variation, phylogeny, and environmental influences during evolution. The findings from this study will help in the development of a global understanding of proteome evolution, and even biological evolution. PMID:24088322

  2. Using proteomics to study sexual reproduction in angiosperms

    USDA-ARS?s Scientific Manuscript database

    While a relative latecomer to the post-genomics era of functional biology, the application of mass spectrometry-based proteomic analysis has increased exponentially over the past 10 years. Some of this increase is the result of transition of chemists physicists, and mathematicians to the study of ...

  3. The Escherichia coli Proteome: Past, Present, and Future Prospects†

    PubMed Central

    Han, Mee-Jung; Lee, Sang Yup

    2006-01-01

    Proteomics has emerged as an indispensable methodology for large-scale protein analysis in functional genomics. The Escherichia coli proteome has been extensively studied and is well defined in terms of biochemical, biological, and biotechnological data. Even before the entire E. coli proteome was fully elucidated, the largest available data set had been integrated to decipher regulatory circuits and metabolic pathways, providing valuable insights into global cellular physiology and the development of metabolic and cellular engineering strategies. With the recent advent of advanced proteomic technologies, the E. coli proteome has been used for the validation of new technologies and methodologies such as sample prefractionation, protein enrichment, two-dimensional gel electrophoresis, protein detection, mass spectrometry (MS), combinatorial assays with n-dimensional chromatographies and MS, and image analysis software. These important technologies will not only provide a great amount of additional information on the E. coli proteome but also synergistically contribute to other proteomic studies. Here, we review the past development and current status of E. coli proteome research in terms of its biological, biotechnological, and methodological significance and suggest future prospects. PMID:16760308

  4. UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

    PubMed

    Meinicke, Peter

    2009-09-02

    Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.

  5. Development of Advanced Technologies for Complete Genomic and Proteomic Characterization of Quantized Human Tumor Cells

    DTIC Science & Technology

    2015-09-01

    glioblastoma . We have successfully established several patient-derived cell lines from glioblastoma tumors and further established a number of...and single-cell technologies. Although the focus of this research is glioblastoma , the proposed tools are generally applicable to all cancer-based...studies. 15. SUBJECT TERMS Human cohorts, Glioblastoma , Genomic, Proteomic, Single-cell technologies, Hypothesis-driven, integrative systems approach

  6. Evolution of Clinical Proteomics and its Role in Medicine | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    NCI's Office of Cancer Clinical Proteomics Research authored a review of the current state of clinical proteomics in the peer-reviewed Journal of Proteome Research. The review highlights outcomes from the CPTC program and also provides a thorough overview of the different technologies that have pushed the field forward. Additionally, the review provides a vision for moving the field forward through linking advances in genomic and proteomic analysis to develop new, molecularly targeted interventions.

  7. Comparative genomic and proteomic analyses of Clostridium acetobutylicum Rh8 and its parent strain DSM 1731 revealed new understandings on butanol tolerance

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bao, Guanhui; University of Chinese Academy of Sciences, Beijing; Dong, Hongjun

    Highlights: • Genomes of a butanol tolerant strain and its parent strain were deciphered. • Comparative genomic and proteomic was applied to understand butanol tolerance. • None differentially expressed proteins have mutations in its corresponding genes. • Mutations in ribosome might be responsible for the global difference of proteomics. - Abstract: Clostridium acetobutylicum strain Rh8 is a butanol-tolerant mutant which can tolerate up to 19 g/L butanol, 46% higher than that of its parent strain DSM 1731. We previously performed comparative cytoplasm- and membrane-proteomic analyses to understand the mechanism underlying the improved butanol tolerance of strain Rh8. In this work,more » we further extended this comparison to the genomic level. Compared with the genome of the parent strain DSM 1731, two insertion sites, four deletion sites, and 67 single nucleotide variations (SNVs) are distributed throughout the genome of strain Rh8. Among the 67 SNVs, 16 SNVs are located in the predicted promoters and intergenic regions; while 29 SNVs are located in the coding sequence, affecting a total of 21 proteins involved in transport, cell structure, DNA replication, and protein translation. The remaining 22 SNVs are located in the ribosomal genes, affecting a total of 12 rRNA genes in different operons. Analysis of previous comparative proteomic data indicated that none of the differentially expressed proteins have mutations in its corresponding genes. Rchange Algorithms analysis indicated that the mutations occurred in the ribosomal genes might change the ribosome RNA thermodynamic characteristics, thus affect the translation strength of these proteins. Take together, the improved butanol tolerance of C. acetobutylicum strain Rh8 might be acquired through regulating the translational process to achieve different expression strength of genes involved in butanol tolerance.« less

  8. Functional genomic Landscape of Human Breast Cancer drivers, vulnerabilities, and resistance

    PubMed Central

    Marcotte, Richard; Sayad, Azin; Brown, Kevin R.; Sanchez-Garcia, Felix; Reimand, Jüri; Haider, Maliha; Virtanen, Carl; Bradner, James E.; Bader, Gary D.; Mills, Gordon B.; Pe’er, Dana; Moffat, Jason; Neel, Benjamin G.

    2016-01-01

    Summary Large-scale genomic studies have identified multiple somatic aberrations in breast cancer, including copy number alterations, and point mutations. Still, identifying causal variants and emergent vulnerabilities that arise as a consequence of genetic alterations remain major challenges. We performed whole genome shRNA “dropout screens” on 77 breast cancer cell lines. Using a hierarchical linear regression algorithm to score our screen results and integrate them with accompanying detailed genetic and proteomic information, we identify vulnerabilities in breast cancer, including candidate “drivers,” and reveal general functional genomic properties of cancer cells. Comparisons of gene essentiality with drug sensitivity data suggest potential resistance mechanisms, effects of existing anti-cancer drugs, and opportunities for combination therapy. Finally, we demonstrate the utility of this large dataset by identifying BRD4 as a potential target in luminal breast cancer, and PIK3CA mutations as a resistance determinant for BET-inhibitors. PMID:26771497

  9. Functional Genomic Landscape of Human Breast Cancer Drivers, Vulnerabilities, and Resistance.

    PubMed

    Marcotte, Richard; Sayad, Azin; Brown, Kevin R; Sanchez-Garcia, Felix; Reimand, Jüri; Haider, Maliha; Virtanen, Carl; Bradner, James E; Bader, Gary D; Mills, Gordon B; Pe'er, Dana; Moffat, Jason; Neel, Benjamin G

    2016-01-14

    Large-scale genomic studies have identified multiple somatic aberrations in breast cancer, including copy number alterations and point mutations. Still, identifying causal variants and emergent vulnerabilities that arise as a consequence of genetic alterations remain major challenges. We performed whole-genome small hairpin RNA (shRNA) "dropout screens" on 77 breast cancer cell lines. Using a hierarchical linear regression algorithm to score our screen results and integrate them with accompanying detailed genetic and proteomic information, we identify vulnerabilities in breast cancer, including candidate "drivers," and reveal general functional genomic properties of cancer cells. Comparisons of gene essentiality with drug sensitivity data suggest potential resistance mechanisms, effects of existing anti-cancer drugs, and opportunities for combination therapy. Finally, we demonstrate the utility of this large dataset by identifying BRD4 as a potential target in luminal breast cancer and PIK3CA mutations as a resistance determinant for BET-inhibitors. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. Genomic, proteomic and biochemical analysis of the chitinolytic machinery of Serratia marcescens BJL200.

    PubMed

    Tuveng, Tina R; Hagen, Live Heldal; Mekasha, Sophanit; Frank, Jeremy; Arntzen, Magnus Øverlie; Vaaje-Kolstad, Gustav; Eijsink, Vincent G H

    2017-04-01

    The chitinolytic machinery of Serratia marcescens BJL200 has been studied in detail over the last couple of decades, however, the proteome secreted by this Gram-negative bacterium during growth on chitin has not been studied in depth. In addition, the genome of this most studied chitinolytic Serratia strain has until now, not been sequenced. We report a draft genome sequence for S. marcescens BJL200. Using label-free quantification (LFQ) proteomics and a recently developed plate-method for assessing secretomes during growth on solid substrates, we find that, as expected, the chitin-active enzymes (ChiA, B, C, and CBP21) are produced in high amounts when the bacterium grows on chitin. Other proteins produced in high amounts after bacterial growth on chitin provide interesting targets for further exploration of the proteins involved in degradation of chitin-rich biomasses. The genome encodes a fourth chitinase (ChiD), which is produced in low amounts during growth on chitin. Studies of chitin degradation with mixtures of recombinantly produced chitin-degrading enzymes showed that ChiD does not contribute to the overall efficiency of the process. ChiD is capable of converting N,N'-diacetyl chitobiose to N-acetyl glucosamine, but is less efficient than another enzyme produced for this purpose, the Chitobiase. Thus, the role of ChiD in chitin degradation, if any, remains unclear. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. Prioritization of potential drug targets against P. aeruginosa by core proteomic analysis using computational subtractive genomics and Protein-Protein interaction network.

    PubMed

    Uddin, Reaz; Jamil, Faiza

    2018-06-01

    Pseudomonas aeruginosa is an opportunistic gram-negative bacterium that has the capability to acquire resistance under hostile conditions and become a threat worldwide. It is involved in nosocomial infections. In the current study, potential novel drug targets against P. aeruginosa have been identified using core proteomic analysis and Protein-Protein Interactions (PPIs) studies. The non-redundant reference proteome of 68 strains having complete genome and latest assembly version of P. aeruginosa were downloaded from ftp NCBI RefSeq server in October 2016. The standalone CD-HIT tool was used to cluster ortholog proteins (having >=80% amino acid identity) present in all strains. The pan-proteome was clustered in 12,380 Clusters of Orthologous Proteins (COPs). By using in-house shell scripts, 3252 common COPs were extracted out and designated as clusters of core proteome. The core proteome of PAO1 strain was selected by fetching PAO1's proteome from common COPs. As a result, 1212 proteins were shortlisted that are non-homologous to the human but essential for the survival of the pathogen. Among these 1212 proteins, 321 proteins are conserved hypothetical proteins. Considering their potential as drug target, those 321 hypothetical proteins were selected and their probable functions were characterized. Based on the druggability criteria, 18 proteins were shortlisted. The interacting partners were identified by investigating the PPIs network using STRING v10 database. Subsequently, 8 proteins were shortlisted as 'hub proteins' and proposed as potential novel drug targets against P. aeruginosa. The study is interesting for the scientific community working to identify novel drug targets against MDR pathogens particularly P. aeruginosa. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. Streptococcus iniae SF1: Complete Genome Sequence, Proteomic Profile, and Immunoprotective Antigens

    PubMed Central

    Zhang, Bao-cun; Zhang, Jian; Sun, Li

    2014-01-01

    Streptococcus iniae is a Gram-positive bacterium that is reckoned one of the most severe aquaculture pathogens. It has a broad host range among farmed marine and freshwater fish and can also cause zoonotic infection in humans. Here we report for the first time the complete genome sequence as well as the host factor-induced proteomic profile of a pathogenic S. iniae strain, SF1, a serotype I isolate from diseased fish. SF1 possesses a single chromosome of 2,149,844 base pairs, which contains 2,125 predicted protein coding sequences (CDS), 12 rRNA genes, and 45 tRNA genes. Among the protein-encoding CDS are genes involved in resource acquisition and utilization, signal sensing and transduction, carbohydrate metabolism, and defense against host immune response. Potential virulence genes include those encoding adhesins, autolysins, toxins, exoenzymes, and proteases. In addition, two putative prophages and a CRISPR-Cas system were found in the genome, the latter containing a CRISPR locus and four cas genes. Proteomic analysis detected 21 secreted proteins whose expressions were induced by host serum. Five of the serum-responsive proteins were subjected to immunoprotective analysis, which revealed that two of the proteins were highly protective against lethal S. iniae challenge when used as purified recombinant subunit vaccines. Taken together, these results provide an important molecular basis for future study of S. iniae in various aspects, in particular those related to pathogenesis and disease control. PMID:24621602

  13. Exploration of panviral proteome: high-throughput cloning and functional implications in virus-host interactions.

    PubMed

    Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M; LaBaer, Joshua

    2014-01-01

    Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.

  14. Proteome Characterization of Leaves in Common Bean

    PubMed Central

    Robison, Faith M.; Heuberger, Adam L.; Brick, Mark A.; Prenni, Jessica E.

    2015-01-01

    Dry edible bean (Phaseolus vulgaris L.) is a globally relevant food crop. The bean genome was recently sequenced and annotated allowing for proteomics investigations aimed at characterization of leaf phenotypes important to agriculture. The objective of this study was to utilize a shotgun proteomics approach to characterize the leaf proteome and to identify protein abundance differences between two bean lines with known variation in their physiological resistance to biotic stresses. Overall, 640 proteins were confidently identified. Among these are proteins known to be involved in a variety of molecular functions including oxidoreductase activity, binding peroxidase activity, and hydrolase activity. Twenty nine proteins were found to significantly vary in abundance (p-value < 0.05) between the two bean lines, including proteins associated with biotic stress. To our knowledge, this work represents the first large scale shotgun proteomic analysis of beans and our results lay the groundwork for future studies designed to investigate the molecular mechanisms involved in pathogen resistance. PMID:28248269

  15. A comprehensive survey of the Plasmodium life cycle by genomic, transcriptomic, and proteomic analyses.

    PubMed

    Hall, Neil; Karras, Marianna; Raine, J Dale; Carlton, Jane M; Kooij, Taco W A; Berriman, Matthew; Florens, Laurence; Janssen, Christoph S; Pain, Arnab; Christophides, Georges K; James, Keith; Rutherford, Kim; Harris, Barbara; Harris, David; Churcher, Carol; Quail, Michael A; Ormond, Doug; Doggett, Jon; Trueman, Holly E; Mendoza, Jacqui; Bidwell, Shelby L; Rajandream, Marie-Adele; Carucci, Daniel J; Yates, John R; Kafatos, Fotis C; Janse, Chris J; Barrell, Bart; Turner, C Michael R; Waters, Andrew P; Sinden, Robert E

    2005-01-07

    Plasmodium berghei and Plasmodium chabaudi are widely used model malaria species. Comparison of their genomes, integrated with proteomic and microarray data, with the genomes of Plasmodium falciparum and Plasmodium yoelii revealed a conserved core of 4500 Plasmodium genes in the central regions of the 14 chromosomes and highlighted genes evolving rapidly because of stage-specific selective pressures. Four strategies for gene expression are apparent during the parasites' life cycle: (i) housekeeping; (ii) host-related; (iii) strategy-specific related to invasion, asexual replication, and sexual development; and (iv) stage-specific. We observed posttranscriptional gene silencing through translational repression of messenger RNA during sexual development, and a 47-base 3' untranslated region motif is implicated in this process.

  16. Statistical Methods for Proteomic Biomarker Discovery based on Feature Extraction or Functional Modeling Approaches.

    PubMed

    Morris, Jeffrey S

    2012-01-01

    In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry ( Cromwell ) and 2D gel electrophoresis ( Pinnacle ) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods

  17. Genomic and Proteomic Analyses of the Fungus Arthrobotrys oligospora Provide Insights into Nematode-Trap Formation

    PubMed Central

    Feng, Yun; Li, Xiaomin; Zou, Chenggang; Xu, Jianping; Ren, Yan; Mi, Qili; Wu, Junli; Liu, Shuqun; Liu, Yu; Huang, Xiaowei; Wang, Haiyan; Niu, Xuemei; Li, Juan; Liang, Lianming; Luo, Yanlu; Ji, Kaifang; Zhou, Wei; Yu, Zefen; Li, Guohong; Liu, Yajun; Li, Lei; Qiao, Min; Feng, Lu; Zhang, Ke-Qin

    2011-01-01

    Nematode-trapping fungi are “carnivorous” and attack their hosts using specialized trapping devices. The morphological development of these traps is the key indicator of their switch from saprophytic to predacious lifestyles. Here, the genome of the nematode-trapping fungus Arthrobotrys oligospora Fres. (ATCC24927) was reported. The genome contains 40.07 Mb assembled sequence with 11,479 predicted genes. Comparative analysis showed that A. oligospora shared many more genes with pathogenic fungi than with non-pathogenic fungi. Specifically, compared to several sequenced ascomycete fungi, the A. oligospora genome has a larger number of pathogenicity-related genes in the subtilisin, cellulase, cellobiohydrolase, and pectinesterase gene families. Searching against the pathogen-host interaction gene database identified 398 homologous genes involved in pathogenicity in other fungi. The analysis of repetitive sequences provided evidence for repeat-induced point mutations in A. oligospora. Proteomic and quantitative PCR (qPCR) analyses revealed that 90 genes were significantly up-regulated at the early stage of trap-formation by nematode extracts and most of these genes were involved in translation, amino acid metabolism, carbohydrate metabolism, cell wall and membrane biogenesis. Based on the combined genomic, proteomic and qPCR data, a model for the formation of nematode trapping device in this fungus was proposed. In this model, multiple fungal signal transduction pathways are activated by its nematode prey to further regulate downstream genes associated with diverse cellular processes such as energy metabolism, biosynthesis of the cell wall and adhesive proteins, cell division, glycerol accumulation and peroxisome biogenesis. This study will facilitate the identification of pathogenicity-related genes and provide a broad foundation for understanding the molecular and evolutionary mechanisms underlying fungi-nematodes interactions. PMID:21909256

  18. Genomic and proteomic analyses of the fungus Arthrobotrys oligospora provide insights into nematode-trap formation.

    PubMed

    Yang, Jinkui; Wang, Lei; Ji, Xinglai; Feng, Yun; Li, Xiaomin; Zou, Chenggang; Xu, Jianping; Ren, Yan; Mi, Qili; Wu, Junli; Liu, Shuqun; Liu, Yu; Huang, Xiaowei; Wang, Haiyan; Niu, Xuemei; Li, Juan; Liang, Lianming; Luo, Yanlu; Ji, Kaifang; Zhou, Wei; Yu, Zefen; Li, Guohong; Liu, Yajun; Li, Lei; Qiao, Min; Feng, Lu; Zhang, Ke-Qin

    2011-09-01

    Nematode-trapping fungi are "carnivorous" and attack their hosts using specialized trapping devices. The morphological development of these traps is the key indicator of their switch from saprophytic to predacious lifestyles. Here, the genome of the nematode-trapping fungus Arthrobotrys oligospora Fres. (ATCC24927) was reported. The genome contains 40.07 Mb assembled sequence with 11,479 predicted genes. Comparative analysis showed that A. oligospora shared many more genes with pathogenic fungi than with non-pathogenic fungi. Specifically, compared to several sequenced ascomycete fungi, the A. oligospora genome has a larger number of pathogenicity-related genes in the subtilisin, cellulase, cellobiohydrolase, and pectinesterase gene families. Searching against the pathogen-host interaction gene database identified 398 homologous genes involved in pathogenicity in other fungi. The analysis of repetitive sequences provided evidence for repeat-induced point mutations in A. oligospora. Proteomic and quantitative PCR (qPCR) analyses revealed that 90 genes were significantly up-regulated at the early stage of trap-formation by nematode extracts and most of these genes were involved in translation, amino acid metabolism, carbohydrate metabolism, cell wall and membrane biogenesis. Based on the combined genomic, proteomic and qPCR data, a model for the formation of nematode trapping device in this fungus was proposed. In this model, multiple fungal signal transduction pathways are activated by its nematode prey to further regulate downstream genes associated with diverse cellular processes such as energy metabolism, biosynthesis of the cell wall and adhesive proteins, cell division, glycerol accumulation and peroxisome biogenesis. This study will facilitate the identification of pathogenicity-related genes and provide a broad foundation for understanding the molecular and evolutionary mechanisms underlying fungi-nematodes interactions.

  19. Stagonospora nodorum: From pathology to genomics and host resistance

    USDA-ARS?s Scientific Manuscript database

    Stagonospora nodorum is a major necrotrophic pathogen of wheat that causes the diseases Stagonospora nodorum leaf and glume blotch. A series of tools and resources, including functional genomics, a genome sequence, proteomics and metabolomics, host-mapping populations, and a worldwide collection of ...

  20. A proteome view of structural, functional, and taxonomic characteristics of major protein domain clusters.

    PubMed

    Sun, Chia-Tsen; Chiang, Austin W T; Hwang, Ming-Jing

    2017-10-27

    Proteome-scale bioinformatics research is increasingly conducted as the number of completely sequenced genomes increases, but analysis of protein domains (PDs) usually relies on similarity in their amino acid sequences and/or three-dimensional structures. Here, we present results from a bi-clustering analysis on presence/absence data for 6,580 unique PDs in 2,134 species with a sequenced genome, thus covering a complete set of proteins, for the three superkingdoms of life, Bacteria, Archaea, and Eukarya. Our analysis revealed eight distinctive PD clusters, which, following an analysis of enrichment of Gene Ontology functions and CATH classification of protein structures, were shown to exhibit structural and functional properties that are taxa-characteristic. For examples, the largest cluster is ubiquitous in all three superkingdoms, constituting a set of 1,472 persistent domains created early in evolution and retained in living organisms and characterized by basic cellular functions and ancient structural architectures, while an Archaea and Eukarya bi-superkingdom cluster suggests its PDs may have existed in the ancestor of the two superkingdoms, and others are single superkingdom- or taxa (e.g. Fungi)-specific. These results contribute to increase our appreciation of PD diversity and our knowledge of how PDs are used in species, yielding implications on species evolution.

  1. Challenges for proteomics core facilities.

    PubMed

    Lilley, Kathryn S; Deery, Michael J; Gatto, Laurent

    2011-03-01

    Many analytical techniques have been executed by core facilities established within academic, pharmaceutical and other industrial institutions. The centralization of such facilities ensures a level of expertise and hardware which often cannot be supported by individual laboratories. The establishment of a core facility thus makes the technology available for multiple researchers in the same institution. Often, the services within the core facility are also opened out to researchers from other institutions, frequently with a fee being levied for the service provided. In the 1990s, with the onset of the age of genomics, there was an abundance of DNA analysis facilities, many of which have since disappeared from institutions and are now available through commercial sources. Ten years on, as proteomics was beginning to be utilized by many researchers, this technology found itself an ideal candidate for being placed within a core facility. We discuss what in our view are the daily challenges of proteomics core facilities. We also examine the potential unmet needs of the proteomics core facility that may also be applicable to proteomics laboratories which do not function as core facilities. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. The First Genomic and Proteomic Characterization of a Deep-Sea Sulfate Reducer: Insights into the Piezophilic Lifestyle of Desulfovibrio piezophilus

    PubMed Central

    Pradel, Nathalie; Ji, Boyang; Gimenez, Grégory; Talla, Emmanuel; Lenoble, Patricia; Garel, Marc; Tamburini, Christian; Fourquet, Patrick; Lebrun, Régine; Bertin, Philippe; Denis, Yann; Pophillat, Matthieu; Barbe, Valérie; Ollivier, Bernard; Dolla, Alain

    2013-01-01

    Desulfovibrio piezophilus strain C1TLV30T is a piezophilic anaerobe that was isolated from wood falls in the Mediterranean deep-sea. D. piezophilus represents a unique model for studying the adaptation of sulfate-reducing bacteria to hydrostatic pressure. Here, we report the 3.6 Mbp genome sequence of this piezophilic bacterium. An analysis of the genome revealed the presence of seven genomic islands as well as gene clusters that are most likely linked to life at a high hydrostatic pressure. Comparative genomics and differential proteomics identified the transport of solutes and amino acids as well as amino acid metabolism as major cellular processes for the adaptation of this bacterium to hydrostatic pressure. In addition, the proteome profiles showed that the abundance of key enzymes that are involved in sulfate reduction was dependent on hydrostatic pressure. A comparative analysis of orthologs from the non-piezophilic marine bacterium D. salexigens and D. piezophilus identified aspartic acid, glutamic acid, lysine, asparagine, serine and tyrosine as the amino acids preferentially replaced by arginine, histidine, alanine and threonine in the piezophilic strain. This work reveals the adaptation strategies developed by a sulfate reducer to a deep-sea lifestyle. PMID:23383081

  3. The wheat chloroplastic proteome.

    PubMed

    Kamal, Abu Hena Mostafa; Cho, Kun; Choi, Jong-Soon; Bae, Kwang-Hee; Komatsu, Setsuko; Uozumi, Nobuyuki; Woo, Sun Hee

    2013-11-20

    With the availability of plant genome sequencing, analysis of plant proteins with mass spectrometry has become promising and admired. Determining the proteome of a cell is still a challenging assignment, which is convoluted by proteome dynamics and convolution. Chloroplast is fastidious curiosity for plant biologists due to their intricate biochemical pathways for indispensable metabolite functions. In this review, an overview on proteomic studies conducted in wheat with a special focus on subcellular proteomics of chloroplast, salt and water stress. In recent years, we and other groups have attempted to understand the photosynthesis in wheat and abiotic stress under salt imposed and water deficit during vegetative stage. Those studies provide interesting results leading to better understanding of the photosynthesis and identifying the stress-responsive proteins. Indeed, recent studies aimed at resolving the photosynthesis pathway in wheat. Proteomic analysis combining two complementary approaches such as 2-DE and shotgun methods couple to high through put mass spectrometry (LTQ-FTICR and MALDI-TOF/TOF) in order to better understand the responsible proteins in photosynthesis and abiotic stress (salt and water) in wheat chloroplast will be focused. In this review we discussed the identification of the most abundant protein in wheat chloroplast and stress-responsive under salt and water stress in chloroplast of wheat seedlings, thus providing the proteomic view of the events during the development of this seedling under stress conditions. Chloroplast is fastidious curiosity for plant biologists due to their intricate biochemical pathways for indispensable metabolite functions. An overview on proteomic studies conducted in wheat with a special focus on subcellular proteomics of chloroplast, salt and water stress. We have attempted to understand the photosynthesis in wheat and abiotic stress under salt imposed and water deficit during seedling stage. Those studies

  4. Genomic and proteomic evidences unravel the UV-resistome of the poly-extremophile Acinetobacter sp. Ver3

    PubMed Central

    Kurth, Daniel; Belfiore, Carolina; Gorriti, Marta F.; Cortez, Néstor; Farias, María E.; Albarracín, Virginia H.

    2015-01-01

    Ultraviolet radiation can damage biomolecules, with detrimental or even lethal effects for life. Even though lower wavelengths are filtered by the ozone layer, a significant amount of harmful UV-B and UV-A radiation reach Earth’s surface, particularly in high altitude environments. high-altitude Andean lakes (HAALs) are a group of disperse shallow lakes and salterns, located at the Dry Central Andes region in South America at altitudes above 3,000 m. As it is considered one of the highest UV-exposed environments, HAAL microbes constitute model systems to study UV-resistance mechanisms in environmental bacteria at various complexity levels. Herein, we present the genome sequence of Acinetobacter sp. Ver3, a gammaproteobacterium isolated from Lake Verde (4,400 m), together with further experimental evidence supporting the phenomenological observations regarding this bacterium ability to cope with increased UV-induced DNA damage. Comparison with the genomes of other Acinetobacter strains highlighted a number of unique genes, such as a novel cryptochrome. Proteomic profiling of UV-exposed cells identified up-regulated proteins such as a specific cytoplasmic catalase, a putative regulator, and proteins associated to amino acid and protein synthesis. Down-regulated proteins were related to several energy-generating pathways such as glycolysis, beta-oxidation of fatty acids, and electronic respiratory chain. To the best of our knowledge, this is the first report on a genome from a polyextremophilic Acinetobacter strain. From the genomic and proteomic data, an “UV-resistome” was defined, encompassing the genes that would support the outstanding UV-resistance of this strain. PMID:25954258

  5. Genomic and proteomic evidences unravel the UV-resistome of the poly-extremophile Acinetobacter sp. Ver3.

    PubMed

    Kurth, Daniel; Belfiore, Carolina; Gorriti, Marta F; Cortez, Néstor; Farias, María E; Albarracín, Virginia H

    2015-01-01

    Ultraviolet radiation can damage biomolecules, with detrimental or even lethal effects for life. Even though lower wavelengths are filtered by the ozone layer, a significant amount of harmful UV-B and UV-A radiation reach Earth's surface, particularly in high altitude environments. high-altitude Andean lakes (HAALs) are a group of disperse shallow lakes and salterns, located at the Dry Central Andes region in South America at altitudes above 3,000 m. As it is considered one of the highest UV-exposed environments, HAAL microbes constitute model systems to study UV-resistance mechanisms in environmental bacteria at various complexity levels. Herein, we present the genome sequence of Acinetobacter sp. Ver3, a gammaproteobacterium isolated from Lake Verde (4,400 m), together with further experimental evidence supporting the phenomenological observations regarding this bacterium ability to cope with increased UV-induced DNA damage. Comparison with the genomes of other Acinetobacter strains highlighted a number of unique genes, such as a novel cryptochrome. Proteomic profiling of UV-exposed cells identified up-regulated proteins such as a specific cytoplasmic catalase, a putative regulator, and proteins associated to amino acid and protein synthesis. Down-regulated proteins were related to several energy-generating pathways such as glycolysis, beta-oxidation of fatty acids, and electronic respiratory chain. To the best of our knowledge, this is the first report on a genome from a polyextremophilic Acinetobacter strain. From the genomic and proteomic data, an "UV-resistome" was defined, encompassing the genes that would support the outstanding UV-resistance of this strain.

  6. Thermosensitivity of growth is determined by chaperone-mediated proteome reallocation

    PubMed Central

    Chen, Ke; Gao, Ye; Mih, Nathan; O’Brien, Edward J.; Yang, Laurence; Palsson, Bernhard O.

    2017-01-01

    Maintenance of a properly folded proteome is critical for bacterial survival at notably different growth temperatures. Understanding the molecular basis of thermoadaptation has progressed in two main directions, the sequence and structural basis of protein thermostability and the mechanistic principles of protein quality control assisted by chaperones. Yet we do not fully understand how structural integrity of the entire proteome is maintained under stress and how it affects cellular fitness. To address this challenge, we reconstruct a genome-scale protein-folding network for Escherichia coli and formulate a computational model, FoldME, that provides statistical descriptions of multiscale cellular response consistent with many datasets. FoldME simulations show (i) that the chaperones act as a system when they respond to unfolding stress rather than achieving efficient folding of any single component of the proteome, (ii) how the proteome is globally balanced between chaperones for folding and the complex machinery synthesizing the proteins in response to perturbation, (iii) how this balancing determines growth rate dependence on temperature and is achieved through nonspecific regulation, and (iv) how thermal instability of the individual protein affects the overall functional state of the proteome. Overall, these results expand our view of cellular regulation, from targeted specific control mechanisms to global regulation through a web of nonspecific competing interactions that modulate the optimal reallocation of cellular resources. The methodology developed in this study enables genome-scale integration of environment-dependent protein properties and a proteome-wide study of cellular stress responses. PMID:29073085

  7. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  8. Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing.

    PubMed

    Pang, Chi Nam Ignatius; Tay, Aidan P; Aya, Carlos; Twine, Natalie A; Harkness, Linda; Hart-Smith, Gene; Chia, Samantha Z; Chen, Zhiliang; Deshpande, Nandan P; Kaakoush, Nadeem O; Mitchell, Hazel M; Kassem, Moustapha; Wilkins, Marc R

    2014-01-03

    Direct links between proteomic and genomic/transcriptomic data are not frequently made, partly because of lack of appropriate bioinformatics tools. To help address this, we have developed the PG Nexus pipeline. The PG Nexus allows users to covisualize peptides in the context of genomes or genomic contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromosomes or contigs where this occurs. In prokaryotes, the PG Nexus pipeline facilitates the validation of genes, where annotation or gene prediction is available, or the discovery of genes using a "virtual protein"-based unbiased approach. We illustrate this with a comprehensive proteogenomics analysis of two strains of Campylobacter concisus . For higher eukaryotes, the PG Nexus facilitates gene validation and supports the identification of mRNA splice junction boundaries and splice variants that are protein-coding. This is illustrated with an analysis of splice junctions covered by human phosphopeptides, and other examples of relevance to the Chromosome-Centric Human Proteome Project. The PG Nexus is open-source and available from https://github.com/IntersectAustralia/ap11_Samifier. It has been integrated into Galaxy and made available in the Galaxy tool shed.

  9. Tackling probiotic and gut microbiota functionality through proteomics.

    PubMed

    Ruiz, Lorena; Hidalgo, Claudio; Blanco-Míguez, Aitor; Lourenço, Anália; Sánchez, Borja; Margolles, Abelardo

    2016-09-16

    Probiotics are live microorganisms which when administered in adequate amounts confer a health benefit on the host. Many strains exert their beneficial effects after transiently colonizing the human gut, where they interact with the rest of the intestinal microorganisms and with the host mucosa. Indeed the human gut harbours a huge number of microorganisms also known as gut microbiota. Imbalances in the relative abundances of the individual components of the gut microbiota may determine the health status of the host and alterations in specific groups have been related to different diseases and metabolic disorders. Proteomics provide a set of high-throughput methodologies for protein identification that are extremely useful for studying probiotic functionality and helping in the assessment of specific health-promoting activities, such as their immunomodulatory activity, the intestinal colonization processes, and the crosstalk mechanisms with the host. Furthermore, proteomics have been used to identify markers of technological performance and stress adaptation, which helps to predict traits such as behaviour into food matrices and ability to survive passage through the gastrointestinal tract. The aim of this review is to compile studies in which proteomics have been used to assess probiotic functionality and to identify molecular players supporting their mechanisms of action. Probiotics are live microorganisms which when administered in adequate amounts confer a health benefit on the host. Molecular basis underlying the functional properties of probiotic bacteria responsible for the health promoting effects have been in the background for many years. Breakthrough of omics technologies in the probiotic and microbiota fields has had a very relevant impact in the elucidation of probiotic mechanisms and in the procedures to select these microorganisms, based on solid scientific evidence. It is unquestionable that, in the near future, the evolution of proteomic techniques

  10. Proteomics in investigation of cancer metastasis: functional and clinical consequences and methodological challenges.

    PubMed

    Maryáš, Josef; Faktor, Jakub; Dvořáková, Monika; Struhárová, Iva; Grell, Peter; Bouchal, Pavel

    2014-03-01

    Metastases are responsible for most of the cases of death in patients with solid tumors. There is thus an urgent clinical need of better understanding the exact molecular mechanisms and finding novel therapeutics targets and biomarkers of metastatic disease of various tumors. Metastases are formed in a complicated biological process called metastatic cascade. Up to now, proteomics has enabled the identification of number of metastasis-associated proteins and potential biomarkers in cancer tissues, microdissected cells, model systems, and secretomes. Expression profiles and biological role of key proteins were confirmed in verification and functional experiments. This communication reviews these observations and analyses the methodological aspects of the proteomics approaches used. Moreover, it reviews contribution of current proteomics in the field of functional characterization and interactome analysis of proteins involved in various events in metastatic cascade. It is evident that ongoing technical progress will further increase proteome coverage and sample capacity of proteomics technologies, giving complex answers to clinical and functional questions asked. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Functional genomics of root growth and development in Arabidopsis.

    PubMed

    Iyer-Pascuzzi, Anjali; Simpson, June; Herrera-Estrella, Luis; Benfey, Philip N

    2009-04-01

    Roots are vital for the uptake of water and nutrients, and for anchorage in the soil. They are highly plastic, able to adapt developmentally and physiologically to changing environmental conditions. Understanding the molecular mechanisms behind this growth and development requires knowledge of root transcriptomics, proteomics, and metabolomics. Genomics approaches, including the recent publication of a root expression map, root proteome, and environment-specific root expression studies, are uncovering complex transcriptional and post-transcriptional networks underlying root development. The challenge is in further capitalizing on the information in these datasets to understand the fundamental principles of root growth and development. In this review, we highlight progress researchers have made toward this goal.

  12. Comprehensive Proteomic Analysis of Human Milk-derived Extracellular Vesicles Unveils a Novel Functional Proteome Distinct from Other Milk Components*

    PubMed Central

    van Herwijnen, Martijn J.C.; Zonneveld, Marijke I.; Goerdayal, Soenita; Nolte – 't Hoen, Esther N.M.; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A.F.; Redegeld, Frank A.; Wauben, Marca H.M.

    2016-01-01

    Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of

  13. Comprehensive Proteomic Analysis of Human Milk-derived Extracellular Vesicles Unveils a Novel Functional Proteome Distinct from Other Milk Components.

    PubMed

    van Herwijnen, Martijn J C; Zonneveld, Marijke I; Goerdayal, Soenita; Nolte-'t Hoen, Esther N M; Garssen, Johan; Stahl, Bernd; Maarten Altelaar, A F; Redegeld, Frank A; Wauben, Marca H M

    2016-11-01

    Breast milk contains several macromolecular components with distinctive functions, whereby milk fat globules and casein micelles mainly provide nutrition to the newborn, and whey contains molecules that can stimulate the newborn's developing immune system and gastrointestinal tract. Although extracellular vesicles (EV) have been identified in breast milk, their physiological function and composition has not been addressed in detail. EV are submicron sized vehicles released by cells for intercellular communication via selectively incorporated lipids, nucleic acids, and proteins. Because of the difficulty in separating EV from other milk components, an in-depth analysis of the proteome of human milk-derived EV is lacking. In this study, an extensive LC-MS/MS proteomic analysis was performed of EV that had been purified from breast milk of seven individual donors using a recently established, optimized density-gradient-based EV isolation protocol. A total of 1963 proteins were identified in milk-derived EV, including EV-associated proteins like CD9, Annexin A5, and Flotillin-1, with a remarkable overlap between the different donors. Interestingly, 198 of the identified proteins are not present in the human EV database Vesiclepedia, indicating that milk-derived EV harbor proteins not yet identified in EV of different origin. Similarly, the proteome of milk-derived EV was compared with that of other milk components. For this, data from 38 published milk proteomic studies were combined in order to construct the total milk proteome, which consists of 2698 unique proteins. Remarkably, 633 proteins identified in milk-derived EV have not yet been identified in human milk to date. Interestingly, these novel proteins include proteins involved in regulation of cell growth and controlling inflammatory signaling pathways, suggesting that milk-derived EVs could support the newborn's developing gastrointestinal tract and immune system. Overall, this study provides an expansion of

  14. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. © 2015 Hilton and Gersbach; Published by Cold Spring Harbor Laboratory Press.

  15. Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study

    DOE PAGES

    Sun, Su; Xie, Shangxian; Cheng, Yanbing; ...

    2017-09-12

    Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level formore » the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.« less

  16. Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study.

    PubMed

    Sun, Su; Xie, Shangxian; Cheng, Yanbing; Yu, Hongbo; Zhao, Honglu; Li, Muzi; Li, Xiaotong; Zhang, Xiaoyu; Yuan, Joshua S; Dai, Susie Y

    2017-09-12

    Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level for the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.

  17. Enhancement of Environmental Hazard Degradation in the Presence of Lignin: a Proteomics Study

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sun, Su; Xie, Shangxian; Cheng, Yanbing

    Proteomics studies of fungal systems have progressed dramatically based on the availability of more fungal genome sequences in recent years. Different proteomics strategies have been applied toward characterization of fungal proteome and revealed important gene functions and proteome dynamics. Presented here is the application of shot-gun proteomic technology to study the bio-remediation of environmental hazards by white-rot fungus. Lignin, a naturally abundant component of the plant biomass, is discovered to promote the degradation of Azo dye by white-rot fungus Irpex lacteus CD2 in the lignin/dye/fungus system. Shotgun proteomics technique was used to understand degradation mechanism at the protein level formore » the lignin/dye/fungus system. Our proteomics study can identify about two thousand proteins (one third of the predicted white-rot fungal proteome) in a single experiment, as one of the most powerful proteomics platforms to study the fungal system to date. The study shows a significant enrichment of oxidoreduction functional category under the dye/lignin combined treatment. An in vitro validation is performed and supports our hypothesis that the synergy of Fenton reaction and manganese peroxidase might play an important role in DR5B dye degradation. The results could guide the development of effective bioremediation strategies and efficient lignocellulosic biomass conversion.« less

  18. Plant subcellular proteomics: Application for exploring optimal cell function in soybean.

    PubMed

    Wang, Xin; Komatsu, Setsuko

    2016-06-30

    Plants have evolved complicated responses to developmental changes and stressful environmental conditions. Subcellular proteomics has the potential to elucidate localized cellular responses and investigate communications among subcellular compartments during plant development and in response to biotic and abiotic stresses. Soybean, which is a valuable legume crop rich in protein and vegetable oil, can grow in several climatic zones; however, the growth and yield of soybean are markedly decreased under stresses. To date, numerous proteomic studies have been performed in soybean to examine the specific protein profiles of cell wall, plasma membrane, nucleus, mitochondrion, chloroplast, and endoplasmic reticulum. In this review, methods for the purification and purity assessment of subcellular organelles from soybean are summarized. In addition, the findings from subcellular proteomic analyses of soybean during development and under stresses, particularly flooding stress, are presented and the proteins regulated among subcellular compartments are discussed. Continued advances in subcellular proteomics are expected to greatly contribute to the understanding of the responses and interactions that occur within and among subcellular compartments during development and under stressful environmental conditions. Subcellular proteomics has the potential to investigate the cellular events and interactions among subcellular compartments in response to development and stresses in plants. Soybean could grow in several climatic zones; however, the growth and yield of soybean are markedly decreased under stresses. Numerous proteomics of cell wall, plasma membrane, nucleus, mitochondrion, chloroplast, and endoplasmic reticulum was carried out to investigate the respecting proteins and their functions in soybean during development or under stresses. In this review, methods of subcellular-organelle enrichment and purity assessment are summarized. In addition, previous findings of

  19. Exploration of Panviral Proteome: High-Throughput Cloning and Functional Implications in Virus-host Interactions

    PubMed Central

    Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M.; LaBaer, Joshua

    2014-01-01

    Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies. PMID:24955142

  20. Background | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The term "proteomics" refers to a large-scale comprehensive study of a specific proteome resulting from its genome, including abundances of proteins, their variations and modifications, and interacting partners and networks in order to understand cellular processes involved.  Similarly, “Cancer proteomics” refers to comprehensive analyses of proteins and their derivatives translated from a specific cancer genome using a human biospecimen or a preclinical model (e.g., cultured cell or animal model).

  1. Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster

    NASA Astrophysics Data System (ADS)

    Zdobnov, Evgeny M.; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R.; Christophides, George K.; Thomasova, Dana; Holt, Robert A.; Subramanian, G. Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H.; Wells, Michael A.; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L.; Kokoza, Elena; Kraft, Cheryl L.; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M.; Salzberg, Steven L.; Sutton, Granger G.; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H.; Ribeiro, Jose; Gelbart, William M.; Kafatos, Fotis C.; Bork, Peer

    2002-10-01

    Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small ``microsyntenic'' clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected.

  2. CPTAC | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) is a national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics.

  3. Comparative Proteomics Reveals a Significant Bias Toward Alternative Protein Isoforms with Conserved Structure and Function

    PubMed Central

    Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.

    2012-01-01

    Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is

  4. Functional genomics of root growth and development in Arabidopsis

    PubMed Central

    Iyer-Pascuzzi, Anjali; Simpson, June; Herrera-Estrella, Luis; Benfey, Philip N.

    2009-01-01

    Summary Roots are vital for the uptake of water and nutrients, and for anchorage in the soil. They are highly plastic, able to adapt developmentally and physiologically to changing environmental conditions. Understanding the molecular mechanisms behind this growth and development requires knowledge of root transcriptomics, proteomics and metabolomics. Genomics approaches, including the recent publication of a root expression map, root proteome, and environment-specific root expression studies, are uncovering complex transcriptional and post-transcriptional networks underlying root development. The challenge is in further capitalizing on the information in these datasets to understand the fundamental principles of root growth and development. In this review, we highlight progress researchers have made toward this goal. PMID:19117793

  5. Marine proteomics: a critical assessment of an emerging technology.

    PubMed

    Slattery, Marc; Ankisetty, Sridevi; Corrales, Jone; Marsh-Hunkin, K Erica; Gochfeld, Deborah J; Willett, Kristine L; Rimoldi, John M

    2012-10-26

    The application of proteomics to marine sciences has increased in recent years because the proteome represents the interface between genotypic and phenotypic variability and, thus, corresponds to the broadest possible biomarker for eco-physiological responses and adaptations. Likewise, proteomics can provide important functional information regarding biosynthetic pathways, as well as insights into mechanism of action, of novel marine natural products. The goal of this review is to (1) explore the application of proteomics methodologies to marine systems, (2) assess the technical approaches that have been used, and (3) evaluate the pros and cons of this proteomic research, with the intent of providing a critical analysis of its future roles in marine sciences. To date, proteomics techniques have been utilized to investigate marine microbe, plant, invertebrate, and vertebrate physiology, developmental biology, seafood safety, susceptibility to disease, and responses to environmental change. However, marine proteomics studies often suffer from poor experimental design, sample processing/optimization difficulties, and data analysis/interpretation issues. Moreover, a major limitation is the lack of available annotated genomes and proteomes for most marine organisms, including several "model species". Even with these challenges in mind, there is no doubt that marine proteomics is a rapidly expanding and powerful integrative molecular research tool from which our knowledge of the marine environment, and the natural products from this resource, will be significantly expanded.

  6. Functional Module Search in Protein Networks based on Semantic Similarity Improves the Analysis of Proteomics Data*

    PubMed Central

    Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus

    2014-01-01

    The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868

  7. [Progress in stable isotope labeled quantitative proteomics methods].

    PubMed

    Zhou, Yuan; Shan, Yichu; Zhang, Lihua; Zhang, Yukui

    2013-06-01

    Quantitative proteomics is an important research field in post-genomics era. There are two strategies for proteome quantification: label-free methods and stable isotope labeling methods which have become the most important strategy for quantitative proteomics at present. In the past few years, a number of quantitative methods have been developed, which support the fast development in biology research. In this work, we discuss the progress in the stable isotope labeling methods for quantitative proteomics including relative and absolute quantitative proteomics, and then give our opinions on the outlook of proteome quantification methods.

  8. Functional Genomics Analysis of Singapore Grouper Iridovirus: Complete Sequence Determination and Proteomic Analysis

    PubMed Central

    Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong

    2004-01-01

    Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645

  9. Integrative genomic and proteomic profiling of human neuroblastoma SH-SY5Y cells reveals signatures of endosulfan exposure.

    PubMed

    Gandhi, Deepa; Tarale, Prashant; Naoghare, Pravin K; Bafana, Amit; Kannan, Krishnamurthi; Sivanesan, Saravanadevi

    2016-01-01

    Endosulfan, an organochlorine pesticide, is known to induce multiple disorders/abnormalities including neuro-degenerative disorders in many animal species. However, the molecular mechanism of endosulfan induced neuronal alterations is still not well understood. In the present study, the effect of sub-lethal concentration of endosulfan (3 μM) on human neuroblastoma cells (SH-SY5Y) was investigated using genomic and proteomic approaches. Microarray and 2D-PAGE followed by MALDI-TOF-MS analysis revealed differential expression of 831 transcripts and 16 proteins in exposed cells. A gene ontology enrichment analysis revealed that the differentially expressed genes and proteins were involved in variety of cellular events such as neuronal developmental pathway, immune response, cell differentiation, apoptosis, transmission of nerve impulse, axonogenesis, etc. The present study attempted to explore the possible molecular mechanism of endosulfan induced neuronal alterations in SH-SY5Y cells using an integrated genomic and proteomic approach. Based on the gene and protein profile possible mechanisms underlying endosulfan neurotoxicity were predicted. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Quantitative trait loci mapping of the mouse plasma proteome (pQTL).

    PubMed

    Holdt, Lesca M; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel

    2013-02-01

    A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F(2) intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins.

  11. Clinical veterinary proteomics: Techniques and approaches to decipher the animal plasma proteome.

    PubMed

    Ghodasara, P; Sadowski, P; Satake, N; Kopp, S; Mills, P C

    2017-12-01

    Over the last two decades, technological advancements in the field of proteomics have advanced our understanding of the complex biological systems of living organisms. Techniques based on mass spectrometry (MS) have emerged as powerful tools to contextualise existing genomic information and to create quantitative protein profiles from plasma, tissues or cell lines of various species. Proteomic approaches have been used increasingly in veterinary science to investigate biological processes responsible for growth, reproduction and pathological events. However, the adoption of proteomic approaches by veterinary investigators lags behind that of researchers in the human medical field. Furthermore, in contrast to human proteomics studies, interpretation of veterinary proteomic data is difficult due to the limited protein databases available for many animal species. This review article examines the current use of advanced proteomics techniques for evaluation of animal health and welfare and covers the current status of clinical veterinary proteomics research, including successful protein identification and data interpretation studies. It includes a description of an emerging tool, sequential window acquisition of all theoretical fragment ion mass spectra (SWATH-MS), available on selected mass spectrometry instruments. This newly developed data acquisition technique combines advantages of discovery and targeted proteomics approaches, and thus has the potential to advance the veterinary proteomics field by enhancing identification and reproducibility of proteomics data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Protannotator: a semiautomated pipeline for chromosome-wise functional annotation of the "missing" human proteome.

    PubMed

    Islam, Mohammad T; Garg, Gagan; Hancock, William S; Risk, Brian A; Baker, Mark S; Ranganathan, Shoba

    2014-01-03

    The chromosome-centric human proteome project (C-HPP) aims to define the complete set of proteins encoded in each human chromosome. The neXtProt database (September 2013) lists 20,128 proteins for the human proteome, of which 3831 human proteins (∼19%) are considered "missing" according to the standard metrics table (released September 27, 2013). In support of the C-HPP initiative, we have extended the annotation strategy developed for human chromosome 7 "missing" proteins into a semiautomated pipeline to functionally annotate the "missing" human proteome. This pipeline integrates a suite of bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology, and biochemical pathways. From sequential BLAST searches, we have primarily identified homologues from reviewed nonhuman mammalian proteins with protein evidence for 1271 (33.2%) "missing" proteins, followed by 703 (18.4%) homologues from reviewed nonhuman mammalian proteins and subsequently 564 (14.7%) homologues from reviewed human proteins. Functional annotations for 1945 (50.8%) "missing" proteins were also determined. To accelerate the identification of "missing" proteins from proteomics studies, we generated proteotypic peptides in silico. Matching these proteotypic peptides to ENCODE proteogenomic data resulted in proteomic evidence for 107 (2.8%) of the 3831 "missing proteins, while evidence from a recent membrane proteomic study supported the existence for another 15 "missing" proteins. The chromosome-wise functional annotation of all "missing" proteins is freely available to the scientific community through our web server (http://biolinfo.org/protannotator).

  13. The cell envelope proteome of Aggregatibacter actinomycetemcomitans

    PubMed Central

    Smith, Kenneth P.; Fields, Julia G.; Voogt, Richard D.; Deng, Bin; Lam, Ying-Wai; Mintz, Keith P.

    2014-01-01

    Summary The cell envelope of Gram-negative bacteria serves a critical role in maintenance of cellular homeostasis, resistance to external stress, and host-pathogen interactions. Envelope protein composition is influenced by the physiological and environmental demands placed on the bacterium. In this study, we report a comprehensive compilation of cell envelope proteins from the periodontal and systemic pathogen Aggregatibacter actinomycetemcomitans VT1169, an afimbriated serotype b strain. The urea-extracted membrane proteins were identified by mass spectrometry-based shotgun proteomics. The membrane proteome, isolated from actively growing bacteria under normal laboratory conditions, included 648 proteins representing 28% of the predicted ORFs in the genome. Bioinformatic analyses were used to annotate and predict the cellular location and function of the proteins. Surface adhesins, porins, lipoproteins, numerous influx and efflux pumps, multiple sugar, amino acid and iron transporters, and components of the type I, II and V secretion systems were identified. Periplasmic space and cytoplasmic proteins with chaperone function were also identified. 107 proteins with unknown function were associated with the cell envelope. Orthologs of a subset of these uncharacterized proteins are present in other bacterial genomes, while others are found exclusively in A. actinomycetemcomitans. This knowledge will contribute to elucidating the role of cell envelope proteins in bacterial growth and survival in the oral cavity. PMID:25055881

  14. Proteomic Analysis of the Arabidopsis Nucleolus Suggests Novel Nucleolar FunctionsD⃞

    PubMed Central

    Pendle, Alison F.; Clark, Gillian P.; Boon, Reinier; Lewandowska, Dominika; Lam, Yun Wah; Andersen, Jens; Mann, Matthias; Lamond, Angus I.; Brown, John W. S.; Shaw, Peter J.

    2005-01-01

    The eukaryotic nucleolus is involved in ribosome biogenesis and a wide range of other RNA metabolism and cellular functions. An important step in the functional analysis of the nucleolus is to determine the complement of proteins of this nuclear compartment. Here, we describe the first proteomic analysis of plant (Arabidopsis thaliana) nucleoli, in which we have identified 217 proteins. This allows a direct comparison of the proteomes of an important nuclear structure between two widely divergent species: human and Arabidopsis. The comparison identified many common proteins, plant-specific proteins, proteins of unknown function found in both proteomes, and proteins that were nucleolar in plants but nonnucleolar in human. Seventy-two proteins were expressed as GFP fusions and 87% showed nucleolar or nucleolar-associated localization. In a striking and unexpected finding, we have identified six components of the postsplicing exon-junction complex (EJC) involved in mRNA export and nonsense-mediated decay (NMD)/mRNA surveillance. This association was confirmed by GFP-fusion protein localization. These results raise the possibility that in plants, nucleoli may have additional functions in mRNA export or surveillance. PMID:15496452

  15. Contrasting patterns of evolutionary constraint and novelty revealed by comparative sperm proteomic analysis in Lepidoptera.

    PubMed

    Whittington, Emma; Forsythe, Desiree; Borziak, Kirill; Karr, Timothy L; Walters, James R; Dorus, Steve

    2017-12-02

    Rapid evolution is a hallmark of reproductive genetic systems and arises through the combined processes of sequence divergence, gene gain and loss, and changes in gene and protein expression. While studies aiming to disentangle the molecular ramifications of these processes are progressing, we still know little about the genetic basis of evolutionary transitions in reproductive systems. Here we conduct the first comparative analysis of sperm proteomes in Lepidoptera, a group that exhibits dichotomous spermatogenesis, in which males produce a functional fertilization-competent sperm (eupyrene) and an incompetent sperm morph lacking nuclear DNA (apyrene). Through the integrated application of evolutionary proteomics and genomics, we characterize the genomic patterns potentially associated with the origination and evolution of this unique spermatogenic process and assess the importance of genetic novelty in Lepidopteran sperm biology. Comparison of the newly characterized Monarch butterfly (Danaus plexippus) sperm proteome to those of the Carolina sphinx moth (Manduca sexta) and the fruit fly (Drosophila melanogaster) demonstrated conservation at the level of protein abundance and post-translational modification within Lepidoptera. In contrast, comparative genomic analyses across insects reveals significant divergence at two levels that differentiate the genetic architecture of sperm in Lepidoptera from other insects. First, a significant reduction in orthology among Monarch sperm genes relative to the remainder of the genome in non-Lepidopteran insect species was observed. Second, a substantial number of sperm proteins were found to be specific to Lepidoptera, in that they lack detectable homology to the genomes of more distantly related insects. Lastly, the functional importance of Lepidoptera specific sperm proteins is broadly supported by their increased abundance relative to proteins conserved across insects. Our results identify a burst of genetic novelty

  16. Rice functional genomics research in China.

    PubMed

    Han, Bin; Xue, Yongbiao; Li, Jiayang; Deng, Xing-Wang; Zhang, Qifa

    2007-06-29

    Rice functional genomics is a scientific approach that seeks to identify and define the function of rice genes, and uncover when and how genes work together to produce phenotypic traits. Rapid progress in rice genome sequencing has facilitated research in rice functional genomics in China. The Ministry of Science and Technology of China has funded two major rice functional genomics research programmes for building up the infrastructures of the functional genomics study such as developing rice functional genomics tools and resources. The programmes were also aimed at cloning and functional analyses of a number of genes controlling important agronomic traits from rice. National and international collaborations on rice functional genomics study are accelerating rice gene discovery and application.

  17. Genomics, metagenomics and proteomics in biomining microorganisms.

    PubMed

    Valenzuela, Lissette; Chi, An; Beard, Simon; Orell, Alvaro; Guiliani, Nicolas; Shabanowitz, Jeff; Hunt, Donald F; Jerez, Carlos A

    2006-01-01

    The use of acidophilic, chemolithotrophic microorganisms capable of oxidizing iron and sulfur in industrial processes to recover metals from minerals containing copper, gold and uranium is a well established biotechnology with distinctive advantages over traditional mining. A consortium of different microorganisms participates in the oxidative reactions resulting in the extraction of dissolved metal values from ores. Considerable effort has been spent in the last years to understand the biochemistry of iron and sulfur compounds oxidation, bacteria-mineral interactions (chemotaxis, quorum sensing, adhesion, biofilm formation) and several adaptive responses allowing the microorganisms to survive in a bioleaching environment. All of these are considered key phenomena for understanding the process of biomining. The use of genomics, metagenomics and high throughput proteomics to study the global regulatory responses that the biomining community uses to adapt to their changing environment is just beginning to emerge in the last years. These powerful approaches are reviewed here since they offer the possibility of exciting new findings that will allow analyzing the community as a microbial system, determining the extent to which each of the individual participants contributes to the process, how they evolve in time to keep the conglomerate healthy and therefore efficient during the entire process of bioleaching.

  18. Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.

    PubMed

    Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin

    2014-06-13

    Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Quantitative Trait Loci Mapping of the Mouse Plasma Proteome (pQTL)

    PubMed Central

    Holdt, Lesca M.; von Delft, Annette; Nicolaou, Alexandros; Baumann, Sven; Kostrzewa, Markus; Thiery, Joachim; Teupser, Daniel

    2013-01-01

    A current challenge in the era of genome-wide studies is to determine the responsible genes and mechanisms underlying newly identified loci. Screening of the plasma proteome by high-throughput mass spectrometry (MALDI-TOF MS) is considered a promising approach for identification of metabolic and disease processes. Therefore, plasma proteome screening might be particularly useful for identifying responsible genes when combined with analysis of variation in the genome. Here, we describe a proteomic quantitative trait locus (pQTL) study of plasma proteome screens in an F2 intercross of 455 mice mapped with 177 genetic markers across the genome. A total of 69 of 176 peptides revealed significant LOD scores (≥5.35) demonstrating strong genetic regulation of distinct components of the plasma proteome. Analyses were confirmed by mechanistic studies and MALDI-TOF/TOF, liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses of the two strongest pQTLs: A pQTL for mass-to-charge ratio (m/z) 3494 (LOD 24.9, D11Mit151) was identified as the N-terminal 35 amino acids of hemoglobin subunit A (Hba) and caused by genetic variation in Hba. Another pQTL for m/z 8713 (LOD 36.4; D1Mit111) was caused by variation in apolipoprotein A2 (Apoa2) and cosegregated with HDL cholesterol. Taken together, we show that genome-wide plasma proteome profiling in combination with genome-wide genetic screening aids in the identification of causal genetic variants affecting abundance of plasma proteins. PMID:23172855

  20. Proteome Characterization Centers - TCGA

    Cancer.gov

    The centers, a component of NCI’s Clinical Proteomic Tumor Analysis Consortium, will analyze a subset of TCGA samples to define proteins translated from cancer genomes and their related biological processes.

  1. Integrative proteomics, genomics, and translational immunology approaches reveal mutated forms of Proteolipid Protein 1 (PLP1) and mutant-specific immune response in multiple sclerosis.

    PubMed

    Qendro, Veneta; Bugos, Grace A; Lundgren, Debbie H; Glynn, John; Han, May H; Han, David K

    2017-03-01

    In order to gain mechanistic insights into multiple sclerosis (MS) pathogenesis, we utilized a multi-dimensional approach to test the hypothesis that mutations in myelin proteins lead to immune activation and central nervous system autoimmunity in MS. Mass spectrometry-based proteomic analysis of human MS brain lesions revealed seven unique mutations of PLP1; a key myelin protein that is known to be destroyed in MS. Surprisingly, in-depth genomic analysis of two MS patients at the genomic DNA and mRNA confirmed mutated PLP1 in RNA, but not in the genomic DNA. Quantification of wild type and mutant PLP RNA levels by qPCR further validated the presence of mutant PLP RNA in the MS patients. To seek evidence linking mutations in abundant myelin proteins and immune-mediated destruction of myelin, specific immune response against mutant PLP1 in MS patients was examined. Thus, we have designed paired, wild type and mutant peptide microarrays, and examined antibody response to multiple mutated PLP1 in sera from MS patients. Consistent with the idea of different patients exhibiting unique mutation profiles, we found that 13 out of 20 MS patients showed antibody responses against specific but not against all the mutant-PLP1 peptides. Interestingly, we found mutant PLP-directed antibody response against specific mutant peptides in the sera of pre-MS controls. The results from integrative proteomic, genomic, and immune analyses reveal a possible mechanism of mutation-driven pathogenesis in human MS. The study also highlights the need for integrative genomic and proteomic analyses for uncovering pathogenic mechanisms of human diseases. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Proteomic insights into floral biology.

    PubMed

    Li, Xiaobai; Jackson, Aaron; Xie, Ming; Wu, Dianxing; Tsai, Wen-Chieh; Zhang, Sheng

    2016-08-01

    The flower is the most important biological structure for ensuring angiosperms reproductive success. Not only does the flower contain critical reproductive organs, but the wide variation in morphology, color, and scent has evolved to entice specialized pollinators, and arguably mankind in many cases, to ensure the successful propagation of its species. Recent proteomic approaches have identified protein candidates related to these flower traits, which has shed light on a number of previously unknown mechanisms underlying these traits. This review article provides a comprehensive overview of the latest advances in proteomic research in floral biology according to the order of flower structure, from corolla to male and female reproductive organs. It summarizes mainstream proteomic methods for plant research and recent improvements on two dimensional gel electrophoresis and gel-free workflows for both peptide level and protein level analysis. The recent advances in sequencing technologies provide a new paradigm for the ever-increasing genome and transcriptome information on many organisms. It is now possible to integrate genomic and transcriptomic data with proteomic results for large-scale protein characterization, so that a global understanding of the complex molecular networks in flower biology can be readily achieved. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. Genome projects and the functional-genomic era.

    PubMed

    Sauer, Sascha; Konthur, Zoltán; Lehrach, Hans

    2005-12-01

    The problems we face today in public health as a result of the -- fortunately -- increasing age of people and the requirements of developing countries create an urgent need for new and innovative approaches in medicine and in agronomics. Genomic and functional genomic approaches have a great potential to at least partially solve these problems in the future. Important progress has been made by procedures to decode genomic information of humans, but also of other key organisms. The basic comprehension of genomic information (and its transfer) should now give us the possibility to pursue the next important step in life science eventually leading to a basic understanding of biological information flow; the elucidation of the function of all genes and correlative products encoded in the genome, as well as the discovery of their interactions in a molecular context and the response to environmental factors. As a result of the sequencing projects, we are now able to ask important questions about sequence variation and can start to comprehensively study the function of expressed genes on different levels such as RNA, protein or the cell in a systematic context including underlying networks. In this article we review and comment on current trends in large-scale systematic biological research. A particular emphasis is put on technology developments that can provide means to accomplish the tasks of future lines of functional genomics.

  4. New Funding Opportunity Announcements (FOAs): Reissuance of Clinical Proteomic Tumor Analysis Consortium (CPTAC) | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Cancer Institute is soliciting applications for the reissuance of its Clinical Proteomic Tumor Analysis Consortium (CPTAC) program.   CPTAC will support broad efforts focused on several cancer types to explore further the complexities of cancer proteomes and their connections to abnormalities in cancer genomes.

  5. The strategy, organization, and progress of the HUPO Human Proteome Project.

    PubMed

    Omenn, Gilbert S

    2014-04-04

    The Human Proteome Project is a major, comprehensive initiative of the Human Proteome Organization. This global collaborative effort aims to identify and characterize at least one protein product and many PTM, SAP, and splice variant isoforms from the 20,300 human protein-coding genes. The deliverables are an extensive parts list and an array of technology platforms, reagents, spectral libraries, and linked knowledge bases that advance the field and facilitate the use of proteomics by a much wider community of life scientists. Such enablement will help address the Grand Challenge of using proteomics to bridge major gaps between evidence of genomic variation and diverse phenotypes. The HUPO Human Proteome Project (HPP) has made an outstanding launch, including a special issue of the Journal of Proteome Research on the Chromosome-centric HPP with a total of 48 articles. This article is part of a Special Issue: Can Proteomics Fill the Gap Between Genomics and Phenotypes? © 2013.

  6. Making proteomics data accessible and reusable: Current state of proteomics databases and repositories

    PubMed Central

    Perez-Riverol, Yasset; Alpi, Emanuele; Wang, Rui; Hermjakob, Henning; Vizcaíno, Juan Antonio

    2015-01-01

    Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data. PMID:25158685

  7. Molecular stratification and precision medicine in systemic sclerosis from genomic and proteomic data.

    PubMed

    Martyanov, Viktor; Whitfield, Michael L

    2016-01-01

    The goal of this review is to summarize recent advances into the pathogenesis and treatment of systemic sclerosis (SSc) from genomic and proteomic studies. Intrinsic gene expression-driven molecular subtypes of SSc are reproducible across three independent datasets. These subsets are a consistent feature of SSc and are found in multiple end-target tissues, such as skin and esophagus. Intrinsic subsets as well as baseline levels of molecular target pathways are potentially predictive of clinical response to specific therapeutics, based on three recent clinical trials. A gene expression-based biomarker of modified Rodnan skin score, a measure of SSc skin severity, can be used as a surrogate outcome metric and has been validated in a recent trial. Proteome analyses have identified novel biomarkers of SSc that correlate with SSc clinical phenotypes. Integrating intrinsic gene expression subset data, baseline molecular pathway information, and serum biomarkers along with surrogate measures of modified Rodnan skin score provides molecular context in SSc clinical trials. With validation, these approaches could be used to match patients with the therapies from which they are most likely to benefit and thus increase the likelihood of clinical improvement.

  8. Making proteomics data accessible and reusable: current state of proteomics databases and repositories.

    PubMed

    Perez-Riverol, Yasset; Alpi, Emanuele; Wang, Rui; Hermjakob, Henning; Vizcaíno, Juan Antonio

    2015-03-01

    Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data. © 2014 The Authors. PROTEOMICS published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Comparative Analysis of Proteomes and Functionomes Provides Insights into Origins of Cellular Diversification

    PubMed Central

    Caetano-Anollés, Gustavo

    2013-01-01

    Reconstructing the evolutionary history of modern species is a difficult problem complicated by the conceptual and technical limitations of phylogenetic tree building methods. Here, we propose a comparative proteomic and functionomic inferential framework for genome evolution that allows resolving the tripartite division of cells and sketching their history. Evolutionary inferences were derived from the spread of conserved molecular features, such as molecular structures and functions, in the proteomes and functionomes of contemporary organisms. Patterns of use and reuse of these traits yielded significant insights into the origins of cellular diversification. Results uncovered an unprecedented strong evolutionary association between Bacteria and Eukarya while revealing marked evolutionary reductive tendencies in the archaeal genomic repertoires. The effects of nonvertical evolutionary processes (e.g., HGT, convergent evolution) were found to be limited while reductive evolution and molecular innovation appeared to be prevalent during the evolution of cells. Our study revealed a strong vertical trace in the history of proteins and associated molecular functions, which was reliably recovered using the comparative genomics approach. The trace supported the existence of a stem line of descent and the very early appearance of Archaea as a diversified superkingdom, but failed to uncover a hidden canonical pattern in which Bacteria was the first superkingdom to deploy superkingdom-specific structures and functions. PMID:24492748

  10. Brucella proteomes--a review.

    PubMed

    DelVecchio, Vito G; Wagner, Mary Ann; Eschenbrenner, Michel; Horn, Troy A; Kraycer, Jo Ann; Estock, Frank; Elzer, Phil; Mujer, Cesar V

    2002-12-20

    The proteomes of selected Brucella spp. have been extensively analyzed by utilizing current proteomic technology involving 2-DE and MALDI-MS. In Brucella melitensis, more than 500 proteins were identified. The rapid and large-scale identification of proteins in this organism was accomplished by using the annotated B. melitensis genome which is now available in the GenBank. Coupled with new and powerful tools for data analysis, differentially expressed proteins were identified and categorized into several classes. A global overview of protein expression patterns emerged, thereby facilitating the simultaneous analysis of different metabolic pathways in B. melitensis. Such a global characterization would not have been possible by using time consuming and traditional biochemical approaches. The era of post-genomic technology offers new and exciting opportunities to understand the complete biology of different Brucella species.

  11. HelmCoP: An Online Resource for Helminth Functional Genomics and Drug and Vaccine Targets Prioritization

    PubMed Central

    Taylor, Christina M.; Mitreva, Makedonka

    2011-01-01

    A vast majority of the burden from neglected tropical diseases result from helminth infections (nematodes and platyhelminthes). Parasitic helminthes infect over 2 billion, exerting a high collective burden that rivals high-mortality conditions such as AIDS or malaria, and cause devastation to crops and livestock. The challenges to improve control of parasitic helminth infections are multi-fold and no single category of approaches will meet them all. New information such as helminth genomics, functional genomics and proteomics coupled with innovative bioinformatic approaches provide fundamental molecular information about these parasites, accelerating both basic research as well as development of effective diagnostics, vaccines and new drugs. To facilitate such studies we have developed an online resource, HelmCoP (Helminth Control and Prevention), built by integrating functional, structural and comparative genomic data from plant, animal and human helminthes, to enable researchers to develop strategies for drug, vaccine and pesticide prioritization, while also providing a useful comparative genomics platform. HelmCoP encompasses genomic data from several hosts, including model organisms, along with a comprehensive suite of structural and functional annotations, to assist in comparative analyses and to study host-parasite interactions. The HelmCoP interface, with a sophisticated query engine as a backbone, allows users to search for multi-factorial combinations of properties and serves readily accessible information that will assist in the identification of various genes of interest. HelmCoP is publicly available at: http://www.nematode.net/helmcop.html. PMID:21760913

  12. Directed proteomic analysis of the human nucleolus.

    PubMed

    Andersen, Jens S; Lyon, Carol E; Fox, Archa H; Leung, Anthony K L; Lam, Yun Wah; Steen, Hanno; Mann, Matthias; Lamond, Angus I

    2002-01-08

    The nucleolus is a subnuclear organelle containing the ribosomal RNA gene clusters and ribosome biogenesis factors. Recent studies suggest it may also have roles in RNA transport, RNA modification, and cell cycle regulation. Despite over 150 years of research into nucleoli, many aspects of their structure and function remain uncharacterized. We report a proteomic analysis of human nucleoli. Using a combination of mass spectrometry (MS) and sequence database searches, including online analysis of the draft human genome sequence, 271 proteins were identified. Over 30% of the nucleolar proteins were encoded by novel or uncharacterized genes, while the known proteins included several unexpected factors with no previously known nucleolar functions. MS analysis of nucleoli isolated from HeLa cells in which transcription had been inhibited showed that a subset of proteins was enriched. These data highlight the dynamic nature of the nucleolar proteome and show that proteins can either associate with nucleoli transiently or accumulate only under specific metabolic conditions. This extensive proteomic analysis shows that nucleoli have a surprisingly large protein complexity. The many novel factors and separate classes of proteins identified support the view that the nucleolus may perform additional functions beyond its known role in ribosome subunit biogenesis. The data also show that the protein composition of nucleoli is not static and can alter significantly in response to the metabolic state of the cell.

  13. Toward an Upgraded Honey Bee (Apis mellifera L.) Genome Annotation Using Proteogenomics.

    PubMed

    McAfee, Alison; Harpur, Brock A; Michaud, Sarah; Beavis, Ronald C; Kent, Clement F; Zayed, Amro; Foster, Leonard J

    2016-02-05

    The honey bee is a key pollinator in agricultural operations as well as a model organism for studying the genetics and evolution of social behavior. The Apis mellifera genome has been sequenced and annotated twice over, enabling proteomics and functional genomics methods for probing relevant aspects of their biology. One troubling trend that emerged from proteomic analyses is that honey bee peptide samples consistently result in lower peptide identification rates compared with other organisms. This suggests that the genome annotation can be improved, or atypical biological processes are interfering with the mass spectrometry workflow. First, we tested whether high levels of polymorphisms could explain some of the missed identifications by searching spectra against the reference proteome (OGSv3.2) versus a customized proteome of a single honey bee, but our results indicate that this contribution was minor. Likewise, error-tolerant peptide searches lead us to eliminate unexpected post-translational modifications as a major factor in missed identifications. We then used a proteogenomic approach with ~1500 raw files to search for missing genes and new exons, to revive discarded annotations and to identify over 2000 new coding regions. These results will contribute to a more comprehensive genome annotation and facilitate continued research on this important insect.

  14. Integrative Identification of Arabidopsis Mitochondrial Proteome and Its Function Exploitation through Protein Interaction Network

    PubMed Central

    Cui, Jian; Liu, Jinghua; Li, Yuhua; Shi, Tieliu

    2011-01-01

    Mitochondria are major players on the production of energy, and host several key reactions involved in basic metabolism and biosynthesis of essential molecules. Currently, the majority of nucleus-encoded mitochondrial proteins are unknown even for model plant Arabidopsis. We reported a computational framework for predicting Arabidopsis mitochondrial proteins based on a probabilistic model, called Naive Bayesian Network, which integrates disparate genomic data generated from eight bioinformatics tools, multiple orthologous mappings, protein domain properties and co-expression patterns using 1,027 microarray profiles. Through this approach, we predicted 2,311 candidate mitochondrial proteins with 84.67% accuracy and 2.53% FPR performances. Together with those experimental confirmed proteins, 2,585 mitochondria proteins (named CoreMitoP) were identified, we explored those proteins with unknown functions based on protein-protein interaction network (PIN) and annotated novel functions for 26.65% CoreMitoP proteins. Moreover, we found newly predicted mitochondrial proteins embedded in particular subnetworks of the PIN, mainly functioning in response to diverse environmental stresses, like salt, draught, cold, and wound etc. Candidate mitochondrial proteins involved in those physiological acitivites provide useful targets for further investigation. Assigned functions also provide comprehensive information for Arabidopsis mitochondrial proteome. PMID:21297957

  15. Genomic and Proteomic Profiling Reveals Reduced Mitochondrial Function and Disruption of the Neuromuscular Junction Driving Rat Sarcopenia

    PubMed Central

    Ibebunjo, Chikwendu; Chick, Joel M.; Kendall, Tracee; Eash, John K.; Li, Christine; Zhang, Yunyu; Vickers, Chad; Wu, Zhidan; Clarke, Brian A.; Shi, Jun; Cruz, Joseph; Fournier, Brigitte; Brachat, Sophie; Gutzwiller, Sabine; Ma, QiCheng; Markovits, Judit; Broome, Michelle; Steinkrauss, Michelle; Skuba, Elizabeth; Galarneau, Jean-Rene; Gygi, Steven P.

    2013-01-01

    Molecular mechanisms underlying sarcopenia, the age-related loss of skeletal muscle mass and function, remain unclear. To identify molecular changes that correlated best with sarcopenia and might contribute to its pathogenesis, we determined global gene expression profiles in muscles of rats aged 6, 12, 18, 21, 24, and 27 months. These rats exhibit sarcopenia beginning at 21 months. Correlation of the gene expression versus muscle mass or age changes, and functional annotation analysis identified gene signatures of sarcopenia distinct from gene signatures of aging. Specifically, mitochondrial energy metabolism (e.g., tricarboxylic acid cycle and oxidative phosphorylation) pathway genes were the most downregulated and most significantly correlated with sarcopenia. Also, perturbed were genes/pathways associated with neuromuscular junction patency (providing molecular evidence of sarcopenia-related functional denervation and neuromuscular junction remodeling), protein degradation, and inflammation. Proteomic analysis of samples at 6, 18, and 27 months confirmed the depletion of mitochondrial energy metabolism proteins and neuromuscular junction proteins. Together, these findings suggest that therapeutic approaches that simultaneously stimulate mitochondrogenesis and reduce muscle proteolysis and inflammation have potential for treating sarcopenia. PMID:23109432

  16. Purification and fractionation of membranes for proteomic analyses.

    PubMed

    Marmagne, Anne; Salvi, Daniel; Rolland, Norbert; Ephritikhine, Geneviève; Joyard, Jacques; Barbier-Brygoo, Hélène

    2006-01-01

    Proteomics is a very powerful approach to link the information contained in sequenced genomes, such as Arabidopsis, to the functional knowledge provided by studies of plant cell compartments. However, membrane proteomics remains a challenge. One way to bring into view the complex mixture of proteins present in a membrane is to develop proteomic analyses based on (1) the use of highly purified membrane fractions and (2) fractionation of membrane proteins to retrieve as many proteins as possible (from the most to the less hydrophobic ones). To illustrate such strategies, we choose two types of membranes, the plasma membrane and the chloroplast envelope membranes. Both types of membranes can be prepared in a reasonable degree of purity from different types of tissues: the plasma membrane from cultured cells and the chloroplast envelope membrane from whole plants. This article is restricted to the description of methods for the preparation of highly purified and characterized plant membrane fractions and the subsequent fractionation of these membrane proteins according to simple physicochemical criteria (i.e., chloroform/methanol extraction, alkaline or saline treatments) for further analyses using modern proteomic methodologies.

  17. An insight into cyanobacterial genomics--a perspective.

    PubMed

    Lakshmi, Palaniswamy Thanga Velan

    2007-05-20

    At the turn of the millennium, cyanobacteria deserve attention to be reviewed to understand the past, present and future. The advent of post genomic research, which encompasses functional genomics, structural genomics, transcriptomics, pharmacogenomics, proteomics and metabolomics that allows a systematic wide approach for biological system studies. Thus by exploiting genomic and associated protein information through computational analyses, the fledging information that are generated by biotechnological analyses, could be well extrapolated to fill in the lacuna of scarce information on cyanobacteria and as an effort this paper attempts to highlights the perspectives available and awakens researcher to concentrate in the field of cyanobacterial informatics.

  18. Temporal changes in milk proteomes reveal developing milk functions.

    PubMed

    Gao, Xinliu; McMahon, Robert J; Woo, Jessica G; Davidson, Barbara S; Morrow, Ardythe L; Zhang, Qiang

    2012-07-06

    Human milk proteins provide essential nutrition for growth and development, and support a number of vital developmental processes in the neonate. A complete understanding of the possible functions of human milk proteins has been limited by incomplete knowledge of the human milk proteome. In this report, we have analyzed the proteomes of whey from human transitional and mature milk using ion-exchange and SDS-PAGE based protein fractionation methods. With a larger-than-normal sample loading approach, we are able to largely extend human milk proteome to 976 proteins. Among them, 152 proteins are found to render significant regulatory changes between transitional milk and mature milk. We further found that immunoglobulins sIgA and IgM are more abundant in transitional milk, whereas IgG is more abundant in mature milk, suggesting a transformation in defense mechanism from newborns to young infants. Additionally, we report a more comprehensive view of a complement system and associated regulatory apparatus in human milk, demonstrating the presence and function of a system similar to that found in the circulation but prevailed by alternative pathway in complement activation. Proteins involved in various aspects of carbohydrate metabolism are also described, revealing either a transition in milk functionality to accommodate carbohydrate-rich secretions as lactation progresses, or a potentially novel way of looking at the metabolic state of the mammary tissue. Lately, a number of extracellular matrix (ECM) proteins are found to be in higher abundance in transitional milk and may be relevant to the development of infants' gastrointestinal tract in early life. In contrast, the ECM protein fibronectin and several of the actin cytoskeleton proteins that it regulates are more abundant in mature milk, which may indicate the important functional role for milk in regulating reactive oxygen species.

  19. OncoBinder facilitates interpretation of proteomic interaction data by capturing coactivation pairs in cancer.

    PubMed

    Van Coillie, Samya; Liang, Lunxi; Zhang, Yao; Wang, Huanbin; Fang, Jing-Yuan; Xu, Jie

    2016-04-05

    High-throughput methods such as co-immunoprecipitationmass spectrometry (coIP-MS) and yeast 2 hybridization (Y2H) have suggested a broad range of unannotated protein-protein interactions (PPIs), and interpretation of these PPIs remains a challenging task. The advancements in cancer genomic researches allow for the inference of "coactivation pairs" in cancer, which may facilitate the identification of PPIs involved in cancer. Here we present OncoBinder as a tool for the assessment of proteomic interaction data based on the functional synergy of oncoproteins in cancer. This decision tree-based method combines gene mutation, copy number and mRNA expression information to infer the functional status of protein-coding genes. We applied OncoBinder to evaluate the potential binders of EGFR and ERK2 proteins based on the gastric cancer dataset of The Cancer Genome Atlas (TCGA). As a result, OncoBinder identified high confidence interactions (annotated by Kyoto Encyclopedia of Genes and Genomes (KEGG) or validated by low-throughput assays) more efficiently than co-expression based method. Taken together, our results suggest that evaluation of gene functional synergy in cancer may facilitate the interpretation of proteomic interaction data. The OncoBinder toolbox for Matlab is freely accessible online.

  20. Trends in genome dynamics among major orders of insects revealed through variations in protein families.

    PubMed

    Rappoport, Nadav; Linial, Michal

    2015-08-07

    Insects belong to a class that accounts for the majority of animals on earth. With over one million identified species, insects display a huge diversity and occupy extreme environments. At present, there are dozens of fully sequenced insect genomes that cover a range of habitats, social behavior and morphologies. In view of such diverse collection of genomes, revealing evolutionary trends and charting functional relationships of proteins remain challenging. We analyzed the relatedness of 17 complete proteomes representative of proteomes from insects including louse, bee, beetle, ants, flies and mosquitoes, as well as an out-group from the crustaceans. The analyzed proteomes mostly represented the orders of Hymenoptera and Diptera. The 287,405 protein sequences from the 18 proteomes were automatically clustered into 20,933 families, including 799 singletons. A comprehensive analysis based on statistical considerations identified the families that were significantly expanded or reduced in any of the studied organisms. Among all the tested species, ants are characterized by an exceptionally high rate of family gain and loss. By assigning annotations to hundreds of species-specific families, the functional diversity among species and between the major clades (Diptera and Hymenoptera) is revealed. We found that many species-specific families are associated with receptor signaling, stress-related functions and proteases. The highest variability among insects associates with the function of transposition and nucleic acids processes (collectively coined TNAP). Specifically, the wasp and ants have an order of magnitude more TNAP families and proteins relative to species that belong to Diptera (mosquitoes and flies). An unsupervised clustering methodology combined with a comparative functional analysis unveiled proteomic signatures in the major clades of winged insects. We propose that the expansion of TNAP families in Hymenoptera potentially contributes to the accelerated

  1. NCI-CPTAC DREAM Proteogenomics Challenge (Registration Now Open) | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    Proteogenomics, integration of proteomics, genomics, and transcriptomics, is an emerging approach that promises to advance basic, translational and clinical research.  By combining genomic and proteomic information, leading scientists are gaining new insights due to a more complete and unified understanding of complex biological processes.

  2. Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes

    PubMed Central

    Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen

    2016-01-01

    Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)1 not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. PMID:27215607

  3. Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes.

    PubMed

    Virant-Klun, Irma; Leicht, Stefan; Hughes, Christopher; Krijgsveld, Jeroen

    2016-08-01

    Oocytes undergo a range of complex processes via oogenesis, maturation, fertilization, and early embryonic development, eventually giving rise to a fully functioning organism. To understand proteome composition and diversity during maturation of human oocytes, here we have addressed crucial aspects of oocyte collection and proteome analysis, resulting in the first proteome and secretome maps of human oocytes. Starting from 100 oocytes collected via a novel serum-free hanging drop culture system, we identified 2,154 proteins, whose function indicate that oocytes are largely resting cells with a proteome that is tailored for homeostasis, cellular attachment, and interaction with its environment via secretory factors. In addition, we have identified 158 oocyte-enriched proteins (such as ECAT1, PIWIL3, NLRP7)(1) not observed in high-coverage proteomics studies of other human cell lines or tissues. Exploiting SP3, a novel technology for proteomic sample preparation using magnetic beads, we scaled down proteome analysis to single cells. Despite the low protein content of only ∼100 ng per cell, we consistently identified ∼450 proteins from individual oocytes. When comparing individual oocytes at the germinal vesicle (GV) and metaphase II (MII) stage, we found that the Tudor and KH domain-containing protein (TDRKH) is preferentially expressed in immature oocytes, while Wee2, PCNA, and DNMT1 were enriched in mature cells, collectively indicating that maintenance of genome integrity is crucial during oocyte maturation. This study demonstrates that an innovative proteomics workflow facilitates analysis of single human oocytes to investigate human oocyte biology and preimplantation development. The approach presented here paves the way for quantitative proteomics in other quantity-limited tissues and cell types. Data associated with this study are available via ProteomeXchange with identifier PXD004142. © 2016 by The American Society for Biochemistry and Molecular Biology

  4. Characterisation of the Manduca sexta sperm proteome: Genetic novelty underlying sperm composition in Lepidoptera.

    PubMed

    Whittington, Emma; Zhao, Qian; Borziak, Kirill; Walters, James R; Dorus, Steve

    2015-07-01

    The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole

  5. Mitochondrial proteome disruption in the diabetic heart through targeted epigenetic regulation at the mitochondrial heat shock protein 70 (mtHsp70) nuclear locus.

    PubMed

    Shepherd, Danielle L; Hathaway, Quincy A; Nichols, Cody E; Durr, Andrya J; Pinti, Mark V; Hughes, Kristen M; Kunovac, Amina; Stine, Seth M; Hollander, John M

    2018-06-01

    >99% of the mitochondrial proteome is nuclear-encoded. The mitochondrion relies on a coordinated multi-complex process for nuclear genome-encoded mitochondrial protein import. Mitochondrial heat shock protein 70 (mtHsp70) is a key component of this process and a central constituent of the protein import motor. Type 2 diabetes mellitus (T2DM) disrupts mitochondrial proteomic signature which is associated with decreased protein import efficiency. The goal of this study was to manipulate the mitochondrial protein import process through targeted restoration of mtHsp70, in an effort to restore proteomic signature and mitochondrial function in the T2DM heart. A novel line of cardiac-specific mtHsp70 transgenic mice on the db/db background were generated and cardiac mitochondrial subpopulations were isolated with proteomic evaluation and mitochondrial function assessed. MicroRNA and epigenetic regulation of the mtHsp70 gene during T2DM were also evaluated. MtHsp70 overexpression restored cardiac function and nuclear-encoded mitochondrial protein import, contributing to a beneficial impact on proteome signature and enhanced mitochondrial function during T2DM. Further, transcriptional repression at the mtHsp70 genomic locus through increased localization of H3K27me3 during T2DM insult was observed. Our results suggest that restoration of a key protein import constituent, mtHsp70, provides therapeutic benefit through attenuation of mitochondrial and contractile dysfunction in T2DM. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Systematic Characterization of the Murine Mitochondrial Proteome Using Functionally Validated Cardiac Mitochondria

    PubMed Central

    Zhang, Jun; Li, Xiaohai; Mueller, Michael; Wang, Yueju; Zong, Chenggong; Deng, Ning; Vondriska, Thomas M.; Liem, David A.; Yang, Jeong-In; Korge, Paavo; Honda, Henry; Weiss, James N.; Apweiler, Rolf; Ping, Peipei

    2009-01-01

    Mitochondria play essential roles in cardiac pathophysiology and the murine model has been extensively used to investigate cardiovascular diseases. In the present study, we characterized murine cardiac mitochondria using an LC/MS/MS approach. We extracted and purified cardiac mitochondria; validated their functionality to ensure the final preparation contains necessary components to sustain their normal function; and subjected these validated organelles to LC/MS/MS-based protein identification. A total of 940 distinct proteins were identified from murine cardiac mitochondria, among which, 480 proteins were not previously identified by major proteomic profiling studies. The 940 proteins consist of functional clusters known to support oxidative phosphorylation, metabolism and biogenesis. In addition, there are several other clusters--including proteolysis, protein folding, and reduction/oxidation signaling-which ostensibly represent previously under-appreciated tasks of cardiac mitochondria. Moreover, many identified proteins were found to occupy other subcellular locations, including cytoplasm, ER, and golgi, in addition to their presence in the mitochondria. These results provide a comprehensive picture of the murine cardiac mitochondrial proteome and underscore tissue- and species-specification. Moreover, the use of functionally intact mitochondria insures that the proteomic observations in this organelle are relevant to its normal biology and facilitates decoding the interplay between mitochondria and other organelles. PMID:18348319

  7. Current Progress in Tonoplast Proteomics Reveals Insights into the Function of the Large Central Vacuole

    PubMed Central

    Trentmann, Oliver; Haferkamp, Ilka

    2013-01-01

    Vacuoles of plants fulfill various biologically important functions, like turgor generation and maintenance, detoxification, solute sequestration, or protein storage. Different types of plant vacuoles (lytic versus protein storage) are characterized by different functional properties apparently caused by a different composition/abundance and regulation of transport proteins in the surrounding membrane, the tonoplast. Proteome analyses allow the identification of vacuolar proteins and provide an informative basis for assigning observed transport processes to specific carriers or channels. This review summarizes techniques required for vacuolar proteome analyses, like e.g., isolation of the large central vacuole or tonoplast membrane purification. Moreover, an overview about diverse published vacuolar proteome studies is provided. It becomes evident that qualitative proteomes from different plant species represent just the tip of the iceberg. During the past few years, mass spectrometry achieved immense improvement concerning its accuracy, sensitivity, and application. As a consequence, modern tonoplast proteome approaches are suited for detecting alterations in membrane protein abundance in response to changing environmental/physiological conditions and help to clarify the regulation of tonoplast transport processes. PMID:23459586

  8. LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system

    PubMed Central

    Bannasch, Detlev; Mehrle, Alexander; Glatting, Karl-Heinz; Pepperkok, Rainer; Poustka, Annemarie; Wiemann, Stefan

    2004-01-01

    We have implemented LIFEdb (http://www.dkfz.de/LIFEdb) to link information regarding novel human full-length cDNAs generated and sequenced by the German cDNA Consortium with functional information on the encoded proteins produced in functional genomics and proteomics approaches. The database also serves as a sample-tracking system to manage the process from cDNA to experimental read-out and data interpretation. A web interface enables the scientific community to explore and visualize features of the annotated cDNAs and ORFs combined with experimental results, and thus helps to unravel new features of proteins with as yet unknown functions. PMID:14681468

  9. Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism

    PubMed Central

    Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

    2016-01-01

    Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced

  10. Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism.

    PubMed

    Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

    2016-01-01

    Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced

  11. Genomic and proteomic characterization of SuMu, a Mu-like bacteriophage infecting Haemophilus parasuis.

    PubMed

    Zehr, Emilie S; Tabatabai, Louisa B; Bayles, Darrell O

    2012-07-23

    Haemophilus parasuis, the causative agent of Glässer's disease, is prevalent in swine herds and clinical signs associated with this disease are meningitis, polyserositis, polyarthritis, and bacterial pneumonia. Six to eight week old pigs in segregated early weaning herds are particularly susceptible to the disease. Insufficient colostral antibody at weaning or the mixing of pigs with heterologous virulent H. parasuis strains from other farm sources in the nursery or grower-finisher stage are considered to be factors for the outbreak of Glässer's disease. Previously, a Mu-like bacteriophage portal gene was detected in a virulent swine isolate of H. parasuis by nested polymerase chain reaction. Mu-like bacteriophages are related phyologenetically to enterobacteriophage Mu and are thought to carry virulence genes or to induce host expression of virulence genes. This study characterizes the Mu-like bacteriophage, named SuMu, isolated from a virulent H. parasuis isolate. Characterization was done by genomic comparison to enterobacteriophage Mu and proteomic identification of various homologs by mass spectrometry. This is the first report of isolation and characterization of this bacteriophage from the Myoviridae family, a double-stranded DNA bacteriophage with a contractile tail, from a virulent field isolate of H. parasuis. The genome size of bacteriophage SuMu was 37,151 bp. DNA sequencing revealed fifty five open reading frames, including twenty five homologs to Mu-like bacteriophage proteins: Nlp, phage transposase-C-terminal, COG2842, Gam-like protein, gp16, Mor, peptidoglycan recognition protein, gp29, gp30, gpG, gp32, gp34, gp36, gp37, gpL, phage tail tube protein, DNA circulation protein, gpP, gp45, gp46, gp47, COG3778, tail fiber protein gp37-C terminal, tail fiber assembly protein, and Com. The last open reading frame was homologous to IS1414. The G + C content of bacteriophage SuMu was 41.87% while its H. parasuis host genome's G + C content was

  12. Proteomic and genomic characterization of a yeast model for Ogden syndrome

    PubMed Central

    Dörfel, Max J.; Fang, Han; Crain, Jonathan; Klingener, Michael; Weiser, Jake

    2016-01-01

    Abstract Naa10 is an Nα‐terminal acetyltransferase that, in a complex with its auxiliary subunit Naa15, co‐translationally acetylates the α‐amino group of newly synthetized proteins as they emerge from the ribosome. Roughly 40–50% of the human proteome is acetylated by Naa10, rendering this an enzyme one of the most broad substrate ranges known. Recently, we reported an X‐linked disorder of infancy, Ogden syndrome, in two families harbouring a c.109 T > C (p.Ser37Pro) variant in NAA10. In the present study we performed in‐depth characterization of a yeast model of Ogden syndrome. Stress tests and proteomic analyses suggest that the S37P mutation disrupts Naa10 function and reduces cellular fitness during heat shock, possibly owing to dysregulation of chaperone expression and accumulation. Microarray and RNA‐seq revealed a pseudo‐diploid gene expression profile in ΔNaa10 cells, probably responsible for a mating defect. In conclusion, the data presented here further support the disruptive nature of the S37P/Ogden mutation and identify affected cellular processes potentially contributing to the severe phenotype seen in Ogden syndrome. Data are available via GEO under identifier GSE86482 or with ProteomeXchange under identifier PXD004923. © 2016 The Authors. Yeast published by John Wiley & Sons, Ltd. PMID:27668839

  13. Functional Genomics of Lignocellulose Degradation in the Basidiomycete White Rot Schizophyllum commune

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ohm, Robin A.; Tegelaar, Martin; Henrissat, Bernard

    2013-03-01

    White and brown rot fungi are among the most important wood decayers in nature. Although more than 50 genomes of Basidiomycete white and brown rots have been sequenced by the Joint Genome Institute, there is still a lot to learn about how these fungi degrade the tough polymers present in wood. In particular, very little is known about how these fungi regulate the expression of genes involved in lignocellulose degradation. Here, we used transcriptomics, proteomics, and promoter analysis in an effort to gain insight into the process of lignocellulose degradation.

  14. Proteomics of industrial fungi: trends and insights for biotechnology.

    PubMed

    de Oliveira, José Miguel P Ferreira; de Graaff, Leo H

    2011-01-01

    Filamentous fungi are widely known for their industrial applications, namely, the production of food-processing enzymes and metabolites such as antibiotics and organic acids. In the past decade, the full genome sequencing of filamentous fungi increased the potential to predict encoded proteins enormously, namely, hydrolytic enzymes or proteins involved in the biosynthesis of metabolites of interest. The integration of genome sequence information with possible phenotypes requires, however, the knowledge of all the proteins in the cell in a system-wise manner, given by proteomics. This review summarises the progress of proteomics and its importance for the study of biotechnological processes in filamentous fungi. A major step forward in proteomics was to couple protein separation with high-resolution mass spectrometry, allowing accurate protein quantification. Despite the fact that most fungal proteomic studies have been focused on proteins from mycelial extracts, many proteins are related to processes which are compartmentalised in the fungal cell, e.g. β-lactam antibiotic production in the microbody. For the study of such processes, a targeted approach is required, e.g. by organelle proteomics. Typical workflows for sample preparation in fungal organelle proteomics are discussed, including homogenisation and sub-cellular fractionation. Finally, examples are presented of fungal organelle proteomic studies, which have enlarged the knowledge on areas of interest to biotechnology, such as protein secretion, energy production or antibiotic biosynthesis.

  15. Proteomics of industrial fungi: trends and insights for biotechnology

    PubMed Central

    de Oliveira, José Miguel P. Ferreira

    2010-01-01

    Filamentous fungi are widely known for their industrial applications, namely, the production of food-processing enzymes and metabolites such as antibiotics and organic acids. In the past decade, the full genome sequencing of filamentous fungi increased the potential to predict encoded proteins enormously, namely, hydrolytic enzymes or proteins involved in the biosynthesis of metabolites of interest. The integration of genome sequence information with possible phenotypes requires, however, the knowledge of all the proteins in the cell in a system-wise manner, given by proteomics. This review summarises the progress of proteomics and its importance for the study of biotechnological processes in filamentous fungi. A major step forward in proteomics was to couple protein separation with high-resolution mass spectrometry, allowing accurate protein quantification. Despite the fact that most fungal proteomic studies have been focused on proteins from mycelial extracts, many proteins are related to processes which are compartmentalised in the fungal cell, e.g. β-lactam antibiotic production in the microbody. For the study of such processes, a targeted approach is required, e.g. by organelle proteomics. Typical workflows for sample preparation in fungal organelle proteomics are discussed, including homogenisation and sub-cellular fractionation. Finally, examples are presented of fungal organelle proteomic studies, which have enlarged the knowledge on areas of interest to biotechnology, such as protein secretion, energy production or antibiotic biosynthesis. PMID:20922379

  16. Functional environmental proteomics: elucidating the role of a c-type cytochrome abundant during uranium bioremediation

    PubMed Central

    Yun, Jiae; Malvankar, Nikhil S; Ueki, Toshiyuki; Lovley, Derek R

    2016-01-01

    Studies with pure cultures of dissimilatory metal-reducing microorganisms have demonstrated that outer-surface c-type cytochromes are important electron transfer agents for the reduction of metals, but previous environmental proteomic studies have typically not recovered cytochrome sequences from subsurface environments in which metal reduction is important. Gel-separation, heme-staining and mass spectrometry of proteins in groundwater from in situ uranium bioremediation experiments identified a putative c-type cytochrome, designated Geobacter subsurface c-type cytochrome A (GscA), encoded within the genome of strain M18, a Geobacter isolate previously recovered from the site. Homologs of GscA were identified in the genomes of other Geobacter isolates in the phylogenetic cluster known as subsurface clade 1, which predominates in a diversity of Fe(III)-reducing subsurface environments. Most of the gscA sequences recovered from groundwater genomic DNA clustered in a tight phylogenetic group closely related to strain M18. GscA was most abundant in groundwater samples in which Geobacter sp. predominated. Expression of gscA in a strain of Geobacter sulfurreducens that lacked the gene for the c-type cytochrome OmcS, thought to facilitate electron transfer from conductive pili to Fe(III) oxide, restored the capacity for Fe(III) oxide reduction. Atomic force microscopy provided evidence that GscA was associated with the pili. These results demonstrate that a c-type cytochrome with an apparent function similar to that of OmcS is abundant when Geobacter sp. are abundant in the subsurface, providing insight into the mechanisms for the growth of subsurface Geobacter sp. on Fe(III) oxide and suggesting an approach for functional analysis of other Geobacter proteins found in the subsurface. PMID:26140532

  17. Functional environmental proteomics: elucidating the role of a c-type cytochrome abundant during uranium bioremediation.

    PubMed

    Yun, Jiae; Malvankar, Nikhil S; Ueki, Toshiyuki; Lovley, Derek R

    2016-02-01

    Studies with pure cultures of dissimilatory metal-reducing microorganisms have demonstrated that outer-surface c-type cytochromes are important electron transfer agents for the reduction of metals, but previous environmental proteomic studies have typically not recovered cytochrome sequences from subsurface environments in which metal reduction is important. Gel-separation, heme-staining and mass spectrometry of proteins in groundwater from in situ uranium bioremediation experiments identified a putative c-type cytochrome, designated Geobacter subsurface c-type cytochrome A (GscA), encoded within the genome of strain M18, a Geobacter isolate previously recovered from the site. Homologs of GscA were identified in the genomes of other Geobacter isolates in the phylogenetic cluster known as subsurface clade 1, which predominates in a diversity of Fe(III)-reducing subsurface environments. Most of the gscA sequences recovered from groundwater genomic DNA clustered in a tight phylogenetic group closely related to strain M18. GscA was most abundant in groundwater samples in which Geobacter sp. predominated. Expression of gscA in a strain of Geobacter sulfurreducens that lacked the gene for the c-type cytochrome OmcS, thought to facilitate electron transfer from conductive pili to Fe(III) oxide, restored the capacity for Fe(III) oxide reduction. Atomic force microscopy provided evidence that GscA was associated with the pili. These results demonstrate that a c-type cytochrome with an apparent function similar to that of OmcS is abundant when Geobacter sp. are abundant in the subsurface, providing insight into the mechanisms for the growth of subsurface Geobacter sp. on Fe(III) oxide and suggesting an approach for functional analysis of other Geobacter proteins found in the subsurface.

  18. Jatropha curcas, a biofuel crop: Functional genomics for understanding metabolic pathways and genetic improvement

    PubMed Central

    Maghuly, Fatemeh; Laimer, Margit

    2013-01-01

    Jatropha curcas is currently attracting much attention as an oilseed crop for biofuel, as Jatropha can grow under climate and soil conditions that are unsuitable for food production. However, little is known about Jatropha, and there are a number of challenges to be overcome. In fact, Jatropha has not really been domesticated; most of the Jatropha accessions are toxic, which renders the seedcake unsuitable for use as animal feed. The seeds of Jatropha contain high levels of polyunsaturated fatty acids, which negatively impact the biofuel quality. Fruiting of Jatropha is fairly continuous, thus increasing costs of harvesting. Therefore, before starting any improvement program using conventional or molecular breeding techniques, understanding gene function and the genome scale of Jatropha are prerequisites. This review presents currently available and relevant information on the latest technologies (genomics, transcriptomics, proteomics and metabolomics) to decipher important metabolic pathways within Jatropha, such as oil and toxin synthesis. Further, it discusses future directions for biotechnological approaches in Jatropha breeding and improvement. PMID:24092674

  19. Farm animal genomics and informatics: an update

    PubMed Central

    Fadiel, Ahmed; Anidi, Ifeanyi; Eichenbaum, Kenneth D.

    2005-01-01

    Farm animal genomics is of interest to a wide audience of researchers because of the utility derived from understanding how genomics and proteomics function in various organisms. Applications such as xenotransplantation, increased livestock productivity, bioengineering new materials, products and even fabrics are several reasons for thriving farm animal genome activity. Currently mined in rapidly growing data warehouses, completed genomes of chicken, fish and cows are available but are largely stored in decentralized data repositories. In this paper, we provide an informatics primer on farm animal bioinformatics and genome project resources which drive attention to the most recent advances in the field. We hope to provide individuals in biotechnology and in the farming industry with information on resources and updates concerning farm animal genome projects. PMID:16275782

  20. Impact of a short-term exposure to spaceflight on the phenotype, genome, transcriptome and proteome of Escherichia coli

    NASA Astrophysics Data System (ADS)

    Li, Tianzhi; Chang, De; Xu, Huiwen; Chen, Jiapeng; Su, Longxiang; Guo, Yinghua; Chen, Zhenhong; Wang, Yajuan; Wang, Li; Wang, Junfeng; Fang, Xiangqun; Liu, Changting

    2015-07-01

    Escherichia coli (E. coli) is the most widely applied model organism in current biological science. As a widespread opportunistic pathogen, E. coli can survive not only by symbiosis with human, but also outside the host as well, which necessitates the evaluation of its response to the space environment. Therefore, to keep humans safe in space, it is necessary to understand how the bacteria respond to this environment. Despite extensive investigations for a few decades, the response of E. coli to the real space environment is still controversial. To better understand the mechanisms how E. coli overcomes harsh environments such as microgravity in space and to investigate whether these factors may induce pathogenic changes in E. coli that are potentially detrimental to astronauts, we conducted detailed genomics, transcriptomic and proteomic studies on E. coli that experienced 17 days of spaceflight. By comparing two flight strains LCT-EC52 and LCT-EC59 to a control strain LCT-EC106 that was cultured under the same temperature conditions on the ground, we identified metabolism changes, polymorphism changes, differentially expressed genes and proteins in the two flight strains. The flight strains differed from the control in the utilization of more than 30 carbon sources. Two single nucleotide polymorphisms (SNPs) and one deletion were identified in the flight strains. The expression level of more than 1000 genes altered in flight strains. Genes involved in chemotaxis, lipid metabolism and cell motility express differently. Moreover, the two flight strains also differed extensively from each other in terms of metabolism, transcriptome and proteome, indicating the impact of space environment on individual cells is heterogeneous and probably genotype-dependent. This study presents the first systematic profile of E. coli genome, transcriptome and proteome after spaceflight, which helps to elucidate the mechanism that controls the adaptation of microbes to the space

  1. Comparison of theoretical proteomes: identification of COGs with conserved and variable pI within the multimodal pI distribution.

    PubMed

    Nandi, Soumyadeep; Mehra, Nipun; Lynn, Andrew M; Bhattacharya, Alok

    2005-09-09

    Theoretical proteome analysis, generated by plotting theoretical isoelectric points (pI) against molecular masses of all proteins encoded by the genome show a multimodal distribution for pI. This multimodal distribution is an effect of allowed combinations of the charged amino acids, and not due to evolutionary causes. The variation in this distribution can be correlated to the organisms ecological niche. Contributions to this variation maybe mapped to individual proteins by studying the variation in pI of orthologs across microorganism genomes. The distribution of ortholog pI values showed trimodal distributions for all prokaryotic genomes analyzed, similar to whole proteome plots. Pairwise analysis of pI variation show that a few COGs are conserved within, but most vary between, the acidic and basic regions of the distribution, while molecular mass is more highly conserved. At the level of functional grouping of orthologs, five groups vary significantly from the population of orthologs, which is attributed to either conservation at the level of sequences or a bias for either positively or negatively charged residues contributing to the function. Individual COGs conserved in both the acidic and basic regions of the trimodal distribution are identified, and orthologs that best represent the variation in levels of the acidic and basic regions are listed. The analysis of pI distribution by using orthologs provides a basis for resolution of theoretical proteome comparison at the level of individual proteins. Orthologs identified that significantly vary between the major acidic and basic regions maybe used as representative of the variation of the entire proteome.

  2. Comparison of theoretical proteomes: Identification of COGs with conserved and variable pI within the multimodal pI distribution

    PubMed Central

    Nandi, Soumyadeep; Mehra, Nipun; Lynn, Andrew M; Bhattacharya, Alok

    2005-01-01

    Background Theoretical proteome analysis, generated by plotting theoretical isoelectric points (pI) against molecular masses of all proteins encoded by the genome show a multimodal distribution for pI. This multimodal distribution is an effect of allowed combinations of the charged amino acids, and not due to evolutionary causes. The variation in this distribution can be correlated to the organisms ecological niche. Contributions to this variation maybe mapped to individual proteins by studying the variation in pI of orthologs across microorganism genomes. Results The distribution of ortholog pI values showed trimodal distributions for all prokaryotic genomes analyzed, similar to whole proteome plots. Pairwise analysis of pI variation show that a few COGs are conserved within, but most vary between, the acidic and basic regions of the distribution, while molecular mass is more highly conserved. At the level of functional grouping of orthologs, five groups vary significantly from the population of orthologs, which is attributed to either conservation at the level of sequences or a bias for either positively or negatively charged residues contributing to the function. Individual COGs conserved in both the acidic and basic regions of the trimodal distribution are identified, and orthologs that best represent the variation in levels of the acidic and basic regions are listed. Conclusion The analysis of pI distribution by using orthologs provides a basis for resolution of theoretical proteome comparison at the level of individual proteins. Orthologs identified that significantly vary between the major acidic and basic regions maybe used as representative of the variation of the entire proteome. PMID:16150155

  3. Global analysis of the rat and human platelet proteome – the molecular blueprint for illustrating multi-functional platelets and cross-species function evolution

    PubMed Central

    Yu, Yanbao; Leng, Taohua; Yun, Dong; Liu, Na; Yao, Jun; Dai, Ying; Yang, Pengyuan; Chen, Xian

    2013-01-01

    Emerging evidences indicate that blood platelets function in multiple biological processes including immune response, bone metastasis and liver regeneration in addition to their known roles in hemostasis and thrombosis. Global elucidation of platelet proteome will provide the molecular base of these platelet functions. Here, we set up a high throughput platform for maximum exploration of the rat/human platelet proteome using integrated proteomics technologies, and then applied to identify the largest number of the proteins expressed in both rat and human platelets. After stringent statistical filtration, a total of 837 unique proteins matched with at least two unique peptides were precisely identified, making it the first comprehensive protein database so far for rat platelets. Meanwhile, quantitative analyses of the thrombin-stimulated platelets offered great insights into the biological functions of platelet proteins and therefore confirmed our global profiling data. A comparative proteomic analysis between rat and human platelets was also conducted, which revealed not only a significant similarity, but also an across-species evolutionary link that the orthologous proteins representing ‘core proteome’, and the ‘evolutionary proteome’ is actually a relatively static proteome. PMID:20443191

  4. Covering complete proteomes with X-ray structures: A current snapshot

    DOE PAGES

    Mizianty, Marcin J.; Fan, Xiao; Yan, Jing; ...

    2014-10-23

    Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtainedmore » through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.« less

  5. Genomes2Drugs: Identifies Target Proteins and Lead Drugs from Proteome Data

    PubMed Central

    Toomey, David; Hoppe, Heinrich C.; Brennan, Marian P.; Nolan, Kevin B.; Chubb, Anthony J.

    2009-01-01

    Background Genome sequencing and bioinformatics have provided the full hypothetical proteome of many pathogenic organisms. Advances in microarray and mass spectrometry have also yielded large output datasets of possible target proteins/genes. However, the challenge remains to identify new targets for drug discovery from this wealth of information. Further analysis includes bioinformatics and/or molecular biology tools to validate the findings. This is time consuming and expensive, and could fail to yield novel drugs if protein purification and crystallography is impossible. To pre-empt this, a researcher may want to rapidly filter the output datasets for proteins that show good homology to proteins that have already been structurally characterised or proteins that are already targets for known drugs. Critically, those researchers developing novel antibiotics need to select out the proteins that show close homology to any human proteins, as future inhibitors are likely to cross-react with the host protein, causing off-target toxicity effects later in clinical trials. Methodology/Principal Findings To solve many of these issues, we have developed a free online resource called Genomes2Drugs which ranks sequences to identify proteins that are (i) homologous to previously crystallized proteins or (ii) targets of known drugs, but are (iii) not homologous to human proteins. When tested using the Plasmodium falciparum malarial genome the program correctly enriched the ranked list of proteins with known drug target proteins. Conclusions/Significance Genomes2Drugs rapidly identifies proteins that are likely to succeed in drug discovery pipelines. This free online resource helps in the identification of potential drug targets. Importantly, the program further highlights proteins that are likely to be inhibited by FDA-approved drugs. These drugs can then be rapidly moved into Phase IV clinical studies under ‘change-of-application’ patents. PMID:19593435

  6. Development of proteome-wide binding reagents for research and diagnostics.

    PubMed

    Taussig, Michael J; Schmidt, Ronny; Cook, Elizabeth A; Stoevesandt, Oda

    2013-12-01

    Alongside MS, antibodies and other specific protein-binding molecules have a special place in proteomics as affinity reagents in a toolbox of applications for determining protein location, quantitative distribution and function (affinity proteomics). The realisation that the range of research antibodies available, while apparently vast is nevertheless still very incomplete and frequently of uncertain quality, has stimulated projects with an objective of raising comprehensive, proteome-wide sets of protein binders. With progress in automation and throughput, a remarkable number of recent publications refer to the practical possibility of selecting binders to every protein encoded in the genome. Here we review the requirements of a pipeline of production of protein binders for the human proteome, including target prioritisation, antigen design, 'next generation' methods, databases and the approaches taken by ongoing projects in Europe and the USA. While the task of generating affinity reagents for all human proteins is complex and demanding, the benefits of well-characterised and quality-controlled pan-proteome binder resources for biomedical research, industry and life sciences in general would be enormous and justify the effort. Given the technical, personnel and financial resources needed to fulfil this aim, expansion of current efforts may best be addressed through large-scale international collaboration. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Schizophrenia proteomics: biomarkers on the path to laboratory medicine?

    PubMed Central

    Lakhan, Shaheen Emmanuel

    2006-01-01

    Over two million Americans are afflicted with schizophrenia, a debilitating mental health disorder with a unique symptomatic and epidemiological profile. Genomics studies have hinted towards candidate schizophrenia susceptibility chromosomal loci and genes. Modern proteomic tools, particularly mass spectrometry and expression scanning, aim to identify both pathogenic-revealing and diagnostically significant biomarkers. Only a few studies on basic proteomics have been conducted for psychiatric disorders relative to the plethora of cancer specific experiments. One such proteomic utility enables the discovery of proteins and biological marker fingerprinting profiling techniques (SELDI-TOF-MS), and then subjects them to tandem mass spectrometric fragmentation and de novo protein sequencing (MALDI-TOF/TOF-MS) for the accurate identification and characterization of the proteins. Such utilities can explain the pathogenesis of neuro-psychiatric disease, provide more objective testing methods, and further demonstrate a biological basis to mental illness. Although clinical proteomics in schizophrenia have yet to reveal a biomarker with diagnostic specificity, methods that better characterize the disorder using endophenotypes can advance findings. Schizophrenia biomarkers could potentially revolutionize its psychopharmacology, changing it into a more hypothesis and genomic/proteomic-driven science. PMID:16846510

  8. Quantitative Clinical Chemistry Proteomics (qCCP) using mass spectrometry: general characteristics and application.

    PubMed

    Lehmann, Sylvain; Hoofnagle, Andrew; Hochstrasser, Denis; Brede, Cato; Glueckmann, Matthias; Cocho, José A; Ceglarek, Uta; Lenz, Christof; Vialaret, Jérôme; Scherl, Alexander; Hirtz, Christophe

    2013-05-01

    Proteomics studies typically aim to exhaustively detect peptides/proteins in a given biological sample. Over the past decade, the number of publications using proteomics methodologies has exploded. This was made possible due to the availability of high-quality genomic data and many technological advances in the fields of microfluidics and mass spectrometry. Proteomics in biomedical research was initially used in 'functional' studies for the identification of proteins involved in pathophysiological processes, complexes and networks. Improved sensitivity of instrumentation facilitated the analysis of even more complex sample types, including human biological fluids. It is at that point the field of clinical proteomics was born, and its fundamental aim was the discovery and (ideally) validation of biomarkers for the diagnosis, prognosis, or therapeutic monitoring of disease. Eventually, it was recognized that the technologies used in clinical proteomics studies [particularly liquid chromatography-tandem mass spectrometry (LC-MS/MS)] could represent an alternative to classical immunochemical assays. Prior to deploying MS in the measurement of peptides/proteins in the clinical laboratory, it seems likely that traditional proteomics workflows and data management systems will need to adapt to the clinical environment and meet in vitro diagnostic (IVD) regulatory constraints. This defines a new field, as reviewed in this article, that we have termed quantitative Clinical Chemistry Proteomics (qCCP).

  9. A 2-D guinea pig lung proteome map

    USDA-ARS?s Scientific Manuscript database

    Guinea pigs represent an important model for a number of infectious and non-infectious pulmonary diseases. The guinea pig genome has recently been sequenced to full coverage, opening up new research avenues using genomics, transcriptomics and proteomics techniques in this species. In order to furth...

  10. Proteome Exploration to Provide a Resource for the Investigation of Ganoderma lucidum

    PubMed Central

    Yu, Guo-Jun; Yin, Ya-Lin; Yu, Wen-Hui; Liu, Wei; Jin, Yan-Xia; Shrestha, Alok; Yang, Qing; Ye, Xiang-Dong; Sun, Hui

    2015-01-01

    Ganoderma lucidum is a basidiomycete white rot fungus that has been used for medicinal purposes worldwide. Although information concerning its genome and transcriptome has recently been reported, relatively little information is available for G. lucidum at the proteomic level. In this study, protein fractions from G. lucidum at three developmental stages (16-day mycelia, and fruiting bodies at 60 and 90 days) were prepared and subjected to LC-MS/MS analysis. A search against the G. lucidum genome database identified 803 proteins. Among these proteins, 61 lignocellulose degrading proteins were detected, most of which (49 proteins) were found in the 90-day fruiting bodies. Fourteen TCA-cycle related proteins, 17 peptidases, two argonaute-like proteins, and two immunomodulatory proteins were also detected. A majority (470) of the 803 proteins had GO annotations and were classified into 36 GO terms, with “binding”, “catalytic activity”, and “hydrolase activity” having high percentages. Additionally, 357 out of the 803 proteins were assigned to at least one COG functional category and grouped into 22 COG classifications. Based on the results from the proteomic and sequence alignment analyses, a potentially new immunomodulatory protein (GL18769) was expressed and shown to have high immunomodulatory activity. In this study, proteomic and biochemical analyses of G. lucidum were performed for the first time, revealing that proteins from this fungus can play significant bioactive roles and providing a new foundation for the further functional investigations that this fungus merits. PMID:25756518

  11. Large-scale label-free quantitative proteomics of the pea aphid-Buchnera symbiosis.

    PubMed

    Poliakov, Anton; Russell, Calum W; Ponnala, Lalit; Hoops, Harold J; Sun, Qi; Douglas, Angela E; van Wijk, Klaas J

    2011-06-01

    Many insects are nutritionally dependent on symbiotic microorganisms that have tiny genomes and are housed in specialized host cells called bacteriocytes. The obligate symbiosis between the pea aphid Acyrthosiphon pisum and the γ-proteobacterium Buchnera aphidicola (only 584 predicted proteins) is particularly amenable for molecular analysis because the genomes of both partners have been sequenced. To better define the symbiotic relationship between this aphid and Buchnera, we used large-scale, high accuracy tandem mass spectrometry (nanoLC-LTQ-Orbtrap) to identify aphid and Buchnera proteins in the whole aphid body, purified bacteriocytes, isolated Buchnera cells and the residual bacteriocyte fraction. More than 1900 aphid and 400 Buchnera proteins were identified. All enzymes in amino acid metabolism annotated in the Buchnera genome were detected, reflecting the high (68%) coverage of the proteome and supporting the core function of Buchnera in the aphid symbiosis. Transporters mediating the transport of predicted metabolites were present in the bacteriocyte. Label-free spectral counting combined with hierarchical clustering, allowed to define the quantitative distribution of a subset of these proteins across both symbiotic partners, yielding no evidence for the selective transfer of protein among the partners in either direction. This is the first quantitative proteome analysis of bacteriocyte symbiosis, providing a wealth of information about molecular function of both the host cell and bacterial symbiont.

  12. Recent developments in structural proteomics for protein structure determination.

    PubMed

    Liu, Hsuan-Liang; Hsu, Jyh-Ping

    2005-05-01

    The major challenges in structural proteomics include identifying all the proteins on the genome-wide scale, determining their structure-function relationships, and outlining the precise three-dimensional structures of the proteins. Protein structures are typically determined by experimental approaches such as X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. However, the knowledge of three-dimensional space by these techniques is still limited. Thus, computational methods such as comparative and de novo approaches and molecular dynamic simulations are intensively used as alternative tools to predict the three-dimensional structures and dynamic behavior of proteins. This review summarizes recent developments in structural proteomics for protein structure determination; including instrumental methods such as X-ray crystallography and NMR spectroscopy, and computational methods such as comparative and de novo structure prediction and molecular dynamics simulations.

  13. The restricted metabolism of the obligate organohalide respiring bacterium Dehalobacter restrictus: lessons from tiered functional genomics

    PubMed Central

    Rupakula, Aamani; Kruse, Thomas; Boeren, Sjef; Holliger, Christof; Smidt, Hauke; Maillard, Julien

    2013-01-01

    Dehalobacter restrictus strain PER-K23 is an obligate organohalide respiring bacterium, which displays extremely narrow metabolic capabilities. It grows only via coupling energy conservation to anaerobic respiration of tetra- and trichloroethene with hydrogen as sole electron donor. Dehalobacter restrictus represents the paradigmatic member of the genus Dehalobacter, which in recent years has turned out to be a major player in the bioremediation of an increasing number of organohalides, both in situ and in laboratory studies. The recent elucidation of the D. restrictus genome revealed a rather elaborate genome with predicted pathways that were not suspected from its restricted metabolism, such as a complete corrinoid biosynthetic pathway, the Wood–Ljungdahl (WL) pathway for CO2 fixation, abundant transcriptional regulators and several types of hydrogenases. However, one important feature of the genome is the presence of 25 reductive dehalogenase genes, from which so far only one, pceA, has been characterized on genetic and biochemical levels. This study describes a multi-level functional genomics approach on D. restrictus across three different growth phases. A global proteomic analysis allowed consideration of general metabolic pathways relevant to organohalide respiration, whereas the dedicated genomic and transcriptomic analysis focused on the diversity, composition and expression of genes associated with reductive dehalogenases. PMID:23479754

  14. Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.

    PubMed

    Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa

    2017-06-15

    Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Tomato functional genomics database (TFGD): a comprehensive collection and analysis package for tomato functional genomics

    USDA-ARS?s Scientific Manuscript database

    Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...

  16. Deorphanizing the human transmembrane genome: A landscape of uncharacterized membrane proteins.

    PubMed

    Babcock, Joseph J; Li, Min

    2014-01-01

    The sequencing of the human genome has fueled the last decade of work to functionally characterize genome content. An important subset of genes encodes membrane proteins, which are the targets of many drugs. They reside in lipid bilayers, restricting their endogenous activity to a relatively specialized biochemical environment. Without a reference phenotype, the application of systematic screens to profile candidate membrane proteins is not immediately possible. Bioinformatics has begun to show its effectiveness in focusing the functional characterization of orphan proteins of a particular functional class, such as channels or receptors. Here we discuss integration of experimental and bioinformatics approaches for characterizing the orphan membrane proteome. By analyzing the human genome, a landscape reference for the human transmembrane genome is provided.

  17. The state of proteome profiling in the fungal genus Aspergillus.

    PubMed

    Kim, Yonghyun; Nandakumar, M P; Marten, Mark R

    2008-03-01

    Aspergilli are an important genus of filamentous fungi that contribute to a multibillion dollar industry. Since many fungal genome sequencing were recently completed, it would be advantageous to profile their proteome to better understand the fungal cell factory. Here, we review proteomic data generated for the Aspergilli in recent years. Thus far, a combined total of 28 cell surface, 102 secreted and 139 intracellular proteins have been identified based on 10 different studies on Aspergillus proteomics. A summary proteome map highlighting identified proteins in major metabolic pathway is presented.

  18. Functional Genomics of Allergen Gene Families in Fruits

    PubMed Central

    Maghuly, Fatemeh; Marzban, Gorji; Laimer, Margit

    2009-01-01

    Fruit consumption is encouraged for health reasons; however, fruits may harbour a series of allergenic proteins that may cause discomfort or even represent serious threats to certain individuals. Thus, the identification and characterization of allergens in fruits requires novel approaches involving genomic and proteomic tools. Since avoidance of fruits also negatively affects the quality of patients’ lives, biotechnological interventions are ongoing to produce low allergenic fruits by down regulating specific genes. In this respect, the control of proteins associated with allergenicity could be achieved by fine tuning the spatial and temporal expression of the relevant genes. PMID:22253972

  19. Application of resequencing to rice genomics, functional genomics and evolutionary analysis

    PubMed Central

    2014-01-01

    Rice is a model system used for crop genomics studies. The completion of the rice genome draft sequences in 2002 not only accelerated functional genome studies, but also initiated a new era of resequencing rice genomes. Based on the reference genome in rice, next-generation sequencing (NGS) using the high-throughput sequencing system can efficiently accomplish whole genome resequencing of various genetic populations and diverse germplasm resources. Resequencing technology has been effectively utilized in evolutionary analysis, rice genomics and functional genomics studies. This technique is beneficial for both bridging the knowledge gap between genotype and phenotype and facilitating molecular breeding via gene design in rice. Here, we also discuss the limitation, application and future prospects of rice resequencing. PMID:25006357

  20. Biochemical and genetic analysis of the yeast proteome with a movable ORF collection

    PubMed Central

    Gelperin, Daniel M.; White, Michael A.; Wilkinson, Martha L.; Kon, Yoshiko; Kung, Li A.; Wise, Kevin J.; Lopez-Hoyo, Nelson; Jiang, Lixia; Piccirillo, Stacy; Yu, Haiyuan; Gerstein, Mark; Dumont, Mark E.; Phizicky, Eric M.; Snyder, Michael; Grayhack, Elizabeth J.

    2005-01-01

    Functional analysis of the proteome is an essential part of genomic research. To facilitate different proteomic approaches, a MORF (moveable ORF) library of 5854 yeast expression plasmids was constructed, each expressing a sequence-verified ORF as a C-terminal ORF fusion protein, under regulated control. Analysis of 5573 MORFs demonstrates that nearly all verified ORFs are expressed, suggests the authenticity of 48 ORFs characterized as dubious, and implicates specific processes including cytoskeletal organization and transcriptional control in growth inhibition caused by overexpression. Global analysis of glycosylated proteins identifies 109 new confirmed N-linked and 345 candidate glycoproteins, nearly doubling the known yeast glycome. PMID:16322557

  1. The Human Proteome Organization Chromosome 6 Consortium: integrating chromosome-centric and biology/disease driven strategies.

    PubMed

    Borchers, C H; Kast, J; Foster, L J; Siu, K W M; Overall, C M; Binkowski, T A; Hildebrand, W H; Scherer, A; Mansoor, M; Keown, P A

    2014-04-04

    The Human Proteome Project (HPP) is designed to generate a comprehensive map of the protein-based molecular architecture of the human body, to provide a resource to help elucidate biological and molecular function, and to advance diagnosis and treatment of diseases. Within this framework, the chromosome-based HPP (C-HPP) has allocated responsibility for mapping individual chromosomes by country or region, while the biology/disease HPP (B/D-HPP) coordinates these teams in cross-functional disease-based groups. Chromosome 6 (Ch6) provides an excellent model for integration of these two tasks. This metacentric chromosome has a complement of 1002-1034 genes that code for known, novel or putative proteins. Ch6 is functionally associated with more than 120 major human diseases, many with high population prevalence, devastating clinical impact and profound societal consequences. The unique combination of genomic, proteomic, metabolomic, phenomic and health services data being drawn together within the Ch6 program has enormous potential to advance personalized medicine by promoting robust biomarkers, subunit vaccines and new drug targets. The strong liaison between the clinical and laboratory teams, and the structured framework for technology transfer and health policy decisions within Canada will increase the speed and efficacy of this transition, and the value of this translational research. Canada has been selected to play a leading role in the international Human Proteome Project, the global counterpart of the Human Genome Project designed to understand the structure and function of the human proteome in health and disease. Canada will lead an international team focusing on chromosome 6, which is functionally associated with more than 120 major human diseases, including immune and inflammatory disorders affecting the brain, skeletal system, heart and blood vessels, lungs, kidney, liver, gastrointestinal tract and endocrine system. Many of these chronic and persistent

  2. Toward the Standardization of Mitochondrial Proteomics: The Italian Mitochondrial Human Proteome Project Initiative.

    PubMed

    Alberio, Tiziana; Pieroni, Luisa; Ronci, Maurizio; Banfi, Cristina; Bongarzone, Italia; Bottoni, Patrizia; Brioschi, Maura; Caterino, Marianna; Chinello, Clizia; Cormio, Antonella; Cozzolino, Flora; Cunsolo, Vincenzo; Fontana, Simona; Garavaglia, Barbara; Giusti, Laura; Greco, Viviana; Lucacchini, Antonio; Maffioli, Elisa; Magni, Fulvio; Monteleone, Francesca; Monti, Maria; Monti, Valentina; Musicco, Clara; Petrosillo, Giuseppe; Porcelli, Vito; Saletti, Rosaria; Scatena, Roberto; Soggiu, Alessio; Tedeschi, Gabriella; Zilocchi, Mara; Roncada, Paola; Urbani, Andrea; Fasano, Mauro

    2017-12-01

    The Mitochondrial Human Proteome Project aims at understanding the function of the mitochondrial proteome and its crosstalk with the proteome of other organelles. Being able to choose a suitable and validated enrichment protocol of functional mitochondria, based on the specific needs of the downstream proteomics analysis, would greatly help the researchers in the field. Mitochondrial fractions from ten model cell lines were prepared using three enrichment protocols and analyzed on seven different LC-MS/MS platforms. All data were processed using neXtProt as reference database. The data are available for the Human Proteome Project purposes through the ProteomeXchange Consortium with the identifier PXD007053. The processed data sets were analyzed using a suite of R routines to perform a statistical analysis and to retrieve subcellular and submitochondrial localizations. Although the overall number of identified total and mitochondrial proteins was not significantly dependent on the enrichment protocol, specific line to line differences were observed. Moreover, the protein lists were mapped to a network representing the functional mitochondrial proteome, encompassing mitochondrial proteins and their first interactors. More than 80% of the identified proteins resulted in nodes of this network but with a different ability in coisolating mitochondria-associated structures for each enrichment protocol/cell line pair.

  3. Rare Disease Mechanisms Identified by Genealogical Proteomics of Copper Homeostasis Mutant Pedigrees.

    PubMed

    Zlatic, Stephanie A; Vrailas-Mortimer, Alysia; Gokhale, Avanti; Carey, Lucas J; Scott, Elizabeth; Burch, Reid; McCall, Morgan M; Rudin-Rush, Samantha; Davis, John Bowen; Hartwig, Cortnie; Werner, Erica; Li, Lian; Petris, Michael; Faundez, Victor

    2018-03-28

    Rare neurological diseases shed light onto universal neurobiological processes. However, molecular mechanisms connecting genetic defects to their disease phenotypes are elusive. Here, we obtain mechanistic information by comparing proteomes of cells from individuals with rare disorders with proteomes from their disease-free consanguineous relatives. We use triple-SILAC mass spectrometry to quantify proteomes from human pedigrees affected by mutations in ATP7A, which cause Menkes disease, a rare neurodegenerative and neurodevelopmental disorder stemming from systemic copper depletion. We identified 214 proteins whose expression was altered in ATP7A -/y fibroblasts. Bioinformatic analysis of ATP7A-mutant proteomes identified known phenotypes and processes affected in rare genetic diseases causing copper dyshomeostasis, including altered mitochondrial function. We found connections between copper dyshomeostasis and the UCHL1/PARK5 pathway of Parkinson disease, which we validated with mitochondrial respiration and Drosophila genetics assays. We propose that our genealogical "omics" strategy can be broadly applied to identify mechanisms linking a genomic locus to its phenotypes. Copyright © 2018 Elsevier Inc. All rights reserved.

  4. The Cancer Genome Atlas Clinical Explorer: a web and mobile interface for identifying clinical-genomic driver associations.

    PubMed

    Lee, HoJoon; Palm, Jennifer; Grimes, Susan M; Ji, Hanlee P

    2015-10-27

    The Cancer Genome Atlas (TCGA) project has generated genomic data sets covering over 20 malignancies. These data provide valuable insights into the underlying genetic and genomic basis of cancer. However, exploring the relationship among TCGA genomic results and clinical phenotype remains a challenge, particularly for individuals lacking formal bioinformatics training. Overcoming this hurdle is an important step toward the wider clinical translation of cancer genomic/proteomic data and implementation of precision cancer medicine. Several websites such as the cBio portal or University of California Santa Cruz genome browser make TCGA data accessible but lack interactive features for querying clinically relevant phenotypic associations with cancer drivers. To enable exploration of the clinical-genomic driver associations from TCGA data, we developed the Cancer Genome Atlas Clinical Explorer. The Cancer Genome Atlas Clinical Explorer interface provides a straightforward platform to query TCGA data using one of the following methods: (1) searching for clinically relevant genes, micro RNAs, and proteins by name, cancer types, or clinical parameters; (2) searching for genomic/proteomic profile changes by clinical parameters in a cancer type; or (3) testing two-hit hypotheses. SQL queries run in the background and results are displayed on our portal in an easy-to-navigate interface according to user's input. To derive these associations, we relied on elastic-net estimates of optimal multiple linear regularized regression and clinical parameters in the space of multiple genomic/proteomic features provided by TCGA data. Moreover, we identified and ranked gene/micro RNA/protein predictors of each clinical parameter for each cancer. The robustness of the results was estimated by bootstrapping. Overall, we identify associations of potential clinical relevance among genes/micro RNAs/proteins using our statistical analysis from 25 cancer types and 18 clinical parameters that

  5. Recent insights into plant-virus interactions through proteomic analysis.

    PubMed

    Di Carli, Mariasole; Benvenuto, Eugenio; Donini, Marcello

    2012-10-05

    Plant viruses represent a major threat for a wide range of host species causing severe losses in agricultural practices. The full comprehension of mechanisms underlying events of virus-host plant interaction is crucial to devise novel plant resistance strategies. Until now, functional genomics studies in plant-virus interaction have been limited mainly on transcriptomic analysis. Only recently are proteomic approaches starting to provide important contributions to this area of research. Classical two-dimensional electrophoresis (2-DE) coupled to mass spectrometry (MS) is still the most widely used platform in plant proteome analysis, although in the last years the application of quantitative "second generation" proteomic techniques (such as differential in gel electrophoresis, DIGE, and gel-free protein separation methods) are emerging as more powerful analytical approaches. Apparently simple, plant-virus interactions reveal a really complex pathophysiological context, in which resistance, defense and susceptibility, and direct virus-induced reactions interplay to trigger expression responses of hundreds of genes. Given that, this review is specifically focused on comparative proteome-based studies on pathogenesis of several viral genera, including some of the most important and widespread plant viruses of the genus Tobamovirus, Sobemovirus, Cucumovirus and Potyvirus. In all, this overview reveals a widespread repression of proteins associated with the photosynthetic apparatus, while energy metabolism/protein synthesis and turnover are typically up-regulated, indicating a major redirection of cell metabolism. Other common features include the modulation of metabolisms concerning sugars, cell wall, and reactive oxigen species as well as pathogenesis-related (PR) proteins. The fine-tuning between plant development and antiviral defense mechanisms determines new patterns of regulation of common metabolic pathways. By offering a 360-degree view of protein modulation

  6. Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world

    PubMed Central

    Wang, Minglei; Yafremava, Liudmila S.; Caetano-Anollés, Derek; Mittenthal, Jay E.; Caetano-Anollés, Gustavo

    2007-01-01

    The repertoire of protein architectures in proteomes is evolutionarily conserved and capable of preserving an accurate record of genomic history. Here we use a census of protein architecture in 185 genomes that have been fully sequenced to generate genome-based phylogenies that describe the evolution of the protein world at fold (F) and fold superfamily (FSF) levels. The patterns of representation of F and FSF architectures over evolutionary history suggest three epochs in the evolution of the protein world: (1) architectural diversification, where members of an architecturally rich ancestral community diversified their protein repertoire; (2) superkingdom specification, where superkingdoms Archaea, Bacteria, and Eukarya were specified; and (3) organismal diversification, where F and FSF specific to relatively small sets of organisms appeared as the result of diversification of organismal lineages. Functional annotation of FSF along these architectural chronologies revealed patterns of discovery of biological function. Most importantly, the analysis identified an early and extensive differential loss of architectures occurring primarily in Archaea that segregates the archaeal lineage from the ancient community of organisms and establishes the first organismal divide. Reconstruction of phylogenomic trees of proteomes reflects the timeline of architectural diversification in the emerging lineages. Thus, Archaea undertook a minimalist strategy using only a small subset of the full architectural repertoire and then crystallized into a diversified superkingdom late in evolution. Our analysis also suggests a communal ancestor to all life that was molecularly complex and adopted genomic strategies currently present in Eukarya. PMID:17908824

  7. The proteomic landscape of triple-negative breast cancer.

    PubMed

    Lawrence, Robert T; Perez, Elizabeth M; Hernández, Daniel; Miller, Chris P; Haas, Kelsey M; Irie, Hanna Y; Lee, Su-In; Blau, C Anthony; Villén, Judit

    2015-04-28

    Triple-negative breast cancer is a heterogeneous disease characterized by poor clinical outcomes and a shortage of targeted treatment options. To discover molecular features of triple-negative breast cancer, we performed quantitative proteomics analysis of twenty human-derived breast cell lines and four primary breast tumors to a depth of more than 12,000 distinct proteins. We used this data to identify breast cancer subtypes at the protein level and demonstrate the precise quantification of biomarkers, signaling proteins, and biological pathways by mass spectrometry. We integrated proteomics data with exome sequence resources to identify genomic aberrations that affect protein expression. We performed a high-throughput drug screen to identify protein markers of drug sensitivity and understand the mechanisms of drug resistance. The genome and proteome provide complementary information that, when combined, yield a powerful engine for therapeutic discovery. This resource is available to the cancer research community to catalyze further analysis and investigation. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  8. Proteomic Approaches and Identification of Novel Therapeutic Targets for Alcoholism

    PubMed Central

    Gorini, Giorgio; Adron Harris, R; Dayne Mayfield, R

    2014-01-01

    Recent studies have shown that gene regulation is far more complex than previously believed and does not completely explain changes at the protein level. Therefore, the direct study of the proteome, considerably different in both complexity and dynamicity to the genome/transcriptome, has provided unique insights to an increasing number of researchers. During the past decade, extraordinary advances in proteomic techniques have changed the way we can analyze the composition, regulation, and function of protein complexes and pathways underlying altered neurobiological conditions. When combined with complementary approaches, these advances provide the contextual information for decoding large data sets into meaningful biologically adaptive processes. Neuroproteomics offers potential breakthroughs in the field of alcohol research by leading to a deeper understanding of how alcohol globally affects protein structure, function, interactions, and networks. The wealth of information gained from these advances can help pinpoint relevant biomarkers for early diagnosis and improved prognosis of alcoholism and identify future pharmacological targets for the treatment of this addiction. PMID:23900301

  9. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  10. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  11. Mining for Microbial Gems: Integrating Proteomics in the Postgenomic Natural Product Discovery Pipeline.

    PubMed

    Du, Chao; van Wezel, Gilles P

    2018-04-30

    Natural products (NPs) are a major source of compounds for medical, agricultural, and biotechnological industries. Many of these compounds are of microbial origin, and, in particular, from Actinobacteria or filamentous fungi. To successfully identify novel compounds that correlate to a bioactivity of interest, or discover new enzymes with desired functions, systematic multiomics approaches have been developed over the years. Bioinformatics tools harness the rapidly expanding wealth of genome sequence information, revealing previously unsuspected biosynthetic diversity. Varying growth conditions or application of elicitors are applied to activate cryptic biosynthetic gene clusters, and metabolomics provide detailed insights into the NPs they specify. Combining these technologies with proteomics-based approaches to profile the biosynthetic enzymes provides scientists with insights into the full biosynthetic potential of microorganisms. The proteomics approaches include enrichment strategies such as employing activity-based probes designed by chemical biology, as well as unbiased (quantitative) proteomics methods. In this review, the opportunities and challenges in microbial NP research are discussed, and, in particular, the application of proteomics to link biosynthetic enzymes to the molecules they produce, and vice versa. © 2018 The Authors. Proteomics Published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Extracellular proteome analysis of Leptospira interrogans serovar Lai.

    PubMed

    Zeng, Lingbing; Zhang, Yunyi; Zhu, Yongzhang; Yin, Haidi; Zhuang, Xuran; Zhu, Weinan; Guo, Xiaokui; Qin, Jinhong

    2013-10-01

    Abstract Leptospirosis is one of the most important zoonoses. Leptospira interrogans serovar Lai is a pathogenic spirochete that is responsible for leptospirosis. Extracellular proteins play an important role in the pathogenicity of this bacterium. In this study, L. interrogans serovar Lai was grown in protein-free medium; the supernatant was collected and subsequently analyzed as the extracellular proteome. A total of 66 proteins with more than two unique peptides were detected by MS/MS, and 33 of these were predicted to be extracellular proteins by a combination of bioinformatics analyses, including Psortb, cello, SoSuiGramN and SignalP. Comparisons of the transcriptional levels of these 33 genes between in vivo and in vitro conditions revealed that 15 genes were upregulated and two genes were downregulated in vivo compared to in vitro. A BLAST search for the components of secretion system at the genomic and proteomic levels revealed the presence of the complete type I secretion system and type II secretion system in this strain. Moreover, this strain also exhibits complete Sec translocase and Tat translocase systems. The extracellular proteome analysis of L. interrogans will supplement the previously generated whole proteome data and provide more information for studying the functions of specific proteins in the infection process and for selecting candidate molecules for vaccines or diagnostic tools for leptospirosis.

  13. MitProNet: A Knowledgebase and Analysis Platform of Proteome, Interactome and Diseases for Mammalian Mitochondria

    PubMed Central

    Mao, Song; Chai, Xiaoqiang; Hu, Yuling; Hou, Xugang; Tang, Yiheng; Bi, Cheng; Li, Xiao

    2014-01-01

    Mitochondrion plays a central role in diverse biological processes in most eukaryotes, and its dysfunctions are critically involved in a large number of diseases and the aging process. A systematic identification of mitochondrial proteomes and characterization of functional linkages among mitochondrial proteins are fundamental in understanding the mechanisms underlying biological functions and human diseases associated with mitochondria. Here we present a database MitProNet which provides a comprehensive knowledgebase for mitochondrial proteome, interactome and human diseases. First an inventory of mammalian mitochondrial proteins was compiled by widely collecting proteomic datasets, and the proteins were classified by machine learning to achieve a high-confidence list of mitochondrial proteins. The current version of MitProNet covers 1124 high-confidence proteins, and the remainders were further classified as middle- or low-confidence. An organelle-specific network of functional linkages among mitochondrial proteins was then generated by integrating genomic features encoded by a wide range of datasets including genomic context, gene expression profiles, protein-protein interactions, functional similarity and metabolic pathways. The functional-linkage network should be a valuable resource for the study of biological functions of mitochondrial proteins and human mitochondrial diseases. Furthermore, we utilized the network to predict candidate genes for mitochondrial diseases using prioritization algorithms. All proteins, functional linkages and disease candidate genes in MitProNet were annotated according to the information collected from their original sources including GO, GEO, OMIM, KEGG, MIPS, HPRD and so on. MitProNet features a user-friendly graphic visualization interface to present functional analysis of linkage networks. As an up-to-date database and analysis platform, MitProNet should be particularly helpful in comprehensive studies of complicated

  14. proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes.

    PubMed

    Mende, Daniel R; Letunic, Ivica; Huerta-Cepas, Jaime; Li, Simone S; Forslund, Kristoffer; Sunagawa, Shinichi; Bork, Peer

    2017-01-04

    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Genomic and proteomic analyses of Mycobacterium bovis BCG Mexico 1931 reveal a diverse immunogenic repertoire against tuberculosis infection

    PubMed Central

    2011-01-01

    Background Studies of Mycobacterium bovis BCG strains used in different countries and vaccination programs show clear variations in the genomes and immune protective properties of BCG strains. The aim of this study was to characterise the genomic and immune proteomic profile of the BCG 1931 strain used in Mexico. Results BCG Mexico 1931 has a circular chromosome of 4,350,386 bp with a G+C content and numbers of genes and pseudogenes similar to those of BCG Tokyo and BCG Pasteur. BCG Mexico 1931 lacks Region of Difference 1 (RD1), RD2 and N-RD18 and one copy of IS6110, indicating that BCG Mexico 1931 belongs to DU2 group IV within the BCG vaccine genealogy. In addition, this strain contains three new RDs, which are 53 (RDMex01), 655 (RDMex02) and 2,847 bp (REDMex03) long, and 55 single-nucleotide polymorphisms representing non-synonymous mutations compared to BCG Pasteur and BCG Tokyo. In a comparative proteomic analysis, the BCG Mexico 1931, Danish, Phipps and Tokyo strains showed 812, 794, 791 and 701 protein spots, respectively. The same analysis showed that BCG Mexico 1931 shares 62% of its protein spots with the BCG Danish strain, 61% with the BCG Phipps strain and only 48% with the BCG Tokyo strain. Thirty-nine reactive spots were detected in BCG Mexico 1931 using sera from subjects with active tuberculosis infections and positive tuberculin skin tests. Conclusions BCG Mexico 1931 has a smaller genome than the BCG Pasteur and BCG Tokyo strains. Two specific deletions in BCG Mexico 1931 are described (RDMex02 and RDMex03). The loss of RDMex02 (fadD23) is associated with enhanced macrophage binding and RDMex03 contains genes that may be involved in regulatory pathways. We also describe new antigenic proteins for the first time. PMID:21981907

  16. Proteome of Caulobacter crescentus cell cycle publicly accessible on SWICZ server.

    PubMed

    Vohradsky, Jiri; Janda, Ivan; Grünenfelder, Björn; Berndt, Peter; Röder, Daniel; Langen, Hanno; Weiser, Jaroslav; Jenal, Urs

    2003-10-01

    Here we present the Swiss-Czech Proteomics Server (SWICZ), which hosts the proteomic database summarizing information about the cell cycle of the aquatic bacterium Caulobacter crescentus. The database provides a searchable tool for easy access of global protein synthesis and protein stability data as examined during the C. crescentus cell cycle. Protein synthesis data collected from five different cell cycle stages were determined for each protein spot as a relative value of the total amount of [(35)S]methionine incorporation. Protein stability of pulse-labeled extracts were measured during a chase period equivalent to one cell cycle unit. Quantitative information for individual proteins together with descriptive data such as protein identities, apparent molecular masses and isoelectric points, were combined with information on protein function, genomic context, and the cell cycle stage, and were then assembled in a relational database with a world wide web interface (http://proteom.biomed.cas.cz), which allows the database records to be searched and displays the recovered information. A total of 1250 protein spots were reproducibly detected on two-dimensional gel electropherograms, 295 of which were identified by mass spectroscopy. The database is accessible either through clickable two-dimensional gel electrophoretic maps or by means of a set of dedicated search engines. Basic characterization of the experimental procedures, data processing, and a comprehensive description of the web site are presented. In its current state, the SWICZ proteome database provides a platform for the incorporation of new data emerging from extended functional studies on the C. crescentus proteome.

  17. Genomic, proteomic, and biochemical analysis of the organohalide respiratory pathway in Desulfitobacterium dehalogenans.

    PubMed

    Kruse, Thomas; van de Pas, Bram A; Atteia, Ariane; Krab, Klaas; Hagen, Wilfred R; Goodwin, Lynne; Chain, Patrick; Boeren, Sjef; Maphosa, Farai; Schraa, Gosse; de Vos, Willem M; van der Oost, John; Smidt, Hauke; Stams, Alfons J M

    2015-03-01

    Desulfitobacterium dehalogenans is able to grow by organohalide respiration using 3-chloro-4-hydroxyphenyl acetate (Cl-OHPA) as an electron acceptor. We used a combination of genome sequencing, biochemical analysis of redox active components, and shotgun proteomics to study elements of the organohalide respiratory electron transport chain. The genome of Desulfitobacterium dehalogenans JW/IU-DC1(T) consists of a single circular chromosome of 4,321,753 bp with a GC content of 44.97%. The genome contains 4,252 genes, including six rRNA operons and six predicted reductive dehalogenases. One of the reductive dehalogenases, CprA, is encoded by a well-characterized cprTKZEBACD gene cluster. Redox active components were identified in concentrated suspensions of cells grown on formate and Cl-OHPA or formate and fumarate, using electron paramagnetic resonance (EPR), visible spectroscopy, and high-performance liquid chromatography (HPLC) analysis of membrane extracts. In cell suspensions, these components were reduced upon addition of formate and oxidized after addition of Cl-OHPA, indicating involvement in organohalide respiration. Genome analysis revealed genes that likely encode the identified components of the electron transport chain from formate to fumarate or Cl-OHPA. Data presented here suggest that the first part of the electron transport chain from formate to fumarate or Cl-OHPA is shared. Electrons are channeled from an outward-facing formate dehydrogenase via menaquinones to a fumarate reductase located at the cytoplasmic face of the membrane. When Cl-OHPA is the terminal electron acceptor, electrons are transferred from menaquinones to outward-facing CprA, via an as-yet-unidentified membrane complex, and potentially an extracellular flavoprotein acting as an electron shuttle between the quinol dehydrogenase membrane complex and CprA. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  18. Genomic, Proteomic, and Biochemical Analysis of the Organohalide Respiratory Pathway in Desulfitobacterium dehalogenans

    PubMed Central

    van de Pas, Bram A.; Atteia, Ariane; Krab, Klaas; Hagen, Wilfred R.; Goodwin, Lynne; Chain, Patrick; Boeren, Sjef; Maphosa, Farai; Schraa, Gosse; de Vos, Willem M.; van der Oost, John; Smidt, Hauke

    2014-01-01

    Desulfitobacterium dehalogenans is able to grow by organohalide respiration using 3-chloro-4-hydroxyphenyl acetate (Cl-OHPA) as an electron acceptor. We used a combination of genome sequencing, biochemical analysis of redox active components, and shotgun proteomics to study elements of the organohalide respiratory electron transport chain. The genome of Desulfitobacterium dehalogenans JW/IU-DC1T consists of a single circular chromosome of 4,321,753 bp with a GC content of 44.97%. The genome contains 4,252 genes, including six rRNA operons and six predicted reductive dehalogenases. One of the reductive dehalogenases, CprA, is encoded by a well-characterized cprTKZEBACD gene cluster. Redox active components were identified in concentrated suspensions of cells grown on formate and Cl-OHPA or formate and fumarate, using electron paramagnetic resonance (EPR), visible spectroscopy, and high-performance liquid chromatography (HPLC) analysis of membrane extracts. In cell suspensions, these components were reduced upon addition of formate and oxidized after addition of Cl-OHPA, indicating involvement in organohalide respiration. Genome analysis revealed genes that likely encode the identified components of the electron transport chain from formate to fumarate or Cl-OHPA. Data presented here suggest that the first part of the electron transport chain from formate to fumarate or Cl-OHPA is shared. Electrons are channeled from an outward-facing formate dehydrogenase via menaquinones to a fumarate reductase located at the cytoplasmic face of the membrane. When Cl-OHPA is the terminal electron acceptor, electrons are transferred from menaquinones to outward-facing CprA, via an as-yet-unidentified membrane complex, and potentially an extracellular flavoprotein acting as an electron shuttle between the quinol dehydrogenase membrane complex and CprA. PMID:25512312

  19. Genome, Proteome and Structure of a T7-Like Bacteriophage of the Kiwifruit Canker Phytopathogen Pseudomonas syringae pv. actinidiae.

    PubMed

    Frampton, Rebekah A; Acedo, Elena Lopez; Young, Vivienne L; Chen, Danni; Tong, Brian; Taylor, Corinda; Easingwood, Richard A; Pitman, Andrew R; Kleffmann, Torsten; Bostina, Mihnea; Fineran, Peter C

    2015-06-24

    Pseudomonas syringae pv. actinidiae is an economically significant pathogen responsible for severe bacterial canker of kiwifruit (Actinidia sp.). Bacteriophages infecting this phytopathogen have potential as biocontrol agents as part of an integrated approach to the management of bacterial canker, and for use as molecular tools to study this bacterium. A variety of bacteriophages were previously isolated that infect P. syringae pv. actinidiae, and their basic properties were characterized to provide a framework for formulation of these phages as biocontrol agents. Here, we have examined in more detail φPsa17, a phage with the capacity to infect a broad range of P. syringae pv. actinidiae strains and the only member of the Podoviridae in this collection. Particle morphology was visualized using cryo-electron microscopy, the genome was sequenced, and its structural proteins were analysed using shotgun proteomics. These studies demonstrated that φPsa17 has a 40,525 bp genome, is a member of the T7likevirus genus and is closely related to the pseudomonad phages φPSA2 and gh-1. Eleven structural proteins (one scaffolding) were detected by proteomics and φPsa17 has a capsid of approximately 60 nm in diameter. No genes indicative of a lysogenic lifecycle were identified, suggesting the phage is obligately lytic. These features indicate that φPsa17 may be suitable for formulation as a biocontrol agent of P. syringae pv. actinidiae.

  20. Proteomic analysis of isolated chlamydomonas centrioles reveals orthologs of ciliary-disease genes.

    PubMed

    Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F

    2005-06-21

    The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.

  1. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling.

    PubMed

    Puente-Marin, Sara; Nombela, Iván; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio; Ortega-Villaizan, María Del Mar

    2018-04-09

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.

  2. Genomic, Proteomic, and Metabolite Characterization of Gemfibrozil-Degrading Organism Bacillus sp. GeD10.

    PubMed

    Kjeldal, Henrik; Zhou, Nicolette A; Wissenbach, Dirk K; von Bergen, Martin; Gough, Heidi L; Nielsen, Jeppe L

    2016-01-19

    Gemfibrozil is a widely used hypolipidemic and triglyceride lowering drug. Excess of the drug is excreted and discharged into the environment primarily via wastewater treatment plant effluents. Bacillus sp. GeD10, a gemfibrozil-degrader, was previously isolated from activated sludge. It is the first identified bacterium capable of degrading gemfibrozil. Gemfibrozil degradation by Bacillus sp. GeD10 was here studied through genome sequencing, quantitative proteomics and metabolite analysis. From the bacterial proteome of Bacillus sp. GeD10 1974 proteins were quantified, of which 284 proteins were found to be overabundant by more than 2-fold (FDR corrected p-value ≤0.032, fold change (log2) ≥ 1) in response to gemfibrozil exposure. Metabolomic analysis identified two hydroxylated intermediates as well as a glucuronidated hydroxyl-metabolite of gemfibrozil. Overall, gemfibrozil exposure in Bacillus sp. GeD10 increased the abundance of several enzymes potentially involved in gemfibrozil degradation as well as resulted in the production of several gemfibrozil metabolites. The potential catabolic pathway/modification included ring-hydroxylation preparing the substrate for subsequent ring cleavage by a meta-cleaving enzyme. The identified genes may allow for monitoring of potential gemfibrozil-degrading organisms in situ and increase the understanding of microbial processing of trace level contaminants. This study represents the first omics study on a gemfibrozil-degrading bacterium.

  3. Jatropha curcas, a biofuel crop: functional genomics for understanding metabolic pathways and genetic improvement.

    PubMed

    Maghuly, Fatemeh; Laimer, Margit

    2013-10-01

    Jatropha curcas is currently attracting much attention as an oilseed crop for biofuel, as Jatropha can grow under climate and soil conditions that are unsuitable for food production. However, little is known about Jatropha, and there are a number of challenges to be overcome. In fact, Jatropha has not really been domesticated; most of the Jatropha accessions are toxic, which renders the seedcake unsuitable for use as animal feed. The seeds of Jatropha contain high levels of polyunsaturated fatty acids, which negatively impact the biofuel quality. Fruiting of Jatropha is fairly continuous, thus increasing costs of harvesting. Therefore, before starting any improvement program using conventional or molecular breeding techniques, understanding gene function and the genome scale of Jatropha are prerequisites. This review presents currently available and relevant information on the latest technologies (genomics, transcriptomics, proteomics and metabolomics) to decipher important metabolic pathways within Jatropha, such as oil and toxin synthesis. Further, it discusses future directions for biotechnological approaches in Jatropha breeding and improvement. © 2013 The Authors. Biotechnology Journal published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Plant Comparative and Functional Genomics

    DOE PAGES

    Yang, Xiaohan; Leebens-Mack, Jim; Chen, Feng; ...

    2015-01-01

    Plants form the foundation for our global ecosystem and are essential for environmental and human health. An increasing number of available plant genomes and tractable experimental systems, comparative and functional plant genomics research is greatly expanding our knowledge of the molecular basis of economically and nutritionally important traits in crop plants. Inferences drawn from comparative genomics are motivating experimental investigations of gene function and gene interactions. In this special issue aims to highlight recent advances made in comparative and functional genomics research in plants. Nine original research articles in this special issue cover five important topics: (1) transcription factor genemore » families relevant to abiotic stress tolerance; (2) plant secondary metabolism; (3) transcriptomebased markers for quantitative trait locus; (4) epigenetic modifications in plant-microbe interactions; and (5) computational prediction of protein-protein interactions. Finally, we studied the plant species in these articles which include model species as well as nonmodel plant species of economic importance (e.g., food crops and medicinal plants).« less

  5. Plant Comparative and Functional Genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Leebens-Mack, Jim; Chen, Feng

    Plants form the foundation for our global ecosystem and are essential for environmental and human health. An increasing number of available plant genomes and tractable experimental systems, comparative and functional plant genomics research is greatly expanding our knowledge of the molecular basis of economically and nutritionally important traits in crop plants. Inferences drawn from comparative genomics are motivating experimental investigations of gene function and gene interactions. In this special issue aims to highlight recent advances made in comparative and functional genomics research in plants. Nine original research articles in this special issue cover five important topics: (1) transcription factor genemore » families relevant to abiotic stress tolerance; (2) plant secondary metabolism; (3) transcriptomebased markers for quantitative trait locus; (4) epigenetic modifications in plant-microbe interactions; and (5) computational prediction of protein-protein interactions. Finally, we studied the plant species in these articles which include model species as well as nonmodel plant species of economic importance (e.g., food crops and medicinal plants).« less

  6. Teaching Expression Proteomics: From the Wet-Lab to the Laptop

    ERIC Educational Resources Information Center

    Teixeira, Miguel C.; Santos, Pedro M.; Rodrigues, Catarina; Sa-Correia, Isabel

    2009-01-01

    Expression proteomics has become, in recent years, a key genome-wide expression approach in fundamental and applied life sciences. This postgenomic technology aims the quantitative analysis of all the proteins or protein forms (the so-called proteome) of a given organism in a given environmental and genetic context. It is a challenge to provide…

  7. The Schistosoma mansoni phylome: using evolutionary genomics to gain insight into a parasite's biology.

    PubMed

    Silva, Larissa Lopes; Marcet-Houben, Marina; Nahum, Laila Alves; Zerlotini, Adhemar; Gabaldón, Toni; Oliveira, Guilherme

    2012-11-13

    Schistosoma mansoni is one of the causative agents of schistosomiasis, a neglected tropical disease that affects about 237 million people worldwide. Despite recent efforts, we still lack a general understanding of the relevant host-parasite interactions, and the possible treatments are limited by the emergence of resistant strains and the absence of a vaccine. The S. mansoni genome was completely sequenced and still under continuous annotation. Nevertheless, more than 45% of the encoded proteins remain without experimental characterization or even functional prediction. To improve our knowledge regarding the biology of this parasite, we conducted a proteome-wide evolutionary analysis to provide a broad view of the S. mansoni's proteome evolution and to improve its functional annotation. Using a phylogenomic approach, we reconstructed the S. mansoni phylome, which comprises the evolutionary histories of all parasite proteins and their homologs across 12 other organisms. The analysis of a total of 7,964 phylogenies allowed a deeper understanding of genomic complexity and evolutionary adaptations to a parasitic lifestyle. In particular, the identification of lineage-specific gene duplications pointed to the diversification of several protein families that are relevant for host-parasite interaction, including proteases, tetraspanins, fucosyltransferases, venom allergen-like proteins, and tegumental-allergen-like proteins. In addition to the evolutionary knowledge, the phylome data enabled us to automatically re-annotate 3,451 proteins through a phylogenetic-based approach rather than solely sequence similarity searches. To allow further exploitation of this valuable data, all information has been made available at PhylomeDB (http://www.phylomedb.org). In this study, we used an evolutionary approach to assess S. mansoni parasite biology, improve genome/proteome functional annotation, and provide insights into host-parasite interactions. Taking advantage of a proteome

  8. Common bean proteomics: Present status and future strategies.

    PubMed

    Zargar, Sajad Majeed; Mahajan, Reetika; Nazir, Muslima; Nagar, Preeti; Kim, Sun Tae; Rai, Vandna; Masi, Antonio; Ahmad, Syed Mudasir; Shah, Riaz Ahmad; Ganai, Nazir Ahmad; Agrawal, Ganesh K; Rakwal, Randeep

    2017-10-03

    Common bean (Phaseolus vulgaris L.) is a legume of appreciable importance and usefulness worldwide to the human population providing food and feed. It is rich in high-quality protein, energy, fiber and micronutrients especially iron, zinc, and pro-vitamin A; and possesses potentially disease-preventing and health-promoting compounds. The recently published genome sequence of common bean is an important landmark in common bean research, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and abiotic stresses severely limiting its productivity. Looking at the trend of increasing world population and the need for food crops best suited to the health of humankind, the legumes will be in great demand, including the common bean mostly for its nutritive values. Hence the need for new research in understanding the biology of this crop brings us to utilize and apply high-throughput omics approaches. In this mini-review our focus will be on the need for proteomics studies in common bean, potential of proteomics for understanding genetic regulation under abiotic and biotic stresses and how proteogenomics will lead to nutritional improvement. We will also discuss future proteomics-based strategies that must be adopted to mine new genomic resources by identifying molecular switches regulating various biological processes. Common bean is regarded as "grain of hope" for the poor, being rich in high-quality protein, energy, fiber and micronutrients (iron, zinc, pro-vitamin A); and possesses potentially disease-preventing and health-promoting compounds. Increasing world population and the need for food crops best suited to the health of humankind, puts legumes into great demand, which includes the common bean mostly. An important landmark in common bean research was the recent publication of its genome sequence, opening new avenues for understanding its genetics in depth. This legume crop is affected by diverse biotic and

  9. The genome, transcriptome, and proteome of the nematode Steinernema carpocapsae: evolutionary signatures of a pathogenic lifestyle

    PubMed Central

    Rougon-Cardoso, Alejandra; Flores-Ponce, Mitzi; Ramos-Aboites, Hilda Eréndira; Martínez-Guerrero, Christian Eduardo; Hao, You-Jin; Cunha, Luis; Rodríguez-Martínez, Jonathan Alejandro; Ovando-Vázquez, Cesaré; Bermúdez-Barrientos, José Roberto; Abreu-Goodger, Cei; Chavarría-Hernández, Norberto; Simões, Nelson; Montiel, Rafael

    2016-01-01

    The entomopathogenic nematode Steinernema carpocapsae has been widely used for the biological control of insect pests. It shares a symbiotic relationship with the bacterium Xenorhabdus nematophila, and is emerging as a genetic model to study symbiosis and pathogenesis. We obtained a high-quality draft of the nematode’s genome comprising 84,613,633 bp in 347 scaffolds, with an N50 of 1.24 Mb. To improve annotation, we sequenced both short and long RNA and conducted shotgun proteomic analyses. S. carpocapsae shares orthologous genes with other parasitic nematodes that are absent in the free-living nematode C. elegans, it has ncRNA families that are enriched in parasites, and expresses proteins putatively associated with parasitism and pathogenesis, suggesting an active role for the nematode during the pathogenic process. Host and parasites might engage in a co-evolutionary arms-race dynamic with genes participating in their interaction showing signatures of positive selection. Our analyses indicate that the consequence of this arms race is better characterized by positive selection altering specific functions instead of just increasing the number of positively selected genes, adding a new perspective to these co-evolutionary theories. We identified a protein, ATAD-3, that suggests a relevant role for mitochondrial function in the evolution and mechanisms of nematode parasitism. PMID:27876851

  10. Genomic and proteomic characterization of SuMu, a Mu-like bacteriophage infecting Haemophilus parasuis

    PubMed Central

    2012-01-01

    Background Haemophilus parasuis, the causative agent of Glässer’s disease, is prevalent in swine herds and clinical signs associated with this disease are meningitis, polyserositis, polyarthritis, and bacterial pneumonia. Six to eight week old pigs in segregated early weaning herds are particularly susceptible to the disease. Insufficient colostral antibody at weaning or the mixing of pigs with heterologous virulent H. parasuis strains from other farm sources in the nursery or grower-finisher stage are considered to be factors for the outbreak of Glässer’s disease. Previously, a Mu-like bacteriophage portal gene was detected in a virulent swine isolate of H. parasuis by nested polymerase chain reaction. Mu-like bacteriophages are related phyologenetically to enterobacteriophage Mu and are thought to carry virulence genes or to induce host expression of virulence genes. This study characterizes the Mu-like bacteriophage, named SuMu, isolated from a virulent H. parasuis isolate. Results Characterization was done by genomic comparison to enterobacteriophage Mu and proteomic identification of various homologs by mass spectrometry. This is the first report of isolation and characterization of this bacteriophage from the Myoviridae family, a double-stranded DNA bacteriophage with a contractile tail, from a virulent field isolate of H. parasuis. The genome size of bacteriophage SuMu was 37,151 bp. DNA sequencing revealed fifty five open reading frames, including twenty five homologs to Mu-like bacteriophage proteins: Nlp, phage transposase-C-terminal, COG2842, Gam-like protein, gp16, Mor, peptidoglycan recognition protein, gp29, gp30, gpG, gp32, gp34, gp36, gp37, gpL, phage tail tube protein, DNA circulation protein, gpP, gp45, gp46, gp47, COG3778, tail fiber protein gp37-C terminal, tail fiber assembly protein, and Com. The last open reading frame was homologous to IS1414. The G + C content of bacteriophage SuMu was 41.87% while its H. parasuis host genome

  11. Proteomics reveals novel components of the Anopheles gambiae eggshell

    PubMed Central

    Amenya, Dolphine A.; Chou, Wayne; Li, Jianyong; Yan, Guiyun; Gershon, Paul D.; James, Anthony A.; Marinotti, Osvaldo

    2010-01-01

    While genome and transcriptome sequencing has revealed a large number and diversity of Anopheles gambiae predicted proteins, identifying their functions and biosynthetic pathways remains challenging. Applied mass spectrometry based proteomics in conjunction with mosquito genome and transcriptome databases were used to identify 44 proteins as putative components of the eggshell. Among the identified molecules are two vitelline membrane proteins and a group of seven putative chorion proteins. Enzymes with peroxidase, laccase and phenoloxidase activities, likely involved in cross-linking reactions that stabilize the eggshell structure, also were identified. Seven odorant binding proteins were found in association with the mosquito eggshell, although their role has yet to be demonstrated. This analysis fills a considerable gap of knowledge about proteins that build the eggshell of anopheline mosquitoes. PMID:20433845

  12. Sex-Specific Biology of the Human Malaria Parasite Revealed from the Proteomes of Mature Male and Female Gametocytes *

    PubMed Central

    Miao, Jun; Chen, Zhao; Wang, Zenglei; Shrestha, Sony; Li, Xiaolian; Li, Runze; Cui, Liwang

    2017-01-01

    The gametocytes of the malaria parasites are obligate for perpetuating the parasite's life cycle through mosquitoes, but the sex-specific biology of gametocytes is poorly understood. We generated a transgenic line in the human malaria parasite Plasmodium falciparum, which allowed us to accurately separate male and female gametocytes by flow cytometry. In-depth analysis of the proteomes by liquid chromatography-tandem mass spectrometry identified 1244 and 1387 proteins in mature male and female gametocytes, respectively. GFP-tagging of nine selected proteins confirmed their sex-partitions to be agreeable with the results from the proteomic analysis. The sex-specific proteomes showed significant differences that are consistent with the divergent functions of the two sexes. Although the male-specific proteome (119 proteins) is enriched in proteins associated with the flagella and genome replication, the female-specific proteome (262 proteins) is more abundant in proteins involved in metabolism, translation and organellar functions. Compared with the Plasmodium berghei sex-specific proteomes, this study revealed both extensive conservation and considerable divergence between these two species, which reflect the disparities between the two species in proteins involved in cytoskeleton, lipid metabolism and protein degradation. Comparison with three sex-specific proteomes allowed us to obtain high-confidence lists of 73 and 89 core male- and female-specific/biased proteins conserved in Plasmodium. The identification of sex-specific/biased proteomes in Plasmodium lays a solid foundation for understanding the molecular mechanisms underlying the unique sex-specific biology in this early-branching eukaryote. PMID:28126901

  13. Differential proteome analysis of diabetes mellitus type 2 and its pathophysiological complications.

    PubMed

    Sohail, Waleed; Majeed, Fatimah; Afroz, Amber

    2018-06-11

    The prevalence of Diabetes Mellitus Type 2 (DM 2) is increasing every passing year due to some global changes in lifestyles of people. The exact underlying mechanisms of the progression of this disease are not yet known. However recent advances in the combined omics more particularly in proteomics and genomics have opened a gateway towards the understanding of predetermined genetic factors, progression, complications and treatment of this disease. Here we shall review the recent advances in proteomics that have led to an early and better diagnostic approaches in controlling DM 2 more importantly the comparison of structural and functional protein biomarkers that are modified in the diseased state. By applying these advanced and promising proteomic strategies with bioinformatics applications and bio-statistical tools the prevalence of DM 2 and its associated disorders i-e nephropathy and retinopathy are expected to be controlled. Copyright © 2018 Diabetes India. Published by Elsevier Ltd. All rights reserved.

  14. Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project.

    PubMed

    Omenn, Gilbert S; Lane, Lydie; Lundberg, Emma K; Overall, Christopher M; Deutsch, Eric W

    2017-12-01

    The Human Proteome Organization (HUPO) Human Proteome Project (HPP) continues to make progress on its two overall goals: (1) completing the protein parts list, with an annual update of the HUPO draft human proteome, and (2) making proteomics an integrated complement to genomics and transcriptomics throughout biomedical and life sciences research. neXtProt version 2017-01-23 has 17 008 confident protein identifications (Protein Existence [PE] level 1) that are compliant with the HPP Guidelines v2.1 ( https://hupo.org/Guidelines ), up from 13 664 in 2012-12 and 16 518 in 2016-04. Remaining to be found by mass spectrometry and other methods are 2579 "missing proteins" (PE2+3+4), down from 2949 in 2016. PeptideAtlas 2017-01 has 15 173 canonical proteins, accounting for nearly all of the 15 290 PE1 proteins based on MS data. These resources have extensive data on PTMs, single amino acid variants, and splice isoforms. The Human Protein Atlas v16 has 10 492 highly curated protein entries with tissue and subcellular spatial localization of proteins and transcript expression. Organ-specific popular protein lists have been generated for broad use in quantitative targeted proteomics using SRM-MS or DIA-SWATH-MS studies of biology and disease.

  15. The UniProtKB guide to the human proteome

    PubMed Central

    Breuza, Lionel; Poux, Sylvain; Estreicher, Anne; Famiglietti, Maria Livia; Magrane, Michele; Tognolli, Michael; Bridge, Alan; Baratin, Delphine; Redaschi, Nicole

    2016-01-01

    Advances in high-throughput and advanced technologies allow researchers to routinely perform whole genome and proteome analysis. For this purpose, they need high-quality resources providing comprehensive gene and protein sets for their organisms of interest. Using the example of the human proteome, we will describe the content of a complete proteome in the UniProt Knowledgebase (UniProtKB). We will show how manual expert curation of UniProtKB/Swiss-Prot is complemented by expert-driven automatic annotation to build a comprehensive, high-quality and traceable resource. We will also illustrate how the complexity of the human proteome is captured and structured in UniProtKB. Database URL: www.uniprot.org PMID:26896845

  16. Anopheles gambiae genome reannotation through synthesis of ab initio and comparative gene prediction algorithms

    PubMed Central

    Li, Jun; Riehle, Michelle M; Zhang, Yan; Xu, Jiannong; Oduol, Frederick; Gomez, Shawn M; Eiglmeier, Karin; Ueberheide, Beatrix M; Shabanowitz, Jeffrey; Hunt, Donald F; Ribeiro, José MC; Vernick, Kenneth D

    2006-01-01

    Background Complete genome annotation is a necessary tool as Anopheles gambiae researchers probe the biology of this potent malaria vector. Results We reannotate the A. gambiae genome by synthesizing comparative and ab initio sets of predicted coding sequences (CDSs) into a single set using an exon-gene-union algorithm followed by an open-reading-frame-selection algorithm. The reannotation predicts 20,970 CDSs supported by at least two lines of evidence, and it lowers the proportion of CDSs lacking start and/or stop codons to only approximately 4%. The reannotated CDS set includes a set of 4,681 novel CDSs not represented in the Ensembl annotation but with EST support, and another set of 4,031 Ensembl-supported genes that undergo major structural and, therefore, probably functional changes in the reannotated set. The quality and accuracy of the reannotation was assessed by comparison with end sequences from 20,249 full-length cDNA clones, and evaluation of mass spectrometry peptide hit rates from an A. gambiae shotgun proteomic dataset confirms that the reannotated CDSs offer a high quality protein database for proteomics. We provide a functional proteomics annotation, ReAnoXcel, obtained by analysis of the new CDSs through the AnoXcel pipeline, which allows functional comparisons of the CDS sets within the same bioinformatic platform. CDS data are available for download. Conclusion Comprehensive A. gambiae genome reannotation is achieved through a combination of comparative and ab initio gene prediction algorithms. PMID:16569258

  17. HMMerThread: detecting remote, functional conserved domains in entire genomes by combining relaxed sequence-database searches with fold recognition.

    PubMed

    Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine

    2011-03-10

    Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak

  18. HMMerThread: Detecting Remote, Functional Conserved Domains in Entire Genomes by Combining Relaxed Sequence-Database Searches with Fold Recognition

    PubMed Central

    Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine

    2011-01-01

    Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak

  19. Application of an Improved Proteomics Method for Abundant Protein Cleanup: Molecular and Genomic Mechanisms Study in Plant Defense*

    PubMed Central

    Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S.; Dai, Susie Y.

    2013-01-01

    High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779

  20. The Present and Future of Biomarkers in Prostate Cancer: Proteomics, Genomics, and Immunology Advancements

    PubMed Central

    Gaudreau, Pierre-Olivier; Stagg, John; Soulières, Denis; Saad, Fred

    2016-01-01

    Prostate cancer (PC) is the second most common form of cancer in men worldwide. Biomarkers have emerged as essential tools for treatment and assessment since the variability of disease behavior, the cost and diversity of treatments, and the related impairment of quality of life have given rise to a need for a personalized approach. High-throughput technology platforms in proteomics and genomics have accelerated the development of biomarkers. Furthermore, recent successes of several new agents in PC, including immunotherapy, have stimulated the search for predictors of response and resistance and have improved the understanding of the biological mechanisms at work. This review provides an overview of currently established biomarkers in PC, as well as a selection of the most promising biomarkers within these particular fields of development. PMID:27168728

  1. A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

    PubMed

    Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

    2012-02-01

    The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.

  2. By their genes ye shall know them: genomic signatures of predatory bacteria

    PubMed Central

    Pasternak, Zohar; Pietrokovski, Shmuel; Rotem, Or; Gophna, Uri; Lurie-Weinberger, Mor N; Jurkevitch, Edouard

    2013-01-01

    Predatory bacteria are taxonomically disparate, exhibit diverse predatory strategies and are widely distributed in varied environments. To date, their predatory phenotypes cannot be discerned in genome sequence data thereby limiting our understanding of bacterial predation, and of its impact in nature. Here, we define the ‘predatome,' that is, sets of protein families that reflect the phenotypes of predatory bacteria. The proteomes of all sequenced 11 predatory bacteria, including two de novo sequenced genomes, and 19 non-predatory bacteria from across the phylogenetic and ecological landscapes were compared. Protein families discriminating between the two groups were identified and quantified, demonstrating that differences in the proteomes of predatory and non-predatory bacteria are large and significant. This analysis allows predictions to be made, as we show by confirming from genome data an over-looked bacterial predator. The predatome exhibits deficiencies in riboflavin and amino acids biosynthesis, suggesting that predators obtain them from their prey. In contrast, these genomes are highly enriched in adhesins, proteases and particular metabolic proteins, used for binding to, processing and consuming prey, respectively. Strikingly, predators and non-predators differ in isoprenoid biosynthesis: predators use the mevalonate pathway, whereas non-predators, like almost all bacteria, use the DOXP pathway. By defining predatory signatures in bacterial genomes, the predatory potential they encode can be uncovered, filling an essential gap for measuring bacterial predation in nature. Moreover, we suggest that full-genome proteomic comparisons are applicable to other ecological interactions between microbes, and provide a convenient and rational tool for the functional classification of bacteria. PMID:23190728

  3. Functional genomics of corrinoid starvation in the organohalide-respiring bacterium Dehalobacter restrictus strain PER-K23

    PubMed Central

    Rupakula, Aamani; Lu, Yue; Kruse, Thomas; Boeren, Sjef; Holliger, Christof; Smidt, Hauke; Maillard, Julien

    2015-01-01

    De novo corrinoid biosynthesis represents one of the most complicated metabolic pathways in nature. Organohalide-respiring bacteria (OHRB) have developed different strategies to deal with their need of corrinoid, as it is an essential cofactor of reductive dehalogenases, the key enzymes in OHR metabolism. In contrast to Dehalococcoides mccartyi, the genome of Dehalobacter restrictus strain PER-K23 contains a complete set of corrinoid biosynthetic genes, of which cbiH appears to be truncated and therefore non-functional, possibly explaining the corrinoid auxotrophy of this obligate OHRB. Comparative genomics within Dehalobacter spp. revealed that one (operon-2) of the five distinct corrinoid biosynthesis associated operons present in the genome of D. restrictus appeared to be present only in that particular strain, which encodes multiple members of corrinoid transporters and salvaging enzymes. Operon-2 was highly up-regulated upon corrinoid starvation both at the transcriptional (346-fold) and proteomic level (46-fold on average), in line with the presence of an upstream cobalamin riboswitch. Together, these data highlight the importance of this operon in corrinoid homeostasis in D. restrictus and the augmented salvaging strategy this bacterium adopted to cope with the need for this essential cofactor. PMID:25610435

  4. Functional Proteomic Analysis of Human NucleolusD⃞

    PubMed Central

    Scherl, Alexander; Couté, Yohann; Déon, Catherine; Callé, Aleth; Kindbeiter, Karine; Sanchez, Jean-Charles; Greco, Anna; Hochstrasser, Denis; Diaz, Jean-Jacques

    2002-01-01

    The notion of a “plurifunctional” nucleolus is now well established. However, molecular mechanisms underlying the biological processes occurring within this nuclear domain remain only partially understood. As a first step in elucidating these mechanisms we have carried out a proteomic analysis to draw up a list of proteins present within nucleoli of HeLa cells. This analysis allowed the identification of 213 different nucleolar proteins. This catalog complements that of the 271 proteins obtained recently by others, giving a total of ∼350 different nucleolar proteins. Functional classification of these proteins allowed outlining several biological processes taking place within nucleoli. Bioinformatic analyses permitted the assignment of hypothetical functions for 43 proteins for which no functional information is available. Notably, a role in ribosome biogenesis was proposed for 31 proteins. More generally, this functional classification reinforces the plurifunctional nature of nucleoli and provides convincing evidence that nucleoli may play a central role in the control of gene expression. Finally, this analysis supports the recent demonstration of a coupling of transcription and translation in higher eukaryotes. PMID:12429849

  5. Proteomic Assessment of Poultry Spermatozoa

    USDA-ARS?s Scientific Manuscript database

    Fully characterizing the protein composition of spermatozoa is the first step in utilizing proteomics to delineate the function of sperm proteins. To date, sperm proteome maps have been partially developed for the human, mouse, rat, bull and several invertebrates. Here we report the first proteomic...

  6. Using the underlying biological organization of the Mycobacterium tuberculosis functional network for protein function prediction.

    PubMed

    Mazandu, Gaston K; Mulder, Nicola J

    2012-07-01

    Despite ever-increasing amounts of sequence and functional genomics data, there is still a deficiency of functional annotation for many newly sequenced proteins. For Mycobacterium tuberculosis (MTB), more than half of its genome is still uncharacterized, which hampers the search for new drug targets within the bacterial pathogen and limits our understanding of its pathogenicity. As for many other genomes, the annotations of proteins in the MTB proteome were generally inferred from sequence homology, which is effective but its applicability has limitations. We have carried out large-scale biological data integration to produce an MTB protein functional interaction network. Protein functional relationships were extracted from the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database, and additional functional interactions from microarray, sequence and protein signature data. The confidence level of protein relationships in the additional functional interaction data was evaluated using a dynamic data-driven scoring system. This functional network has been used to predict functions of uncharacterized proteins using Gene Ontology (GO) terms, and the semantic similarity between these terms measured using a state-of-the-art GO similarity metric. To achieve better trade-off between improvement of quality, genomic coverage and scalability, this prediction is done by observing the key principles driving the biological organization of the functional network. This study yields a new functionally characterized MTB strain CDC1551 proteome, consisting of 3804 and 3698 proteins out of 4195 with annotations in terms of the biological process and molecular function ontologies, respectively. These data can contribute to research into the Development of effective anti-tubercular drugs with novel biological mechanisms of action. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling

    PubMed Central

    Puente-Marin, Sara; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio

    2018-01-01

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation. PMID:29642539

  8. Proteomic and comparative genomic analysis of two Brassica napus lines differing in oil content.

    PubMed

    Gan, Lu; Zhang, Chun-yu; Wang, Xiao-dong; Wang, Hao; Long, Yan; Yin, Yong-tai; Li, Dian-rong; Tian, Jian-Hua; Li, Zai-yun; Lin, Zhi-wei; Yu, Long-Jiang; Li, Mao-Teng

    2013-11-01

    Ultrastructural observations, combined with proteomic and comparative genomic analyses, were applied to interpret the differences in protein composition and oil-body characteristics of mature seed of two Brassica napus lines with high and low oil contents of 55.19% and 36.49%, respectively. The results showed that oil bodies were arranged much closer in the high than in the low oil content line, and differences in cell size and thickness of cell walls were also observed. There were 119 and 32 differentially expressed proteins (DEPs) of total and oil-body proteins identified. The 119 DEPs of total protein were mainly involved in the oil-related, dehydration-related, storage and defense/disease, and some of these may be related to oil formation. The DEPs involved with dehydration-related were both detected in total and oil-body proteins for high and low oil lines and may be correlated with the number and size of oil bodies in the different lines. Some genes that corresponded to DEPs were confirmed by quantitative trait loci (QTL) mapping analysis for oil content. The results revealed that some candidate genes deduced from DEPs were located in the confidence intervals of QTL for oil content. Finally, the function of one gene that coded storage protein was verified by using a collection of Arabidopsis lines that can conditionally express the full length cDNA from developing seeds of B. napus.

  9. Proteomics: a new approach to the study of disease.

    PubMed

    Chambers, G; Lawrie, L; Cash, P; Murray, G I

    2000-11-01

    The global analysis of cellular proteins has recently been termed proteomics and is a key area of research that is developing in the post-genome era. Proteomics uses a combination of sophisticated techniques including two-dimensional (2D) gel electrophoresis, image analysis, mass spectrometry, amino acid sequencing, and bio-informatics to resolve comprehensively, to quantify, and to characterize proteins. The application of proteomics provides major opportunities to elucidate disease mechanisms and to identify new diagnostic markers and therapeutic targets. This review aims to explain briefly the background to proteomics and then to outline proteomic techniques. Applications to the study of human disease conditions ranging from cancer to infectious diseases are reviewed. Finally, possible future advances are briefly considered, especially those which may lead to faster sample throughput and increased sensitivity for the detection of individual proteins. Copyright 2000 John Wiley & Sons, Ltd.

  10. Evolution, language and analogy in functional genomics.

    PubMed

    Benner, S A; Gaucher, E A

    2001-07-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  11. Evolution, language and analogy in functional genomics

    NASA Technical Reports Server (NTRS)

    Benner, S. A.; Gaucher, E. A.

    2001-01-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  12. Proteome complexity and the forces that drive proteome imbalance.

    PubMed

    Harper, J Wade; Bennett, Eric J

    2016-09-15

    The cellular proteome is a complex microcosm of structural and regulatory networks that requires continuous surveillance and modification to meet the dynamic needs of the cell. It is therefore crucial that the protein flux of the cell remains in balance to ensure proper cell function. Genetic alterations that range from chromosome imbalance to oncogene activation can affect the speed, fidelity and capacity of protein biogenesis and degradation systems, which often results in proteome imbalance. An improved understanding of the causes and consequences of proteome imbalance is helping to reveal how these systems can be targeted to treat diseases such as cancer.

  13. Mass spectrometry-based proteomics: from cancer biology to protein biomarkers, drug targets, and clinical applications.

    PubMed

    Jimenez, Connie R; Verheul, Henk M W

    2014-01-01

    Proteomics is optimally suited to bridge the gap between genomic information on the one hand and biologic functions and disease phenotypes at the other, since it studies the expression and/or post-translational modification (especially phosphorylation) of proteins--the major cellular players bringing about cellular functions--at a global level in biologic specimens. Mass spectrometry technology and (bio)informatic tools have matured to the extent that they can provide high-throughput, comprehensive, and quantitative protein inventories of cells, tissues, and biofluids in clinical samples at low level. In this article, we focus on next-generation proteomics employing nanoliquid chromatography coupled to high-resolution tandem mass spectrometry for in-depth (phospho)protein profiling of tumor tissues and (proximal) biofluids, with a focus on studies employing clinical material. In addition, we highlight emerging proteogenomic approaches for the identification of tumor-specific protein variants, and targeted multiplex mass spectrometry strategies for large-scale biomarker validation. Below we provide a discussion of recent progress, some research highlights, and challenges that remain for clinical translation of proteomic discoveries.

  14. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions.

    PubMed

    Gerlt, John A

    2017-08-22

    The exponentially increasing number of protein and nucleic acid sequences provides opportunities to discover novel enzymes, metabolic pathways, and metabolites/natural products, thereby adding to our knowledge of biochemistry and biology. The challenge has evolved from generating sequence information to mining the databases to integrating and leveraging the available information, i.e., the availability of "genomic enzymology" web tools. Web tools that allow identification of biosynthetic gene clusters are widely used by the natural products/synthetic biology community, thereby facilitating the discovery of novel natural products and the enzymes responsible for their biosynthesis. However, many novel enzymes with interesting mechanisms participate in uncharacterized small-molecule metabolic pathways; their discovery and functional characterization also can be accomplished by leveraging information in protein and nucleic acid databases. This Perspective focuses on two genomic enzymology web tools that assist the discovery novel metabolic pathways: (1) Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST) for generating sequence similarity networks to visualize and analyze sequence-function space in protein families and (2) Enzyme Function Initiative-Genome Neighborhood Tool (EFI-GNT) for generating genome neighborhood networks to visualize and analyze the genome context in microbial and fungal genomes. Both tools have been adapted to other applications to facilitate target selection for enzyme discovery and functional characterization. As the natural products community has demonstrated, the enzymology community needs to embrace the essential role of web tools that allow the protein and genome sequence databases to be leveraged for novel insights into enzymological problems.

  15. PROTEOMICS OF THE AMNIOTIC FLUID IN ASSESSMENT OF THE PLACENTA – RELEVANCE FOR PRETERM BIRTH

    PubMed Central

    Buhimschi, Irina A.; Buhimschi, Catalin S.

    2008-01-01

    Proteomics is the study of expressed proteins and has emerged as a complement to genomic research. The major advantage of proteomics over DNA-RNA based technologies is that it more closely relates to phenotype and not the source code. Proteomics thus holds the promise of providing direct insight into the true mechanisms of human disease. Historically, examination of the placenta was the first modality to subclassify pathogenetical entities responsible for preterm birth. Because placenta is a key pathophysiological participant in several major obstetrical syndromes (preterm birth, preeclampsia, intrauterine growth restriction) identification of relevant biomarkers of placental function can profoundly impact on the prediction of fetal outcome and treatment efficacy. Proteomics is a young science and studies that associate proteomic patterns with long-term outcome require follow-up of children up to school age. In the interim, placental pathological footprints of cellular injury can be useful as intermediate outcomes. Furthermore, knowledge of the identity of the dys-regulated proteins may provide the necessary insight into novel pathophysiological pathways and unravel possible targets for therapeutic intervention that could not have been envisioned through hypothesis-driven approaches. PMID:18191197

  16. Sex-Specific Biology of the Human Malaria Parasite Revealed from the Proteomes of Mature Male and Female Gametocytes.

    PubMed

    Miao, Jun; Chen, Zhao; Wang, Zenglei; Shrestha, Sony; Li, Xiaolian; Li, Runze; Cui, Liwang

    2017-04-01

    The gametocytes of the malaria parasites are obligate for perpetuating the parasite's life cycle through mosquitoes, but the sex-specific biology of gametocytes is poorly understood. We generated a transgenic line in the human malaria parasite Plasmodium falciparum , which allowed us to accurately separate male and female gametocytes by flow cytometry. In-depth analysis of the proteomes by liquid chromatography-tandem mass spectrometry identified 1244 and 1387 proteins in mature male and female gametocytes, respectively. GFP-tagging of nine selected proteins confirmed their sex-partitions to be agreeable with the results from the proteomic analysis. The sex-specific proteomes showed significant differences that are consistent with the divergent functions of the two sexes. Although the male-specific proteome (119 proteins) is enriched in proteins associated with the flagella and genome replication, the female-specific proteome (262 proteins) is more abundant in proteins involved in metabolism, translation and organellar functions. Compared with the Plasmodium berghei sex-specific proteomes, this study revealed both extensive conservation and considerable divergence between these two species, which reflect the disparities between the two species in proteins involved in cytoskeleton, lipid metabolism and protein degradation. Comparison with three sex-specific proteomes allowed us to obtain high-confidence lists of 73 and 89 core male- and female-specific/biased proteins conserved in Plasmodium The identification of sex-specific/biased proteomes in Plasmodium lays a solid foundation for understanding the molecular mechanisms underlying the unique sex-specific biology in this early-branching eukaryote. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  17. Quantitative Proteomics of the Infectious and Replicative Forms of Chlamydia trachomatis

    PubMed Central

    Skipp, Paul J. S.; Hughes, Chris; McKenna, Thérèse; Edwards, Richard; Langridge, James; Thomson, Nicholas R.; Clarke, Ian N.

    2016-01-01

    The obligate intracellular developmental cycle of Chlamydia trachomatis presents significant challenges in defining its proteome. In this study we have applied quantitative proteomics to both the intracellular reticulate body (RB) and the extracellular elementary body (EB) from C. trachomatis. We used C. trachomatis L2 as a model chlamydial isolate for our study since it has a high infectivity:particle ratio and there is an excellent quality genome sequence. EBs and RBs (>99% pure) were quantified by chromosomal and plasmid copy number using PCR, from which the concentrations of chlamydial proteins per bacterial cell/genome were determined. RBs harvested at 15h post infection (PI) were purified by three successive rounds of gradient centrifugation. This is the earliest possible time to obtain purified RBs, free from host cell components in quantity, within the constraints of the technology. EBs were purified at 48h PI. We then used two-dimensional reverse phase UPLC to fractionate RB or EB peptides before mass spectroscopic analysis, providing absolute amount estimates of chlamydial proteins. The ability to express the data as molecules per cell gave ranking in both abundance and energy requirements for synthesis, allowing meaningful identification of rate-limiting components. The study assigned 562 proteins with high confidence and provided absolute estimates of protein concentration for 489 proteins. Interestingly, the data showed an increase in TTS capacity at 15h PI. Most of the enzymes involved in peptidoglycan biosynthesis were detected along with high levels of muramidase (in EBs) suggesting breakdown of peptidoglycan occurs in the non-dividing form of the microorganism. All the genome-encoded enzymes for glycolysis, pentose phosphate pathway and tricarboxylic acid cycle were identified and quantified; these data supported the observation that the EB is metabolically active. The availability of detailed, accurate quantitative proteomic data will be

  18. New Markers for Predicting Fertility of the Male Gametes in the Post Genomic Age.

    PubMed

    Dipresa, Savina; De Toni, Luca; Foresta, Carlo; Garolla, Andrea

    2018-04-18

    A number of test have been proposed to assess male fertility potential, ranging from routine testing by light microscopic method for evaluating semen samples, to screening test for DNA integrity aimed to look at sperm chromatin abnormalities. Spermatozoa are an extremely differentiated cell, they have critical functions for embryo development and heredity, in addiction to delivering a haploid paternal genome to the oocyte. Towards this goal certain requirements must always be met. The ability of spermatozoa to perform its reproductive function taking place in the spermatogenesis, a highly specialized process depending on multiple factors with effect on male fertility. In the past 30 years, large-scale analyses of transcriptomic and genome expression in mammals have generated a large amount of informations on numberless biomolecules involved in spermatogenesis and male germ cell reproductive function. Sperm proteome represents the protein content that spermatozoa needs to survive and work correctly and modifications of sperm proteome play a role in determining functional changes leading to a decrease of reproductive competence into affected spermatozoa. The post-genomic approach consists of different methodologies for concurrently testicular transcriptome studies, protein compositional analysis and metabolomics findings of the spermatozoa in humans. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Mapping genomic features to functional traits through microbial whole genome sequences.

    PubMed

    Zhang, Wei; Zeng, Erliang; Liu, Dan; Jones, Stuart E; Emrich, Scott

    2014-01-01

    Recently, the utility of trait-based approaches for microbial communities has been identified. Increasing availability of whole genome sequences provide the opportunity to explore the genetic foundations of a variety of functional traits. We proposed a machine learning framework to quantitatively link the genomic features with functional traits. Genes from bacteria genomes belonging to different functional traits were grouped to Cluster of Orthologs (COGs), and were used as features. Then, TF-IDF technique from the text mining domain was applied to transform the data to accommodate the abundance and importance of each COG. After TF-IDF processing, COGs were ranked using feature selection methods to identify their relevance to the functional trait of interest. Extensive experimental results demonstrated that functional trait related genes can be detected using our method. Further, the method has the potential to provide novel biological insights.

  20. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence–Function Space and Genome Context to Discover Novel Functions

    PubMed Central

    2017-01-01

    The exponentially increasing number of protein and nucleic acid sequences provides opportunities to discover novel enzymes, metabolic pathways, and metabolites/natural products, thereby adding to our knowledge of biochemistry and biology. The challenge has evolved from generating sequence information to mining the databases to integrating and leveraging the available information, i.e., the availability of “genomic enzymology” web tools. Web tools that allow identification of biosynthetic gene clusters are widely used by the natural products/synthetic biology community, thereby facilitating the discovery of novel natural products and the enzymes responsible for their biosynthesis. However, many novel enzymes with interesting mechanisms participate in uncharacterized small-molecule metabolic pathways; their discovery and functional characterization also can be accomplished by leveraging information in protein and nucleic acid databases. This Perspective focuses on two genomic enzymology web tools that assist the discovery novel metabolic pathways: (1) Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST) for generating sequence similarity networks to visualize and analyze sequence–function space in protein families and (2) Enzyme Function Initiative-Genome Neighborhood Tool (EFI-GNT) for generating genome neighborhood networks to visualize and analyze the genome context in microbial and fungal genomes. Both tools have been adapted to other applications to facilitate target selection for enzyme discovery and functional characterization. As the natural products community has demonstrated, the enzymology community needs to embrace the essential role of web tools that allow the protein and genome sequence databases to be leveraged for novel insights into enzymological problems. PMID:28826221

  1. Constructing Proteome Reference Map of the Porcine Jejunal Cell Line (IPEC-J2) by Label-Free Mass Spectrometry.

    PubMed

    Kim, Sang Hoon; Pajarillo, Edward Alain B; Balolong, Marilen P; Lee, Ji Yoon; Kang, Dae-Kyung

    2016-06-28

    In this study, the global proteome of the IPEC-J2 cell line was evaluated using ultra-high performance liquid chromatography coupled to a quadrupole Q Exactive™ Orbitrap mass spectrometer. Proteins were isolated from highly confluent IPEC-J2 cells in biological replicates and analyzed by label-free mass spectrometry prior to matching against a porcine genomic dataset. The results identified 1,517 proteins, accounting for 7.35% of all genes in the porcine genome. The highly abundant proteins detected, such as actin, annexin A2, and AHNAK nucleoprotein, are involved in structural integrity, signaling mechanisms, and cellular homeostasis. The high abundance of heat shock proteins indicated their significance in cellular defenses, barrier function, and gut homeostasis. Pathway analysis and annotation using the Kyoto Encyclopedia of Genes and Genomes database resulted in a putative protein network map of the regulation of immunological responses and structural integrity in the cell line. The comprehensive proteome analysis of IPEC-J2 cells provides fundamental insights into overall protein expression and pathway dynamics that might be useful in cell adhesion studies and immunological applications.

  2. Shaping biological knowledge: applications in proteomics.

    PubMed

    Lisacek, F; Chichester, C; Gonnet, P; Jaillet, O; Kappus, S; Nikitin, F; Roland, P; Rossier, G; Truong, L; Appel, R

    2004-01-01

    The central dogma of molecular biology has provided a meaningful principle for data integration in the field of genomics. In this context, integration reflects the known transitions from a chromosome to a protein sequence: transcription, intron splicing, exon assembly and translation. There is no such clear principle for integrating proteomics data, since the laws governing protein folding and interactivity are not quite understood. In our effort to bring together independent pieces of information relative to proteins in a biologically meaningful way, we assess the bias of bioinformatics resources and consequent approximations in the framework of small-scale studies. We analyse proteomics data while following both a data-driven (focus on proteins smaller than 10 kDa) and a hypothesis-driven (focus on whole bacterial proteomes) approach. These applications are potentially the source of specialized complements to classical biological ontologies.

  3. The complete genome and proteome of Laribacter hongkongensis reveal potential mechanisms for adaptations to different temperatures and habitats.

    PubMed

    Woo, Patrick C Y; Lau, Susanna K P; Tse, Herman; Teng, Jade L L; Curreem, Shirly O T; Tsang, Alan K L; Fan, Rachel Y Y; Wong, Gilman K M; Huang, Yi; Loman, Nicholas J; Snyder, Lori A S; Cai, James J; Huang, Jian-Dong; Mak, William; Pallen, Mark J; Lok, Si; Yuen, Kwok-Yung

    2009-03-01

    Laribacter hongkongensis is a newly discovered Gram-negative bacillus of the Neisseriaceae family associated with freshwater fish-borne gastroenteritis and traveler's diarrhea. The complete genome sequence of L. hongkongensis HLHK9, recovered from an immunocompetent patient with severe gastroenteritis, consists of a 3,169-kb chromosome with G+C content of 62.35%. Genome analysis reveals different mechanisms potentially important for its adaptation to diverse habitats of human and freshwater fish intestines and freshwater environments. The gene contents support its phenotypic properties and suggest that amino acids and fatty acids can be used as carbon sources. The extensive variety of transporters, including multidrug efflux and heavy metal transporters as well as genes involved in chemotaxis, may enable L. hongkongensis to survive in different environmental niches. Genes encoding urease, bile salts efflux pump, adhesin, catalase, superoxide dismutase, and other putative virulence factors-such as hemolysins, RTX toxins, patatin-like proteins, phospholipase A1, and collagenases-are present. Proteomes of L. hongkongensis HLHK9 cultured at 37 degrees C (human body temperature) and 20 degrees C (freshwater habitat temperature) showed differential gene expression, including two homologous copies of argB, argB-20, and argB-37, which encode two isoenzymes of N-acetyl-L-glutamate kinase (NAGK)-NAGK-20 and NAGK-37-in the arginine biosynthesis pathway. NAGK-20 showed higher expression at 20 degrees C, whereas NAGK-37 showed higher expression at 37 degrees C. NAGK-20 also had a lower optimal temperature for enzymatic activities and was inhibited by arginine probably as negative-feedback control. Similar duplicated copies of argB are also observed in bacteria from hot springs such as Thermus thermophilus, Deinococcus geothermalis, Deinococcus radiodurans, and Roseiflexus castenholzii, suggesting that similar mechanisms for temperature adaptation may be employed by other

  4. Proteome complexity and the forces that drive proteome imbalance

    PubMed Central

    Harper, J. Wade; Bennett, Eric J.

    2016-01-01

    Summary The cellular proteome is a complex microcosm of structural and regulatory networks that requires continuous surveillance and modification to meet the dynamic needs of the cell. It is therefore crucial that the protein flux of the cell remains in balance to ensure proper cell function. Genetic alterations that range from chromosome imbalance to oncogene activation can affect the speed, fidelity and capacity of protein biogenesis and degradation systems, which often results in proteome imbalance. An improved understanding of the causes and consequences of proteome imbalance is helping to reveal how these systems can be targeted to treat diseases such as cancer. PMID:27629639

  5. Neural Stem Cells (NSCs) and Proteomics*

    PubMed Central

    Shoemaker, Lorelei D.; Kornblum, Harley I.

    2016-01-01

    Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. PMID:26494823

  6. Neural Stem Cells (NSCs) and Proteomics.

    PubMed

    Shoemaker, Lorelei D; Kornblum, Harley I

    2016-02-01

    Neural stem cells (NSCs) can self-renew and give rise to the major cell types of the CNS. Studies of NSCs include the investigation of primary, CNS-derived cells as well as animal and human embryonic stem cell (ESC)-derived and induced pluripotent stem cell (iPSC)-derived sources. NSCs provide a means with which to study normal neural development, neurodegeneration, and neurological disease and are clinically relevant sources for cellular repair to the damaged and diseased CNS. Proteomics studies of NSCs have the potential to delineate molecules and pathways critical for NSC biology and the means by which NSCs can participate in neural repair. In this review, we provide a background to NSC biology, including the means to obtain them and the caveats to these processes. We then focus on advances in the proteomic interrogation of NSCs. This includes the analysis of posttranslational modifications (PTMs); approaches to analyzing different proteomic compartments, such the secretome; as well as approaches to analyzing temporal differences in the proteome to elucidate mechanisms of differentiation. We also discuss some of the methods that will undoubtedly be useful in the investigation of NSCs but which have not yet been applied to the field. While many proteomics studies of NSCs have largely catalogued the proteome or posttranslational modifications of specific cellular states, without delving into specific functions, some have led to understandings of functional processes or identified markers that could not have been identified via other means. Many challenges remain in the field, including the precise identification and standardization of NSCs used for proteomic analyses, as well as how to translate fundamental proteomics studies to functional biology. The next level of investigation will require interdisciplinary approaches, combining the skills of those interested in the biochemistry of proteomics with those interested in modulating NSC function. © 2016 by The

  7. Shotgun proteomics approach to characterizing the embryonic proteome of the silkworm, Bombyx mori, at labrum appearance stage.

    PubMed

    Li, J-Y; Chen, X; Hosseini Moghaddam, S H; Chen, M; Wei, H; Zhong, B-X

    2009-10-01

    The shotgun approach has gained considerable acknowledgement in recent years as a dominant strategy in proteomics. We observed a dramatic increase of specific protein spots in two-dimensional electrophoresis (2-DE) gels of the silkworm (Bombyx mori) embryo at labrum appearance, a characteristic stage during embryonic development of silkworm which is involved with temperature increase by silkworm raiser. We employed shotgun liquid chromatography tandem mass spectrometry (LC-MS/MS) technology to analyse the proteome of B. mori embryos at this stage. A total of 2168 proteins were identified with an in-house database. Approximately 47% of them had isoelectric point (pI) values distributed theoretically in the range pI 5-7 and approximately 60% of them had molecular weights of 15-45 kDa. Furthermore, 111 proteins had an pI greater than 10 and were difficult to separate by 2-DE. Many important functional proteins related to embryonic development, stress response, DNA transcription/translation, cell growth, proliferation and differentiation, organogenesis and reproduction were identified. Among them proteins related to nervous system development were noticeable. All known heat shock proteins (HSPs) were detected in this developmental stage of B. mori embryo. In addition, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis showed energetic metabolism at this stage. These results were expected to provide more information for proteomic monitoring of the insect embryo and better understanding of the spatiotemporal expression of genes during embryonic developmental processes.

  8. Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size

    PubMed Central

    Zhang, Jingshan; Maslov, Sergei; Shakhnovich, Eugene I

    2008-01-01

    Crowded intracellular environments present a challenge for proteins to form functional specific complexes while reducing non-functional interactions with promiscuous non-functional partners. Here we show how the need to minimize the waste of resources to non-functional interactions limits the proteome diversity and the average concentration of co-expressed and co-localized proteins. Using the results of high-throughput Yeast 2-Hybrid experiments, we estimate the characteristic strength of non-functional protein–protein interactions. By combining these data with the strengths of specific interactions, we assess the fraction of time proteins spend tied up in non-functional interactions as a function of their overall concentration. This allows us to sketch the phase diagram for baker's yeast cells using the experimentally measured concentrations and subcellular localization of their proteins. The positions of yeast compartments on the phase diagram are consistent with our hypothesis that the yeast proteome has evolved to operate closely to the upper limit of its size, whereas keeping individual protein concentrations sufficiently low to reduce non-functional interactions. These findings have implication for conceptual understanding of intracellular compartmentalization, multicellularity and differentiation. PMID:18682700

  9. The proteomic complexity and rise of the primordial ancestor of diversified life

    PubMed Central

    2011-01-01

    Background The last universal common ancestor represents the primordial cellular organism from which diversified life was derived. This urancestor accumulated genetic information before the rise of organismal lineages and is considered to be either a simple 'progenote' organism with a rudimentary translational apparatus or a more complex 'cenancestor' with almost all essential biological processes. Recent comparative genomic studies support the latter model and propose that the urancestor was similar to modern organisms in terms of gene content. However, most of these studies were based on molecular sequences, which are fast evolving and of limited value for deep evolutionary explorations. Results Here we engage in a phylogenomic study of protein domain structure in the proteomes of 420 free-living fully sequenced organisms. Domains were defined at the highly conserved fold superfamily (FSF) level of structural classification and an iterative phylogenomic approach was used to reconstruct max_set and min_set FSF repertoires as upper and lower bounds of the urancestral proteome. While the functional make up of the urancestral sets was complex, they represent only 5-11% of the 1,420 FSFs of extant proteomes and their make up and reuse was at least 5 and 3 times smaller than proteomes of free-living organisms, repectively. Trees of proteomes reconstructed directly from FSFs or from molecular functions, which included the max_set and min_set as articial taxa, showed that urancestors were always placed at their base and rooted the tree of life in Archaea. Finally, a molecular clock of FSFs suggests the min_set reflects urancestral genetic make up more reliably and confirms diversified life emerged about 2.9 billion years ago during the start of planet oxygenation. Conclusions The minimum urancestral FSF set reveals the urancestor had advanced metabolic capabilities, was especially rich in nucleotide metabolism enzymes, had pathways for the biosynthesis of membrane sn1

  10. Proteogenomics Dashboard for the Human Proteome Project.

    PubMed

    Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto

    2015-09-04

    dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.

  11. Proteomics of eukaryotic microorganisms: The medically and biotechnologically important fungal genus Aspergillus.

    PubMed

    Kniemeyer, Olaf

    2011-08-01

    Fungal species of the genus Aspergillus play significant roles as model organisms in basic research, as "cell factories" for the production of organic acids, pharmaceuticals or industrially important enzymes and as pathogens causing superficial and invasive infections in animals and humans. The release of the genome sequences of several Aspergillus sp. has paved the way for global analyses of protein expression in Aspergilli including the characterisation of proteins, which have not designated any function. With the application of proteomic methods, particularly 2-D gel and LC-MS/MS-based methods, first insights into the composition of the proteome of Aspergilli under different growth and stress conditions could be gained. Putative targets of global regulators led to the improvement of industrially relevant Aspergillus strains and so far not described Aspergillus antigens have already been discovered. Here, I review the recent proteome data generated for the species Aspergillus nidulans, Aspergillus fumigatus, Aspergillus niger, Aspergillus terreus, Aspergillus flavus and Aspergillus oryzae. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Exploring the Arabidopsis proteome: influence of protein solubilization buffers on proteome coverage.

    PubMed

    Marondedze, Claudius; Wong, Aloysius; Groen, Arnoud; Serrano, Natalia; Jankovic, Boris; Lilley, Kathryn; Gehring, Christoph; Thomas, Ludivine

    2014-12-31

    The study of proteomes provides new insights into stimulus-specific responses of protein synthesis and turnover, and the role of post-translational modifications at the systems level. Due to the diverse chemical nature of proteins and shortcomings in the analytical techniques used in their study, only a partial display of the proteome is achieved in any study, and this holds particularly true for plant proteomes. Here we show that different solubilization and separation methods have profound effects on the resulting proteome. In particular, we observed that the type of detergents employed in the solubilization buffer preferentially enriches proteins in different functional categories. These include proteins with a role in signaling, transport, response to temperature stimuli and metabolism. This data may offer a functional bias on comparative analysis studies. In order to obtain a broader coverage, we propose a two-step solubilization protocol with first a detergent-free buffer and then a second step utilizing a combination of two detergents to solubilize proteins.

  13. Exploring the Arabidopsis Proteome: Influence of Protein Solubilization Buffers on Proteome Coverage

    PubMed Central

    Marondedze, Claudius; Wong, Aloysius; Groen, Arnoud; Serrano, Natalia; Jankovic, Boris; Lilley, Kathryn; Gehring, Christoph; Thomas, Ludivine

    2014-01-01

    The study of proteomes provides new insights into stimulus-specific responses of protein synthesis and turnover, and the role of post-translational modifications at the systems level. Due to the diverse chemical nature of proteins and shortcomings in the analytical techniques used in their study, only a partial display of the proteome is achieved in any study, and this holds particularly true for plant proteomes. Here we show that different solubilization and separation methods have profound effects on the resulting proteome. In particular, we observed that the type of detergents employed in the solubilization buffer preferentially enriches proteins in different functional categories. These include proteins with a role in signaling, transport, response to temperature stimuli and metabolism. This data may offer a functional bias on comparative analysis studies. In order to obtain a broader coverage, we propose a two-step solubilization protocol with first a detergent-free buffer and then a second step utilizing a combination of two detergents to solubilize proteins. PMID:25561235

  14. Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.

    PubMed

    Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali

    2018-04-01

    Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.

  15. Curated protein information in the Saccharomyces genome database.

    PubMed

    Hellerstedt, Sage T; Nash, Robert S; Weng, Shuai; Paskov, Kelley M; Wong, Edith D; Karra, Kalpana; Engel, Stacia R; Cherry, J Michael

    2017-01-01

    Due to recent advancements in the production of experimental proteomic data, the Saccharomyces genome database (SGD; www.yeastgenome.org ) has been expanding our protein curation activities to make new data types available to our users. Because of broad interest in post-translational modifications (PTM) and their importance to protein function and regulation, we have recently started incorporating expertly curated PTM information on individual protein pages. Here we also present the inclusion of new abundance and protein half-life data obtained from high-throughput proteome studies. These new data types have been included with the aim to facilitate cellular biology research. : www.yeastgenome.org. © The Author(s) 2017. Published by Oxford University Press.

  16. Draft Genome Sequences of Human Pathogenic Fungus Geomyces pannorum Sensu Lato and Bat White Nose Syndrome Pathogen Geomyces (Pseudogymnoascus) destructans.

    PubMed

    Chibucos, Marcus C; Crabtree, Jonathan; Nagaraj, Sushma; Chaturvedi, Sudha; Chaturvedi, Vishnu

    2013-12-19

    We report the draft genome sequences of Geomyces pannorum sensu lato and Geomyces (Pseudogymnoascus) destructans. G. pannorum has a larger proteome than G. destructans, containing more proteins with ascribed enzymatic functions. This dichotomy in the genomes of related psychrophilic fungi is a valuable target for defining their distinct saprobic and pathogenic attributes.

  17. Technological advances and genomics in metazoan parasites.

    PubMed

    Knox, D P

    2004-02-01

    Molecular biology has provided the means to identify parasite proteins, to define their function, patterns of expression and the means to produce them in quantity for subsequent functional analyses. Whole genome and expressed sequence tag programmes, and the parallel development of powerful bioinformatics tools, allow the execution of genome-wide between stage or species comparisons and meaningful gene-expression profiling. The latter can be undertaken with several new technologies such as DNA microarray and serial analysis of gene expression. Proteome analysis has come to the fore in recent years providing a crucial link between the gene and its protein product. RNA interference and ballistic gene transfer are exciting developments which can provide the means to precisely define the function of individual genes and, of importance in devising novel parasite control strategies, the effect that gene knockdown will have on parasite survival.

  18. Genic insights from integrated human proteomics in GeneCards.

    PubMed

    Fishilevich, Simon; Zimmerman, Shahar; Kohn, Asher; Iny Stein, Tsippi; Olender, Tsviya; Kolker, Eugene; Safran, Marilyn; Lancet, Doron

    2016-01-01

    Cards sections and help promote and organize the genome-wide structural and functional knowledge of the human proteome. Database URL:http://www.genecards.org/. © The Author(s) 2016. Published by Oxford University Press.

  19. DOGMA: domain-based transcriptome and proteome quality assessment.

    PubMed

    Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten

    2016-09-01

    Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. A proteome-scale map of the human interactome network

    PubMed Central

    Rolland, Thomas; Taşan, Murat; Charloteaux, Benoit; Pevzner, Samuel J.; Zhong, Quan; Sahni, Nidhi; Yi, Song; Lemmens, Irma; Fontanillo, Celia; Mosca, Roberto; Kamburov, Atanas; Ghiassian, Susan D.; Yang, Xinping; Ghamsari, Lila; Balcha, Dawit; Begg, Bridget E.; Braun, Pascal; Brehme, Marc; Broly, Martin P.; Carvunis, Anne-Ruxandra; Convery-Zupan, Dan; Corominas, Roser; Coulombe-Huntington, Jasmin; Dann, Elizabeth; Dreze, Matija; Dricot, Amélie; Fan, Changyu; Franzosa, Eric; Gebreab, Fana; Gutierrez, Bryan J.; Hardy, Madeleine F.; Jin, Mike; Kang, Shuli; Kiros, Ruth; Lin, Guan Ning; Luck, Katja; MacWilliams, Andrew; Menche, Jörg; Murray, Ryan R.; Palagi, Alexandre; Poulin, Matthew M.; Rambout, Xavier; Rasla, John; Reichert, Patrick; Romero, Viviana; Ruyssinck, Elien; Sahalie, Julie M.; Scholz, Annemarie; Shah, Akash A.; Sharma, Amitabh; Shen, Yun; Spirohn, Kerstin; Tam, Stanley; Tejeda, Alexander O.; Trigg, Shelly A.; Twizere, Jean-Claude; Vega, Kerwin; Walsh, Jennifer; Cusick, Michael E.; Xia, Yu; Barabási, Albert-László; Iakoucheva, Lilia M.; Aloy, Patrick; De Las Rivas, Javier; Tavernier, Jan; Calderwood, Michael A.; Hill, David E.; Hao, Tong; Roth, Frederick P.; Vidal, Marc

    2014-01-01

    SUMMARY Just as reference genome sequences revolutionized human genetics, reference maps of interactome networks will be critical to fully understand genotype-phenotype relationships. Here, we describe a systematic map of ~14,000 high-quality human binary protein-protein interactions. At equal quality, this map is ~30% larger than what is available from small-scale studies published in the literature in the last few decades. While currently available information is highly biased and only covers a relatively small portion of the proteome, our systematic map appears strikingly more homogeneous, revealing a “broader” human interactome network than currently appreciated. The map also uncovers significant inter-connectivity between known and candidate cancer gene products, providing unbiased evidence for an expanded functional cancer landscape, while demonstrating how high quality interactome models will help “connect the dots” of the genomic revolution. PMID:25416956

  1. Identification of factors that function in Drosophila salivary gland cell death during development using proteomics

    PubMed Central

    McPhee, C K; Balgley, B M; Nelson, C; Hill, J H; Batlevi, Y; Fang, X; Lee, C S; Baehrecke, E H

    2013-01-01

    Proteasome inhibitors induce cell death and are used in cancer therapy, but little is known about the relationship between proteasome impairment and cell death under normal physiological conditions. Here, we investigate the relationship between proteasome function and larval salivary gland cell death during development in Drosophila. Drosophila larval salivary gland cells undergo synchronized programmed cell death requiring both caspases and autophagy (Atg) genes during development. Here, we show that ubiquitin proteasome system (UPS) function is reduced during normal salivary gland cell death, and that ectopic proteasome impairment in salivary gland cells leads to early DNA fragmentation and salivary gland condensation in vivo. Shotgun proteomic analyses of purified dying salivary glands identified the UPS as the top category of proteins enriched, suggesting a possible compensatory induction of these factors to maintain proteolysis during cell death. We compared the proteome following ectopic proteasome impairment to the proteome during developmental cell death in salivary gland cells. Proteins that were enriched in both populations of cells were screened for their function in salivary gland degradation using RNAi knockdown. We identified several factors, including trol, a novel gene CG11880, and the cop9 signalsome component cop9 signalsome 6, as required for Drosophila larval salivary gland degradation. PMID:22935612

  2. KEGG orthology-based annotation of the predicted proteome of Acropora digitifera: ZoophyteBase - an open access and searchable database of a coral genome

    PubMed Central

    2013-01-01

    Background Contemporary coral reef research has firmly established that a genomic approach is urgently needed to better understand the effects of anthropogenic environmental stress and global climate change on coral holobiont interactions. Here we present KEGG orthology-based annotation of the complete genome sequence of the scleractinian coral Acropora digitifera and provide the first comprehensive view of the genome of a reef-building coral by applying advanced bioinformatics. Description Sequences from the KEGG database of protein function were used to construct hidden Markov models. These models were used to search the predicted proteome of A. digitifera to establish complete genomic annotation. The annotated dataset is published in ZoophyteBase, an open access format with different options for searching the data. A particularly useful feature is the ability to use a Google-like search engine that links query words to protein attributes. We present features of the annotation that underpin the molecular structure of key processes of coral physiology that include (1) regulatory proteins of symbiosis, (2) planula and early developmental proteins, (3) neural messengers, receptors and sensory proteins, (4) calcification and Ca2+-signalling proteins, (5) plant-derived proteins, (6) proteins of nitrogen metabolism, (7) DNA repair proteins, (8) stress response proteins, (9) antioxidant and redox-protective proteins, (10) proteins of cellular apoptosis, (11) microbial symbioses and pathogenicity proteins, (12) proteins of viral pathogenicity, (13) toxins and venom, (14) proteins of the chemical defensome and (15) coral epigenetics. Conclusions We advocate that providing annotation in an open-access searchable database available to the public domain will give an unprecedented foundation to interrogate the fundamental molecular structure and interactions of coral symbiosis and allow critical questions to be addressed at the genomic level based on combined aspects of

  3. CRISPR/Cas9: From Genome Engineering to Cancer Drug Discovery

    PubMed Central

    Luo, Ji

    2016-01-01

    Advances in translational research are often driven by new technologies. The advent of microarrays, next-generation sequencing, proteomics and RNA interference (RNAi) have led to breakthroughs in our understanding of the mechanisms of cancer and the discovery of new cancer drug targets. The discovery of the bacterial clustered regularly interspaced palindromic repeat (CRISPR) system and its subsequent adaptation as a tool for mammalian genome engineering has opened up new avenues for functional genomics studies. This review will focus on the utility of CRISPR in the context of cancer drug target discovery. PMID:28603775

  4. A large scale Plasmodium vivax- Saimiri boliviensis trophozoite-schizont transition proteome

    PubMed Central

    Lapp, Stacey A.; Barnwell, John W.; Galinski, Mary R.

    2017-01-01

    Plasmodium vivax is a complex protozoan parasite with over 6,500 genes and stage-specific differential expression. Much of the unique biology of this pathogen remains unknown, including how it modifies and restructures the host reticulocyte. Using a recently published P. vivax reference genome, we report the proteome from two biological replicates of infected Saimiri boliviensis host reticulocytes undergoing transition from the late trophozoite to early schizont stages. Using five database search engines, we identified a total of 2000 P. vivax and 3487 S. boliviensis proteins, making this the most comprehensive P. vivax proteome to date. PlasmoDB GO-term enrichment analysis of proteins identified at least twice by a search engine highlighted core metabolic processes and molecular functions such as glycolysis, translation and protein folding, cell components such as ribosomes, proteasomes and the Golgi apparatus, and a number of vesicle and trafficking related clusters. Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.8 enriched functional annotation clusters of S. boliviensis proteins highlighted vesicle and trafficking-related clusters, elements of the cytoskeleton, oxidative processes and response to oxidative stress, macromolecular complexes such as the proteasome and ribosome, metabolism, translation, and cell death. Host and parasite proteins potentially involved in cell adhesion were also identified. Over 25% of the P. vivax proteins have no functional annotation; this group includes 45 VIR members of the large PIR family. A number of host and pathogen proteins contained highly oxidized or nitrated residues, extending prior trophozoite-enriched stage observations from S. boliviensis infections, and supporting the possibility of oxidative stress in relation to the disease. This proteome significantly expands the size and complexity of the known P. vivax and Saimiri host iRBC proteomes, and provides in-depth data that will be valuable

  5. The Divided Bacterial Genome: Structure, Function, and Evolution.

    PubMed

    diCenzo, George C; Finan, Turlough M

    2017-09-01

    Approximately 10% of bacterial genomes are split between two or more large DNA fragments, a genome architecture referred to as a multipartite genome. This multipartite organization is found in many important organisms, including plant symbionts, such as the nitrogen-fixing rhizobia, and plant, animal, and human pathogens, including the genera Brucella , Vibrio , and Burkholderia . The availability of many complete bacterial genome sequences means that we can now examine on a broad scale the characteristics of the different types of DNA molecules in a genome. Recent work has begun to shed light on the unique properties of each class of replicon, the unique functional role of chromosomal and nonchromosomal DNA molecules, and how the exploitation of novel niches may have driven the evolution of the multipartite genome. The aims of this review are to (i) outline the literature regarding bacterial genomes that are divided into multiple fragments, (ii) provide a meta-analysis of completed bacterial genomes from 1,708 species as a way of reviewing the abundant information present in these genome sequences, and (iii) provide an encompassing model to explain the evolution and function of the multipartite genome structure. This review covers, among other topics, salient genome terminology; mechanisms of multipartite genome formation; the phylogenetic distribution of multipartite genomes; how each part of a genome differs with respect to genomic signatures, genetic variability, and gene functional annotation; how each DNA molecule may interact; as well as the costs and benefits of this genome structure. Copyright © 2017 American Society for Microbiology.

  6. [Methods of quantitative proteomics].

    PubMed

    Kopylov, A T; Zgoda, V G

    2007-01-01

    In modern science proteomic analysis is inseparable from other fields of systemic biology. Possessing huge resources quantitative proteomics operates colossal information on molecular mechanisms of life. Advances in proteomics help researchers to solve complex problems of cell signaling, posttranslational modification, structure and functional homology of proteins, molecular diagnostics etc. More than 40 various methods have been developed in proteomics for quantitative analysis of proteins. Although each method is unique and has certain advantages and disadvantages all these use various isotope labels (tags). In this review we will consider the most popular and effective methods employing both chemical modifications of proteins and also metabolic and enzymatic methods of isotope labeling.

  7. Functional proteomics outlines the complexity of breast cancer molecular subtypes.

    PubMed

    Gámez-Pozo, Angelo; Trilla-Fuertes, Lucía; Berges-Soria, Julia; Selevsek, Nathalie; López-Vacas, Rocío; Díaz-Almirón, Mariana; Nanni, Paolo; Arevalillo, Jorge M; Navarro, Hilario; Grossmann, Jonas; Gayá Moreno, Francisco; Gómez Rioja, Rubén; Prado-Vázquez, Guillermo; Zapater-Moros, Andrea; Main, Paloma; Feliú, Jaime; Martínez Del Prado, Purificación; Zamora, Pilar; Ciruelos, Eva; Espinosa, Enrique; Fresno Vara, Juan Ángel

    2017-08-30

    Breast cancer is a heterogeneous disease comprising a variety of entities with various genetic backgrounds. Estrogen receptor-positive, human epidermal growth factor receptor 2-negative tumors typically have a favorable outcome; however, some patients eventually relapse, which suggests some heterogeneity within this category. In the present study, we used proteomics and miRNA profiling techniques to characterize a set of 102 either estrogen receptor-positive (ER+)/progesterone receptor-positive (PR+) or triple-negative formalin-fixed, paraffin-embedded breast tumors. Protein expression-based probabilistic graphical models and flux balance analyses revealed that some ER+/PR+ samples had a protein expression profile similar to that of triple-negative samples and had a clinical outcome similar to those with triple-negative disease. This probabilistic graphical model-based classification had prognostic value in patients with luminal A breast cancer. This prognostic information was independent of that provided by standard genomic tests for breast cancer, such as MammaPrint, OncoType Dx and the 8-gene Score.

  8. An Evolutionary Classification of Genomic Function

    PubMed Central

    Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B.R.

    2015-01-01

    The pronouncements of the ENCODE Project Consortium regarding “junk DNA” exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into “functional DNA,” that is, DNA sequences that have a selected-effect function, and “rubbish DNA,” that is, sequences that do not. Functional DNA is further subdivided into “literal DNA” and “indifferent DNA.” In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into “junk DNA” and “garbage DNA.” Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA. PMID:25635041

  9. Draft Genome Sequences of Human Pathogenic Fungus Geomyces pannorum Sensu Lato and Bat White Nose Syndrome Pathogen Geomyces (Pseudogymnoascus) destructans

    PubMed Central

    Crabtree, Jonathan; Nagaraj, Sushma; Chaturvedi, Sudha

    2013-01-01

    We report the draft genome sequences of Geomyces pannorum sensu lato and Geomyces (Pseudogymnoascus) destructans. G. pannorum has a larger proteome than G. destructans, containing more proteins with ascribed enzymatic functions. This dichotomy in the genomes of related psychrophilic fungi is a valuable target for defining their distinct saprobic and pathogenic attributes. PMID:24356829

  10. Genomic and Proteomic Dissection of the Ubiquitous Plant Pathogen, Armillaria mellea: Toward a New Infection Model System

    PubMed Central

    2013-01-01

    Armillaria mellea is a major plant pathogen. Yet, no large-scale “-omics” data are available to enable new studies, and limited experimental models are available to investigate basidiomycete pathogenicity. Here we reveal that the A. mellea genome comprises 58.35 Mb, contains 14473 gene models, of average length 1575 bp (4.72 introns/gene). Tandem mass spectrometry identified 921 mycelial (n = 629 unique) and secreted (n = 183 unique) proteins. Almost 100 mycelial proteins were either species-specific or previously unidentified at the protein level. A number of proteins (n = 111) was detected in both mycelia and culture supernatant extracts. Signal sequence occurrence was 4-fold greater for secreted (50.2%) compared to mycelial (12%) proteins. Analyses revealed a rich reservoir of carbohydrate degrading enzymes, laccases, and lignin peroxidases in the A. mellea proteome, reminiscent of both basidiomycete and ascomycete glycodegradative arsenals. We discovered that A. mellea exhibits a specific killing effect against Candida albicans during coculture. Proteomic investigation of this interaction revealed the unique expression of defensive and potentially offensive A. mellea proteins (n = 30). Overall, our data reveal new insights into the origin of basidiomycete virulence and we present a new model system for further studies aimed at deciphering fungal pathogenic mechanisms. PMID:23656496

  11. Acute toxicity of functionalized single wall carbon nanotubes: A biochemical, histopathologic and proteomics approach.

    PubMed

    Ahmadi, Homa; Ramezani, Mohammad; Yazdian-Robati, Rezvan; Behnam, Behzad; Razavi Azarkhiavi, Kamal; Hashem Nia, Azadeh; Mokhtarzadeh, Ahad; Matbou Riahi, Maryam; Razavi, Bibi Marjan; Abnous, Khalil

    2017-09-25

    Recently carbon nanotubes (CNTs) showed promising potentials in different biomedical applications but their safe use in humans and probable toxicities are still challenging. The aim of this study was to determine the acute toxicity of functionalized single walled carbon nanotubes (SWCNTs). In this project, PEGylated and Tween functionalized SWCNTs were prepared. BALB/c mice were randomly divided into nine groups, including PEGylated SWCNTs (75,150μg/mouse) and PEG, Tween80 suspended SWCNTs, Tween 80 and a control group (intact mice). One or 7 days after intravenous injection, the mice were killed and serum and livers were collected. The oxidative stress markers, biochemical and histopathological changes were studied. Subsequently, proteomics approach was used to investigate the alterations of protein expression profiles in the liver. Results showed that there were not any significant differences in malondealdehyde (MDA), glutathione (GSH) levels and biochemical enzymes (ALT and AST) between groups, while the histopathological observations of livers showed some injuries. The results of proteomics analysis revealed indolethylamine N-Methyltransferase (INMT), glycine N-Methyltransferase (GNMT), selenium binding protein (Selenbp), thioredoxin peroxidase (TPx), TNF receptor associated protein 1(Trap1), peroxiredoxin-6 (Prdx6), electron transport flavoprotein (Etf-α), regucalcin (Rgn) and ATP5b proteins were differentially expressed in functionalized SWCNTs groups. Western blot analyses confirmed that the changes in Prdx6 were consistent with 2-DE gel analysis. In summary, acute toxicological study on two functionalized SWCNTs did not show any significant toxicity at selected doses. Proteomics analysis also showed that following exposure to functionalized SWCNTs, the expression of some proteins with antioxidant activity and detoxifying properties were increased in liver tissue. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. The Redox Proteome*

    PubMed Central

    Go, Young-Mi; Jones, Dean P.

    2013-01-01

    The redox proteome consists of reversible and irreversible covalent modifications that link redox metabolism to biologic structure and function. These modifications, especially of Cys, function at the molecular level in protein folding and maturation, catalytic activity, signaling, and macromolecular interactions and at the macroscopic level in control of secretion and cell shape. Interaction of the redox proteome with redox-active chemicals is central to macromolecular structure, regulation, and signaling during the life cycle and has a central role in the tolerance and adaptability to diet and environmental challenges. PMID:23861437

  13. Genomics, proteomics, MEMS and SAIF: which role for diagnostic imaging?

    PubMed

    Grassi, R; Lagalla, R; Rotondo, A

    2008-09-01

    In these three words--genomics, proteomics and nanotechnologies--is the future of medicine of the third millennium, which will be characterised by more careful attention to disease prevention, diagnosis and treatment. Molecular imaging appears to satisfy this requirement. It is emerging as a new science that brings together molecular biology and in vivo imaging and represents the key for the application of personalized medicine. Micro-PET (positron emission tomography), micro-SPECT (single photon emission computed tomography), micro-CT (computed tomography), micro-MR (magnetic resonance), micro-US (ultrasound) and optical imaging are all molecular imaging techniques, several of which are applied only in preclinical settings on animal models. Others, however, are applied routinely in both clinical and preclinical setting. Research on small animals allows investigation of the genesis and development of diseases, as well as drug efficacy and the development of personalized therapies, through the study of biological processes that precede the expression of common symptoms of a pathology. Advances in molecular imaging were made possible only by collaboration among scientists in the fields of radiology, chemistry, molecular and cell biology, physics, mathematics, pharmacology, gene therapy and oncology. Although until now researchers have traditionally limited their interactions, it is only by increasing these connections that the current gaps in terminology, methods and approaches that inhibit scientific progress can be eliminated.

  14. Proteomics data repositories

    PubMed Central

    Riffle, Michael; Eng, Jimmy K.

    2010-01-01

    The field of proteomics, particularly the application of mass spectrometry analysis to protein samples, is well-established and growing rapidly. Proteomics studies generate large volumes of raw experimental data and inferred biological results. To facilitate the dissemination of these data, centralized data repositories have been developed that make the data and results accessible to proteomics researchers and biologists alike. This review of proteomics data repositories focuses exclusively on freely-available, centralized data resources that disseminate or store experimental mass spectrometry data and results. The resources chosen reflect a current “snapshot” of the state of resources available with an emphasis placed on resources that may be of particular interest to yeast researchers. Resources are described in terms of their intended purpose and the features and functionality provided to users. PMID:19795424

  15. A predicted physicochemically distinct sub-proteome associated with the intracellular organelle of the anammox bacterium Kuenenia stuttgartiensis.

    PubMed

    Medema, Marnix H; Zhou, Miaomiao; van Hijum, Sacha A F T; Gloerich, Jolein; Wessels, Hans J C T; Siezen, Roland J; Strous, Marc

    2010-05-12

    Anaerobic ammonium-oxidizing (anammox) bacteria perform a key step in global nitrogen cycling. These bacteria make use of an organelle to oxidize ammonia anaerobically to nitrogen (N2) and so contribute approximately 50% of the nitrogen in the atmosphere. It is currently unknown which proteins constitute the organellar proteome and how anammox bacteria are able to specifically target organellar and cell-envelope proteins to their correct final destinations. Experimental approaches are complicated by the absence of pure cultures and genetic accessibility. However, the genome of the anammox bacterium Candidatus "Kuenenia stuttgartiensis" has recently been sequenced. Here, we make use of these genome data to predict the organellar sub-proteome and address the molecular basis of protein sorting in anammox bacteria. Two training sets representing organellar (30 proteins) and cell envelope (59 proteins) proteins were constructed based on previous experimental evidence and comparative genomics. Random forest (RF) classifiers trained on these two sets could differentiate between organellar and cell envelope proteins with ~89% accuracy using 400 features consisting of frequencies of two adjacent amino acid combinations. A physicochemically distinct organellar sub-proteome containing 562 proteins was predicted with the best RF classifier. This set included almost all catabolic and respiratory factors encoded in the genome. Apparently, the cytoplasmic membrane performs no catabolic functions. We predict that the Tat-translocation system is located exclusively in the organellar membrane, whereas the Sec-translocation system is located on both the organellar and cytoplasmic membranes. Canonical signal peptides were predicted and validated experimentally, but a specific (N- or C-terminal) signal that could be used for protein targeting to the organelle remained elusive. A physicochemically distinct organellar sub-proteome was predicted from the genome of the anammox bacterium K

  16. Defining functional DNA elements in the human genome

    PubMed Central

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  17. The impact of proteomics on the understanding of functions and biogenesis of fungal extracellular vesicles.

    PubMed

    Rodrigues, Marcio L; Nakayasu, Ernesto S; Almeida, Igor C; Nimrichter, Leonardo

    2014-01-31

    Several microbial molecules are released to the extracellular space in vesicle-like structures. In pathogenic fungi, these molecules include pigments, polysaccharides, lipids, and proteins, which traverse the cell wall in vesicles that accumulate in the extracellular space. The diverse composition of fungal extracellular vesicles (EV) is indicative of multiple mechanisms of cellular biogenesis, a hypothesis that was supported by EV proteomic studies in a set of Saccharomyces cerevisiae strains with defects in both conventional and unconventional secretory pathways. In the human pathogens Cryptococcus neoformans, Histoplasma capsulatum, and Paracoccidioides brasiliensis, extracellular vesicle proteomics revealed the presence of proteins with both immunological and pathogenic activities. In fact, fungal EV have been demonstrated to interfere with the activity of immune effector cells and to increase fungal pathogenesis. In this review, we discuss the impact of proteomics on the understanding of functions and biogenesis of fungal EV, as well as the potential role of these structures in fungal pathogenesis. This article is part of a Special Issue entitled: Trends in Microbial Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. Design and Initial Characterization of the SC-200 Proteomics Standard Mixture

    PubMed Central

    Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald

    2011-01-01

    Abstract High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels. PMID:21250827

  19. Design and initial characterization of the SC-200 proteomics standard mixture.

    PubMed

    Bauman, Andrew; Higdon, Roger; Rapson, Sean; Loiue, Brenton; Hogan, Jason; Stacy, Robin; Napuli, Alberto; Guo, Wenjin; van Voorhis, Wesley; Roach, Jared; Lu, Vincent; Landorf, Elizabeth; Stewart, Elizabeth; Kolker, Natali; Collart, Frank; Myler, Peter; van Belle, Gerald; Kolker, Eugene

    2011-01-01

    High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels.

  20. Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

    PubMed

    Gibbons, John G; Rokas, Antonis

    2009-03-01

    Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.

  1. Quantitative proteomic analysis in breast cancer.

    PubMed

    Tabchy, A; Hennessy, B T; Gonzalez-Angulo, A M; Bernstam, F M; Lu, Y; Mills, G B

    2011-02-01

    Much progress has recently been made in the genomic and transcriptional characterization of tumors. However, historically the characterization of cells at the protein level has suffered limitations in reproducibility, scalability and robustness. Recent technological advances have made it possible to accurately and reproducibly portray the global levels and active states of cellular proteins. Protein microarrays examine the native post-translational conformations of proteins including activated phosphorylated states, in a comprehensive high-throughput mode, and can map activated pathways and networks of proteins inside the cells. The reverse-phase protein microarray (RPPA) offers a unique opportunity to study signal transduction networks in small biological samples such as human biopsy material and can provide critical information for therapeutic decision-making and the monitoring of patients for targeted molecular medicine. By providing the key missing link to the story generated from genomic and gene expression characterization efforts, functional proteomics offer the promise of a comprehensive understanding of cancer. Several initial successes in breast cancer are showing that such information is clinically relevant. Copyright 2011 Prous Science, S.A.U. or its licensors. All rights reserved.

  2. Proteomic analysis of the venom from the scorpion Mesobuthus martensii.

    PubMed

    Xu, Xiaobo; Duan, Zhigui; Di, Zhiyong; He, Yawen; Li, Jianglin; Li, Zhongjie; Xie, Chunliang; Zeng, Xiongzhi; Cao, Zhijian; Wu, Yingliang; Liang, Songping; Li, Wenxin

    2014-06-25

    The scorpion Mesobuthus martensii is the most populous species in eastern Asian countries, and several toxic components have been identified from their venoms. Nevertheless, a complete proteomic profile of the venom of M. martensii is still not available. In this study, the venom of M. martensii was analyzed by comprehensive proteomic approaches. 153 fractions were isolated from the M. martensii venom by 2-DE, SDS-PAGE and RP-HPLC. The ESI-Q-TOF MS results of all fractions were used to search the scorpion genomic and transcriptomic databases. Totally, 227 non-redundant protein sequences were unambiguously identified, composed of 134 previously known and 93 previously unknown proteins. Among 134 previously known proteins, 115 proteins were firstly confirmed from the M. martensii crude venom and 19 toxins were confirmed once again, involving 43 typical toxins, 7 atypical toxins, 12 venom enzymes and 72 cell associated proteins. In typical toxins, 7 novel-toxin sequences were identified, including 3 Na(+)-channel toxins, 3K(+)-channel toxins and 1 no-annotation toxin. These results increased 230% (115/50) venom components compared with previous studies from the M. martensii venom, especially 50% (24/48) typical toxins. Additionally, a mass fingerprint obtained by MALDI-TOF MS indicated that the scorpion venom contained more than 200 different molecular mass components. This work firstly gave a systematic investigation of the M. martensii venom by combined proteomics strategy coupled with genomics and transcriptomics. A large number of protein components were unambiguously identified from the venom of M. martensii, most of which were confirmed for the first time. We also contributed 7 novel-toxin sequences and 93 protein sequences previously unknown to be part of the venom, for which we assigned potential biological functions. Besides, we obtained a mass fingerprint of the M. martensii venom. Together, our study not only provides the most comprehensive catalog of the

  3. N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana *

    PubMed Central

    Ndah, Elvis; Jonckheere, Veronique

    2017-01-01

    Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes. PMID:28432195

  4. N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana.

    PubMed

    Willems, Patrick; Ndah, Elvis; Jonckheere, Veronique; Stael, Simon; Sticker, Adriaan; Martens, Lennart; Van Breusegem, Frank; Gevaert, Kris; Van Damme, Petra

    2017-06-01

    Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. The International Proteomics Tutorial Programme (IPTP): a teaching tool box for the proteomics community.

    PubMed

    James, Peter

    2011-09-01

    The most critical functions of the various proteomics organisations are the training of young scientists and the dissemination of information to the general scientific community. The education committees of the Human Proteome Organisation (HUPO) and the European Proteomics Association (EuPA) together with their national counterparts are therefore launching the International Proteomics Tutorial Programme to meet these needs. The programme is being led by Peter James (Sweden), Thierry Rabilloud (France) and Kazuyuki Nakamura (Japan). It involves collaboration between the leading proteomics journals: Journal of Proteome Research, Journal of Proteomics, Molecular and Cellular Proteomics, and Proteomics. The overall level is aimed at Masters/PhD level students who are starting out their research and who would benefit from a solid grounding in the techniques used in modern protein-based research. The tutorial program will cover core techniques and basics as an introduction to scientists new to the field. At a later stage the programme may be expanded with a series of more advanced topics focussing on the application of proteomics techniques to biological problem solving. The entire series of articles and slides will be made freely available for teaching use at the Journals and Organisations homepages and at a special website, www.proteomicstutorials.org. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. Proteomic contributions to our understanding of vaccine and immune responses

    PubMed Central

    Galassie, Allison C.; Link, Andrew J.

    2015-01-01

    Vaccines are one of the greatest public health successes; yet, due to the empirical nature of vaccine design, we have an incomplete understanding of how the genes and proteins induced by vaccines contribute to the development of both protective innate and adaptive immune responses. While the advent of genomics has enabled new vaccine development and facilitated understanding of the immune response, proteomics identifies potentially new vaccine antigens with increasing speed and sensitivity. In addition, as proteomics is complementary to transcriptomic approaches, a combination of both approaches provides a more comprehensive view of the immune response after vaccination via systems vaccinology. This review details the advances that proteomic strategies have made in vaccine development and reviews how proteomics contributes to the development of a more complete understanding of human vaccines and immune responses. PMID:26172619

  7. Tetrazine ligation for chemical proteomics.

    PubMed

    Kang, Kyungtae; Park, Jongmin; Kim, Eunha

    2016-01-01

    Determining small molecule-target protein interaction is essential for the chemical proteomics. One of the most important keys to explore biological system in chemical proteomics field is finding first-class molecular tools. Chemical probes can provide great spatiotemporal control to elucidate biological functions of proteins as well as for interrogating biological pathways. The invention of bioorthogonal chemistry has revolutionized the field of chemical biology by providing superior chemical tools and has been widely used for investigating the dynamics and function of biomolecules in live condition. Among 20 different bioorthogonal reactions, tetrazine ligation has been spotlighted as the most advanced bioorthogonal chemistry because of their extremely faster kinetics and higher specificity than others. Therefore, tetrazine ligation has a tremendous potential to enhance the proteomic research. This review highlights the current status of tetrazine ligation reaction as a molecular tool for the chemical proteomics.

  8. An Extremely Halophilic Proteobacterium Combines a Highly Acidic Proteome with a Low Cytoplasmic Potassium Content*

    PubMed Central

    Deole, Ratnakar; Challacombe, Jean; Raiford, Douglas W.; Hoff, Wouter D.

    2013-01-01

    Halophilic archaea accumulate molar concentrations of KCl in their cytoplasm as an osmoprotectant and have evolved highly acidic proteomes that function only at high salinity. We examined osmoprotection in the photosynthetic Proteobacteria Halorhodospira halophila and Halorhodospira halochloris. Genome sequencing and isoelectric focusing gel electrophoresis showed that the proteome of H. halophila is acidic. In line with this finding, H. halophila accumulated molar concentrations of KCl when grown in high salt medium as detected by x-ray microanalysis and plasma emission spectrometry. This result extends the taxonomic range of organisms using KCl as a main osmoprotectant to the Proteobacteria. The closely related organism H. halochloris does not exhibit an acidic proteome, matching its inability to accumulate K+. This observation indicates recent evolutionary changes in the osmoprotection strategy of these organisms. Upon growth of H. halophila in low salt medium, its cytoplasmic K+ content matches that of Escherichia coli, revealing an acidic proteome that can function in the absence of high cytoplasmic salt concentrations. These findings necessitate a reassessment of two central aspects of theories for understanding extreme halophiles. First, we conclude that proteome acidity is not driven by stabilizing interactions between K+ ions and acidic side chains but by the need for maintaining sufficient solvation and hydration of the protein surface at high salinity through strongly hydrated carboxylates. Second, we propose that obligate protein halophilicity is a non-adaptive property resulting from genetic drift in which constructive neutral evolution progressively incorporates weakly stabilizing K+-binding sites on an increasingly acidic protein surface. PMID:23144460

  9. Significant expansion of exon-bordering protein domains during animal proteome evolution

    PubMed Central

    Liu, Mingyi; Walch, Heiko; Wu, Shaoping; Grigoriev, Andrei

    2005-01-01

    We present evidence of remarkable genome-wide mobility and evolutionary expansion for a class of protein domains whose borders locate close to the borders of their encoding exons. These exon-bordering domains are more numerous and widely distributed in the human genome than other domains. They also co-occur with more diverse domains to form a larger variety of domain architectures in human proteins. A systematic comparison of nine animal genomes from nematodes to mammals revealed that exon-bordering domains expanded faster than other protein domains in both abundance and distribution, as well as the diversity of co-occurring domains and the domain architectures of harboring proteins. Furthermore, exon-bordering domains exhibited a particularly strong preference for class 1-1 intron phase. Our findings suggest that exon-bordering domains were amplified and interchanged within a genome more often and/or more successfully than other domains during evolution, probably the result of extensive exon shuffling and gene duplication events. The diverse biological functions of these domains underscore the important role they play in the expansion and diversification of animal proteomes. PMID:15640447

  10. FGWAS: Functional genome wide association analysis.

    PubMed

    Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu

    2017-10-01

    Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Green systems biology - From single genomes, proteomes and metabolomes to ecosystems research and biotechnology.

    PubMed

    Weckwerth, Wolfram

    2011-12-10

    biochemical networks up to whole species populations. This process relies on the development of new technologies for the analysis of molecular data, especially genomics, metabolomics and proteomics data. The ambitious aim of these non-targeted 'omic' technologies is to extend our understanding beyond the analysis of separated parts of the system, in contrast to traditional reductionistic hypothesis-driven approaches. The consequent integration of genotyping, pheno/morphotyping and the analysis of the molecular phenotype using metabolomics, proteomics and transcriptomics will reveal a novel understanding of plant metabolism and its interaction with the environment. The analysis of single model systems - plants, fungi, animals and bacteria - will finally emerge in the analysis of populations of plants and other organisms and their adaptation to the ecological niche. In parallel, this novel understanding of ecophysiology will translate into knowledge-based approaches in crop plant biotechnology and marker- or genome-assisted breeding approaches. In this review the foundations of green systems biology are described and applications in ecosystems research are presented. Knowledge exchange of ecosystems research and green biotechnology merging into green systems biology is anticipated based on the principles of natural variation, biodiversity and the genotype-phenotype environment relationship as the fundamental drivers of ecology and evolution. Copyright © 2011 Elsevier B.V. All rights reserved.

  12. Proteomics of ovarian cancer: functional insights and clinical applications

    DOE PAGES

    Elzek, Mohamed A.; Rodland, Karin D.

    2015-03-04

    In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification ofmore » aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. In conclusion, we propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.« less

  13. Phylogenomics databases for facilitating functional genomics in rice.

    PubMed

    Jung, Ki-Hong; Cao, Peijian; Sharma, Rita; Jain, Rashmi; Ronald, Pamela C

    2015-12-01

    The completion of whole genome sequence of rice (Oryza sativa) has significantly accelerated functional genomics studies. Prior to the release of the sequence, only a few genes were assigned a function each year. Since sequencing was completed in 2005, the rate has exponentially increased. As of 2014, 1,021 genes have been described and added to the collection at The Overview of functionally characterized Genes in Rice online database (OGRO). Despite this progress, that number is still very low compared with the total number of genes estimated in the rice genome. One limitation to progress is the presence of functional redundancy among members of the same rice gene family, which covers 51.6 % of all non-transposable element-encoding genes. There remain a significant portion or rice genes that are not functionally redundant, as reflected in the recovery of loss-of-function mutants. To more accurately analyze functional redundancy in the rice genome, we have developed a phylogenomics databases for six large gene families in rice, including those for glycosyltransferases, glycoside hydrolases, kinases, transcription factors, transporters, and cytochrome P450 monooxygenases. In this review, we introduce key features and applications of these databases. We expect that they will serve as a very useful guide in the post-genomics era of research.

  14. The speciation of the proteome

    PubMed Central

    Jungblut, Peter R; Holzhütter, Hermann G; Apweiler, Rolf; Schlüter, Hartmut

    2008-01-01

    Introduction In proteomics a paradox situation developed in the last years. At one side it is basic knowledge that proteins are post-translationally modified and occur in different isoforms. At the other side the protein expression concept disclaims post-translational modifications by connecting protein names directly with function. Discussion Optimal proteome coverage is today reached by bottom-up liquid chromatography/mass spectrometry. But quantification at the peptide level in shotgun or bottom-up approaches by liquid chromatography and mass spectrometry is completely ignoring that a special peptide may exist in an unmodified form and in several-fold modified forms. The acceptance of the protein species concept is a basic prerequisite for meaningful quantitative analyses in functional proteomics. In discovery approaches only top-down analyses, separating the protein species before digestion, identification and quantification by two-dimensional gel electrophoresis or protein liquid chromatography, allow the correlation between changes of a biological situation and function. Conclusion To obtain biological relevant information kinetics and systems biology have to be performed at the protein species level, which is the major challenge in proteomics today. PMID:18638390

  15. Structure, proteome and genome of Sinorhizobium meliloti phage ΦM5: A virus with LUZ24-like morphology and a highly mosaic genome.

    PubMed

    Johnson, Matthew C; Sena-Velez, Marta; Washburn, Brian K; Platt, Georgia N; Lu, Stephen; Brewer, Tess E; Lynn, Jason S; Stroupe, M Elizabeth; Jones, Kathryn M

    2017-12-01

    Bacteriophages of nitrogen-fixing rhizobial bacteria are revealing a wealth of novel structures, diverse enzyme combinations and genomic features. Here we report the cryo-EM structure of the phage capsid at 4.9-5.7Å-resolution, the phage particle proteome, and the genome of the Sinorhizobium meliloti-infecting Podovirus ΦM5. This is the first structure of a phage with a capsid and capsid-associated structural proteins related to those of the LUZ24-like viruses that infect Pseudomonas aeruginosa. Like many other Podoviruses, ΦM5 is a T=7 icosahedron with a smooth capsid and short, relatively featureless tail. Nonetheless, this group is phylogenetically quite distinct from Podoviruses of the well-characterized T7, P22, and epsilon 15 supergroups. Structurally, a distinct bridge of density that appears unique to ΦM5 reaches down the body of the coat protein to the extended loop that interacts with the next monomer in a hexamer, perhaps stabilizing the mature capsid. Further, the predicted tail fibers of ΦM5 are quite different from those of enteric bacteria phages, but have domains in common with other rhizophages. Genomically, ΦM5 is highly mosaic. The ΦM5 genome is 44,005bp with 357bp direct terminal repeats (DTRs) and 58 unique ORFs. Surprisingly, the capsid structural module, the tail module, the DNA-packaging terminase, the DNA replication module and the integrase each appear to be from a different lineage. One of the most unusual features of ΦM5 is its terminase whose large subunit is quite different from previously-described short-DTR-generating packaging machines and does not fit into any of the established phylogenetic groups. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  16. Functional proteomic and interactome analysis of proteins associated with beef tenderness in angus cattle

    USDA-ARS?s Scientific Manuscript database

    Beef is a source of high quality protein for the human population, and beef tenderness has significant influence on beef palatability, consumer expectation and industry profitability. To further elucidate the factors affecting beef tenderness, functional proteomics and bioinformatics interactome ana...

  17. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences.

    PubMed

    Traini, Alessandra; Iorizzo, Massimo; Mann, Harpartap; Bradeen, James M; Carputo, Domenico; Frusciante, Luigi; Chiusano, Maria Luisa

    2013-01-01

    Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT) markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  18. Frequently Asked Questions about Genetic and Genomic Science

    MedlinePlus

    ... of the new genetic and genomic techniques and technologies? Proteomics The suffix "-ome" comes from the Greek ... pharmacogenomics is one of the large-scale "omic" technologies, it can examine the entirety of the genome, ...

  19. Proteomic analysis of the Theileria annulata schizont

    PubMed Central

    Witschi, M.; Xia, D.; Sanderson, S.; Baumgartner, M.; Wastling, J.M.; Dobbelaere, D.A.E.

    2013-01-01

    The apicomplexan parasite, Theileria annulata, is the causative agent of tropical theileriosis, a devastating lymphoproliferative disease of cattle. The schizont stage transforms bovine leukocytes and provides an intriguing model to study host/pathogen interactions. The genome of T. annulata has been sequenced and transcriptomic data are rapidly accumulating. In contrast, little is known about the proteome of the schizont, the pathogenic, transforming life cycle stage of the parasite. Using one-dimensional (1-D) gel LC-MS/MS, a proteomic analysis of purified T. annulata schizonts was carried out. In whole parasite lysates, 645 proteins were identified. Proteins with transmembrane domains (TMDs) were under-represented and no proteins with more than four TMDs could be detected. To tackle this problem, Triton X-114 treatment was applied, which facilitates the extraction of membrane proteins, followed by 1-D gel LC-MS/MS. This resulted in the identification of an additional 153 proteins. Half of those had one or more TMD and 30 proteins with more than four TMDs were identified. This demonstrates that Triton X-114 treatment can provide a valuable additional tool for the identification of new membrane proteins in proteomic studies. With two exceptions, all proteins involved in glycolysis and the citric acid cycle were identified. For at least 29% of identified proteins, the corresponding transcripts were not present in the existing expressed sequence tag databases. The proteomics data were integrated into the publicly accessible database resource at EuPathDB (www.eupathdb.org) so that mass spectrometry-based protein expression evidence for T. annulata can be queried alongside transcriptional and other genomics data available for these parasites. PMID:23178997

  20. The function and evolution of the Aspergillus genome

    PubMed Central

    Gibbons, John G.; Rokas, Antonis

    2012-01-01

    Species in the filamentous fungal genus Aspergillus display a wide diversity of lifestyles and are of great importance to humans. The decoding of genome sequences from a dozen species that vary widely in their degree of evolutionary affinity has galvanized studies of the function and evolution of the Aspergillus genome in clinical, industrial, and agricultural environments. Here, we synthesize recent key findings that shed light on the architecture of the Aspergillus genome, on the molecular foundations of the genus’ astounding dexterity and diversity in secondary metabolism, and on the genetic underpinnings of virulence in Aspergillus fumigatus, one of the most lethal fungal pathogens. Many of these insights dramatically expand our knowledge of fungal and microbial eukaryote genome evolution and function and argue that Aspergillus constitutes a superb model clade for the study of functional and comparative genomics. PMID:23084572

  1. Plant Abiotic Stress Proteomics: The Major Factors Determining Alterations in Cellular Proteome

    PubMed Central

    Kosová, Klára; Vítámvás, Pavel; Urban, Milan O.; Prášil, Ilja T.; Renaut, Jenny

    2018-01-01

    HIGHLIGHTS: Major environmental and genetic factors determining stress-related protein abundance are discussed.Major aspects of protein biological function including protein isoforms and PTMs, cellular localization and protein interactions are discussed.Functional diversity of protein isoforms and PTMs is discussed. Abiotic stresses reveal profound impacts on plant proteomes including alterations in protein relative abundance, cellular localization, post-transcriptional and post-translational modifications (PTMs), protein interactions with other protein partners, and, finally, protein biological functions. The main aim of the present review is to discuss the major factors determining stress-related protein accumulation and their final biological functions. A dynamics of stress response including stress acclimation to altered ambient conditions and recovery after the stress treatment is discussed. The results of proteomic studies aimed at a comparison of stress response in plant genotypes differing in stress adaptability reveal constitutively enhanced levels of several stress-related proteins (protective proteins, chaperones, ROS scavenging- and detoxification-related enzymes) in the tolerant genotypes with respect to the susceptible ones. Tolerant genotypes can efficiently adjust energy metabolism to enhanced needs during stress acclimation. Stress tolerance vs. stress susceptibility are relative terms which can reflect different stress-coping strategies depending on the given stress treatment. The role of differential protein isoforms and PTMs with respect to their biological functions in different physiological constraints (cellular compartments and interacting partners) is discussed. The importance of protein functional studies following high-throughput proteome analyses is presented in a broader context of plant biology. In summary, the manuscript tries to provide an overview of the major factors which have to be considered when interpreting data from proteomic

  2. Global response of Acidithiobacillus ferrooxidans ATCC 53993 to high concentrations of copper: A quantitative proteomics approach.

    PubMed

    Martínez-Bussenius, Cristóbal; Navarro, Claudio A; Orellana, Luis; Paradela, Alberto; Jerez, Carlos A

    2016-08-11

    Acidithiobacillus ferrooxidans is used in industrial bioleaching of minerals to extract valuable metals. A. ferrooxidans strain ATCC 53993 is much more resistant to copper than other strains of this microorganism and it has been proposed that genes present in an exclusive genomic island (GI) of this strain would contribute to its extreme copper tolerance. ICPL (isotope-coded protein labeling) quantitative proteomics was used to study in detail the response of this bacterium to copper. A high overexpression of RND efflux systems and CusF copper chaperones, both present in the genome and the GI of strain ATCC 53993 was found. Also, changes in the levels of the respiratory system proteins such as AcoP and Rus copper binding proteins and several proteins with other predicted functions suggest that numerous metabolic changes are apparently involved in controlling the effects of the toxic metal on this acidophile. Using quantitative proteomics we overview the adaptation mechanisms that biomining acidophiles use to stand their harsh environment. The overexpression of several genes present in an exclusive genomic island strongly suggests the importance of the proteins coded in this DNA region in the high tolerance of A. ferrooxidans ATCC 53993 to metals. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. Transcriptome- Assisted Label-Free Quantitative Proteomics Analysis Reveals Novel Insights into Piper nigrum—Phytophthora capsici Phytopathosystem

    PubMed Central

    Mahadevan, Chidambareswaren; Krishnan, Anu; Saraswathy, Gayathri G.; Surendran, Arun; Jaleel, Abdul; Sakuntala, Manjula

    2016-01-01

    Black pepper (Piper nigrum L.), a tropical spice crop of global acclaim, is susceptible to Phytophthora capsici, an oomycete pathogen which causes the highly destructive foot rot disease. A systematic understanding of this phytopathosystem has not been possible owing to lack of genome or proteome information. In this study, we explain an integrated transcriptome-assisted label-free quantitative proteomics pipeline to study the basal immune components of black pepper when challenged with P. capsici. We report a global identification of 532 novel leaf proteins from black pepper, of which 518 proteins were functionally annotated using BLAST2GO tool. A label-free quantitation of the protein datasets revealed 194 proteins common to diseased and control protein datasets of which 22 proteins showed significant up-regulation and 134 showed significant down-regulation. Ninety-three proteins were identified exclusively on P. capsici infected leaf tissues and 245 were expressed only in mock (control) infected samples. In-depth analysis of our data gives novel insights into the regulatory pathways of black pepper which are compromised during the infection. Differential down-regulation was observed in a number of critical pathways like carbon fixation in photosynthetic organism, cyano-amino acid metabolism, fructose, and mannose metabolism, glutathione metabolism, and phenylpropanoid biosynthesis. The proteomics results were validated with real-time qRT-PCR analysis. We were also able to identify the complete coding sequences for all the proteins of which few selected genes were cloned and sequence characterized for further confirmation. Our study is the first report of a quantitative proteomics dataset in black pepper which provides convincing evidence on the effectiveness of a transcriptome-based label-free proteomics approach for elucidating the host response to biotic stress in a non-model spice crop like P. nigrum, for which genome information is unavailable. Our dataset

  4. Transcriptome- Assisted Label-Free Quantitative Proteomics Analysis Reveals Novel Insights into Piper nigrum-Phytophthora capsici Phytopathosystem.

    PubMed

    Mahadevan, Chidambareswaren; Krishnan, Anu; Saraswathy, Gayathri G; Surendran, Arun; Jaleel, Abdul; Sakuntala, Manjula

    2016-01-01

    Black pepper (Piper nigrum L.), a tropical spice crop of global acclaim, is susceptible to Phytophthora capsici, an oomycete pathogen which causes the highly destructive foot rot disease. A systematic understanding of this phytopathosystem has not been possible owing to lack of genome or proteome information. In this study, we explain an integrated transcriptome-assisted label-free quantitative proteomics pipeline to study the basal immune components of black pepper when challenged with P. capsici. We report a global identification of 532 novel leaf proteins from black pepper, of which 518 proteins were functionally annotated using BLAST2GO tool. A label-free quantitation of the protein datasets revealed 194 proteins common to diseased and control protein datasets of which 22 proteins showed significant up-regulation and 134 showed significant down-regulation. Ninety-three proteins were identified exclusively on P. capsici infected leaf tissues and 245 were expressed only in mock (control) infected samples. In-depth analysis of our data gives novel insights into the regulatory pathways of black pepper which are compromised during the infection. Differential down-regulation was observed in a number of critical pathways like carbon fixation in photosynthetic organism, cyano-amino acid metabolism, fructose, and mannose metabolism, glutathione metabolism, and phenylpropanoid biosynthesis. The proteomics results were validated with real-time qRT-PCR analysis. We were also able to identify the complete coding sequences for all the proteins of which few selected genes were cloned and sequence characterized for further confirmation. Our study is the first report of a quantitative proteomics dataset in black pepper which provides convincing evidence on the effectiveness of a transcriptome-based label-free proteomics approach for elucidating the host response to biotic stress in a non-model spice crop like P. nigrum, for which genome information is unavailable. Our dataset

  5. Integrative Analysis of Subcellular Quantitative Proteomics Studies Reveals Functional Cytoskeleton Membrane-Lipid Raft Interactions in Cancer.

    PubMed

    Shah, Anup D; Inder, Kerry L; Shah, Alok K; Cristino, Alexandre S; McKie, Arthur B; Gabra, Hani; Davis, Melissa J; Hill, Michelle M

    2016-10-07

    Lipid rafts are dynamic membrane microdomains that orchestrate molecular interactions and are implicated in cancer development. To understand the functions of lipid rafts in cancer, we performed an integrated analysis of quantitative lipid raft proteomics data sets modeling progression in breast cancer, melanoma, and renal cell carcinoma. This analysis revealed that cancer development is associated with increased membrane raft-cytoskeleton interactions, with ∼40% of elevated lipid raft proteins being cytoskeletal components. Previous studies suggest a potential functional role for the raft-cytoskeleton in the action of the putative tumor suppressors PTRF/Cavin-1 and Merlin. To extend the observation, we examined lipid raft proteome modulation by an unrelated tumor suppressor opioid binding protein cell-adhesion molecule (OPCML) in ovarian cancer SKOV3 cells. In agreement with the other model systems, quantitative proteomics revealed that 39% of OPCML-depleted lipid raft proteins are cytoskeletal components, with microfilaments and intermediate filaments specifically down-regulated. Furthermore, protein-protein interaction network and simulation analysis showed significantly higher interactions among cancer raft proteins compared with general human raft proteins. Collectively, these results suggest increased cytoskeleton-mediated stabilization of lipid raft domains with greater molecular interactions as a common, functional, and reversible feature of cancer cells.

  6. Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana

    2012-03-27

    Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to themore » un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent

  7. Systems Proteomics for Translational Network Medicine

    PubMed Central

    Arrell, D. Kent; Terzic, Andre

    2012-01-01

    Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016

  8. Comparative proteomic study on Brassica hexaploid and its parents provides new insights into the effects of polyploidization.

    PubMed

    Shen, Yanyue; Zhang, Yu; Zou, Jun; Meng, Jinling; Wang, Jianbo

    2015-01-01

    Polyploidy has played an important role in promoting plant evolution through genomic merging and doubling. Although genomic and transcriptomic changes have been observed in polyploids, the effects of polyploidization on proteomic divergence are poorly understood. In this study, we reported quantitative analysis of proteomic changes in leaves of Brassica hexaploid and its parents using isobaric tags for relative and absolute quantitation (iTRAQ) coupled with mass spectrometry. A total of 2044 reproducible proteins were quantified by at least two unique peptides. We detected 452 proteins differentially expressed between Brassica hexaploid and its parents, and 100 proteins were non-additively expressed in Brassica hexaploid, which suggested a trend of non-additive protein regulation following genomic merger and doubling. Functional categories of cellular component biogenesis, immune system process, and response to stimulus, were significantly enriched in non-additive proteins, probably providing a driving force for variation and adaptation in allopolyploids. In particular, majority of the total 452 differentially expressed proteins showed expression level dominance of one parental expression, and there was an expression level dominance bias toward the tetraploid progenitor. In addition, the percentage of differentially expressed proteins that matched previously reported differentially genes were relatively low. This study aimed to get new insights into the effects of polyploidization on proteomic divergence. Using iTRAQ LC-MS/MS technology, we identified 452 differentially expressed proteins between allopolyploid and its parents which involved in response to stimulus, multi-organism process, and immune system process, much more than previous studies using 2-DE coupled with mass spectrometry technology. Therefore, our manuscript represents the most comprehensive analysis of protein profiles in allopolyploid and its parents, which will lead to a better understanding of

  9. Proteomics and Metabolomics: Two Emerging Areas for Legume Improvement

    PubMed Central

    Ramalingam, Abirami; Kudapa, Himabindu; Pazhamala, Lekha T.; Weckwerth, Wolfram; Varshney, Rajeev K.

    2015-01-01

    The crop legumes such as chickpea, common bean, cowpea, peanut, pigeonpea, soybean, etc. are important sources of nutrition and contribute to a significant amount of biological nitrogen fixation (>20 million tons of fixed nitrogen) in agriculture. However, the production of legumes is constrained due to abiotic and biotic stresses. It is therefore imperative to understand the molecular mechanisms of plant response to different stresses and identify key candidate genes regulating tolerance which can be deployed in breeding programs. The information obtained from transcriptomics has facilitated the identification of candidate genes for the given trait of interest and utilizing them in crop breeding programs to improve stress tolerance. However, the mechanisms of stress tolerance are complex due to the influence of multi-genes and post-transcriptional regulations. Furthermore, stress conditions greatly affect gene expression which in turn causes modifications in the composition of plant proteomes and metabolomes. Therefore, functional genomics involving various proteomics and metabolomics approaches have been obligatory for understanding plant stress tolerance. These approaches have also been found useful to unravel different pathways related to plant and seed development as well as symbiosis. Proteome and metabolome profiling using high-throughput based systems have been extensively applied in the model legume species, Medicago truncatula and Lotus japonicus, as well as in the model crop legume, soybean, to examine stress signaling pathways, cellular and developmental processes and nodule symbiosis. Moreover, the availability of protein reference maps as well as proteomics and metabolomics databases greatly support research and understanding of various biological processes in legumes. Protein-protein interaction techniques, particularly the yeast two-hybrid system have been advantageous for studying symbiosis and stress signaling in legumes. In this review, several

  10. Anthelmintic metabolism in parasitic helminths: proteomic insights.

    PubMed

    Brophy, Peter M; MacKintosh, Neil; Morphew, Russell M

    2012-08-01

    Anthelmintics are the cornerstone of parasitic helminth control. Surprisingly, understanding of the biochemical pathways used by parasitic helminths to detoxify anthelmintics is fragmented, despite the increasing global threat of anthelmintic resistance within the ruminant and equine industries. Reductionist biochemistry has likely over-estimated the enzymatic role of glutathione transferases in anthelmintic metabolism and neglected the potential role of the cytochrome P-450 superfamily (CYPs). Proteomic technologies offers the opportunity to support genomics, reverse genetics and pharmacokinetics, and provide an integrated insight into both the cellular mechanisms underpinning response to anthelmintics and also the identification of biomarker panels for monitoring the development of anthelmintic resistance. To date, there have been limited attempts to include proteomics in anthelmintic metabolism studies. Optimisations of membrane, post-translational modification and interaction proteomic technologies in helminths are needed to especially study Phase I CYPs and Phase III ABC transporter pumps for anthelmintics and their metabolites.

  11. CPTAC Announces New PTRCs, PCCs, and PGDACs | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    This week, the Office of Cancer Clinical Proteomics Research (OCCPR) at the National Cancer Institute (NCI), part of the National Institutes of Health, announced its aim to further the convergence of proteomics with genomics – “proteogenomics,” to better understand the molecular basis of cancer and accelerate research in these areas by disseminating research resources to the scientific community.

  12. Gain-of-function mutagenesis approaches in rice for functional genomics and improvement of crop productivity.

    PubMed

    Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Dutta, Mouboni; Kirti, P B

    2017-07-01

    The epitome of any genome research is to identify all the existing genes in a genome and investigate their roles. Various techniques have been applied to unveil the functions either by silencing or over-expressing the genes by targeted expression or random mutagenesis. Rice is the most appropriate model crop for generating a mutant resource for functional genomic studies because of the availability of high-quality genome sequence and relatively smaller genome size. Rice has syntenic relationships with members of other cereals. Hence, characterization of functionally unknown genes in rice will possibly provide key genetic insights and can lead to comparative genomics involving other cereals. The current review attempts to discuss the available gain-of-function mutagenesis techniques for functional genomics, emphasizing the contemporary approach, activation tagging and alterations to this method for the enhancement of yield and productivity of rice. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  13. In-depth analysis of the thylakoid membrane proteome of Arabidopsis thaliana chloroplasts: new proteins, new functions, and a plastid proteome database.

    PubMed

    Friso, Giulia; Giacomelli, Lisa; Ytterberg, A Jimmy; Peltier, Jean-Benoit; Rudella, Andrea; Sun, Qi; Wijk, Klaas J van

    2004-02-01

    An extensive analysis of the Arabidopsis thaliana peripheral and integral thylakoid membrane proteome was performed by sequential extractions with salt, detergent, and organic solvents, followed by multidimensional protein separation steps (reverse-phase HPLC and one- and two-dimensional electrophoresis gels), different enzymatic and nonenzymatic protein cleavage techniques, mass spectrometry, and bioinformatics. Altogether, 154 proteins were identified, of which 76 (49%) were alpha-helical integral membrane proteins. Twenty-seven new proteins without known function but with predicted chloroplast transit peptides were identified, of which 17 (63%) are integral membrane proteins. These new proteins, likely important in thylakoid biogenesis, include two rubredoxins, a potential metallochaperone, and a new DnaJ-like protein. The data were integrated with our analysis of the lumenal-enriched proteome. We identified 83 out of 100 known proteins of the thylakoid localized photosynthetic apparatus, including several new paralogues and some 20 proteins involved in protein insertion, assembly, folding, or proteolysis. An additional 16 proteins are involved in translation, demonstrating that the thylakoid membrane surface is an important site for protein synthesis. The high coverage of the photosynthetic apparatus and the identification of known hydrophobic proteins with low expression levels, such as cpSecE, Ohp1, and Ohp2, indicate an excellent dynamic resolution of the analysis. The sequential extraction process proved very helpful to validate transmembrane prediction. Our data also were cross-correlated to chloroplast subproteome analyses by other laboratories. All data are deposited in a new curated plastid proteome database (PPDB) with multiple search functions (http://cbsusrv01.tc.cornell.edu/users/ppdb/). This PPDB will serve as an expandable resource for the plant community.

  14. Genomic and Proteomic Biomarkers for Cancer: A Multitude of Opportunities

    PubMed Central

    Tainsky, Michael A.

    2009-01-01

    Biomarkers are molecular indicators of a biological status, and as biochemical species can be assayed to evaluate the presence of cancer and therapeutic interventions. Through a variety of mechanisms cancer cells provide the biomarker material for their own detection. Biomarkers may be detectable in the blood, other body fluids, or tissues. The expectation is that the level of an informative biomarker is related to the specific type of disease present in the body. Biomarkers have potential both as diagnostic indicators and monitors of the effectiveness of clinical interventions. Biomarkers are also able to stratify cancer patients to the most appropriate treatment. Effective biomarkers for the early detection of cancer should provide a patient with a better outcome which in turn will translate into more efficient delivery of healthcare. Technologies for the early detection of cancer have resulted in reductions in disease-associated mortalities from cancers that are otherwise deadly if allowed to progress. Such screening technologies have proven that early detection will decrease the morbidity and mortality from cancer. An emerging theme in biomarker research is the expectation that panels of biomarker analytes rather than single markers will be needed to have sufficient sensitivity and specificity for the presymptomatic detection of cancer. Biomarkers may provide prognostic information of disease enabling interventions using targeted therapeutic agents as well as course-corrections in cancer treatment. Novel genomic, proteomic and metabolomic technologies are being used to discover and validate tumor biomarkers individually and in panels. PMID:19406210

  15. Identification of the pI 4.6 extensin peroxidase from Lycopersicon esculentum using proteomics and reverse-genomics.

    PubMed

    Dong, Wen; Kieliszewski, Marcia; Held, Michael A

    2015-04-01

    The regulation of plant cell growth and early defense response involves the insolubilization of hydroxyproline-rich glycoproteins (HRGPs), such as extensin, in the primary cell wall. In tomato (Lycopersicon esculentum), insolubilization occurs by the formation of tyrosyl-crosslinks catalyzed specifically by the pI 4.6 extensin peroxidase (EP). To date, neither the gene encoding EP nor the protein itself has been identified. Here, we have identified tomato EP candidates using both proteomic and bioinformatic approaches. Bioinformatic screening of the tomato genome yielded eight EP candidates, which contained a putative signal sequence and a predicted pI near 4.6. Biochemical fractionation of tomato culture media followed by proteomic detection further refined our list of EP candidates to three, with the lead candidate designated (CG5). To test for EP crosslinking activity, we cloned into a bacterial expression vector the CG5 open-reading frame from tomato cDNA. The CG5 was expressed in Escherichia coli, fractionated from inclusion bodies, and folded in vitro. The peroxidase activity of CG5 was assayed and quantified by ABTS (2,2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)) assay. Subsequent extensin crosslinking assays showed that CG5 can covalently crosslink authentic tomato P1 extensin and P3-type extensin analogs in vitro supporting our hypothesis that CG5 encodes a tomato EP. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. Identification of the pI 4.6 extensin peroxidase from Lycopersicon esculentum using proteomics and reverse-genomics

    PubMed Central

    Dong, Wen; Kieliszewski, Marcia; Held, Michael A.

    2014-01-01

    The regulation of plant cell growth and early defense response involves the insolubilization of hydroxyproline-rich glycoproteins (HRGPs), such as extensin, in the primary cell wall. In tomato (Lycopersicon esculentum), insolublization occurs by the formation of tyrosyl-crosslinks catalyzed specifically by the pI 4.6 extensin peroxidase (EP). To date, neither the gene encoding EP nor the protein itself has been identified. Here, we’ve identified tomato EP candidates using both proteomic and bioinformatic approaches. Bioinformatic screening of the tomato genome yielded eight EP candidates, which contained a putative signal sequence and a predicted pI near 4.6. Biochemical fractionation of tomato culture media followed by proteomic detection further refined our list of EP candidates to three, with the lead candidate designated (CG5). To test for EP crosslinking activity, we cloned into a bacterial expression vector the CG5 open-reading frame from tomato cDNA. The CG5 was expressed in E. coli, fractionated from inclusion bodies, and folded in vitro. The peroxidase activity of CG5 was assayed and quantified by ABTS (2,2′-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)) assay. Subsequent extensin crosslinking assays showed that CG5 can covalently crosslink authentic tomato P1 extensin and P3-type extensin analogs in vitro supporting our hypothesis that CG5 encodes a tomato EP. PMID:25446231

  17. Identifying the missing proteins in human proteome by biological language model.

    PubMed

    Dong, Qiwen; Wang, Kai; Liu, Xuan

    2016-12-23

    With the rapid development of high-throughput sequencing technology, the proteomics research becomes a trendy field in the post genomics era. It is necessary to identify all the native-encoding protein sequences for further function and pathway analysis. Toward that end, the Human Proteome Organization lunched the Human Protein Project in 2011. However many proteins are hard to be detected by experiment methods, which becomes one of the bottleneck in Human Proteome Project. In consideration of the complicatedness of detecting these missing proteins by using wet-experiment approach, here we use bioinformatics method to pre-filter the missing proteins. Since there are analogy between the biological sequences and natural language, the n-gram models from Natural Language Processing field has been used to filter the missing proteins. The dataset used in this study contains 616 missing proteins from the "uncertain" category of the neXtProt database. There are 102 proteins deduced by the n-gram model, which have high probability to be native human proteins. We perform a detail analysis on the predicted structure and function of these missing proteins and also compare the high probability proteins with other mass spectrum datasets. The evaluation shows that the results reported here are in good agreement with those obtained by other well-established databases. The analysis shows that 102 proteins may be native gene-coding proteins and some of the missing proteins are membrane or natively disordered proteins which are hard to be detected by experiment methods.

  18. A comprehensive proteomics and genomics analysis reveals novel transmembrane proteins in human platelets and mouse megakaryocytes including G6b-B, a novel ITIM protein

    PubMed Central

    Senis, Yotis A.; Tomlinson, Michael G.; García, Ángel; Dumon, Stephanie; Heath, Victoria L.; Herbert, John; Cobbold, Stephen P.; Spalton, Jennifer C.; Ayman, Sinem; Antrobus, Robin; Zitzmann, Nicole; Bicknell, Roy; Frampton, Jon; Authi, Kalwant; Martin, Ashley; Wakelam, Michael J.O.; Watson, Stephen P.

    2007-01-01

    Summary The platelet surface is poorly characterized due to the low abundance of many membrane proteins and the lack of specialist tools for their investigation. In this study we have identified novel human platelet and mouse megakaryocyte membrane proteins using specialist proteomic and genomic approaches. Three separate methods were used to enrich platelet surface proteins prior to identification by liquid chromatography and tandem mass spectrometry: lectin affinity chromatography; biotin/NeutrAvidin affinity chromatography; and free flow electrophoresis. Many known, abundant platelet surface transmembrane proteins and several novel proteins were identified using each receptor enrichment strategy. In total, two or more unique peptides were identified for 46, 68 and 22 surface membrane, intracellular membrane and membrane proteins of unknown sub-cellular localization, respectively. The majority of these were single transmembrane proteins. To complement the proteomic studies, we analysed the transcriptome of a highly purified preparation of mature primary mouse megakaryocytes using serial analysis of gene expression in view of the increasing importance of mutant mouse models in establishing protein function in platelets. This approach identified all of the major classes of platelet transmembrane receptors, including multi-transmembrane proteins. Strikingly, 17 of the 25 most megakaryocyte-specific genes (relative to 30 other SAGE libraries) were transmembrane proteins, illustrating the unique nature of the megakaryocyte/platelet surface. The list of novel plasma membrane proteins identified using proteomics includes the immunoglobulin superfamily member G6b, which undergoes extensive alternate splicing. Specific antibodies were used to demonstrate expression of the G6b-B isoform, which contains an immunoreceptor tyrosine-based inhibition motif. G6b-B undergoes tyrosine phosphorylation and association with the SH2-containing phosphatase, SHP-1, in stimulated

  19. Determining protein function and interaction from genome analysis

    DOEpatents

    Eisenberg, David; Marcotte, Edward M.; Thompson, Michael J.; Pellegrini, Matteo; Yeates, Todd O.

    2004-08-03

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  20. Announcing the Launch of CPTAC’s Proteogenomics DREAM Challenge | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    This week, we are excited to announce the launch of the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) Proteogenomics Computational DREAM Challenge.  The aim of this Challenge is to encourage the generation of computational methods for extracting information from the cancer proteome and for linking those data to genomic and transcriptomic information.  The specific goals are to predict proteomic and phosphoproteomic data from other multiple data types including transcriptomics and genetics.

  1. Proteome Comparisons between Hemolymph of Two Honeybee Strains (Apis mellifera ligustica) Reveal Divergent Molecular Basis in Driving Hemolymph Function and High Royal Jelly Secretion.

    PubMed

    Ararso, Zewdu; Ma, Chuan; Qi, Yuping; Feng, Mao; Han, Bin; Hu, Han; Meng, Lifeng; Li, Jianke

    2018-01-05

    Hemolymph is vital for the immunity of honeybees and offers a way to investigate their physiological status. To gain novel insight into the functionality and molecular details of the hemolymph in driving increased Royal Jelly (RJ) production, we characterized and compared hemolymph proteomes across the larval and adult ages of Italian bees (ITbs) and Royal Jelly bees (RJbs), a stock selected from ITbs for increasing RJ output. Unprecedented in-depth proteome was attained with the identification of 3394 hemolymph proteins in both bee lines. The changes in proteome support the general function of hemolymph to drive development and immunity across different ages. However, age-specific proteome settings have adapted to prime the distinct physiology for larvae and adult bees. In larvae, the proteome is thought to drive temporal immunity, rapid organogenesis, and reorganization of larval structures. In adults, the proteome plays key roles in prompting tissue development and immune defense in newly emerged bees, in gland maturity in nurse bees, and in carbohydrate energy production in forager bees. Between larval and adult samples of the same age, RJbs and ITbs have tailored distinct hemolymph proteome programs to drive their physiology. In particular, in day 4 larvae and nurse bees, a large number of highly abundant proteins are enriched in protein synthesis and energy metabolism in RJbs. This implies that they have adapted their proteome to initiate different developmental trajectories and high RJ secretion in response to selection for enhanced RJ production. Our hitherto unexplored in-depth proteome coverage provides novel insight into molecular details that drive hemolymph function and high RJ production by RJbs.

  2. Plant fluid proteomics: Delving into the xylem sap, phloem sap and apoplastic fluid proteomes.

    PubMed

    Rodríguez-Celma, Jorge; Ceballos-Laita, Laura; Grusak, Michael A; Abadía, Javier; López-Millán, Ana-Flor

    2016-08-01

    The phloem sap, xylem sap and apoplastic fluid play key roles in long and short distance transport of signals and nutrients, and act as a barrier against local and systemic pathogen infection. Among other components, these plant fluids contain proteins which are likely to be important players in their functionalities. However, detailed information about their proteomes is only starting to arise due to the difficulties inherent to the collection methods. This review compiles the proteomic information available to date in these three plant fluids, and compares the proteomes obtained in different plant species in order to shed light into conserved functions in each plant fluid. Inter-species comparisons indicate that all these fluids contain the protein machinery for self-maintenance and defense, including proteins related to cell wall metabolism, pathogen defense, proteolysis, and redox response. These analyses also revealed that proteins may play more relevant roles in signaling in the phloem sap and apoplastic fluid than in the xylem sap. A comparison of the proteomes of the three fluids indicates that although functional categories are somewhat similar, proteins involved are likely to be fluid-specific, except for a small group of proteins present in the three fluids, which may have a universal role, especially in cell wall maintenance and defense. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. Proteomic analysis of hyperadhesive Candida glabrata clinical isolates reveals a core wall proteome and differential incorporation of adhesins.

    PubMed

    Gómez-Molero, Emilia; de Boer, Albert D; Dekker, Henk L; Moreno-Martínez, Ana; Kraneveld, Eef A; Ichsan; Chauhan, Neeraj; Weig, Michael; de Soet, Johannes J; de Koster, Chris G; Bader, Oliver; de Groot, Piet W J

    2015-12-01

    Attachment to human host tissues or abiotic medical devices is a key step in the development of infections by Candida glabrata. The genome of this pathogenic yeast codes for a large number of adhesins, but proteomic work using reference strains has shown incorporation of only few adhesins in the cell wall. By making inventories of the wall proteomes of hyperadhesive clinical isolates and reference strain CBS138 using mass spectrometry, we describe the cell wall proteome of C. glabrata and tested the hypothesis that hyperadhesive isolates display differential incorporation of adhesins. Two clinical strains (PEU382 and PEU427) were selected, which both were hyperadhesive to polystyrene and showed high surface hydrophobicity. Cell wall proteome analysis under biofilm-forming conditions identified a core proteome of about 20 proteins present in all C. glabrata strains. In addition, 12 adhesin-like wall proteins were identified in the hyperadherent strains, including six novel adhesins (Awp8-13) of which only Awp12 was also present in CBS138. We conclude that the hyperadhesive capacity of these two clinical C. glabrata isolates is correlated with increased and differential incorporation of cell wall adhesins. Future studies should elucidate the role of the identified proteins in the establishment of C. glabrata infections. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Interactions of photosynthesis with genome size and function.

    PubMed

    Raven, John A; Beardall, John; Larkum, Anthony W D; Sánchez-Baracaldo, Patricia

    2013-07-19

    Photolithotrophs are divided between those that use water as their electron donor (Cyanobacteria and the photosynthetic eukaryotes) and those that use a different electron donor (the anoxygenic photolithotrophs, all of them Bacteria). Photolithotrophs with the most reduced genomes have more genes than do the corresponding chemoorganotrophs, and the fastest-growing photolithotrophs have significantly lower specific growth rates than the fastest-growing chemoorganotrophs. Slower growth results from diversion of resources into the photosynthetic apparatus, which accounts for about half of the cell protein. There are inherent dangers in (especially oxygenic) photosynthesis, including the formation of reactive oxygen species (ROS) and blue light sensitivity of the water spitting apparatus. The extent to which photolithotrophs incur greater DNA damage and repair, and faster protein turnover with increased rRNA requirement, needs further investigation. A related source of environmental damage is ultraviolet B (UVB) radiation (280-320 nm), whose flux at the Earth's surface decreased as oxygen (and ozone) increased in the atmosphere. This oxygenation led to the requirements of defence against ROS, and decreasing availability to organisms of combined (non-dinitrogen) nitrogen and ferrous iron, and (indirectly) phosphorus, in the oxygenated biosphere. Differential codon usage in the genome and, especially, the proteome can lead to economies in the use of potentially growth-limiting elements.

  5. Parasites, proteomes and systems: has Descartes' clock run out of time?

    PubMed

    Wastling, J M; Armstrong, S D; Krishna, R; Xia, D

    2012-08-01

    Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types.

  6. Parasites, proteomes and systems: has Descartes’ clock run out of time?

    PubMed Central

    WASTLING, J. M.; ARMSTRONG, S. D.; KRISHNA, R.; XIA, D.

    2012-01-01

    SUMMARY Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types. PMID:22828391

  7. Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function

    PubMed Central

    Grobei, Monica A.; Qeli, Ermir; Brunner, Erich; Rehrauer, Hubert; Zhang, Runxuan; Roschitzki, Bernd; Basler, Konrad; Ahrens, Christian H.; Grossniklaus, Ueli

    2009-01-01

    Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified ∼3500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference data set, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model–protein sequence–protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics data set, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues. PMID:19546170

  8. New Funding Opportunity - Illuminating the Druggable Genome | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Institutes of Health Common Fund announces two new Funding Opportunity Announcements with a focus on the Illuminating the Druggable Genome (IDG). These funding opportunities are designed to foster the development of technologies and information management to facilitate the unveiling of the functions of the poorly characterized and/or un-annotated members in four protein classes of the Druggable Genome. The IDG project is predicated on the need to fully explore the underlying biology and role in disease of genes linked to already drugged genes within the Druggable Genome.

  9. Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM.

    PubMed

    Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem

    2011-08-11

    Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.

  10. Genetic resources offer efficient tools for rice functional genomics research.

    PubMed

    Lo, Shuen-Fang; Fan, Ming-Jen; Hsing, Yue-Ie; Chen, Liang-Jwu; Chen, Shu; Wen, Ien-Chie; Liu, Yi-Lun; Chen, Ku-Ting; Jiang, Mirng-Jier; Lin, Ming-Kuang; Rao, Meng-Yen; Yu, Lin-Chih; Ho, Tuan-Hua David; Yu, Su-May

    2016-05-01

    Rice is an important crop and major model plant for monocot functional genomics studies. With the establishment of various genetic resources for rice genomics, the next challenge is to systematically assign functions to predicted genes in the rice genome. Compared with the robustness of genome sequencing and bioinformatics techniques, progress in understanding the function of rice genes has lagged, hampering the utilization of rice genes for cereal crop improvement. The use of transfer DNA (T-DNA) insertional mutagenesis offers the advantage of uniform distribution throughout the rice genome, but preferentially in gene-rich regions, resulting in direct gene knockout or activation of genes within 20-30 kb up- and downstream of the T-DNA insertion site and high gene tagging efficiency. Here, we summarize the recent progress in functional genomics using the T-DNA-tagged rice mutant population. We also discuss important features of T-DNA activation- and knockout-tagging and promoter-trapping of the rice genome in relation to mutant and candidate gene characterizations and how to more efficiently utilize rice mutant populations and datasets for high-throughput functional genomics and phenomics studies by forward and reverse genetics approaches. These studies may facilitate the translation of rice functional genomics research to improvements of rice and other cereal crops. © 2015 John Wiley & Sons Ltd.

  11. Hydroponics on a chip: analysis of the Fe deficient Arabidopsis thylakoid membrane proteome.

    PubMed

    Laganowsky, Arthur; Gómez, Stephen M; Whitelegge, Julian P; Nishio, John N

    2009-04-13

    The model plant Arabidopsis thaliana was used to evaluate the thylakoid membrane proteome under Fe-deficient conditions. Plants were cultivated using a novel hydroponic system, called "hydroponics on a chip", which yields highly reproducible plant tissue samples for physiological analyses, and can be easily used for in vivo stable isotope labeling. The thylakoid membrane proteome, from intact chloroplasts isolated from Fe-sufficient and Fe-deficient plants grown with hydroponics on a chip, was analyzed using liquid chromatography coupled to mass spectrometry. Intact masses of thylakoid membrane proteins were measured, many for the first time, and several proteins were identified with post-translational modifications that were altered by Fe deficiency; for example, the doubly phosphorylated form of the photosystem II oxygen evolving complex, PSBH, increased under Fe-deficiency. Increased levels of photosystem II protein subunit PSBS were detected in the Fe-deficient samples. Antioxidant enzymes, including ascorbate peroxidase and peroxiredoxin Q, were only detected in the Fe-deficient samples. We present the first biochemical evidence that the two major LHC IIb proteins (LHCB1 and LHCB2) may have significantly different functions in the thylakoid membrane. The study illustrates the utility of intact mass proteomics as an indispensable tool for functional genomics. "Hydroponics on a chip" provides the ability to grow A. thaliana under defined conditions that will be useful for systems biology.

  12. Combining Capillary Electrophoresis with Mass Spectrometry for Applications in Proteomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Simpson, David C.; Smith, Richard D.

    2005-04-01

    Throughout the field of global proteomics, ranging from simple organism studies to human medical applications, the high sample complexity creates demands for improved separations and analysis techniques. Furthermore, with increased organism complexity, the correlation between proteome and genome becomes less certain due to extensive mRNA processing prior to translation. In this way, the same DNA sequence can potentially code for regions in a number of distinct proteins; quantitative differences in expression (or abundance) between these often-related species are of significant interest. Well-established proteomics techniques, which use genomic information to identify peptides that originate from protease digestion, often cannot easily distinguishmore » between such gene products; intact protein-level analyses are required to complete the picture, particularly for identifying post-translational modifications. While chromatographic techniques are currently better suited to peptide analysis, capillary electrophoresis (CE) in combination with mass spectrometry (MS) may become important for intact protein analysis. This review focuses on CE/MS instrumentation and techniques showing promise for such applications, highlighting those with greatest potential. Reference will also be made to developments relevant to peptide-level analyses for use in time- or sample-limited situations.« less

  13. Retroelements and their impact on genome evolution and functioning.

    PubMed

    Gogvadze, Elena; Buzdin, Anton

    2009-12-01

    Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.

  14. Characterizing genomic alterations in cancer by complementary functional associations.

    PubMed

    Kim, Jong Wook; Botvinnik, Olga B; Abudayyeh, Omar; Birger, Chet; Rosenbluh, Joseph; Shrestha, Yashaswi; Abazeed, Mohamed E; Hammerman, Peter S; DiCara, Daniel; Konieczkowski, David J; Johannessen, Cory M; Liberzon, Arthur; Alizad-Rahvar, Amir Reza; Alexe, Gabriela; Aguirre, Andrew; Ghandi, Mahmoud; Greulich, Heidi; Vazquez, Francisca; Weir, Barbara A; Van Allen, Eliezer M; Tsherniak, Aviad; Shao, Diane D; Zack, Travis I; Noble, Michael; Getz, Gad; Beroukhim, Rameen; Garraway, Levi A; Ardakani, Masoud; Romualdi, Chiara; Sales, Gabriele; Barbie, David A; Boehm, Jesse S; Hahn, William C; Mesirov, Jill P; Tamayo, Pablo

    2016-05-01

    Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment. We used REVEALER to uncover complementary genomic alterations associated with the transcriptional activation of β-catenin and NRF2, MEK-inhibitor sensitivity, and KRAS dependency. REVEALER successfully identified both known and new associations, demonstrating the power of combining functional profiles with extensive characterization of genomic alterations in cancer genomes.

  15. Development of data representation standards by the human proteome organization proteomics standards initiative

    PubMed Central

    Albar, Juan Pablo; Binz, Pierre-Alain; Eisenacher, Martin; Jones, Andrew R; Mayer, Gerhard; Omenn, Gilbert S; Orchard, Sandra; Vizcaíno, Juan Antonio; Hermjakob, Henning

    2015-01-01

    Objective To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI’s evolution, and future directions and synergies for the group. Materials and Methods The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. Results We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community. Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and

  16. Consolidation of proteomics data in the Cancer Proteomics database.

    PubMed

    Arntzen, Magnus Ø; Boddie, Paul; Frick, Rahel; Koehler, Christian J; Thiede, Bernd

    2015-11-01

    Cancer is a class of diseases characterized by abnormal cell growth and one of the major reasons for human deaths. Proteins are involved in the molecular mechanisms leading to cancer, furthermore they are affected by anti-cancer drugs, and protein biomarkers can be used to diagnose certain cancer types. Therefore, it is important to explore the proteomics background of cancer. In this report, we developed the Cancer Proteomics database to re-interrogate published proteome studies investigating cancer. The database is divided in three sections related to cancer processes, cancer types, and anti-cancer drugs. Currently, the Cancer Proteomics database contains 9778 entries of 4118 proteins extracted from 143 scientific articles covering all three sections: cell death (cancer process), prostate cancer (cancer type) and platinum-based anti-cancer drugs including carboplatin, cisplatin, and oxaliplatin (anti-cancer drugs). The detailed information extracted from the literature includes basic information about the articles (e.g., PubMed ID, authors, journal name, publication year), information about the samples (type, study/reference, prognosis factor), and the proteomics workflow (Subcellular fractionation, protein, and peptide separation, mass spectrometry, quantification). Useful annotations such as hyperlinks to UniProt and PubMed were included. In addition, many filtering options were established as well as export functions. The database is freely available at http://cancerproteomics.uio.no. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Functional proteomic analysis of corticosteroid pharmacodynamics in rat liver: Relationship to hepatic stress, signaling, energy regulation, and drug metabolism.

    PubMed

    Ayyar, Vivaswath S; Almon, Richard R; DuBois, Debra C; Sukumaran, Siddharth; Qu, Jun; Jusko, William J

    2017-05-08

    Corticosteroids (CS) are anti-inflammatory agents that cause extensive pharmacogenomic and proteomic changes in multiple tissues. An understanding of the proteome-wide effects of CS in liver and its relationships to altered hepatic and systemic physiology remains incomplete. Here, we report the application of a functional pharmacoproteomic approach to gain integrated insight into the complex nature of CS responses in liver in vivo. An in-depth functional analysis was performed using rich pharmacodynamic (temporal-based) proteomic data measured over 66h in rat liver following a single dose of methylprednisolone (MPL). Data mining identified 451 differentially regulated proteins. These proteins were analyzed on the basis of temporal regulation, cellular localization, and literature-mined functional information. Of the 451 proteins, 378 were clustered into six functional groups based on major clinically-relevant effects of CS in liver. MPL-responsive proteins were highly localized in the mitochondria (20%) and cytosol (24%). Interestingly, several proteins were related to hepatic stress and signaling processes, which appear to be involved in secondary signaling cascades and in protecting the liver from CS-induced oxidative damage. Consistent with known adverse metabolic effects of CS, several rate-controlling enzymes involved in amino acid metabolism, gluconeogenesis, and fatty-acid metabolism were altered by MPL. In addition, proteins involved in the metabolism of endogenous compounds, xenobiotics, and therapeutic drugs including cytochrome P450 and Phase-II enzymes were differentially regulated. Proteins related to the inflammatory acute-phase response were up-regulated in response to MPL. Functionally-similar proteins showed large diversity in their temporal profiles, indicating complex mechanisms of regulation by CS. Clinical use of corticosteroid (CS) therapy is frequent and chronic. However, current knowledge on the proteome-level effects of CS in liver and

  18. A Proteogenomic Approach to Understanding MYC Function in Metastatic Medulloblastoma Tumors.

    PubMed

    Staal, Jerome A; Pei, Yanxin; Rood, Brian R

    2016-10-19

    Brain tumors are the leading cause of cancer-related deaths in children, and medulloblastoma is the most prevalent malignant childhood/pediatric brain tumor. Providing effective treatment for these cancers, with minimal damage to the still-developing brain, remains one of the greatest challenges faced by clinicians. Understanding the diverse events driving tumor formation, maintenance, progression, and recurrence is necessary for identifying novel targeted therapeutics and improving survival of patients with this disease. Genomic copy number alteration data, together with clinical studies, identifies c-MYC amplification as an important risk factor associated with the most aggressive forms of medulloblastoma with marked metastatic potential. Yet despite this, very little is known regarding the impact of such genomic abnormalities upon the functional biology of the tumor cell. We discuss here how recent advances in quantitative proteomic techniques are now providing new insights into the functional biology of these aggressive tumors, as illustrated by the use of proteomics to bridge the gap between the genotype and phenotype in the case of c-MYC -amplified/associated medulloblastoma. These integrated proteogenomic approaches now provide a new platform for understanding cancer biology by providing a functional context to frame genomic abnormalities.

  19. Molecular Diagnosis and Biomarker Identification on SELDI proteomics data by ADTBoost method.

    PubMed

    Wang, Lu-Yong; Chakraborty, Amit; Comaniciu, Dorin

    2005-01-01

    Clinical proteomics is an emerging field that will have great impact on molecular diagnosis, identification of disease biomarkers, drug discovery and clinical trials in the post-genomic era. Protein profiling in tissues and fluids in disease and pathological control and other proteomics techniques will play an important role in molecular diagnosis with therapeutics and personalized healthcare. We introduced a new robust diagnostic method based on ADTboost algorithm, a novel algorithm in proteomics data analysis to improve classification accuracy. It generates classification rules, which are often smaller and easier to interpret. This method often gives most discriminative features, which can be utilized as biomarkers for diagnostic purpose. Also, it has a nice feature of providing a measure of prediction confidence. We carried out this method in amyotrophic lateral sclerosis (ALS) disease data acquired by surface enhanced laser-desorption/ionization-time-of-flight mass spectrometry (SELDI-TOF MS) experiments. Our method is shown to have outstanding prediction capacity through the cross-validation, ROC analysis results and comparative study. Our molecular diagnosis method provides an efficient way to distinguish ALS disease from neurological controls. The results are expressed in a simple and straightforward alternating decision tree format or conditional format. We identified most discriminative peaks in proteomic data, which can be utilized as biomarkers for diagnosis. It will have broad application in molecular diagnosis through proteomics data analysis and personalized medicine in this post-genomic era.

  20. Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis

    PubMed Central

    Tellgren-Roth, Christian; Baudo, Charles D.; Kennell, John C.; Sun, Sheng; Billmyre, R. Blake; Schröder, Markus S.; Andersson, Anna; Holm, Tina; Sigurgeirsson, Benjamin; Wu, Guangxi; Sankaranarayanan, Sundar Ram; Siddharthan, Rahul; Sanyal, Kaustuv; Lundeberg, Joakim; Nystedt, Björn; Boekhout, Teun; Dawson, Thomas L.; Heitman, Joseph

    2017-01-01

    Abstract Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies. PMID:28100699

  1. A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*

    PubMed Central

    Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing

    2011-01-01

    Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108

  2. Yeast Genomics for Bread, Beer, Biology, Bucks and Breath

    NASA Astrophysics Data System (ADS)

    Sakharkar, Kishore R.; Sakharkar, Meena K.

    The rapid advances and scale up of projects in DNA sequencing dur ing the past two decades have produced complete genome sequences of several eukaryotic species. The versatile genetic malleability of the yeast, and the high degree of conservation between its cellular processes and those of human cells have made it a model of choice for pioneering research in molecular and cell biology. The complete sequence of yeast genome has proven to be extremely useful as a reference towards the sequences of human and for providing systems to explore key gene functions. Yeast has been a ‘legendary model’ for new technologies and gaining new biological insights into basic biological sciences and biotechnology. This chapter describes the awesome power of yeast genetics, genomics and proteomics in understanding of biological function. The applications of yeast as a screening tool to the field of drug discovery and development are highlighted and the traditional importance of yeast for bakers and brewers is discussed.

  3. Comparison of Normal and Breast Cancer Cell lines using Proteome, Genome and Interactome data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patwardhan, Anil J.; Strittmatter, Eric F.; Camp, David G.

    2005-12-01

    Normal and cancer cell line proteomes were profiled using high throughput mass spectrometry techniques. Application of both protein-level and peptide-level sample fractionation combined with LC-MS/MS analysis enabled the confident identification of 2,235 unmodified proteins representing a broad range of functional and compartmental classes. An iterative multi-step search strategy was used to identify post-translational modifications and detected several proteins that are preferentially modified in cancer cells. Information regarding both unmodified and modified protein forms was combined with publicly available gene expression and protein-protein interaction data. The resulting integrated dataset revealed several functionally related proteins that are differentially regulated between normal andmore » cancer cell lines.« less

  4. Proteomic Analysis of Pigeonpea (Cajanus cajan) Seeds Reveals the Accumulation of Numerous Stress-Related Proteins.

    PubMed

    Krishnan, Hari B; Natarajan, Savithiry S; Oehrle, Nathan W; Garrett, Wesley M; Darwish, Omar

    2017-06-14

    Pigeonpea is one of the major sources of dietary protein for more than a billion people living in South Asia. This hardy legume is often grown in low-input and risk-prone marginal environments. Considerable research effort has been devoted by a global research consortium to develop genomic resources for the improvement of this legume crop. These efforts have resulted in the elucidation of the complete genome sequence of pigeonpea. Despite these developments, little is known about the seed proteome of this important crop. Here, we report the proteome of pigeonpea seed. To enable the isolation of maximum number of seed proteins, including those that are present in very low amounts, three different protein fractions were obtained by employing different extraction media. High-resolution two-dimensional (2-D) electrophoresis followed by MALDI-TOF-TOF-MS/MS analysis of these protein fractions resulted in the identification of 373 pigeonpea seed proteins. Consistent with the reported high degree of synteny between the pigeonpea and soybean genomes, a large number of pigeonpea seed proteins exhibited significant amino acid homology with soybean seed proteins. Our proteomic analysis identified a large number of stress-related proteins, presumably due to its adaptation to drought-prone environments. The availability of a pigeonpea seed proteome reference map should shed light on the roles of these identified proteins in various biological processes and facilitate the improvement of seed composition.

  5. Insight from Mitochondrial Functions and Proteomics to Understand Cardiometabolic Disorders in Survivors of Acute Lymphoblastic Leukemia.

    PubMed

    Leahy, Jade; Spahis, Schohraya; Bonneil, Eric; Garofalo, Carole; Grimard, Guy; Morel, Sophia; Laverdière, Caroline; Krajinovic, Maja; Drouin, Simon; Delvin, Edgard; Sinnett, Daniel; Marcil, Valérie; Levy, Emile

    2018-03-18

    Childhood acute lymphoblastic leukemia (cALL) is the most prevalent form of cancer in children. Due to advances in treatment and therapy, young cALL subjects now achieve a 90% survival rate. However, this tremendous advance does not come without consequence since ~2/3 of cALL survivors are affected by long-term and late, severe complications. Although the metabolic syndrome is a very serious sequel of cALL, the mechanisms remain undefined. It is also surprising to note that the mitochondrion, a central organelle in metabolic functions and the main cellular energy generator, have not yet been explored. To determine whether cALL survivors exhibit impairments in their mitochondrial functions and proteomic profiling in relationship with metabolic disorders in cALL survivors compared to healthy controls. Anthropometric measures, metabolic characteristics and lipid profiles were assessed, mitochondria isolated from peripheral blood mononuclear cells, and proteomic analyzed. Our data demonstrated that metabolically Unhealthy survivors exhibited several metabolic syndrome components (e.g. overweight, insulin resistance, dyslipidemia, inflammation) whereas Healthy cALL survivors resemble the Controls. In line with these abnormalities, functional experiments in these subjects revealed a significant decrease in the protein expression of mitochondrial antioxidant superoxide dismutase, PGC1-α transcription factor (a key modulator of mitochondrion biogenesis), and an increase in pro-apoptotic cytochrome c. Proteomic analysis of mitochondria by mass spectrometry revealed changes in the regulation of proteins related to inflammation, apoptosis, energy production, redox and antioxidant activity, fatty acid β-oxidation, protein transport and metabolism, and signalling pathways between groups. Through the use of proteomic analysis, our work demonstrated a number of significant alterations in protein expression in mitochondria of cALL survivors, especially the metabolically

  6. Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

    DOE PAGES

    Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.; ...

    2015-04-08

    Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less

  7. Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.

    Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σ RPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensusmore » sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σ RPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less

  8. Comparative bioinformatics analyses and profiling of lysosome-related organelle proteomes

    NASA Astrophysics Data System (ADS)

    Hu, Zhang-Zhi; Valencia, Julio C.; Huang, Hongzhan; Chi, An; Shabanowitz, Jeffrey; Hearing, Vincent J.; Appella, Ettore; Wu, Cathy

    2007-01-01

    Complete and accurate profiling of cellular organelle proteomes, while challenging, is important for the understanding of detailed cellular processes at the organelle level. Mass spectrometry technologies coupled with bioinformatics analysis provide an effective approach for protein identification and functional interpretation of organelle proteomes. In this study, we have compiled human organelle reference datasets from large-scale proteomic studies and protein databases for seven lysosome-related organelles (LROs), as well as the endoplasmic reticulum and mitochondria, for comparative organelle proteome analysis. Heterogeneous sources of human organelle proteins and rodent homologs are mapped to human UniProtKB protein entries based on ID and/or peptide mappings, followed by functional annotation and categorization using the iProXpress proteomic expression analysis system. Cataloging organelle proteomes allows close examination of both shared and unique proteins among various LROs and reveals their functional relevance. The proteomic comparisons show that LROs are a closely related family of organelles. The shared proteins indicate the dynamic and hybrid nature of LROs, while the unique transmembrane proteins may represent additional candidate marker proteins for LROs. This comparative analysis, therefore, provides a basis for hypothesis formulation and experimental validation of organelle proteins and their functional roles.

  9. The developmental proteome of Drosophila melanogaster

    PubMed Central

    Casas-Vila, Nuria; Bluhm, Alina; Sayols, Sergi; Dinges, Nadja; Dejung, Mario; Altenhein, Tina; Kappei, Dennis; Altenhein, Benjamin; Roignant, Jean-Yves; Butter, Falk

    2017-01-01

    Drosophila melanogaster is a widely used genetic model organism in developmental biology. While this model organism has been intensively studied at the RNA level, a comprehensive proteomic study covering the complete life cycle is still missing. Here, we apply label-free quantitative proteomics to explore proteome remodeling across Drosophila’s life cycle, resulting in 7952 proteins, and provide a high temporal-resolved embryogenesis proteome of 5458 proteins. Our proteome data enabled us to monitor isoform-specific expression of 34 genes during development, to identify the pseudogene Cyp9f3Ψ as a protein-coding gene, and to obtain evidence of 268 small proteins. Moreover, the comparison with available transcriptomic data uncovered examples of poor correlation between mRNA and protein, underscoring the importance of proteomics to study developmental progression. Data integration of our embryogenesis proteome with tissue-specific data revealed spatial and temporal information for further functional studies of yet uncharacterized proteins. Overall, our high resolution proteomes provide a powerful resource and can be explored in detail in our interactive web interface. PMID:28381612

  10. Detailed tail proteomic analysis of axolotl (Ambystoma mexicanum) using an mRNA-seq reference database.

    PubMed

    Demircan, Turan; Keskin, Ilknur; Dumlu, Seda Nilgün; Aytürk, Nilüfer; Avşaroğlu, Mahmut Erhan; Akgün, Emel; Öztürk, Gürkan; Baykal, Ahmet Tarık

    2017-01-01

    Salamander axolotl has been emerging as an important model for stem cell research due to its powerful regenerative capacity. Several advantages, such as the high capability of advanced tissue, organ, and appendages regeneration, promote axolotl as an ideal model system to extend our current understanding on the mechanisms of regeneration. Acknowledging the common molecular pathways between amphibians and mammals, there is a great potential to translate the messages from axolotl research to mammalian studies. However, the utilization of axolotl is hindered due to the lack of reference databases of genomic, transcriptomic, and proteomic data. Here, we introduce the proteome analysis of the axolotl tail section searched against an mRNA-seq database. We translated axolotl mRNA sequences to protein sequences and annotated these to process the LC-MS/MS data and identified 1001 nonredundant proteins. Functional classification of identified proteins was performed by gene ontology searches. The presence of some of the identified proteins was validated by in situ antibody labeling. Furthermore, we have analyzed the proteome expressional changes postamputation at three time points to evaluate the underlying mechanisms of the regeneration process. Taken together, this work expands the proteomics data of axolotl to contribute to its establishment as a fully utilized model. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation

    NASA Astrophysics Data System (ADS)

    Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.

    2016-06-01

    Mass spectrometry-based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications.

  12. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation

    PubMed Central

    Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.

    2016-01-01

    Mass spectrometry–based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications. PMID:27049631

  13. Proteomic Profiling in the Brain of CLN1 Disease Model Reveals Affected Functional Modules.

    PubMed

    Tikka, Saara; Monogioudi, Evanthia; Gotsopoulos, Athanasios; Soliymani, Rabah; Pezzini, Francesco; Scifo, Enzo; Uusi-Rauva, Kristiina; Tyynelä, Jaana; Baumann, Marc; Jalanko, Anu; Simonati, Alessandro; Lalowski, Maciej

    2016-03-01

    Neuronal ceroid lipofuscinoses (NCL) are the most commonly inherited progressive encephalopathies of childhood. Pathologically, they are characterized by endolysosomal storage with different ultrastructural features and biochemical compositions. The molecular mechanisms causing progressive neurodegeneration and common molecular pathways linking expression of different NCL genes are largely unknown. We analyzed proteome alterations in the brains of a mouse model of human infantile CLN1 disease-palmitoyl-protein thioesterase 1 (Ppt1) gene knockout and its wild-type age-matched counterpart at different stages: pre-symptomatic, symptomatic and advanced. For this purpose, we utilized a combination of laser capture microdissection-based quantitative liquid chromatography tandem mass spectrometry (MS) and matrix-assisted laser desorption/ionization time-of-flight MS imaging to quantify/visualize the changes in protein expression in disease-affected brain thalamus and cerebral cortex tissue slices, respectively. Proteomic profiling of the pre-symptomatic stage thalamus revealed alterations mostly in metabolic processes and inhibition of various neuronal functions, i.e., neuritogenesis. Down-regulation in dynamics associated with growth of plasma projections and cellular protrusions was further corroborated by findings from RNA sequencing of CLN1 patients' fibroblasts. Changes detected at the symptomatic stage included: mitochondrial functions, synaptic vesicle transport, myelin proteome and signaling cascades, such as RhoA signaling. Considerable dysregulation of processes related to mitochondrial cell death, RhoA/Huntington's disease signaling and myelin sheath breakdown were observed at the advanced stage of the disease. The identified changes in protein levels were further substantiated by bioinformatics and network approaches, immunohistochemistry on brain tissues and literature knowledge, thus identifying various functional modules affected in the CLN1 childhood

  14. Chemical Proteomic Approaches Targeting Cancer Stem Cells: A Review of Current Literature.

    PubMed

    Jung, Hye Jin

    2017-01-01

    Cancer stem cells (CSCs) have been proposed as central drivers of tumor initiation, progression, recurrence, and therapeutic resistance. Therefore, identifying stem-like cells within cancers and understanding their properties is crucial for the development of effective anticancer therapies. Recently, chemical proteomics has become a powerful tool to efficiently determine protein networks responsible for CSC pathophysiology and comprehensively elucidate molecular mechanisms of drug action against CSCs. This review provides an overview of major methodologies utilized in chemical proteomic approaches. In addition, recent successful chemical proteomic applications targeting CSCs are highlighted. Future direction of potential CSC research by integrating chemical genomic and proteomic data obtained from a single biological sample of CSCs are also suggested in this review. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  15. RNAi for functional genomics in plants.

    PubMed

    McGinnis, Karen M

    2010-03-01

    RNAi refers to several different types of gene silencing mediated by small, dsRNA molecules. Over the course of 20 years, the scientific understanding of RNAi has developed from the initial observation of unexpected expression patterns to a sophisticated understanding of a multi-faceted, evolutionarily conserved network of mechanisms that regulate gene expression in many organisms. It has also been developed as a genetic tool that can be exploited in a wide range of species. Because transgene-induced RNAi has been effective at silencing one or more genes in a wide range of plants, this technology also bears potential as a powerful functional genomics tool across the plant kingdom. Transgene-induced RNAi has indeed been shown to be an effective mechanism for silencing many genes in many organisms, but the results from multiple projects which attempted to exploit RNAi on a genome-wide scale suggest that there is a great deal of variation in the silencing efficacy between transgenic events, silencing targets and silencing-induced phenotype. The results from these projects indicate several important variables that should be considered in experimental design prior to the initiation of functional genomics efforts based on RNAi silencing. In recent years, alternative strategies have been developed for targeted gene silencing, and a combination of approaches may also enhance the use of targeted gene silencing for functional genomics.

  16. Contemporary Network Proteomics and Its Requirements

    PubMed Central

    Goh, Wilson Wen Bin; Wong, Limsoon; Sng, Judy Chia Ghee

    2013-01-01

    The integration of networks with genomics (network genomics) is a familiar field. Conventional network analysis takes advantage of the larger coverage and relative stability of gene expression measurements. Network proteomics on the other hand has to develop further on two critical factors: (1) expanded data coverage and consistency, and (2) suitable reference network libraries, and data mining from them. Concerning (1) we discuss several contemporary themes that can improve data quality, which in turn will boost the outcome of downstream network analysis. For (2), we focus on network analysis developments, specifically, the need for context-specific networks and essential considerations for localized network analysis. PMID:24833333

  17. Organellar proteomics reveals hundreds of novel nuclear proteins in the malaria parasite Plasmodium falciparum

    PubMed Central

    2012-01-01

    Background The post-genomic era of malaria research provided unprecedented insights into the biology of Plasmodium parasites. Due to the large evolutionary distance to model eukaryotes, however, we lack a profound understanding of many processes in Plasmodium biology. One example is the cell nucleus, which controls the parasite genome in a development- and cell cycle-specific manner through mostly unknown mechanisms. To study this important organelle in detail, we conducted an integrative analysis of the P. falciparum nuclear proteome. Results We combined high accuracy mass spectrometry and bioinformatic approaches to present for the first time an experimentally determined core nuclear proteome for P. falciparum. Besides a large number of factors implicated in known nuclear processes, one-third of all detected proteins carry no functional annotation, including many phylum- or genus-specific factors. Importantly, extensive experimental validation using 30 transgenic cell lines confirmed the high specificity of this inventory, and revealed distinct nuclear localization patterns of hitherto uncharacterized proteins. Further, our detailed analysis identified novel protein domains potentially implicated in gene transcription pathways, and sheds important new light on nuclear compartments and processes including regulatory complexes, the nucleolus, nuclear pores, and nuclear import pathways. Conclusion Our study provides comprehensive new insight into the biology of the Plasmodium nucleus and will serve as an important platform for dissecting general and parasite-specific nuclear processes in malaria parasites. Moreover, as the first nuclear proteome characterized in any protist organism, it will provide an important resource for studying evolutionary aspects of nuclear biology. PMID:23181666

  18. THE MMACHC PROTEOME: HALLMARKS OF FUNCTIONAL COBALAMIN DEFICIENCY IN HUMANS

    PubMed Central

    Hannibal, Luciana; DiBello, Patricia M.; Yu, Michelle; Miller, Abby; Wang, Sihe; Willard, Belinda; Rosenblatt, David S.; Jacobsen, Donald W.

    2011-01-01

    Cobalamin (Cbl, B12) is an essential micronutrient required to fulfill the enzymatic reactions of cytosolic methylcobalamin-dependent methionine synthase and mitochondrial adenosylcobalamin-dependent methylmalonyl-CoA mutase. Mutations in the MMACHC gene (cblC complementation group) disrupt processing of the upper-axial ligand of newly internalized cobalamins, leading to functional deficiency of the vitamin. Patients with cblC disease present with both hyperhomocysteinemia and methylmalonic acidemia, cognitive dysfunction, and megaloblastic anemia. In the present study we show that cultured skin fibroblasts from cblC patients export increased levels of both homocysteine and methylmalonic acid compared to control skin fibroblasts, and that they also have decreased levels of total intracellular folates. This is consistent with the clinical phenotype of functional cobalamin deficiency in vivo. The protein changes that accompany human functional Cbl deficiency are unknown. The proteome of control and cblC fibroblasts was quantitatively examined by two dimensional in-gel electrophoresis (2D-DIGE) and liquid chromatography-electrospray ionization-mass spectrometry (LC/ESI/MS). Major changes were observed in the expression levels of proteins involved in cytoskeleton organization and assembly, the neurological system and cell signaling. Pathway analysis of the differentially expressed proteins demonstrated strong associations with neurological disorders, muscular and skeletal disorders, and cardiovascular diseases in the cblC mutant cell lines. Supplementation of the cell cultures with hydroxocobalamin did not restore the cblC proteome to the patterns of expression observed in control cells. These results concur with the observed phenotype of patients with the cblC disorder and their sometimes poor response to treatment with hydroxocobalamin. Our findings could be valuable for designing alternative therapies to alleviate the clinical manifestation of the cblC disorder, as

  19. Phylogenomics of plant genomes: a methodology for genome-wide searches for orthologs in plants

    PubMed Central

    Conte, Matthieu G; Gaillard, Sylvain; Droc, Gaetan; Perin, Christophe

    2008-01-01

    Background Gene ortholog identification is now a major objective for mining the increasing amount of sequence data generated by complete or partial genome sequencing projects. Comparative and functional genomics urgently need a method for ortholog detection to reduce gene function inference and to aid in the identification of conserved or divergent genetic pathways between several species. As gene functions change during evolution, reconstructing the evolutionary history of genes should be a more accurate way to differentiate orthologs from paralogs. Phylogenomics takes into account phylogenetic information from high-throughput genome annotation and is the most straightforward way to infer orthologs. However, procedures for automatic detection of orthologs are still scarce and suffer from several limitations. Results We developed a procedure for ortholog prediction between Oryza sativa and Arabidopsis thaliana. Firstly, we established an efficient method to cluster A. thaliana and O. sativa full proteomes into gene families. Then, we developed an optimized phylogenomics pipeline for ortholog inference. We validated the full procedure using test sets of orthologs and paralogs to demonstrate that our method outperforms pairwise methods for ortholog predictions. Conclusion Our procedure achieved a high level of accuracy in predicting ortholog and paralog relationships. Phylogenomic predictions for all validated gene families in both species were easily achieved and we can conclude that our methodology outperforms similarly based methods. PMID:18426584

  20. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut

    PubMed Central

    Armero, Alix; Bocs, Stéphanie; This, Dominique

    2017-01-01

    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/). PMID:28334050

  1. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.

    PubMed

    Armero, Alix; Baudouin, Luc; Bocs, Stéphanie; This, Dominique

    2017-01-01

    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).

  2. The International Proteomics Tutorial Programme--reaching out to the next generation proteome scientists.

    PubMed

    James, Peter; Marko-Varga, György A

    2011-08-05

    One of the most critical functions of the various Proteomics organizations is the training of young scientists and the dissemination of information to the general scientific community. The education committees of the Human Proteome Organisation (HUPO) and the European Proteomics Association (EuPA) together with the other local proteomics associations are therefore launching a joint Tutorial Program to meet these needs. The level is aimed at Masters/PhD level students with good basic training in biology, biochemistry, mathematics and statistics. The Tutorials will consist of a review/teaching article with an accompanying talk slide presentation for classroom teaching. The Tutorial Program will cover core techniques and basics as an introduction to scientists new to the field. The entire series of articles and slides will be made freely available for teaching use at the Journals and Organizations homepages.

  3. Matrix metalloproteinase proteomics: substrates, targets, and therapy.

    PubMed

    Morrison, Charlotte J; Butler, Georgina S; Rodríguez, David; Overall, Christopher M

    2009-10-01

    Proteomics encompasses powerful techniques termed 'degradomics' for unbiased high-throughput protease substrate discovery screens that have been applied to an important family of extracellular proteases, the matrix metalloproteinases (MMPs). Together with the data generated from genetic deletion and transgenic mouse models and genomic profiling, these screens can uncover the diverse range of MMP functions, reveal which MMPs and MMP-mediated pathways exacerbate pathology, and which are involved in protection and the resolution of disease. This information can be used to identify and validate candidate drug targets and antitargets, and is critical for the development of new inhibitors of MMP function. Such inhibitors may target either the MMP directly in a specific manner or pathways upstream and downstream of MMP activity that are mediating deleterious effects in disease. Since MMPs do not operate alone but are part of the 'protease web', it is necessary to use system-wide approaches to understand MMP proteolysis in vivo, to discover new biological roles and their potential for therapeutic modification.

  4. Quantitative proteomics in teleost fish: insights and challenges for neuroendocrine and neurotoxicology research.

    PubMed

    Martyniuk, Christopher J; Popesku, Jason T; Chown, Brittany; Denslow, Nancy D; Trudeau, Vance L

    2012-05-01

    Neuroendocrine systems integrate both extrinsic and intrinsic signals to regulate virtually all aspects of an animal's physiology. In aquatic toxicology, studies have shown that pollutants are capable of disrupting the neuroendocrine system of teleost fish, and many chemicals found in the environment can also have a neurotoxic mode of action. Omics approaches are now used to better understand cell signaling cascades underlying fish neurophysiology and the control of pituitary hormone release, in addition to identifying adverse effects of pollutants in the teleostean central nervous system. For example, both high throughput genomics and proteomic investigations of molecular signaling cascades for both neurotransmitter and nuclear receptor agonists/antagonists have been reported. This review highlights recent studies that have utilized quantitative proteomics methods such as 2D differential in-gel electrophoresis (DIGE) and isobaric tagging for relative and absolute quantitation (iTRAQ) in neuroendocrine regions and uses these examples to demonstrate the challenges of using proteomics in neuroendocrinology and neurotoxicology research. To begin to characterize the teleost neuroproteome, we functionally annotated 623 unique proteins found in the fish hypothalamus and telencephalon. These proteins have roles in biological processes that include synaptic transmission, ATP production, receptor activity, cell structure and integrity, and stress responses. The biological processes most represented by proteins detected in the teleost neuroendocrine brain included transport (8.4%), metabolic process (5.5%), and glycolysis (4.8%). We provide an example of using sub-network enrichment analysis (SNEA) to identify protein networks in the fish hypothalamus in response to dopamine receptor signaling. Dopamine signaling altered the abundance of proteins that are binding partners of microfilaments, integrins, and intermediate filaments, consistent with data suggesting dopaminergic

  5. Activity-based proteomics of enzyme superfamilies: serine hydrolases as a case study.

    PubMed

    Simon, Gabriel M; Cravatt, Benjamin F

    2010-04-09

    Genome sequencing projects have uncovered thousands of uncharacterized enzymes in eukaryotic and prokaryotic organisms. Deciphering the physiological functions of enzymes requires tools to profile and perturb their activities in native biological systems. Activity-based protein profiling has emerged as a powerful chemoproteomic strategy to achieve these objectives through the use of chemical probes that target large swaths of enzymes that share active-site features. Here, we review activity-based protein profiling and its implementation to annotate the enzymatic proteome, with particular attention given to probes that target serine hydrolases, a diverse superfamily of enzymes replete with many uncharacterized members.

  6. ProCon - PROteomics CONversion tool.

    PubMed

    Mayer, Gerhard; Stephan, Christian; Meyer, Helmut E; Kohl, Michael; Marcus, Katrin; Eisenacher, Martin

    2015-11-03

    With the growing amount of experimental data produced in proteomics experiments and the requirements/recommendations of journals in the proteomics field to publicly make available data described in papers, a need for long-term storage of proteomics data in public repositories arises. For such an upload one needs proteomics data in a standardized format. Therefore, it is desirable, that the proprietary vendor's software will integrate in the future such an export functionality using the standard formats for proteomics results defined by the HUPO-PSI group. Currently not all search engines and analysis tools support these standard formats. In the meantime there is a need to provide user-friendly free-to-use conversion tools that can convert the data into such standard formats in order to support wet-lab scientists in creating proteomics data files ready for upload into the public repositories. ProCon is such a conversion tool written in Java for conversion of proteomics identification data into standard formats mzIdentML and Pride XML. It allows the conversion of Sequest™/Comet .out files, of search results from the popular and often used ProteomeDiscoverer® 1.x (x=versions 1.1 to1.4) software and search results stored in the LIMS systems ProteinScape® 1.3 and 2.1 into mzIdentML and PRIDE XML. This article is part of a Special Issue entitled: Computational Proteomics. Copyright © 2015. Published by Elsevier B.V.

  7. Proteome Analysis of Borrelia burgdorferi Response to Environmental Change

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Angel, Thomas E.; Luft, Benjamin J.; Yang, Xiaohua

    2010-11-02

    We examined global changes in protein expression in the B31 strain of Borrelia burgdorferi, in response to two environmental cues (pH and temperature) chosen for their reported similarity to those encountered at different stages of the organism’s life cycle. Multidimensional nano-liquid chromatographic separations coupled with tandem mass spectrometry were used to examine the array of proteins (i.e., the proteome) of B. burgdorferi for different pH and temperature culture conditions. Changes in pH and temperature elicited in vitro adaptations of this spirochete known to cause Lyme disease and led to alterations in protein expression that are associated with increased microbial pathogenesis.more » We identified 1031 proteins that represent 59% of the annotated genome of B. burgdorferi and elucidated a core proteome of 414 proteins that were present in all environmental conditions investigated. Observed changes in protein abundances indicated varied replicon usage, as well as proteome functional distributions between the in vitro cell culture conditions. Surprisingly, the pH and temperature conditions that mimicked B. burgdorferi residing in the gut of a fed tick showed a marked reduction in protein diversity. Additionally, the results provide us with leading candidates for exploring how B. burgdorferi adapts to and is able to survive in a wide variety of environmental conditions and lay a foundation for planned in situ studies of B. burgdorferi isolated from the tick midgut and infected animals.« less

  8. MitoMiner: a data warehouse for mitochondrial proteomics data

    PubMed Central

    Smith, Anthony C.; Blackshaw, James A.; Robinson, Alan J.

    2012-01-01

    MitoMiner (http://mitominer.mrc-mbu.cam.ac.uk/) is a data warehouse for the storage and analysis of mitochondrial proteomics data gathered from publications of mass spectrometry and green fluorescent protein tagging studies. In MitoMiner, these data are integrated with data from UniProt, Gene Ontology, Online Mendelian Inheritance in Man, HomoloGene, Kyoto Encyclopaedia of Genes and Genomes and PubMed. The latest release of MitoMiner stores proteomics data sets from 46 studies covering 11 different species from eumetazoa, viridiplantae, fungi and protista. MitoMiner is implemented by using the open source InterMine data warehouse system, which provides a user interface allowing users to upload data for analysis, personal accounts to store queries and results and enables queries of any data in the data model. MitoMiner also provides lists of proteins for use in analyses, including the new MitoMiner mitochondrial proteome reference sets that specify proteins with substantial experimental evidence for mitochondrial localization. As further mitochondrial proteomics data sets from normal and diseased tissue are published, MitoMiner can be used to characterize the variability of the mitochondrial proteome between tissues and investigate how changes in the proteome may contribute to mitochondrial dysfunction and mitochondrial-associated diseases such as cancer, neurodegenerative diseases, obesity, diabetes, heart failure and the ageing process. PMID:22121219

  9. Quantitative Analysis of the Human Milk Whey Proteome Reveals Developing Milk and Mammary-Gland Functions across the First Year of Lactation

    PubMed Central

    Zhang, Qiang; Cundiff, Judy K.; Maria, Sarah D.; McMahon, Robert J.; Woo, Jessica G.; Davidson, Barbara S.; Morrow, Ardythe L.

    2013-01-01

    In-depth understanding of the changing functions of human milk (HM) proteins and the corresponding physiological adaptions of the lactating mammary gland has been inhibited by incomplete knowledge of the HM proteome. We analyzed the HM whey proteome (n = 10 women with samples at 1 week and 1, 3, 6, 9 and 12 months) using a quantitative proteomic approach. One thousand three hundred and thirty three proteins were identified with 615 being quantified. Principal component analysis revealed a transition in the HM whey proteome-throughout the first year of lactation. Abundance changes in IgG, sIgA and sIgM display distinct features during the first year. Complement components and other acute-phase proteins are generally at higher levels in early lactation. Proteomic analysis further suggests that the sources of milk fatty acids (FA) shift from more direct blood influx to more de novo mammary synthesis over lactation. The abundances of the majority of glycoproteins decline over lactation, which is consistent with increased enzyme expression in glycoprotein degradation and decreased enzyme expression in glycoprotein synthesis. Cellular detoxification machinery may be transformed as well, thereby accommodating increased metabolic activities in late lactation. The multiple developing functions of HM proteins and the corresponding mammary adaption become more apparent from this study. PMID:28250401

  10. Large Scale Proteomic Data and Network-Based Systems Biology Approaches to Explore the Plant World.

    PubMed

    Di Silvestre, Dario; Bergamaschi, Andrea; Bellini, Edoardo; Mauri, PierLuigi

    2018-06-03

    The investigation of plant organisms by means of data-derived systems biology approaches based on network modeling is mainly characterized by genomic data, while the potential of proteomics is largely unexplored. This delay is mainly caused by the paucity of plant genomic/proteomic sequences and annotations which are fundamental to perform mass-spectrometry (MS) data interpretation. However, Next Generation Sequencing (NGS) techniques are contributing to filling this gap and an increasing number of studies are focusing on plant proteome profiling and protein-protein interactions (PPIs) identification. Interesting results were obtained by evaluating the topology of PPI networks in the context of organ-associated biological processes as well as plant-pathogen relationships. These examples foreshadow well the benefits that these approaches may provide to plant research. Thus, in addition to providing an overview of the main-omic technologies recently used on plant organisms, we will focus on studies that rely on concepts of module, hub and shortest path, and how they can contribute to the plant discovery processes. In this scenario, we will also consider gene co-expression networks, and some examples of integration with metabolomic data and genome-wide association studies (GWAS) to select candidate genes will be mentioned.

  11. Genome sequence diversity and clues to the evolution of variola (smallpox) virus.

    PubMed

    Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M

    2006-08-11

    Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.

  12. A glimpse into the proteome of phototrophic bacterium Rhodobacter capsulatus.

    PubMed

    Onder, Ozlem; Aygun-Sunar, Semra; Selamoglu, Nur; Daldal, Fevzi

    2010-01-01

    A first glimpse into the proteome of Rhodobacter capsulatus revealed more than 450 (with over 210 cytoplasmic and 185 extracytoplasmic known as well as 55 unknown) proteins that are identified with high degree of confidence using nLC-MS/MS analyses. The accumulated data provide a solid platform for ongoing efforts to establish the proteome of this species and the cellular locations of its constituents. They also indicate that at least 40 of the identified proteins, which were annotated in genome databases as unknown hypothetical proteins, correspond to predicted translation products that are indeed present in cells under the growth conditions used in this work. In addition, matching the identification labels of the proteins reported between the two available R. capsulatus genome databases (ERGO-light with RRCxxxxx and NT05 with NT05RCxxxx numbers) indicated that 11 such proteins are listed only in the latter database.

  13. Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline*

    PubMed Central

    Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W.; Moritz, Robert L.

    2015-01-01

    Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. PMID:25418363

  14. Development and application of automated systems for plasmid-based functional proteomics to improve syntheitc biology of engineered industrial microbes for high level expression of proteases for biofertilizer production

    USDA-ARS?s Scientific Manuscript database

    In addition to microarray technology, which provides a robust method to study protein function in a rapid, economical, and proteome-wide fashion, plasmid-based functional proteomics is an important technology for rapidly obtaining large quantities of protein and determining protein function across a...

  15. A Genomic and Proteomic Approach to Identify and Quantify the Expressed Bacillus thuringiensis Proteins in the Supernatant and Parasporal Crystal.

    PubMed

    Gomis-Cebolla, Joaquín; Scaramal Ricietto, Ana Paula; Ferré, Juan

    2018-05-10

    The combined analysis of genomic and proteomic data allowed us to determine which cry and vip genes are present in a Bacillus thuringiensis ( Bt ) isolate and which ones are being expressed. Nine Bt isolates were selected from Spanish collections of Bt based on their vip1 and vip2 gene content. As a first step, nine isolates were analyzed by PCR to select those Bt isolates that contained genes with the lowest similarity to already described vip1 and vip2 genes (isolates E-SE10.2 and O-V84.2). Two selected isolates were subjected to a combined genomic and proteomic analysis. The results showed that the Bt isolate E-SE10.2 codifies for two new vegetative proteins, Vip2Ac-like_1 and Sip1Aa-like_1, that do not show expression differences at 24 h vs. 48 h and are expressed in a low amount. The Bt isolate O-V84.2 codifies for three new vegetative proteins, Vip4Aa-like_1, Vip4Aa-like_2, and Vip2Ac-like_2, that are marginally expressed. The Vip4Aa-like_1 protein was two-fold more abundant at 24 h vs. 48 h, while the Vip4Aa-like_2 was detected only at 24 h. For Vip2Ac-like_2, no differences in expression were found at 24 h vs. 48 h. Moreover, the parasporal crystal of the E-SE10.2 isolate contains a single type of crystal protein, Cry23Aa-like, while the parasporal crystal from O-V84.2 contains three kinds of crystal proteins: 7.0⁻9.8% weight of Cry45Aa-like proteins, 35⁻37% weight of Cry32-like proteins and 2.8⁻4.3% weight of Cry73-like protein.

  16. Micro-proteomics with iterative data analysis: Proteome analysis in C. elegans at the single worm level.

    PubMed

    Bensaddek, Dalila; Narayan, Vikram; Nicolas, Armel; Murillo, Alejandro Brenes; Gartner, Anton; Kenyon, Cynthia J; Lamond, Angus I

    2016-02-01

    Proteomics studies typically analyze proteins at a population level, using extracts prepared from tens of thousands to millions of cells. The resulting measurements correspond to average values across the cell population and can mask considerable variation in protein expression and function between individual cells or organisms. Here, we report the development of micro-proteomics for the analysis of Caenorhabditis elegans, a eukaryote composed of 959 somatic cells and ∼1500 germ cells, measuring the worm proteome at a single organism level to a depth of ∼3000 proteins. This includes detection of proteins across a wide dynamic range of expression levels (>6 orders of magnitude), including many chromatin-associated factors involved in chromosome structure and gene regulation. We apply the micro-proteomics workflow to measure the global proteome response to heat-shock in individual nematodes. This shows variation between individual animals in the magnitude of proteome response following heat-shock, including variable induction of heat-shock proteins. The micro-proteomics pipeline thus facilitates the investigation of stochastic variation in protein expression between individuals within an isogenic population of C. elegans. All data described in this study are available online via the Encyclopedia of Proteome Dynamics (http://www.peptracker.com/epd), an open access, searchable database resource. © 2015 The Authors. PROTEOMICS Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Dynamic Adaptive Binning: An Improved Quantification Technique for NMR Spectroscopic Data

    DTIC Science & Technology

    2010-01-01

    Reo 2002). Unlike proteomics and genomics that assess inter- mediate products, metabolomics assesses the end product of cellular function, metabolites...other proteomic , genomic , and metabolomic analyses, NMR spectroscopy is Electronic supplementary material The online version of this article (doi...Changes occurring at the level of genes and proteins (assessed by genomics and proteomics ) may or may not influence a variety of cellular functions

  18. GeNemo: a search engine for web-based functional genomic data.

    PubMed

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-07-08

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of ENCODE and mouse ENCODE datasets. Unlike text-based search engines, GeNemo's searches are based on pattern matching of functional genomic regions. This distinguishes GeNemo from text or DNA sequence searches. The user can input any complete or partial functional genomic dataset, for example, a binding intensity file (bigWig) or a peak file. GeNemo reports any genomic regions, ranging from hundred bases to hundred thousand bases, from any of the online ENCODE datasets that share similar functional (binding, modification, accessibility) patterns. This is enabled by a Markov Chain Monte Carlo-based maximization process, executed on up to 24 parallel computing threads. By clicking on a search result, the user can visually compare her/his data with the found datasets and navigate the identified genomic regions. GeNemo is available at www.genemo.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Confronting the catalytic dark matter encoded by sequenced genomes

    PubMed Central

    Ellens, Kenneth W.; Christian, Nils; Singh, Charandeep; Satagopam, Venkata P.

    2017-01-01

    Abstract The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the ‘unknown enzyme problem’. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research. PMID:29059321

  20. Genome-wide screening and identification of antigens for rickettsial vaccine development

    USDA-ARS?s Scientific Manuscript database

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...

  1. GAPP: A Proteogenomic Software for Genome Annotation and Global Profiling of Post-translational Modifications in Prokaryotes.

    PubMed

    Zhang, Jia; Yang, Ming-Kun; Zeng, Honghui; Ge, Feng

    2016-11-01

    Although the number of sequenced prokaryotic genomes is growing rapidly, experimentally verified annotation of prokaryotic genome remains patchy and challenging. To facilitate genome annotation efforts for prokaryotes, we developed an open source software called GAPP for genome annotation and global profiling of post-translational modifications (PTMs) in prokaryotes. With a single command, it provides a standard workflow to validate and refine predicted genetic models and discover diverse PTM events. We demonstrated the utility of GAPP using proteomic data from Helicobacter pylori, one of the major human pathogens that is responsible for many gastric diseases. Our results confirmed 84.9% of the existing predicted H. pylori proteins, identified 20 novel protein coding genes, and corrected four existing gene models with regard to translation initiation sites. In particular, GAPP revealed a large repertoire of PTMs using the same proteomic data and provided a rich resource that can be used to examine the functions of reversible modifications in this human pathogen. This software is a powerful tool for genome annotation and global discovery of PTMs and is applicable to any sequenced prokaryotic organism; we expect that it will become an integral part of ongoing genome annotation efforts for prokaryotes. GAPP is freely available at https://sourceforge.net/projects/gappproteogenomic/. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  2. A Proteomic Approach to Investigating Gene Cluster Expression and Secondary Metabolite Functionality in Aspergillus fumigatus

    PubMed Central

    Owens, Rebecca A.; Hammel, Stephen; Sheridan, Kevin J.; Jones, Gary W.; Doyle, Sean

    2014-01-01

    A combined proteomics and metabolomics approach was utilised to advance the identification and characterisation of secondary metabolites in Aspergillus fumigatus. Here, implementation of a shotgun proteomic strategy led to the identification of non-redundant mycelial proteins (n = 414) from A. fumigatus including proteins typically under-represented in 2-D proteome maps: proteins with multiple transmembrane regions, hydrophobic proteins and proteins with extremes of molecular mass and pI. Indirect identification of secondary metabolite cluster expression was also achieved, with proteins (n = 18) from LaeA-regulated clusters detected, including GliT encoded within the gliotoxin biosynthetic cluster. Biochemical analysis then revealed that gliotoxin significantly attenuates H2O2-induced oxidative stress in A. fumigatus (p>0.0001), confirming observations from proteomics data. A complementary 2-D/LC-MS/MS approach further elucidated significantly increased abundance (p<0.05) of proliferating cell nuclear antigen (PCNA), NADH-quinone oxidoreductase and the gliotoxin oxidoreductase GliT, along with significantly attenuated abundance (p<0.05) of a heat shock protein, an oxidative stress protein and an autolysis-associated chitinase, when gliotoxin and H2O2 were present, compared to H2O2 alone. Moreover, gliotoxin exposure significantly reduced the abundance of selected proteins (p<0.05) involved in de novo purine biosynthesis. Significantly elevated abundance (p<0.05) of a key enzyme, xanthine-guanine phosphoribosyl transferase Xpt1, utilised in purine salvage, was observed in the presence of H2O2 and gliotoxin. This work provides new insights into the A. fumigatus proteome and experimental strategies, plus mechanistic data pertaining to gliotoxin functionality in the organism. PMID:25198175

  3. Proteomics of the Lysosome

    PubMed Central

    Lübke, Torben; Lobel, Peter; Sleat, David

    2009-01-01

    Defects in lysosomal function have been associated with numerous monogenic human diseases typically classified as lysosomal storage diseases. However, there is increasing evidence that lysosomal proteins are also involved in more widespread human diseases including cancer and Alzheimer disease. Thus, there is a continuing interest in understanding the cellular functions of the lysosome and an emerging approach to this is the identification of its constituent proteins by proteomic analyses. To date, the mammalian lysosome has been shown to contain ~ 60 soluble luminal proteins and ~25 transmembrane proteins. However, recent proteomic studies based upon affinity purification of soluble components or subcellular fractionation to obtain both soluble and membrane components suggest that there may be many more of both classes of protein resident within this organelle than previously appreciated. Discovery of such proteins has important implications for understanding the function and the dynamics of the lysosome but can also lead the way towards the discovery of the genetic basis for human diseases of hitherto unknown etiology. Here, we describe current approaches to lysosomal proteomics and data interpretation and review the new lysosomal proteins that have recently emerged from such studies. PMID:18977398

  4. Post-genomics nanotechnology is gaining momentum: nanoproteomics and applications in life sciences.

    PubMed

    Kobeissy, Firas H; Gulbakan, Basri; Alawieh, Ali; Karam, Pierre; Zhang, Zhiqun; Guingab-Cagmat, Joy D; Mondello, Stefania; Tan, Weihong; Anagli, John; Wang, Kevin

    2014-02-01

    The post-genomics era has brought about new Omics biotechnologies, such as proteomics and metabolomics, as well as their novel applications to personal genomics and the quantified self. These advances are now also catalyzing other and newer post-genomics innovations, leading to convergences between Omics and nanotechnology. In this work, we systematically contextualize and exemplify an emerging strand of post-genomics life sciences, namely, nanoproteomics and its applications in health and integrative biological systems. Nanotechnology has been utilized as a complementary component to revolutionize proteomics through different kinds of nanotechnology applications, including nanoporous structures, functionalized nanoparticles, quantum dots, and polymeric nanostructures. Those applications, though still in their infancy, have led to several highly sensitive diagnostics and new methods of drug delivery and targeted therapy for clinical use. The present article differs from previous analyses of nanoproteomics in that it offers an in-depth and comparative evaluation of the attendant biotechnology portfolio and their applications as seen through the lens of post-genomics life sciences and biomedicine. These include: (1) immunosensors for inflammatory, pathogenic, and autoimmune markers for infectious and autoimmune diseases, (2) amplified immunoassays for detection of cancer biomarkers, and (3) methods for targeted therapy and automatically adjusted drug delivery such as in experimental stroke and brain injury studies. As nanoproteomics becomes available both to the clinician at the bedside and the citizens who are increasingly interested in access to novel post-genomics diagnostics through initiatives such as the quantified self, we anticipate further breakthroughs in personalized and targeted medicine.

  5. Post-Genomics Nanotechnology Is Gaining Momentum: Nanoproteomics and Applications in Life Sciences

    PubMed Central

    Kobeissy, Firas H.; Gulbakan, Basri; Alawieh, Ali; Karam, Pierre; Zhang, Zhiqun; Guingab-Cagmat, Joy D.; Mondello, Stefania; Tan, Weihong; Anagli, John

    2014-01-01

    Abstract The post-genomics era has brought about new Omics biotechnologies, such as proteomics and metabolomics, as well as their novel applications to personal genomics and the quantified self. These advances are now also catalyzing other and newer post-genomics innovations, leading to convergences between Omics and nanotechnology. In this work, we systematically contextualize and exemplify an emerging strand of post-genomics life sciences, namely, nanoproteomics and its applications in health and integrative biological systems. Nanotechnology has been utilized as a complementary component to revolutionize proteomics through different kinds of nanotechnology applications, including nanoporous structures, functionalized nanoparticles, quantum dots, and polymeric nanostructures. Those applications, though still in their infancy, have led to several highly sensitive diagnostics and new methods of drug delivery and targeted therapy for clinical use. The present article differs from previous analyses of nanoproteomics in that it offers an in-depth and comparative evaluation of the attendant biotechnology portfolio and their applications as seen through the lens of post-genomics life sciences and biomedicine. These include: (1) immunosensors for inflammatory, pathogenic, and autoimmune markers for infectious and autoimmune diseases, (2) amplified immunoassays for detection of cancer biomarkers, and (3) methods for targeted therapy and automatically adjusted drug delivery such as in experimental stroke and brain injury studies. As nanoproteomics becomes available both to the clinician at the bedside and the citizens who are increasingly interested in access to novel post-genomics diagnostics through initiatives such as the quantified self, we anticipate further breakthroughs in personalized and targeted medicine. PMID:24410486

  6. Optimizing Algorithm Choice for Metaproteomics: Comparing X!Tandem and Proteome Discoverer for Soil Proteomes

    NASA Astrophysics Data System (ADS)

    Diaz, K. S.; Kim, E. H.; Jones, R. M.; de Leon, K. C.; Woodcroft, B. J.; Tyson, G. W.; Rich, V. I.

    2014-12-01

    The growing field of metaproteomics links microbial communities to their expressed functions by using mass spectrometry methods to characterize community proteins. Comparison of mass spectrometry protein search algorithms and their biases is crucial for maximizing the quality and amount of protein identifications in mass spectral data. Available algorithms employ different approaches when mapping mass spectra to peptides against a database. We compared mass spectra from four microbial proteomes derived from high-organic content soils searched with two search algorithms: 1) Sequest HT as packaged within Proteome Discoverer (v.1.4) and 2) X!Tandem as packaged in TransProteomicPipeline (v.4.7.1). Searches used matched metagenomes, and results were filtered to allow identification of high probability proteins. There was little overlap in proteins identified by both algorithms, on average just ~24% of the total. However, when adjusted for spectral abundance, the overlap improved to ~70%. Proteome Discoverer generally outperformed X!Tandem, identifying an average of 12.5% more proteins than X!Tandem, with X!Tandem identifying more proteins only in the first two proteomes. For spectrally-adjusted results, the algorithms were similar, with X!Tandem marginally outperforming Proteome Discoverer by an average of ~4%. We then assessed differences in heat shock proteins (HSP) identification by the two algorithms by BLASTing identified proteins against the Heat Shock Protein Information Resource, because HSP hits typically account for the majority signal in proteomes, due to extraction protocols. Total HSP identifications for each of the 4 proteomes were approximately ~15%, ~11%, ~17%, and ~19%, with ~14% for total HSPs with redundancies removed. Of the ~15% average of proteins from the 4 proteomes identified as HSPs, ~10% of proteins and spectra were identified by both algorithms. On average, Proteome Discoverer identified ~9% more HSPs than X!Tandem.

  7. A complete mass spectrometric map for the analysis of the yeast proteome and its application to quantitative trait analysis

    PubMed Central

    Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi

    2013-01-01

    Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424

  8. Proteomic Assessment of Fluid Shifts and Association with Visual Impairment and Intracranial Pressure in Twin Astronauts

    NASA Technical Reports Server (NTRS)

    Rana, Brinda K.; Stenger, Michael B.; Lee, Stuart M. C.; Macias, Brandon R.; Siamwala, Jamila; Piening, Brian Donald; Hook, Vivian; Ebert, Doug; Patel, Hemal; Smith, Scott; hide

    2016-01-01

    BACKGROUND: Astronauts participating in long duration space missions are at an increased risk of physiological disruptions. The development of visual impairment and intracranial pressure (VIIP) syndrome is one of the leading health concerns for crew members on long-duration space missions; microgravity-induced fluid shifts and chronic elevated cabin CO2 may be contributing factors. By studying physiological and molecular changes in one identical twin during his 1-year ISS mission and his ground-based co-twin, this work extends a current NASA-funded investigation to assess space flight induced "Fluid Shifts" in association with the development of VIIP. This twin study uniquely integrates physiological and -omic signatures to further our understanding of the molecular mechanisms underlying space flight-induced VIIP. We are: (i) conducting longitudinal proteomic assessments of plasma to identify fluid regulation-related molecular pathways altered by long-term space flight; and (ii) integrating physiological and proteomic data with genomic data to understand the genomic mechanism by which these proteomic signatures are regulated. PURPOSE: We are exploring proteomic signatures and genomic mechanisms underlying space flight-induced VIIP symptoms with the future goal of developing early biomarkers to detect and monitor the progression of VIIP. This study is first to employ a male monozygous twin pair to systematically determine the impact of fluid distribution in microgravity, integrating a comprehensive set of structural and functional measures with proteomic, metabolomic and genomic data. This project has a broader impact on Earth-based clinical areas, such as traumatic brain injury-induced elevations of intracranial pressure, hydrocephalus, and glaucoma. HYPOTHESIS: We predict that the space-flown twin will experience a space flight-induced alteration in proteins and peptides related to fluid balance, fluid control and brain injury as compared to his pre-flight protein

  9. Plant membrane proteomics.

    PubMed

    Ephritikhine, Geneviève; Ferro, Myriam; Rolland, Norbert

    2004-12-01

    Plant membrane proteins are involved in many different functions according to their location in the cell. For instance, the chloroplast has two membrane systems, thylakoids and envelope, with specialized membrane proteins for photosynthesis and metabolite and ion transporters, respectively. Although recent advances in sample preparation and analytical techniques have been achieved for the study of membrane proteins, the characterization of these proteins, especially the hydrophobic ones, is still challenging. The present review highlights recent advances in methodologies for identification of plant membrane proteins from purified subcellular structures. The interest of combining several complementary extraction procedures to take into account specific features of membrane proteins is discussed in the light of recent proteomics data, notably for chloroplast envelope, mitochondrial membranes and plasma membrane from Arabidopsis. These examples also illustrate how, on one hand, proteomics can feed bioinformatics for a better definition of prediction tools and, on the other hand, although prediction tools are not 100% reliable, they can give valuable information for biological investigations. In particular, membrane proteomics brings new insights over plant membrane systems, on both the membrane compartment where proteins are working and their putative cellular function.

  10. Functional Insights from Structural Genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Forouhar,F.; Kuzin, A.; Seetharaman, J.

    2007-01-01

    Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNAmore » methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).« less

  11. Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

    PubMed

    Zhu, Yafeng; Engström, Pär G; Tellgren-Roth, Christian; Baudo, Charles D; Kennell, John C; Sun, Sheng; Billmyre, R Blake; Schröder, Markus S; Andersson, Anna; Holm, Tina; Sigurgeirsson, Benjamin; Wu, Guangxi; Sankaranarayanan, Sundar Ram; Siddharthan, Rahul; Sanyal, Kaustuv; Lundeberg, Joakim; Nystedt, Björn; Boekhout, Teun; Dawson, Thomas L; Heitman, Joseph; Scheynius, Annika; Lehtiö, Janne

    2017-03-17

    Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Characterizing genomic alterations in cancer by complementary functional associations | Office of Cancer Genomics

    Cancer.gov

    Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment.

  13. The bovine lactation genome: Insights into the evolution of mammalian milk

    USDA-ARS?s Scientific Manuscript database

    The newly assembled Bos Taurus genome sequence enables the linkage of bovine milk and lactation data with other mammalian genomes. Using publicly available milk proteome data and mammary expressed sequence tags, 197 milk protein genes and over 6,000 mammary genes were identified in the bovine genome...

  14. Integrated proteomics, genomics, metabolomics approaches reveal oxalic acid as pathogenicity factor in Tilletia indica inciting Karnal bunt disease of wheat.

    PubMed

    Pandey, Vishakha; Singh, Manoj; Pandey, Dinesh; Kumar, Anil

    2018-05-18

    Tilletia indica incites Karnal bunt (KB) disease in wheat. To date, no KB resistant wheat cultivar could be developed due to non-availability of potential biomarkers related to pathogenicity/virulence for screening of resistant wheat genotypes. The present study was carried out to compare the proteomes of T. indica highly (TiK) and low (TiP) virulent isolates. Twenty one protein spots consistently observed as up-regulated/differential in the TiK proteome were selected for identification by MALDI-TOF/TOF. Identified sequences showed homology with fungal proteins playing essential role in plant infection and pathogen survival, including stress response, adhesion, fungal penetration, invasion, colonization, degradation of host cell wall, signal transduction pathway. These results were integrated with T. indica genome sequence for identification of homologs of candidate pathogenicity/virulence related proteins. Protein identified in TiK isolate as malate dehydrogenase that converts malate to oxaloacetate which is precursor of oxalic acid. Oxalic acid is key pathogenicity factor in phytopathogenic fungi. These results were validated by GC-MS based metabolic profiling of T. indica isolates indicating that oxalic acid was exclusively identified in TiK isolate. Thus, integrated omics approaches leads to identification of pathogenicity/virulence factor(s) that would provide insights into pathogenic mechanisms of fungi and aid in devising effective disease management strategies.

  15. Meta-analysis of global metabolomics and proteomics data to link alterations with phenotype

    DOE PAGES

    Patti, Gary J.; Tautenhahn, Ralf; Fonslow, Bryan R.; ...

    2011-01-01

    Global metabolomics has emerged as a powerful tool to interrogate cellular biochemistry at the systems level by tracking alterations in the levels of small molecules. One approach to define cellular dynamics with respect to this dysregulation of small molecules has been to consider metabolic flux as a function of time. While flux measurements have proven effective for model organisms, acquiring multiple time points at appropriate temporal intervals for many sample types (e.g., clinical specimens) is challenging. As an alternative, meta-analysis provides another strategy for delineating metabolic cause and effect perturbations. That is, the combination of untargeted metabolomic data from multiplemore » pairwise comparisons enables the association of specific changes in small molecules with unique phenotypic alterations. We recently developed metabolomic software called metaXCMS to automate these types of higher order comparisons. Here we discuss the potential of metaXCMS for analyzing proteomic datasets and highlight the biological value of combining meta-results from both metabolomic and proteomic analyses. The combined meta-analysis has the potential to facilitate efforts in functional genomics and the identification of metabolic disruptions related to disease pathogenesis.« less

  16. Computational Prediction of the Global Functional Genomic Landscape: Applications, Methods and Challenges

    PubMed Central

    Zhou, Weiqiang; Sherwood, Ben; Ji, Hongkai

    2017-01-01

    Technological advances have led to an explosive growth of high-throughput functional genomic data. Exploiting the correlation among different data types, it is possible to predict one functional genomic data type from other data types. Prediction tools are valuable in understanding the relationship among different functional genomic signals. They also provide a cost-efficient solution to inferring the unknown functional genomic profiles when experimental data are unavailable due to resource or technological constraints. The predicted data may be used for generating hypotheses, prioritizing targets, interpreting disease variants, facilitating data integration, quality control, and many other purposes. This article reviews various applications of prediction methods in functional genomics, discusses analytical challenges, and highlights some common and effective strategies used to develop prediction methods for functional genomic data. PMID:28076869

  17. Plasmodium vivax trophozoite-stage proteomes

    PubMed Central

    Anderson, D.C.; Lapp, Stacey A.; Akinyi, Sheila; Meyer, Esmeralda V.S.; Barnwell, John W.; Korir-Morrison, Cindy; Galinski, Mary R.

    2015-01-01

    Plasmodium vivax is the causative infectious agent of 80–300 million annual cases of malaria. Many aspects of this parasite’s biology remain unknown. To further elucidate the interaction of P. vivax with its Saimiri boliviensis host, we obtained detailed proteomes of infected red blood cells, representing the trophozoite-enriched stage of development. Data from two of three biological replicate proteomes, emphasized here, were analyzed using five search engines, which enhanced identifications and resulted in the most comprehensive P. vivax proteomes to date, with 1375 P. vivax and 3209 S. boliviensis identified proteins. Ribosome subunit proteins were noted for both P. vivax and S. boliviensis, consistent with P. vivax’s known reticulocyte host–cell specificity. A majority of the host and pathogen proteins identified belong to specific functional categories, and several parasite gene families, while 33% of the P. vivax proteins have no reported function. Hemoglobin was significantly oxidized in both proteomes, and additional protein oxidation and nitration was detected in one of the two proteomes. Detailed analyses of these post-translational modifications are presented. The proteins identified here significantly expand the known P. vivax proteome and complexity of available host protein functionality underlying the host–parasite interactive biology, and reveal unsuspected oxidative modifications that may impact protein function. Biological significance Plasmodium vivax malaria is a serious neglected disease, causing an estimated 80 to 300 million cases annually in 95 countries. Infection can result in significant morbidity and possible death. P. vivax, unlike the much better-studied Plasmodium falciparum species, cannot be grown in long-term culture, has a dormant form in the liver called the hypnozoite stage, has a reticulocyte host–cell preference in the blood, and creates caveolae vesicle complexes at the surface of the infected reticulocyte

  18. Processing shotgun proteomics data on the Amazon cloud with the trans-proteomic pipeline.

    PubMed

    Slagel, Joseph; Mendoza, Luis; Shteynberg, David; Deutsch, Eric W; Moritz, Robert L

    2015-02-01

    Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  19. Function-selective domain architecture plasticity potentials in eukaryotic genome evolution

    PubMed Central

    Linkeviciute, Viktorija; Rackham, Owen J.L.; Gough, Julian; Oates, Matt E.; Fang, Hai

    2015-01-01

    To help evaluate how protein function impacts on genome evolution, we introduce a new concept of ‘architecture plasticity potential’ – the capacity to form distinct domain architectures – both for an individual domain, or more generally for a set of domains grouped by shared function. We devise a scoring metric to measure the plasticity potential for these domain sets, and evaluate how function has changed over time for different species. Applying this metric to a phylogenetic tree of eukaryotic genomes, we find that the involvement of each function is not random but highly selective. For certain lineages there is strong bias for evolution to involve domains related to certain functions. In general eukaryotic genomes, particularly animals, expand complex functional activities such as signalling and regulation, but at the cost of reducing metabolic processes. We also observe differential evolution of transcriptional regulation and a unique evolutionary role of channel regulators; crucially this is only observable in terms of the architecture plasticity potential. Our findings provide a new layer of information to understand the significance of function in eukaryotic genome evolution. A web search tool, available at http://supfam.org/Pevo, offers a wide spectrum of options for exploring functional importance in eukaryotic genome evolution. PMID:25980317

  20. Functional proteomic analyses of Bothrops atrox venom reveals phenotypes associated with habitat variation in the Amazon.

    PubMed

    Sousa, Leijiane F; Portes-Junior, José A; Nicolau, Carolina A; Bernardoni, Juliana L; Nishiyama, Milton Y; Amazonas, Diana R; Freitas-de-Sousa, Luciana A; Mourão, Rosa Hv; Chalkidis, Hipócrates M; Valente, Richard H; Moura-da-Silva, Ana M

    2017-04-21

    Venom variability is commonly reported for venomous snakes including Bothrops atrox. Here, we compared the composition of venoms from B. atrox snakes collected at Amazonian conserved habitats (terra-firme upland forest and várzea) and human modified areas (pasture and degraded areas). Venom samples were submitted to shotgun proteomic analysis as a whole or compared after fractionation by reversed-phase chromatography. Whole venom proteomes revealed a similar composition among the venoms with predominance of SVMPs, CTLs, and SVSPs and intermediate amounts of PLA 2 s and LAAOs. However, when distribution of particular isoforms was analyzed by either method, the venom from várzea snakes showed a decrease in hemorrhagic SVMPs and an increase in SVSPs, and procoagulant SVMPs and PLA 2 s. These differences were validated by experimental approaches including both enzymatic and in vivo assays, and indicated restrictions in respect to antivenom efficacy to variable components. Thus, proteomic analysis at the isoform level combined to in silico prediction of functional properties may indicate venom biological activity. These results also suggest that the prevalence of functionally distinct isoforms contributes to the variability of the venoms and could reflect the adaptation of B. atrox to distinct prey communities in different Amazon habitats. In this report, we compared isoforms present in venoms from snakes collected at different Amazonian habitats. By means of a species venom gland transcriptome and the in silico functional prediction of each isoform, we were able to predict the principal venom activities in vitro and in animal models. We also showed remarkable differences in the venom pools from snakes collected at the floodplain (várzea habitat) compared to other habitats. Not only was this venom less hemorrhagic and more procoagulant, when compared to the venom pools from the other three habitats studied, but also this enhanced procoagulant activity was not