Science.gov

Sample records for functional genomics proteomics

  1. Genomes to Proteomes

    SciTech Connect

    Panisko, Ellen A.; Grigoriev, Igor; Daly, Don S.; Webb-Robertson, Bobbie-Jo; Baker, Scott E.

    2009-03-01

    Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic

  2. Functional genomics and proteomics of the cellular osmotic stress response in 'non-model' organisms.

    PubMed

    Kültz, Dietmar; Fiol, Diego; Valkova, Nelly; Gomez-Jimenez, Silvia; Chan, Stephanie Y; Lee, Jinoo

    2007-05-01

    All organisms are adapted to well-defined extracellular salinity ranges. Osmoregulatory mechanisms spanning all levels of biological organization, from molecules to behavior, are central to salinity adaptation. Functional genomics and proteomics approaches represent powerful tools for gaining insight into the molecular basis of salinity adaptation and euryhalinity in animals. In this review, we discuss our experience in applying such tools to so-called 'non-model' species, including euryhaline animals that are well-suited for studies of salinity adaptation. Suppression subtractive hybridization, RACE-PCR and mass spectrometry-driven proteomics can be used to identify genes and proteins involved in salinity adaptation or other environmental stress responses in tilapia, sharks and sponges. For protein identification in non-model species, algorithms based on sequence homology searches such as MSBLASTP2 are most powerful. Subsequent gene ontology and pathway analysis can then utilize sets of identified genes and proteins for modeling molecular mechanisms of environmental adaptation. Current limitations for proteomics in non-model species can be overcome by improving sequence coverage, N- and C-terminal sequencing and analysis of intact proteins. Dependence on information about biochemical pathways and gene ontology databases for model species represents a more severe barrier for work with non-model species. To minimize such dependence, focusing on a single biological process (rather than attempting to describe the system as a whole) is key when applying 'omics' approaches to non-model organisms. PMID:17449824

  3. Revisiting the Identification of Canonical Splice Isoforms through Integration of Functional Genomics and Proteomics Evidence

    PubMed Central

    Li, Hong-Dong; Menon, Rajasree; Omenn, Gilbert S.; Guan, Yuanfang

    2014-01-01

    Canonical isoforms in different databases have been defined as the most prevalent, most conserved, most expressed, longest, or the one with the clearest description of domains or post-translational modifications. In this article, we revisit these definitions of canonical isoforms based on functional genomics and proteomics evidence, focusing on mouse data. We report a novel functional relationship network-based approach for identifying the Highest Connected Isoforms (HCIs). We show that 46% of these HCIs are not the longest transcripts. In addition, this approach revealed many genes that have more than one highly connected isoforms. Averaged across 175 RNA-seq datasets covering diverse tissues and conditions, 65% of the HCIs show higher expression levels than non-highest connected isoforms (NCIs) at the transcript level. At the protein level, these HCIs highly overlap with the expressed splice variants, based on proteomic data from eight different normal tissues. These results suggest that a more confident definition of canonical isoforms can be made through integration of multiple lines of evidence, including highest connected isoforms defined by biological processes and pathways, expression prevalence at the transcript level, and relative or absolute abundance at the protein level. This integrative proteogenomics approach can successfully identify principal isoforms that are responsible for the canonical functions of genes. PMID:25265570

  4. ProGeRF: Proteome and Genome Repeat Finder Utilizing a Fast Parallel Hash Function

    PubMed Central

    Moraes, Walas Jhony Lopes; Rodrigues, Thiago de Souza; Bartholomeu, Daniella Castanheira

    2015-01-01

    Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity. PMID:25811026

  5. Bacillus subtilis functional genomics: global characterization of the stringent response by proteome and transcriptome analysis†

    PubMed Central

    Eymann, Christine; Homuth, Georg; Scharf, Christian; Hecker, Michael

    2002-01-01

    The stringent response in Bacillus subtilis was characterized by using proteome and transcriptome approaches. Comparison of protein synthesis patterns of wild-type and relA mutant cells cultivated under conditions which provoke the stringent response revealed significant differences. According to their altered synthesis patterns in response to dl-norvaline, proteins were assigned to four distinct classes: (i) negative stringent control, i.e., strongly decreased protein synthesis in the wild type but not in the relA mutant (e.g., r-proteins); (ii) positive stringent control, i.e., induction of protein synthesis in the wild type only (e.g., YvyD and LeuD); (iii) proteins that were induced independently of RelA (e.g., YjcI); and (iv) proteins downregulated independently of RelA (e.g., glycolytic enzymes). Transcriptome studies based on DNA macroarray techniques were used to complement the proteome data, resulting in comparable induction and repression patterns of almost all corresponding genes. However, a comparison of both approaches revealed that only a subset of RelA-dependent genes or proteins was detectable by proteomics, demonstrating that the transcriptome approach allows a more comprehensive global gene expression profile analysis. The present study presents the first comprehensive description of the stringent response of a bacterial species and an almost complete map of protein-encoding genes affected by (p)ppGpp. The negative stringent control concerns reactions typical of growth and reproduction (ribosome synthesis, DNA synthesis, cell wall synthesis, etc.). Negatively controlled unknown y-genes may also code for proteins with a specific function during growth and reproduction (e.g., YlaG). On the other hand, many genes are induced in a RelA-dependent manner, including genes coding for already-known and as-yet-unknown proteins. A passive model is preferred to explain this positive control relying on the redistribution of the RNA polymerase under the

  6. Integrated Molecular Signature of Disease: Analysis of Influenza Virus-Infected Macaques through Functional Genomics and Proteomics

    SciTech Connect

    Baas, T.; Baskin, C. R.; Diamond, Deborah L.; Garcia-Sastre, A.; Bielefeldt-Ohmann, H.; Tumpey, T. M.; Thomas, M. J.; Carter, V. S.; Teal, T. H.; Van Hoven, N.; Proll, Sean; Jacobs, Jon M.; Caldwell, Z.; Gritsenko, Marina A.; Hukkanen, R.; Camp, David G.; Smith, Richard D.; Katze, Michael G.

    2006-11-01

    Recent outbreaks of avian influenza in humans have stressed the need for an improved non-human primate model of influenza pathogenesis. In order to develop our macaque model, we expanded our in vivo and functional genomics experiments: We focused on the innate immune response at day 2 post-inoculation and on gene expression in affected lung tissue with viral genetic material present; finally, we sought to identify signature genes for early infection in whole blood. For these purposes, we infected six pigtailed macaques with 107 TCID50 of influenza A/Texas/36/91 virus and three control animals with a sham inoculate. We sacrificed one control and two experimental animals at day 2, 4, and 7 and lung tissue was harvested for pathology, gene expression profiling, and proteomics. Additionally, blood was collected for genomics every other day from each animal until its endpoint. Gross and microscopic pathology, immunohistochemistry, viral gene expression by arrays and/or quantitative real-time RT-PCR confirmed successful yet mild infection in all experimental animals. Genomic experiments were performed using second generation macaque-specific oligonucleotide arrays and high-throughput proteomics revealed host response to infection at the protein level. Our data showed dramatic differences in gene expression within the same influenza-induced lesion based on the presence or absence of viral mRNA. We also identified genes tightly co-regulated in peripheral white blood cells and in lung tissue at day 2 post-inoculation. This latter finding opens the possibility of using gene expression arrays on whole blood to detect infection after exposure but prior to onset of symptoms or shedding.

  7. IL-15Rα deficiency in skeletal muscle alters respiratory function and the proteome of mitochondrial subpopulations independent of changes to the mitochondrial genome.

    PubMed

    O'Connell, Grant C; Nichols, Cody; Guo, Ge; Croston, Tara L; Thapa, Dharendra; Hollander, John M; Pistilli, Emidio E

    2015-11-01

    Interleukin-15 receptor alpha knockout (IL15RαKO) mice exhibit a greater skeletal muscle mitochondrial density with an altered mitochondrial morphology. However, the mechanism and functional impact of these changes have not been determined. In this study, we characterized the functional, proteomic, and genomic alterations in mitochondrial subpopulations isolated from the skeletal muscles of IL15RαKO mice and B6129 background control mice. State 3 respiration was greater in interfibrillar mitochondria and whole muscle ATP levels were greater in IL15RαKO mice supporting the increases in respiration rate. However, the state 3/state 4 ratio was lower, suggesting some degree of respiratory uncoupling. Proteomic analyses identified several markers independently in mitochondrial subpopulations that are associated with these functional alterations. Next Generation Sequencing of mtDNA revealed a high degree of similarity between the mitochondrial genomes of IL15RαKO mice and controls in terms of copy number, consensus coding and the presence of minor alleles, suggesting that the functional and proteomic alterations we observed occurred independent of alterations to the mitochondrial genome. These data provide additional evidence to implicate IL-15Rα as a regulator of skeletal muscle phenotypes through effects on the mitochondrion, and suggest these effects are driven by alterations to the mitochondrial proteome. PMID:26458787

  8. Algal Functional Annotation Tool from the DOE-UCLA Institute for Genomics and Proteomics

    DOE Data Explorer

    Lopez, David

    The Algal Functional Annotation Tool is a bioinformatics resource to visualize pathway maps, identify enriched biological terms, or convert gene identifiers to elucidate biological function in silico. These types of analysis have been catered to support lists of gene identifiers, such as those coming from transcriptome gene expression analysis. By analyzing the functional annotation of an interesting set of genes, common biological motifs may be elucidated and a first-pass analysis can point further research in the right direction. Currently, the following databases have been parsed, processed, and added to the tool: 1( Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathways Database, 2) MetaCyc Encyclopedia of Metabolic Pathways, 3) Panther Pathways Database, 4) Reactome Pathways Database, 5) Gene Ontology, 6) MapMan Ontology, 7) KOG (Eukaryotic Clusters of Orthologous Groups), 5)Pfam, 6) InterPro.

  9. Functional genomics, proteomics, and regulatory DNA analysis in isogenic settings using zinc finger nuclease-driven transgenesis into a safe harbor locus in the human genome

    PubMed Central

    DeKelver, Russell C.; Choi, Vivian M.; Moehle, Erica A.; Paschon, David E.; Hockemeyer, Dirk; Meijsing, Sebastiaan H.; Sancak, Yasemin; Cui, Xiaoxia; Steine, Eveline J.; Miller, Jeffrey C.; Tam, Phillip; Bartsevich, Victor V.; Meng, Xiangdong; Rupniewski, Igor; Gopalan, Sunita M.; Sun, Helena C.; Pitz, Kathleen J.; Rock, Jeremy M.; Zhang, Lei; Davis, Gregory D.; Rebar, Edward J.; Cheeseman, Iain M.; Yamamoto, Keith R.; Sabatini, David M.; Jaenisch, Rudolf; Gregory, Philip D.; Urnov, Fyodor D.

    2010-01-01

    Isogenic settings are routine in model organisms, yet remain elusive for genetic experiments on human cells. We describe the use of designed zinc finger nucleases (ZFNs) for efficient transgenesis without drug selection into the PPP1R12C gene, a “safe harbor” locus known as AAVS1. ZFNs enable targeted transgenesis at a frequency of up to 15% following transient transfection of both transformed and primary human cells, including fibroblasts and hES cells. When added to this locus, transgenes such as expression cassettes for shRNAs, small-molecule-responsive cDNA expression cassettes, and reporter constructs, exhibit consistent expression and sustained function over 50 cell generations. By avoiding random integration and drug selection, this method allows bona fide isogenic settings for high-throughput functional genomics, proteomics, and regulatory DNA analysis in essentially any transformed human cell type and in primary cells. PMID:20508142

  10. Complete genome and proteome of Acholeplasma laidlawii.

    PubMed

    Lazarev, V N; Levitskii, S A; Basovskii, Y I; Chukin, M M; Akopian, T A; Vereshchagin, V V; Kostrjukova, E S; Kovaleva, G Y; Kazanov, M D; Malko, D B; Vitreschak, A G; Sernova, N V; Gelfand, M S; Demina, I A; Serebryakova, M V; Galyamina, M A; Vtyurin, N N; Rogov, S I; Alexeev, D G; Ladygina, V G; Govorun, V M

    2011-09-01

    We present the complete genome sequence and proteogenomic map for Acholeplasma laidlawii PG-8A (class Mollicutes, order Acholeplasmatales, family Acholeplasmataceae). The genome of A. laidlawii is represented by a single 1,496,992-bp circular chromosome with an average G+C content of 31 mol%. This is the longest genome among the Mollicutes with a known nucleotide sequence. It contains genes of polymerase type I, SOS response, and signal transduction systems, as well as RNA regulatory elements, riboswitches, and T boxes. This demonstrates a significant capability for the regulation of gene expression and mutagenic response to stress. Acholeplasma laidlawii and phytoplasmas are the only Mollicutes known to use the universal genetic code, in which UGA is a stop codon. Within the Mollicutes group, only the sterol-nonrequiring Acholeplasma has the capacity to synthesize saturated fatty acids de novo. Proteomic data were used in the primary annotation of the genome, validating expression of many predicted proteins. We also detected posttranslational modifications of A. laidlawii proteins: phosphorylation and acylation. Seventy-four candidate phosphorylated proteins were found: 16 candidates are proteins unique to A. laidlawii, and 11 of them are surface-anchored or integral membrane proteins, which implies the presence of active signaling pathways. Among 20 acylated proteins, 14 contained palmitic chains, and six contained stearic chains. No residue of linoleic or oleic acid was observed. Acylated proteins were components of mainly sugar and inorganic ion transport systems and were surface-anchored proteins with unknown functions. PMID:21784942

  11. Onco-proteogenomics: cancer proteomics joins forces with genomics.

    PubMed

    Alfaro, Javier A; Sinha, Ankit; Kislinger, Thomas; Boutros, Paul C

    2014-11-01

    The complexities of tumor genomes are rapidly being uncovered, but how they are regulated into functional proteomes remains poorly understood. Standard proteomics workflows use databases of known proteins, but these databases do not capture the uniqueness of the cancer transcriptome, with its point mutations, unusual splice variants and gene fusions. Onco-proteogenomics integrates mass spectrometry-generated data with genomic information to identify tumor-specific peptides. Linking tumor-derived DNA, RNA and protein measurements into a central-dogma perspective has the potential to improve our understanding of cancer biology. PMID:25357240

  12. 2004 Structural, Function and Evolutionary Genomics

    SciTech Connect

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  13. JDIP Genomics, Antibodies, and Proteomics Core Update

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The JDIP Genomics, Proteomics, and Antibodies Core has developed several resources that are available for use by JDIP researchers. Five tasks have been completed or are in progress: Task 1 – Transposon mutants: Nearly 24,000 gene disruption M. paratuberculosis mutants are now available for JDIP re...

  14. HepatoProteomics: Applying Proteomic Technologies to the Study of Liver Function and Disease

    SciTech Connect

    Diamond, Deborah L.; Proll, Sean; Jacobs, Jon M.; Chan, Eric Y.; Camp, David G.; Smith, Richard D.; Katze, Michael G.

    2006-08-01

    The wealth of human genome sequence information now available, coupled with technological advances in robotics, nanotechnology, mass spectrometry, and information systems, has given rise to a method of scientific inquiry known as functional genomics. By using these technologies to survey gene expression and protein production on a near global scale, the goal of functional genomics is to assign biological function to genes with currently unknown roles in physiology. This approach carries particular appeal in disease research, where it can uncover the function of previously unknown genes and molecular pathways that are directly involved in disease progression. With this knowledge may come improved diagnostic techniques, prognostic capabilities, and novel therapeutic approaches. In this regard, the continuing evolution of proteomic technologies has resulted in an increasingly greater impact of proteome studies in many areas of research and hepatology is no exception. Our laboratory has been extremely active in this area, applying both genomic and proteomic technologies to the analysis of virus-host interactions in several systems, including the study of hepatitis C virus (HCV) infection and HCV-associated liver disease. Since proteomic technologies are foreign to many hepatologists (and to almost everyone else), this article will provide an overview of proteomic methods and technologies and describe how they're being used to study liver function and disease. We use our studies of HCV infection and HCV-associated liver disease to present an operational framework for performing high throughput proteome analysis and extracting biologically meaningful information.

  15. Proteomics Data on UCSC Genome Browser - Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium scientists are working together with the University of California, Santa Cruz (UCSC) Genomics Institute to provide public access to cancer proteomics data.

  16. Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling

    PubMed Central

    Medina, Ignacio; Carbonell, José; Pulido, Luis; Madeira, Sara C.; Goetz, Stefan; Conesa, Ana; Tárraga, Joaquín; Pascual-Montano, Alberto; Nogales-Cadenas, Ruben; Santoyo, Javier; García, Francisco; Marbà, Martina; Montaner, David; Dopazo, Joaquín

    2010-01-01

    Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein–protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org. PMID:20478823

  17. Functional proteomics within the genus Lactobacillus.

    PubMed

    De Angelis, Maria; Calasso, Maria; Cavallo, Noemi; Di Cagno, Raffaella; Gobbetti, Marco

    2016-03-01

    Lactobacillus are mainly used for the manufacture of fermented dairy, sourdough, meat, and vegetable foods or used as probiotics. Under optimal processing conditions, Lactobacillus strains contribute to food functionality through their enzyme portfolio and the release of metabolites. An extensive genomic diversity analysis was conducted to elucidate the core features of the genus Lactobacillus, and to provide a better comprehension of niche adaptation of the strains. However, proteomics is an indispensable "omics" science to elucidate the proteome diversity, and the mechanisms of regulation and adaptation of Lactobacillus strains. This review focuses on the novel and comprehensive knowledge of functional proteomics and metaproteomics of Lactobacillus species. A large list of proteomic case studies of different Lactobacillus species is provided to illustrate the adaptability of the main metabolic pathways (e.g., carbohydrate transport and metabolism, pyruvate metabolism, proteolytic system, amino acid metabolism, and protein synthesis) to various life conditions. These investigations have highlighted that lactobacilli modulate the level of a complex panel of proteins to growth/survive in different ecological niches. In addition to the general regulation and stress response, specific metabolic pathways can be switched on and off, modifying the behavior of the strains. PMID:27001126

  18. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  19. DEFINING THE MANDATE OF PROTEOMICS IN THE POST-GENOMIC ERA: WORKSHOP REPORT

    EPA Science Inventory

    Research in proteomics is the next step after genomics in understanding life processes at the molecular level. In the largest sense proteomics encompasses knowledge of the structure, function and expression of all proteins in the biochemical or biological contexts of all organism...

  20. Genomic, Proteomic, and Metabolomic Data Integration Strategies

    PubMed Central

    Wanichthanarak, Kwanjeera; Fahrmann, Johannes F; Grapov, Dmitry

    2015-01-01

    Robust interpretation of experimental results measuring discreet biological domains remains a significant challenge in the face of complex biochemical regulation processes such as organismal versus tissue versus cellular metabolism, epigenetics, and protein post-translational modification. Integration of analyses carried out across multiple measurement or omic platforms is an emerging approach to help address these challenges. This review focuses on select methods and tools for the integration of metabolomic with genomic and proteomic data using a variety of approaches including biochemical pathway-, ontology-, network-, and empirical-correlation-based methods. PMID:26396492

  1. Genomics and functional genomics with haloarchaea.

    PubMed

    Soppa, J; Baumann, A; Brenneis, M; Dambeck, M; Hering, O; Lange, C

    2008-09-01

    The first haloarchaeal genome was published in 2000 and today five genome sequences are available. Transcriptome and proteome analyses have been established for two and three haloarchaeal species, respectively, and more than 20 studies using these functional genomic approaches have been published in the last two years. These studies gave global overviews of metabolic regulation (aerobic and anaerobic respiration, phototrophy, carbon source usage), stress response (UV, X-rays, transition metals, osmotic and temperature stress), cell cycle-dependent transcript level regulation, and transcript half-lives. The only translatome analysis available for any prokaryotic species revealed that 10 and 20% of all transcripts are translationally regulated in Haloferax volcanii and Halobacterium salinarum, respectively. Very effective methods for the construction of in frame deletion mutants have been established recently for haloarchaea and are intensively used to unravel the biological roles of genes in this group. Bioinformatic analyses include both cross-genome comparisons as well as integration of genomic data with experimental results. The first systems biology approaches have been performed that used experimental data to construct predictive models of gene expression and metabolism, respectively. In this contribution the current status of genomics, functional genomics, and molecular genetics of haloarchaea is summarized and selected examples are discussed. PMID:18493745

  2. Integrative Analysis of Metabolomic, Proteomic and Genomic Data to Reveal Functional Pathways and Candidate Genes for Drip Loss in Pigs.

    PubMed

    Welzenbach, Julia; Neuhoff, Christiane; Heidt, Hanna; Cinar, Mehmet Ulas; Looft, Christian; Schellander, Karl; Tholen, Ernst; Große-Brinkhaus, Christine

    2016-01-01

    The aim of this study was to integrate multi omics data to characterize underlying functional pathways and candidate genes for drip loss in pigs. The consideration of different omics levels allows elucidating the black box of phenotype expression. Metabolite and protein profiling was applied in Musculus longissimus dorsi samples of 97 Duroc × Pietrain pigs. In total, 126 and 35 annotated metabolites and proteins were quantified, respectively. In addition, all animals were genotyped with the porcine 60 k Illumina beadchip. An enrichment analysis resulted in 10 pathways, amongst others, sphingolipid metabolism and glycolysis/gluconeogenesis, with significant influence on drip loss. Drip loss and 22 metabolic components were analyzed as intermediate phenotypes within a genome-wide association study (GWAS). We detected significantly associated genetic markers and candidate genes for drip loss and for most of the metabolic components. On chromosome 18, a region with promising candidate genes was identified based on SNPs associated with drip loss, the protein "phosphoglycerate mutase 2" and the metabolite glycine. We hypothesize that association studies based on intermediate phenotypes are able to provide comprehensive insights in the genetic variation of genes directly involved in the metabolism of performance traits. In this way, the analyses contribute to identify reliable candidate genes. PMID:27589727

  3. Clinical Microfluidics for Neutrophil Genomics and Proteomics

    PubMed Central

    Kotz, Kenneth T.; Xiao, Wenzong; Miller-Graziano, Carol; Qian, Wei-Jun; Russom, Aman; Warner, Elizabeth A.; Moldawer, Lyle L.; De, Asit; Bankey, Paul E.; Petritis, Brianne O.; Camp, David G.; Rosenbach, Alan E.; Goverman, Jeremy; Fagan, Shawn P.; Brownstein, Bernard H.; Irimia, Daniel; Xu, Weihong; Wilhelmy, Julie; Mindrinos, Michael N.; Smith, Richard D.; Davis, Ronald W.; Tompkins, Ronald G.; Toner, Mehmet

    2010-01-01

    Neutrophils play critical roles in modulating the immune response. We present a robust methodology for rapidly isolating neutrophils directly from whole blood and develop ‘on-chip’ processing for mRNA and protein isolation for genomics and proteomics. We validate this device with an ex vivo stimulation experiment and by comparison with standard bulk isolation methodologies. Lastly, we implement this tool as part of a near patient blood processing system within a multi-center clinical study of the immune response to severe trauma and burn injury. The preliminary results from a small cohort of patients in our study and healthy controls show a unique time-dependent gene expression pattern clearly demonstrating the ability of this tool to discriminate temporal transcriptional events of neutrophils within a clinical setting. PMID:20802500

  4. Human sperm chromatin epigenetic potential: genomics, proteomics, and male infertility

    PubMed Central

    Castillo, Judit; Estanyol, Josep Maria; Ballescà, Josep Lluis; Oliva, Rafael

    2015-01-01

    The classical idea about the function of the mammalian sperm chromatin is that it serves to transmit a highly protected and transcriptionally inactive paternal genome, largely condensed by protamines, to the next generation. In addition, recent sperm chromatin genome-wide dissection studies indicate the presence of a differential distribution of the genes and repetitive sequences in the protamine-condensed and histone-condensed sperm chromatin domains, which could be potentially involved in regulatory roles after fertilization. Interestingly, recent proteomic studies have shown that sperm chromatin contains many additional proteins, in addition to the abundant histones and protamines, with specific modifications and chromatin affinity features which are also delivered to the oocyte. Both gene and protein signatures seem to be altered in infertile patients and, as such, are consistent with the potential involvement of the sperm chromatin landscape in early embryo development. This present work reviews the available information on the composition of the human sperm chromatin and its epigenetic potential, with a particular focus on recent results derived from high-throughput genomic and proteomic studies. As a complement, we provide experimental evidence for the detection of phosphorylations and acetylations in human protamine 1 using a mass spectrometry approach. The available data indicate that the sperm chromatin is much more complex than what it was previously thought, raising the possibility that it could also serve to transmit crucial paternal epigenetic information to the embryo. PMID:25926607

  5. GENOMIC AND PROTEOMIC TECHNIQUES APPLIED TO REPRODUCTIVE BIOLOGY

    EPA Science Inventory

    Genomic and proteomic techniques applied to reproductive biology
    John C. Rockett
    Reproductive Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, United States Environmental Protection Agency, Research Tria...

  6. Integrated proteomic and genomic analysis of colorectal cancer

    Cancer.gov

    Investigators who analyzed 95 human colorectal tumor samples have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, pro

  7. Systems proteomics of liver mitochondria function.

    PubMed

    Williams, Evan G; Wu, Yibo; Jha, Pooja; Dubuis, Sébastien; Blattmann, Peter; Argmann, Carmen A; Houten, Sander M; Amariuta, Tiffany; Wolski, Witold; Zamboni, Nicola; Aebersold, Ruedi; Auwerx, Johan

    2016-06-10

    Recent improvements in quantitative proteomics approaches, including Sequential Window Acquisition of all Theoretical Mass Spectra (SWATH-MS), permit reproducible large-scale protein measurements across diverse cohorts. Together with genomics, transcriptomics, and other technologies, transomic data sets can be generated that permit detailed analyses across broad molecular interaction networks. Here, we examine mitochondrial links to liver metabolism through the genome, transcriptome, proteome, and metabolome of 386 individuals in the BXD mouse reference population. Several links were validated between genetic variants toward transcripts, proteins, metabolites, and phenotypes. Among these, sequence variants in Cox7a2l alter its protein's activity, which in turn leads to downstream differences in mitochondrial supercomplex formation. This data set demonstrates that the proteome can now be quantified comprehensively, serving as a key complement to transcriptomics, genomics, and metabolomics--a combination moving us forward in complex trait analysis. PMID:27284200

  8. Connecting Genomic Alterations to Cancer Biology with Proteomics: The NCI Clinical Proteomic Tumor Analysis Consortium

    SciTech Connect

    Ellis, Matthew; Gillette, Michael; Carr, Steven A.; Paulovich, Amanda G.; Smith, Richard D.; Rodland, Karin D.; Townsend, Reid; Kinsinger, Christopher; Mesri, Mehdi; Rodriguez, Henry; Liebler, Daniel

    2013-10-03

    The National Cancer Institute (NCI) Clinical Proteomic Tumor Analysis Consortium is applying the latest generation of proteomic technologies to genomically annotated tumors from The Cancer Genome Atlas (TCGA) program, a joint initiative of the NCI and the National Human Genome Research Institute. By providing a fully integrated accounting of DNA, RNA, and protein abnormalities in individual tumors, these datasets will illuminate the complex relationship between genomic abnormalities and cancer phenotypes, thus producing biologic insights as well as a wave of novel candidate biomarkers and therapeutic targets amenable to verifi cation using targeted mass spectrometry methods.

  9. Proteomics. Making sense of genomic information for drug discovery.

    PubMed

    Whitelegge, J P; le Coutre, J

    2001-01-01

    As an increasing number of available genomes triggers a gold rush in modern biology, the scientific challenge shifts towards understanding the total of the encoded information, most notably the proteins, their structures, functions and interactions. Currently this work is in its early stages but the near future will bring a merger of biology, engineering and informatics with a far broader impact on society than pure genomics has had so far. The challenge of characterizing the structures and functions of all proteins in a given cell demands technological advances beyond the classical methodologies of protein biochemistry. Mass spectrometry techniques for high-throughput protein identification, including peptide mass fingerprinting, sequence tagging and mass spectrometry on full-length proteins are providing the driving force behind proteomics endeavors. New technologies are needed to move high-resolution protein structure determination to an industrial scale. Nonetheless, improvements in techniques for the separation of intrinsic membrane proteins are enabling proteomics efforts towards identifying drug targets within this important class of biomolecules. Beyond the acquisition of data on sequences, structures and interactions, however, the major work in drug discovery remains: the screening of large candidate compound libraries combined with clever medicinal chemistry that guarantees selective action and defined delivery of the drug. PMID:12173311

  10. The Proteome Folding Project: Proteome-scale prediction of structure and function

    PubMed Central

    Drew, Kevin; Winters, Patrick; Butterfoss, Glenn L.; Berstis, Viktors; Uplinger, Keith; Armstrong, Jonathan; Riffle, Michael; Schweighofer, Erik; Bovermann, Bill; Goodlett, David R.; Davis, Trisha N.; Shasha, Dennis; Malmström, Lars; Bonneau, Richard

    2011-01-01

    The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition, and grid-computing-enabled de novo structure prediction. We predict protein domain boundaries and three-dimensional (3D) structures for protein domains from 94 genomes (including human, Arabidopsis, rice, mouse, fly, yeast, Escherichia coli, and worm). De novo structure predictions were distributed on a grid of more than 1.5 million CPUs worldwide (World Community Grid). We generated significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions. PMID:21824995

  11. Bacillus thuringiensis: a genomics and proteomics perspective.

    PubMed

    Ibrahim, Mohamed A; Griko, Natalya; Junker, Matthew; Bulla, Lee A

    2010-01-01

    Bacillus thuringiensis (Bt) is a unique bacterium in that it shares a common place with a number of chemical compounds which are used commercially to control insects important to agriculture and public health. Although other bacteria, including B. popilliae and B. sphaericus, are used as microbial insecticides, their spectrum of insecticidal activity is quite limited compared to Bt. Importantly, Bt is safe for humans and is the most widely used environmentally compatible biopesticide worldwide. Furthermore, insecticidal Bt genes have been incorporated into several major crops, rendering them insect resistant, and thus providing a model for genetic engineering in agriculture.This review highlights what the authors consider the most relevant issues and topics pertaining to the genomics and proteomics of Bt. At least one of the authors (L.A.B.) has spent most of his professional life studying different aspects of this bacterium with the goal in mind of determining the mechanism(s) by which it kills insects. The other authors have a much shorter experience with Bt but their intellect and personal insight have greatly enriched our understanding of what makes Bt distinctive in the microbial world. Obviously, there is personal interest and bias reflected in this article notwithstanding oversight of a number of published studies. This review contains some material not published elsewhere although several ideas and concepts were developed from a broad base of scientific literature up to 2010. PMID:21327125

  12. Genomics, metagenomics and proteomics in biomining microorganisms.

    PubMed

    Valenzuela, Lissette; Chi, An; Beard, Simon; Orell, Alvaro; Guiliani, Nicolas; Shabanowitz, Jeff; Hunt, Donald F; Jerez, Carlos A

    2006-01-01

    The use of acidophilic, chemolithotrophic microorganisms capable of oxidizing iron and sulfur in industrial processes to recover metals from minerals containing copper, gold and uranium is a well established biotechnology with distinctive advantages over traditional mining. A consortium of different microorganisms participates in the oxidative reactions resulting in the extraction of dissolved metal values from ores. Considerable effort has been spent in the last years to understand the biochemistry of iron and sulfur compounds oxidation, bacteria-mineral interactions (chemotaxis, quorum sensing, adhesion, biofilm formation) and several adaptive responses allowing the microorganisms to survive in a bioleaching environment. All of these are considered key phenomena for understanding the process of biomining. The use of genomics, metagenomics and high throughput proteomics to study the global regulatory responses that the biomining community uses to adapt to their changing environment is just beginning to emerge in the last years. These powerful approaches are reviewed here since they offer the possibility of exciting new findings that will allow analyzing the community as a microbial system, determining the extent to which each of the individual participants contributes to the process, how they evolve in time to keep the conglomerate healthy and therefore efficient during the entire process of bioleaching. PMID:16288845

  13. Defining Genome Maintenance Pathways using Functional Genomic Approaches

    PubMed Central

    Bansbach, Carol E.; Cortez, David

    2011-01-01

    Genome maintenance activities including DNA repair, cell division cycle control, and checkpoint signaling pathways preserve genome integrity and prevent disease. Defects in these pathways cause birth defects, neurodegeneration, premature aging, and cancer. Recent technical advances in functional genomic approaches such as expression profiling, proteomics, and RNA interference (RNAi) technologies have rapidly expanded our knowledge of the proteins that work in these pathways. In this review, we examine the use of these high-throughput methodologies in higher eukaryotic organisms for the interrogation of genome maintenance activities. PMID:21787120

  14. Complete Genome and Proteome of Acholeplasma laidlawii▿†

    PubMed Central

    Lazarev, V. N.; Levitskii, S. A.; Basovskii, Y. I.; Chukin, M. M.; Akopian, T. A.; Vereshchagin, V. V.; Kostrjukova, E. S.; Kovaleva, G. Y.; Kazanov, M. D.; Malko, D. B.; Vitreschak, A. G.; Sernova, N. V.; Gelfand, M. S.; Demina, I. A.; Serebryakova, M. V.; Galyamina, M. A.; Vtyurin, N. N.; Rogov, S. I.; Alexeev, D. G.; Ladygina, V. G.; Govorun, V. M.

    2011-01-01

    We present the complete genome sequence and proteogenomic map for Acholeplasma laidlawii PG-8A (class Mollicutes, order Acholeplasmatales, family Acholeplasmataceae). The genome of A. laidlawii is represented by a single 1,496,992-bp circular chromosome with an average G+C content of 31 mol%. This is the longest genome among the Mollicutes with a known nucleotide sequence. It contains genes of polymerase type I, SOS response, and signal transduction systems, as well as RNA regulatory elements, riboswitches, and T boxes. This demonstrates a significant capability for the regulation of gene expression and mutagenic response to stress. Acholeplasma laidlawii and phytoplasmas are the only Mollicutes known to use the universal genetic code, in which UGA is a stop codon. Within the Mollicutes group, only the sterol-nonrequiring Acholeplasma has the capacity to synthesize saturated fatty acids de novo. Proteomic data were used in the primary annotation of the genome, validating expression of many predicted proteins. We also detected posttranslational modifications of A. laidlawii proteins: phosphorylation and acylation. Seventy-four candidate phosphorylated proteins were found: 16 candidates are proteins unique to A. laidlawii, and 11 of them are surface-anchored or integral membrane proteins, which implies the presence of active signaling pathways. Among 20 acylated proteins, 14 contained palmitic chains, and six contained stearic chains. No residue of linoleic or oleic acid was observed. Acylated proteins were components of mainly sugar and inorganic ion transport systems and were surface-anchored proteins with unknown functions. PMID:21784942

  15. Exploring the post-genomic world: differing explanatory and manipulatory functions of post-genomic sciences

    PubMed Central

    Holmes, Christina; Carlson, Siobhan M.; McDonald, Fiona; Jones, Mavis; Graham, Janice

    2016-01-01

    Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics. PMID:27134568

  16. University of Victoria Genome British Columbia Proteomics Centre Partners with CPTAC - Office of Cancer Clinical Proteomics Research

    Cancer.gov

    University of Victoria Genome British Columbia Proteomics Centre, a leader in proteomic technology development, has partnered with the U.S. National Cancer Institute (NCI) to make targeted proteomic assays accessible to the community through NCI’s CPTAC Assay Portal.

  17. Proteogenomics: the needs and roles to be filled by proteomics in genome annotation

    SciTech Connect

    Ansong, Charles; Purvine, Samuel O.; Adkins, Joshua N.; Lipton, Mary S.; Smith, Richard D.

    2008-01-01

    While genome sequencing efforts reveal the basic building blocks of life, a genome sequence alone is insufficient for elucidating biological function. Genome annotation – the process of identifying genes and assigning function to each gene in a genome sequence – provides the means to elucidate biological function from sequence. Current state-of-the-art high throughput genome annotation uses a combination of comparative (sequence similarity data) and non-comparative (ab initio gene prediction algorithms) methods to identify open reading frames in genome sequences. Because approaches used to validate the presence of these open reading frames are typically based on the information derived from the annotated genomes, they cannot independently and unequivocally determine whether a predicted open reading frame is translated into a protein. With the ability to directly measure peptides arising from expressed proteins, high throughput liquid chromatography-tandem mass spectrometry-based proteomics, approaches can be used to verify coding regions of a genomic sequence. Here, we highlight several ways in which high throughput tandem mass spectrometry-based proteomics can improve the quality of genome annotations and suggest that it could be efficiently applied during the initial gene calling process so that the improvements are propagated through the subsequent functional annotation process.

  18. The 9th Siena meeting: from genome to proteome: open innovations.

    PubMed

    Godovac-Zimmermann, Jasminka

    2012-12-01

    The Siena Meeting has been held biannually since 1994, when for the first time the concept of the proteome was introduced to a large scientific audience. Over the years, the meeting has grown to be a major international conference in the field of proteomics and has attracted excellent scientists from all corners of the world. The 9th Siena Meeting: 'from Genome to Proteome: Open Innovations' was attended by 300 scientists. There were four plenary and eight parallel sessions with 50 invited talks and three poster sessions with 94 posters covering wide range of functional proteomics, signaling, biomarkers, cancer, neuroscience, glycoproteomics, mass spectrometry and bioinformatics. As in the past, this year's Siena Meeting maintained its tradition of placing science at centre stage, which generated a wide range of discussions of major importance for the future. PMID:23256669

  19. Comparative Bacterial Proteomics: Analysis of the Core Genome Concept

    SciTech Connect

    Callister, Stephen J.; McCue, Lee Ann; Turse, Josh E.; Monroe, Matthew E.; Auberry, Kenneth J.; Smith, Richard D.; Adkins, Joshua N.; Lipton, Mary S.

    2008-02-06

    Comparative bacterial genomic studies commonly predict a set of genes indicative of common ancestry. Experimental validation of the existence of this core genome requires extensive measurement and is not typically undertaken. Enabled by an extensive proteome database development over a six year period, we experimentally verified the expression of proteins predicted from genomic ortholog comparisons among 17 environmental and pathogenic bacteria. More exclusive relationships were observed among the expressed protein content of phenotypically related bacteria, which is indicative of the specific lifestyles associated with these organisms. While genomic studies establish relative orthologous relationships among a set of bacteria and propose a set of ancestral genes, our proteomics study establishes expressed lifestyle differences among conserved genes and proposes a set of expressed ancestral traits.

  20. Highlights of recent articles on data mining in genomics & proteomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This editorial elaborates on investigations consisting of different “OMICS” technologies and their application to biological sciences. In addition, advantages and recent development of the proteomic, genomic and data mining technologies are discussed. This information will be useful to scientists ...

  1. MAPU 2.0: high-accuracy proteomes mapped to genomes

    PubMed Central

    Gnad, Florian; Oroshi, Mario; Birney, Ewan; Mann, Matthias

    2009-01-01

    The MAPU 2.0 database contains proteomes of organelles, tissues and cell types measured by mass spectrometry (MS)-based proteomics. In contrast to other databases it is meant to contain a limited number of experiments and only those with very high-resolution and -accuracy data. MAPU 2.0 displays the proteomes of organelles, tissues and body fluids or conversely displays the occurrence of proteins of interest in all these proteomes. The new release addresses MS-specific problems including ambiguous peptide-to-protein assignments and it provides insight into general functional features on the protein level ranging from gene ontology classification to comprehensive SwissProt annotation. Moreover, the derived proteomic data are used to annotate the genomes using Distributed Annotation Service (DAS) via EnsEMBL services. MAPU 2.0 is a model for a database specifically designed for high-accuracy proteomics and a member of the ProteomExchange Consortium. It is available on line at http://www.mapuproteome.com. PMID:18948283

  2. Comparative Analysis of Genomics and Proteomics in Bacillus thuringiensis 4.0718

    PubMed Central

    Rang, Jie; He, Hao; Wang, Ting; Ding, Xuezhi; Zuo, Mingxing; Quan, Meifang; Sun, Yunjun; Yu, Ziquan; Hu, Shengbiao; Xia, Liqiu

    2015-01-01

    Bacillus thuringiensis is a widely used biopesticide that produced various insecticidal active substances during its life cycle. Separation and purification of numerous insecticide active substances have been difficult because of the relatively short half-life of such substances. On the other hand, substances can be synthetized at different times during development, so samples at different stages have to be studied, further complicating the analysis. A dual genomic and proteomic approach would enhance our ability to identify such substances, and particularily using mass spectrometry-based proteomic methods. The comparative analysis for genomic and proteomic data have showed that not all of the products deduced from the annotated genome could be identified among the proteomic data. For instance, genome annotation results showed that 39 coding sequences in the whole genome were related to insect pathogenicity, including five cry genes. However, Cry2Ab, Cry1Ia, Cytotoxin K, Bacteriocin, Exoenzyme C3 and Alveolysin could not be detected in the proteomic data obtained. The sporulation-related proteins were also compared analysis, results showed that the great majority sporulation-related proteins can be detected by mass spectrometry. This analysis revealed Spo0A~P, SigF, SigE(+), SigK(+) and SigG(+), all known to play an important role in the process of spore formation regulatory network, also were displayed in the proteomic data. Through the comparison of the two data sets, it was possible to infer that some genes were silenced or were expressed at very low levels. For instance, found that cry2Ab seems to lack a functional promoter while cry1Ia may not be expressed due to the presence of transposons. With this comparative study a relatively complete database can be constructed and used to transform hereditary material, thereby prompting the high expression of toxic proteins. A theoretical basis is provided for constructing highly virulent engineered bacteria and for

  3. Annotation of loci from genome-wide association studies using tissue-specific quantitative interaction proteomics.

    PubMed

    Lundby, Alicia; Rossin, Elizabeth J; Steffensen, Annette B; Acha, Moshe Rav; Newton-Cheh, Christopher; Pfeufer, Arne; Lynch, Stacey N; Olesen, Søren-Peter; Brunak, Søren; Ellinor, Patrick T; Jukema, J Wouter; Trompet, Stella; Ford, Ian; Macfarlane, Peter W; Krijthe, Bouwe P; Hofman, Albert; Uitterlinden, André G; Stricker, Bruno H; Nathoe, Hendrik M; Spiering, Wilko; Daly, Mark J; Asselbergs, Folkert W; van der Harst, Pim; Milan, David J; de Bakker, Paul I W; Lage, Kasper; Olsen, Jesper V

    2014-08-01

    Genome-wide association studies (GWAS) have identified thousands of loci associated with complex traits, but it is challenging to pinpoint causal genes in these loci and to exploit subtle association signals. We used tissue-specific quantitative interaction proteomics to map a network of five genes involved in the Mendelian disorder long QT syndrome (LQTS). We integrated the LQTS network with GWAS loci from the corresponding common complex trait, QT-interval variation, to identify candidate genes that were subsequently confirmed in Xenopus laevis oocytes and zebrafish. We used the LQTS protein network to filter weak GWAS signals by identifying single-nucleotide polymorphisms (SNPs) in proximity to genes in the network supported by strong proteomic evidence. Three SNPs passing this filter reached genome-wide significance after replication genotyping. Overall, we present a general strategy to propose candidates in GWAS loci for functional studies and to systematically filter subtle association signals using tissue-specific quantitative interaction proteomics. PMID:24952909

  4. Annotation of loci from genome-wide association studies using tissue-specific quantitative interaction proteomics

    PubMed Central

    Lundby, Alicia; Rossin, Elizabeth J.; Steffensen, Annette B.; Rav Acha, Moshe; Newton-Cheh, Christopher; Pfeufer, Arne; Lynch, Stacey N.; Olesen, Søren-Peter; Brunak, Søren; Ellinor, Patrick T.; Jukema, J.Wouter; Trompet, Stella; Ford, Ian; Macfarlane, Peter W.; Krijthe, Bouwe P.; Hofman, Albert; Uitterlinden, Andre G.; Stricker, Bruno H.; Nathoe, Hendrik M.; Spiering, Wilko; Daly, Mark J.; Asselbergs, Folkert W.; van der Harst, Pim; Milan, David J.; de Bakker, Paul I.W.; Lage, Kasper; Olsen, Jesper V.

    2014-01-01

    Genome-wide association studies (GWAS) have identified thousands of loci associated wtih complex traits, but it is challenging to pinpoint causal genes in these loci and to exploit subtle association signals. We used tissue-specific quantitative interaction proteomics to map a network of five genes involved in the Mendelian disorder long QT syndrome (LQTS). We integrated the LQTS network with GWAS loci from the corresponding common complex trait, QT interval variation, to identify candidate genes that were subsequently confirmed in Xenopus laevis oocytes and zebrafish. We used the LQTS protein network to filter weak GWAS signals by identifying single nucleotide polymorphisms (SNPs) in proximity to genes in the network supported by strong proteomic evidence. Three SNPs passing this filter reached genome-wide significance after replication genotyping. Overall, we present a general strategy to propose candidates in GWAS loci for functional studies and to systematically filter subtle association signals using tissue-specific quantitative interaction proteomics. PMID:24952909

  5. Plant nuclear proteomics for unraveling physiological function.

    PubMed

    Yin, Xiaojian; Komatsu, Setsuko

    2016-09-25

    The nucleus is the subcellular organelle that functions as the regulatory hub of the cell and is responsible for regulating several critical cellular functions, including cell proliferation, gene expression, and cell survival. Nuclear proteomics is a useful approach for investigating the mechanisms underlying plant responses to abiotic stresses, including protein-protein interactions, enzyme activities, and post-translational modifications. Among abiotic stresses, flooding is a major limiting factor for plant growth and yields, particularly for soybean. In this review, plant nuclei purification methods, modifications of plant nuclear proteins, and recent contributions to the field of plant nuclear proteomics are summarized. In addition, to reveal the upstream regulating mechanisms controlling soybean responses to flooding stress, the functions of flooding-responsive nuclear proteins are reviewed based on the results of nuclear proteomic analysis of soybean in the early stages of flooding stress. PMID:27004615

  6. How the proteome packages the genome for cardiovascular development

    PubMed Central

    Karbassi, Elaheh; Vondriska, Thomas M.

    2014-01-01

    The devastating impact of congenital heart defects has made mechanisms of vertebrate heart and vascular development an active area of study. Because myocyte death is a common feature of acquired cardiovascular diseases and the adult heart does not regenerate, the need exists to understand whether features of the developing heart and vasculature—which are more plastic—can be exploited therapeutically in the disease setting. We know that a core network of transcription factors governs commitment to the cardiovascular lineage, and recent studies using genetic loss-of-function approaches and unbiased genomic studies have revealed the role for various chromatin modulatory events. We reason that chromatin structure itself is a causal feature that influences transcriptome complexity along a developmental continuum, and the purpose of this article is to highlight the areas in which ‘omics technologies have the potential to reveal new principles of phenotypic plasticity in development. We review the major mechanisms of chromatin structural regulation, highlighting what is known about their actions to control cardiovascular differentiation. We discuss emergent mechanisms of regulation that have been identified on the basis of genomic and proteomic studies of cardiac nuclei and identify current challenges to an integrated understanding of chromatin structure and cardiovascular phenotype. PMID:25074278

  7. Dicarbonyl proteome and genome damage in metabolic and vascular disease.

    PubMed

    Rabbani, Naila; Thornalley, Paul J

    2014-04-01

    Methylglyoxal is a potent protein-glycating agent. It is an arginine-directed glycating agent and often modifies functionally important sites in proteins. Glycation forms mainly MG-H1 [Nδ-(5-hydro-5-methyl-4-imidazolon-2-yl)ornithine] residues. MG-H1 content of proteins is quantified by stable isotopic dilution analysis-MS/MS and also by immunoblotting with specific monoclonal antibodies. Methylglyoxal-modified proteins undergo cellular proteolysis and release MG-H1 free adduct for excretion. MG-H1 residues have been found in proteins of animals, plants, bacteria, fungi and protoctista. MG-H1 is often the major advanced glycation end-product in proteins of tissues and body fluids, increasing in diabetes and associated vascular complications, renal failure, cirrhosis, Alzheimer's disease, arthritis, Parkinson's disease and aging. Proteins susceptible to methylglyoxal modification with related functional impairment are called the DCP (dicarbonyl proteome). The DCP includes albumin, haemoglobin, transcription factors, mitochondrial proteins, extracellular matrix proteins, lens crystallins and others. DCP component proteins are linked to mitochondrial dysfunction in diabetes and aging, oxidative stress, dyslipidaemia, cell detachment and anoikis and apoptosis. Methylglyoxal also modifies DNA where deoxyguanosine residues are modified to imidazopurinone MGdG {3-(2'-deoxyribosyl)-6,7-dihydro-6,7-dihydroxy-6/7-methylimidazo-[2,3-b]purine-9(8)one} isomers. MGdG was the major quantitative adduct detected in vivo. It was linked to frequency of DNA strand breaks and increased markedly during apoptosis induced by a cell-permeant glyoxalase I inhibitor. Glyoxalase I metabolizes >99% methylglyoxal and thereby protects the proteome and genome. Gene deletion of GLO1 is embryonically lethal and GLO1 silencing increases methylglyoxal concentration, MG-H1 and MGdG, premature aging and disease. Studies of methylglyoxal glycation have importance for human health, longevity and

  8. Leveraging Genomics Software to Improve Proteomics Results

    SciTech Connect

    Fodor, I K; Nelson, D O

    2005-09-06

    Rigorous data analysis techniques are essential in quantifying the differential expression of proteins in biological samples of interest. Statistical methods from the microarray literature were applied to the analysis of two-dimensional difference gel electrophoresis (2-D DIGE) proteomics experiments, in the context of technical variability studies involving human plasma. Protein expression measurements were corrected to account for observed intensity-dependent biases within gels, and normalized to mitigate observed gel to gel variations. The methods improved upon the results achieved using the best currently available 2-D DIGE proteomics software. The spot-wise protein variance was reduced by 10% and the number of apparently differentially expressed proteins was reduced by over 50%.

  9. An empirical strategy for characterizing bacterial proteomes across species in the absence of genomic sequences

    SciTech Connect

    Turse, Joshua E.; Marshall, Matthew J.; Fredrickson, Jim K.; Lipton, Mary S.; Callister, Stephen J.

    2010-11-12

    Current methods in proteomics are dependent on the availability of sequenced genomes to identify proteins. However, genomic sequences are not always available for bacteria or microbial communities, even with high throughput sequencing technology becoming more readily available. Nevertheless, the homology that exists between related bacteria makes possible the extraction of meaningful biological information from an organism’s, or community’s proteome using the genomic sequence of a near neighbor. Here, a cross-organism search strategy was used to look at the amount of proteomics information obtainable with relative genetic distance from a near neighbor organism and to identify proteins in the proteome of minimally characterized environmental isolates. We conclude that closely related organisms with sequenced genomes, can be used to characterize proteomes of organisms with unsequenced genomes. In general, a cross-organism search strategy demonstrates the first step to use of sequences genomes to evaluate the proteomes of environmental bacteria and microbial communities that have no sequenced genome

  10. A pan-cancer proteomic perspective on The Cancer Genome Atlas.

    PubMed

    Akbani, Rehan; Ng, Patrick Kwok Shing; Werner, Henrica M J; Shahmoradgoli, Maria; Zhang, Fan; Ju, Zhenlin; Liu, Wenbin; Yang, Ji-Yeon; Yoshihara, Kosuke; Li, Jun; Ling, Shiyun; Seviour, Elena G; Ram, Prahlad T; Minna, John D; Diao, Lixia; Tong, Pan; Heymach, John V; Hill, Steven M; Dondelinger, Frank; Städler, Nicolas; Byers, Lauren A; Meric-Bernstam, Funda; Weinstein, John N; Broom, Bradley M; Verhaak, Roeland G W; Liang, Han; Mukherjee, Sach; Lu, Yiling; Mills, Gordon B

    2014-01-01

    Protein levels and function are poorly predicted by genomic and transcriptomic analysis of patient tumours. Therefore, direct study of the functional proteome has the potential to provide a wealth of information that complements and extends genomic, epigenomic and transcriptomic analysis in The Cancer Genome Atlas (TCGA) projects. Here we use reverse-phase protein arrays to analyse 3,467 patient samples from 11 TCGA 'Pan-Cancer' diseases, using 181 high-quality antibodies that target 128 total proteins and 53 post-translationally modified proteins. The resultant proteomic data are integrated with genomic and transcriptomic analyses of the same samples to identify commonalities, differences, emergent pathways and network biology within and across tumour lineages. In addition, tissue-specific signals are reduced computationally to enhance biomarker and target discovery spanning multiple tumour lineages. This integrative analysis, with an emphasis on pathways and potentially actionable proteins, provides a framework for determining the prognostic, predictive and therapeutic relevance of the functional proteome. PMID:24871328

  11. Nanopatterned structures for biomolecular analysis toward genomic and proteomic applications

    NASA Astrophysics Data System (ADS)

    Chou, Chia-Fu; Gu, Jian; Wei, Qihuo; Liu, Yingjie; Gupta, Ravi; Nishio, Takeyoshi; Zenhausern, Frederic

    2005-01-01

    We report our fabrication of nanoscale devices using electron beam and nanoimprint lithography (NIL). We focus our study in the emerging fields of NIL, nanophotonics and nanobiotechnology and give a few examples as to how these nanodevices may be applied toward genomic and proteomic applications for molecular analysis. The examples include reverse NIL-fabricated nanofluidic channels for DNA stretching, nanoscale molecular traps constructed from dielectric constrictions for DNA or protein focusing by dielectrophoresis, multi-layer nanoburger and nanoburger multiplets for optimized surface-plasma enhanced Raman scattering for protein detection, and biomolecular motor-based nanosystems. The development of advanced nanopatterning techniques promises reliable and high-throughput manufacturing of nanodevices which could impact significantly on the areas of genomics, proteomics, drug discovery and molecular clinical diagnostics.

  12. Differential Proteomics Analysis of Bacillus amyloliquefaciens and Its Genome-Shuffled Mutant for Improving Surfactin Production

    PubMed Central

    Zhao, Junfeng; Cao, Lin; Zhang, Chong; Zhong, Lei; Lu, Jing; Lu, Zhaoxin

    2014-01-01

    Genome shuffling technology was used as a novel whole-genome engineering approach to rapidly improve the antimicrobial lipopeptide yield of Bacillus amyloliquefaciens. Comparative proteomic analysis of the parental ES-2-4 and genome-shuffled FMB38 strains was conducted to examine the differentially expressed proteins. The proteome was separated by 2-DE (two dimensional electrophoresis) and analyzed by MS (mass spectrum). In the shuffled strain FMB38, 51 differentially expressed protein spots with higher than two-fold spot density were detected by gel image comparison. Forty-six protein spots were detectable by silver staining and further MS analysis. The results demonstrated that among the 46 protein spots expressed particularly induced in the genome-shuffled mutant, 15 were related to metabolism, five to DNA replication, recombination and repair, six to translation and post-translational modifications, one to cell secretion and signal transduction mechanisms, three to surfactin synthesis, two to energy production and conversion, and 14 to others. All these indicated that the metabolic capability of the mutant was improved by the genome shuffling. The study will enable future detailed investigation of gene expression and function linked with surfactin synthesis. The results of proteome analysis may provide information for metabolic engineering of Bacillus amyloliquefaciens for overproduction of surfactin. PMID:25365175

  13. Functional proteomics of barley and barley chloroplasts – strategies, methods and perspectives

    PubMed Central

    Petersen, Jørgen; Rogowska-Wrzesinska, Adelina; Jensen, Ole N.

    2013-01-01

    Barley (Hordeum vulgare) is an important cereal grain that is used in a range of products for animal and human consumption. Crop yield and seed quality has been optimized during decades by plant breeding programs supported by biotechnology and molecular biology techniques. The recently completed whole-genome sequencing of barley revealed approximately 26,100 open reading frames, which provides a foundation for detailed molecular studies of barley by functional genomics and proteomics approaches. Such studies will provide further insights into the mechanisms of, for example, drought and stress tolerance, micronutrient utilization, and photosynthesis in barley. In the present review we present the current state of proteomics research for investigations of barley chloroplasts, i.e., the organelle that contain the photosynthetic apparatus in the plant. We describe several different proteomics strategies and discuss their applications in characterization of the barley chloroplast as well as future perspectives for functional proteomics in barley research. PMID:23515231

  14. Public Access for Teaching Genomics, Proteomics, and Bioinformatics

    PubMed Central

    Campbell, A. Malcolm

    2003-01-01

    When the human genome project was conceived, its leaders wanted all researchers to have equal access to the data and associated research tools. Their vision of equal access provides an unprecedented teaching opportunity. Teachers and students have free access to the same databases that researchers are using. Furthermore, the recent movement to deliver scientific publications freely has presented a second source of current information for teaching. I have developed a genomics course that incorporates many of the public-domain databases, research tools, and peer-reviewed journals. These online resources provide students with exciting entree into the new fields of genomics, proteomics, and bioinformatics. In this essay, I outline how these fields are especially well suited for inclusion in the undergraduate curriculum. Assessment data indicate that my students were able to utilize online information to achieve the educational goals of the course and that the experience positively influenced their perceptions of how they might contribute to biology. PMID:12888845

  15. An integrative genomic and proteomic approach to chemosensitivity prediction

    PubMed Central

    Ma, Yan; Ding, Zhenyu; Qian, Yong; Wan, Ying-Wooi; Tosun, Kursad; Shi, Xianglin; Castranova, Vincent; Harner, E. James; Guo, Nancy I.

    2009-01-01

    New computational approaches are needed to integrate both protein expression and gene expression profiles, extending beyond the correlation analyses of gene and protein expression profiles in the current practices. Here, we developed an algorithm to classify cell line chemosensitivity based on integrated transcriptional and proteomic profiles. We sought to determine whether a combination of gene and protein expression profiles of untreated cells was able to enhance the performance of chemosensitivity prediction. An integrative feature selection scheme was employed to identify chemosensitivity determinants from genome-wide transcriptional profiles and 52 protein expression levels in 60 human cancer cell lines (the NCI-60). A set of 118 anti-cancer drugs whose mechanisms of action were putatively understood was evaluated. Classifiers of the complete range of drug response (sensitive, intermediate, or resistant) were generated for the evaluated anti-cancer drugs, one for each agent. The classifiers were designed to be independent of the cells' tissue origins. The classification accuracy of all the evaluated 118 agents was remarkably better (P<0.001) than that would be achieved by chance. Furthermore, 76 out of the 118 classifiers identified from integrated genomic and protein profiles significantly (P<0.05) improved the accuracy of protein expression-based classifiers identified previously. These results demonstrate that our integrated genomic and proteomic approach enhances the performance of chemosensitivity prediction. This study presents a new analytical framework to identify integrated gene and protein expression signatures for predicting cellular behavior and clinical outcome in general. PMID:19082483

  16. Protein microarrays as tools for functional proteomics.

    PubMed

    LaBaer, Joshua; Ramachandran, Niroshan

    2005-02-01

    Protein microarrays present an innovative and versatile approach to study protein abundance and function at an unprecedented scale. Given the chemical and structural complexity of the proteome, the development of protein microarrays has been challenging. Despite these challenges there has been a marked increase in the use of protein microarrays to map interactions of proteins with various other molecules, and to identify potential disease biomarkers, especially in the area of cancer biology. In this review, we discuss some of the promising advances made in the development and use of protein microarrays. PMID:15701447

  17. The complete genome and proteome of Mycoplasma mobile.

    PubMed

    Jaffe, Jacob D; Stange-Thomann, Nicole; Smith, Cherylyn; DeCaprio, David; Fisher, Sheila; Butler, Jonathan; Calvo, Sarah; Elkins, Tim; FitzGerald, Michael G; Hafez, Nabil; Kodira, Chinnappa D; Major, John; Wang, Shunguang; Wilkinson, Jane; Nicol, Robert; Nusbaum, Chad; Birren, Bruce; Berg, Howard C; Church, George M

    2004-08-01

    Although often considered "minimal" organisms, mycoplasmas show a wide range of diversity with respect to host environment, phenotypic traits, and pathogenicity. Here we report the complete genomic sequence and proteogenomic map for the piscine mycoplasma Mycoplasma mobile, noted for its robust gliding motility. For the first time, proteomic data are used in the primary annotation of a new genome, providing validation of expression for many of the predicted proteins. Several novel features were discovered including a long repeating unit of DNA of approximately 2435 bp present in five complete copies that are shown to code for nearly identical yet uniquely expressed proteins. M. mobile has among the lowest DNA GC contents (24.9%) and most reduced set of tRNAs of any organism yet reported (28). Numerous instances of tandem duplication as well as lateral gene transfer are evident in the genome. The multiple available complete genome sequences for other motile and immotile mycoplasmas enabled us to use comparative genomic and phylogenetic methods to suggest several candidate genes that might be involved in motility. The results of these analyses leave open the possibility that gliding motility might have arisen independently more than once in the mycoplasma lineage. PMID:15289470

  18. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  19. Functional Genomic and Proteomic Analysis Reveals Disruption of Myelin-Related Genes and Translation in a Mouse Model of Early Life Neglect

    PubMed Central

    Bordner, Kelly A.; George, Elizabeth D.; Carlyle, Becky C.; Duque, Alvaro; Kitchen, Robert R.; Lam, TuKiet T.; Colangelo, Christopher M.; Stone, Kathryn L.; Abbott, Thomas B.; Mane, Shrikant M.; Nairn, Angus C.; Simen, Arthur A.

    2011-01-01

    Early life neglect is an important public health problem which can lead to lasting psychological dysfunction. Good animal models are necessary to understand the mechanisms responsible for the behavioral and anatomical pathology that results. We recently described a novel model of early life neglect, maternal separation with early weaning (MSEW), that produces behavioral changes in the mouse that persist into adulthood. To begin to understand the mechanism by which MSEW leads to these changes we applied cDNA microarray, next-generation RNA-sequencing (RNA-seq), label-free proteomics, multiple reaction monitoring (MRM) proteomics, and methylation analysis to tissue samples obtained from medial prefrontal cortex to determine the molecular changes induced by MSEW that persist into adulthood. The results show that MSEW leads to dysregulation of markers of mature oligodendrocytes and genes involved in protein translation and other categories, an apparent downward biasing of translation, and methylation changes in the promoter regions of selected dysregulated genes. These findings are likely to prove useful in understanding the mechanism by which early life neglect affects brain structure, cognition, and behavior. PMID:21629843

  20. proBAMsuite, a Bioinformatics Framework for Genome-Based Representation and Analysis of Proteomics Data*

    PubMed Central

    Wang, Xiaojing; Slebos, Robbert J. C.; Chambers, Matthew C.; Tabb, David L.; Liebler, Daniel C.; Zhang, Bing

    2016-01-01

    To facilitate genome-based representation and analysis of proteomics data, we developed a new bioinformatics framework, proBAMsuite, in which a central component is the protein BAM (proBAM) file format for organizing peptide spectrum matches (PSMs)1 within the context of the genome. proBAMsuite also includes two R packages, proBAMr and proBAMtools, for generating and analyzing proBAM files, respectively. Applying proBAMsuite to three recently published proteomics datasets, we demonstrated its utility in facilitating efficient genome-based sharing, interpretation, and integration of proteomics data. First, the interpretation of proteomics data is significantly enhanced with the rich genomic annotation information. Second, PSMs can be easily reannotated using user-specified gene annotation schemes and assembled into both protein and gene identifications. Third, using the genome as a common reference, proBAMsuite facilitates seamless proteomics and proteogenomics data integration. Finally, proBAM files can be readily visualized in genome browsers and thus bring proteomics data analysis to a general audience beyond the proteomics community. Results from this study establish proBAMsuite as a useful bioinformatics framework for proteomics and proteogenomics research. PMID:26657539

  1. Encapsulated in silica: genome, proteome and physiology of the thermophilic bacterium Anoxybacillus flavithermus

    SciTech Connect

    Saw, Jimmy H; Mountain, Bruce W; Feng, Lu; Omelchenko, Marina V; Saito, Jennifer A; Stott, Matthew B; Li, Dan; Zhao, Guang; Wu, Junli; Galperin, Michael Y; Dunfield, Peter F; Wang, Lei; Alam, Maqsudul

    2008-01-01

    Gram-positive bacteria of the genus Anoxybacillus have been found in diverse thermophilic habitats, such as geothermal hot springs and manure, and in processed foods such as gelatin and milk powder. Anoxybacillus flavithermus is a facultatively anaerobic bacterium found in super-saturated silica solutions and in opaline silica sinter. The ability of A. flavithermus to grow in super-saturated silica solutions makes it an ideal subject to study the processes of sinter formation, which might be similar to the biomineralization processes that occurred at the dawn of life. We report here the complete genome sequence of A. flavithermus strain WK1, isolated from the waste water drain at the Wairakei geothermal power station in New Zealand. It consists of a single chromosome of 2,846,746 base pairs and is predicted to encode 2,863 proteins. In silico genome analysis identified several enzymes that could be involved in silica adaptation and biofilm formation, and their predicted functions were experimentally validated in vitro. Proteomic analysis confirmed the regulation of biofilm-related proteins and crucial enzymes for the synthesis of long-chain polyamines as constituents of silica nanospheres. Microbial fossils preserved in silica and silica sinters are excellent objects for studying ancient life, a new paleobiological frontier. An integrated analysis of the A. flavithermus genome and proteome provides the first glimpse of metabolic adaptation during silicification and sinter formation. Comparative genome analysis suggests an extensive gene loss in the Anoxybacillus/Geobacillus branch after its divergence from other bacilli.

  2. Proteomics-based confirmation of protein expression and correction of annotation errors in the Brucella abortus genome

    PubMed Central

    2010-01-01

    Background Brucellosis is a major bacterial zoonosis affecting domestic livestock and wild mammals, as well as humans around the globe. While conducting proteomics studies to better understand Brucella abortus virulence, we consolidated the proteomic data collected and compared it to publically available genomic data. Results The proteomic data was compiled from several independent comparative studies of Brucella abortus that used either outer membrane blebs, cytosols, or whole bacteria grown in media, as well as intracellular bacteria recovered at different times following macrophage infection. We identified a total of 621 bacterial proteins that were differentially expressed in a condition-specific manner. For 305 of these proteins we provide the first experimental evidence of their expression. Using a custom-built protein sequence database, we uncovered 7 annotation errors. We provide experimental evidence of expression of 5 genes that were originally annotated as non-expressed pseudogenes, as well as start site annotation errors for 2 other genes. Conclusions An essential element for ensuring correct functional studies is the correspondence between reported genome sequences and subsequent proteomics studies. In this study, we have used proteomics evidence to confirm expression of multiple proteins previously considered to be putative, as well as correct annotation errors in the genome of Brucella abortus strain 2308. PMID:20462421

  3. Phenotypic, genomic, transcriptomic and proteomic changes in Bacillus cereus after a short-term space flight

    NASA Astrophysics Data System (ADS)

    Su, Longxiang; Zhou, Lisha; Liu, Jinwen; Cen, Zhong; Wu, Chunyan; Wang, Tong; Zhou, Tao; Chang, De; Guo, Yinghua; Fang, Xiangqun; Wang, Junfeng; Li, Tianzhi; Yin, Sanjun; Dai, Wenkui; Zhou, Yuping; Zhao, Jiao; Fang, Chengxiang; Yang, Ruifu; Liu, Changting

    2014-01-01

    The environment in space could affect microorganisms by changing a variety of features, including proliferation rate, cell physiology, cell metabolism, biofilm production, virulence, and drug resistance. However, the relevant mechanisms remain unclear. To explore the effect of a space environment on Bacillus cereus, a strain of B. cereus was sent to space for 398 h by ShenZhou VIII from November 1, 2011 to November 17, 2011. A ground simulation with similar temperature conditions was simultaneously performed as a control. After the flight, the flight and control strains were further analyzed using phenotypic, genomic, transcriptomic and proteomic techniques to explore the divergence of B. cereus in a space environment. The flight strains exhibited a significantly slower growth rate, a significantly higher amikacin resistance level, and changes in metabolism relative to the ground control strain. After the space flight, three polymorphic loci were found in the flight strains LCT-BC25 and LCT-BC235. A combined transcriptome and proteome analysis was performed, and this analysis revealed that the flight strains had changes in genes/proteins relevant to metabolism. In addition, certain genes/proteins that are relevant to structural function, gene expression modification and translation, and virulence were also altered. Our study represents the first documented analysis of the phenotypic, genomic, transcriptomic, and proteomic changes that occur in B. cereus during space flight, and our results could be beneficial to the field of space microbiology.

  4. Integrating genomics, proteomics and bioinformatics in translational studies of molecular medicine.

    PubMed

    Ostrowski, Jerzy; Wyrwicz, Lucjan S

    2009-09-01

    Understanding the molecular mechanisms of disease requires the introduction of molecular diagnostics into medical practice. Current medicine employs only elements of molecular diagnostics, which are usually applied on the scale of single genes. Medicine in the postgenomic era will utilize thousands of disease-associated molecular markers provided by high-throughput sequencing and functional genomic, proteomic and metabolomic studies. Such a spectrum of techniques will link clinical medicine based on molecularly oriented diagnostics with the prediction and prevention of disease. To achieve this task, large-scale and genome-wide biological and medical data must be combined with biostatistical and bioinformatic analyses to model biological systems. Collecting, cataloging and comparing data from molecular studies, and the subsequent development of conclusions, creates the fundamentals of systems biology. This highly complex analytical process reflects a new scientific paradigm known as integrative genomics. PMID:19732006

  5. Advances in genomics, transcriptomics and proteomics of toxin-producing cyanobacteria.

    PubMed

    D'Agostino, Paul M; Woodhouse, Jason N; Makower, A Katharina; Yeung, Anna C Y; Ongley, Sarah E; Micallef, Melinda L; Moffitt, Michelle C; Neilan, Brett A

    2016-02-01

    A common misconception persists that the genomes of toxic and non-toxic cyanobacterial strains are largely conserved with the exception of the presence or absence of the genes responsible for toxin production. Implementation of -omics era technologies has challenged this paradigm, with comparative analyses providing increased insight into the differences between strains of the same species. The implementation of genomic, transcriptomic and proteomic approaches has revealed distinct profiles between toxin-producing and non-toxic strains. Further, metagenomics and metaproteomics highlight the genomic potential and functional state of toxic bloom events over time. In this review, we highlight how these technologies have shaped our understanding of the complex relationship between these molecules, their producers and the environment at large within which they persist. PMID:26663762

  6. Extracting meaning from functional genomics experiments

    SciTech Connect

    Quackenbush, John . E-mail: johnq@jimmy.harvard.edu

    2005-09-01

    The completion of draft genome sequences for human, mouse, rat, and an increasing number of other species, has provided us with preliminary gene catalogues for many organisms of medical and scientific interests. Interpreting these gene lists in the context of the organism's underlying biology, however, remains difficult. The development of DNA microarrays provided one potential source of data to help interpret gene function; by profiling global patterns of gene expression across diverse conditions, it was hoped that we might be able to develop insight into biological function. But the power of these functional genomics assays, as well as assays in proteomics and metabolomics, is that they primarily give us lists of differentially expressed genes that can be correlated with particular phenotypic states, but which remain difficult to link mechanistically to the biology driving the phenotype.

  7. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. PMID:26323482

  8. Genome and proteome annotation: organization, interpretation and integration

    PubMed Central

    Reeves, Gabrielle A.; Talavera, David; Thornton, Janet M.

    2008-01-01

    Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations. PMID:19019817

  9. Genomics, proteomics and metabonomics in toxicology: hopefully not 'fashionomics'.

    PubMed

    Pognan, Francois

    2004-10-01

    Genomics, proteomics and metabonomics are applied to toxicology either as stand-alone technologies or in combination, with the intention of providing a more efficient assessment of the potential side effects of new chemical entities. Two different approaches are taken: a predictive/proactive strategy based on a statistical analogy of 'signatures' of drugs to many known toxicant gene or metabolite fingerprints; and a mechanistic/reactive strategy based on the in-depth biological analysis of the gene, protein or metabolite profiles induced by one or a few compounds of interest. This article focuses on the advantages and disadvantages of these technologies, as well as the many hurdles associated with both these approaches in toxicology that have to be considered before applying them to the assessment of future drugs. PMID:15469409

  10. Nongenetic functions of the genome.

    PubMed

    Bustin, Michael; Misteli, Tom

    2016-05-01

    The primary function of the genome is to store, propagate, and express the genetic information that gives rise to a cell's architectural and functional machinery. However, the genome is also a major structural component of the cell. Besides its genetic roles, the genome affects cellular functions by nongenetic means through its physical and structural properties, particularly by exerting mechanical forces and by serving as a scaffold for binding of cellular components. Major cellular processes affected by nongenetic functions of the genome include establishment of nuclear structure, signal transduction, mechanoresponses, cell migration, and vision in nocturnal animals. We discuss the concept, mechanisms, and implications of nongenetic functions of the genome. PMID:27151873

  11. Proteomics in Aquaculture Research: Are We There Yet?

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Proteomics can be defined as the study of the entire proteome, the proteome being the expressed compliment of the genome. Proteomics aims to understand gene function and molecular processes of the living cell through the study of expressed proteins. A review of the literature suggests proteomics i...

  12. Web-Based Arabidopsis Functional and Structural Genomics Resources

    PubMed Central

    Lu, Yan; Last, Robert L.

    2008-01-01

    As plant research moves to a “post-genomic” era, many diverse internet resources become available to the international research community. Arabidopsis thaliana, because of its small size, rapid life cycle and simple genome, has been a model system for decades, with much research funding and many projects devoted to creation of functional and structural genomics resources. Different types of data, including genome, transcriptome, proteome, phenome, metabolome and ionome are stored in these resources. In this chapter, a variety of genomics resources are introduced, with simple descriptions of how some can be accessed by laboratory researchers via the internet. PMID:22303243

  13. Linkage of exposure and effects using genomics, proteomics and metabolomics in small fish models (presentation)

    EPA Science Inventory

    This research project combines the use of whole organism endpoints, genomic, proteomic and metabolomic approaches, and computational modeling in a systems biology approach to 1) identify molecular indicators of exposure and biomarkers of effect to EDCs representing several modes/...

  14. The Genomic and Proteomic Content of Cancer Cell-Derived Exosomes

    PubMed Central

    Henderson, Meredith C.; Azorsa, David O.

    2011-01-01

    Exosomes are secreted membrane vesicles that have been proposed as an effective means to detect a variety of disease states, including cancer. The properties of exosomes, including stability in biological fluids, allow for their efficient isolation and make them an ideal vehicle for studies on early disease detection and evaluation. Much data has been collected over recent years regarding the messenger RNA, microRNA, and protein contents of exosomes. In addition, many studies have described the functional role that exosomes play in disease initiation and progression. Tumor cells have been shown to secrete exosomes, often in increased amounts compared to normal cells, and these exosomes can carry the genomic and proteomic signatures characteristic of the tumor cells from which they were derived. While these unique signatures make exosomes ideal for cancer detection, exosomes derived from cancer cells have also been shown to play a functional role in cancer progression. Here, we review the unique genomic and proteomic contents of exosomes originating from cancer cells as well as their functional effects to promote tumor progression. PMID:22649786

  15. Genomic and Proteomic Determinants of Lower Extremity Revascularization Failure: Rationale and Study Design

    PubMed Central

    Nelson, Peter R.; O'Malley, Kerri A.; Feezor, Robert J.; Moldawer, Lyle L.; Seeger, James M.

    2007-01-01

    This translational research program applies a working model of advanced functional genomics/proteomics and bioinformatics to human peripheral arterial occlusive disease (PAOD). It is a multidisciplinary collaborative effort of clinicians, scientists, and statisticians with an advisory panel comprised of experts in inflammation biology, vascular biology, molecular genetics, bioinformatics, clinical trial design, and epidemiology. The proposed human initiative is designed to study 300 symptomatic patients with PAOD undergoing medical management with or without vascular intervention by either lower extremity angioplasty/stenting or vein graft bypass. The study aims to test the hypothesis that the systemic inflammatory response following vascular intervention influences the local milieu responsible for vascular repair and adaptation. The expectation is that this response is not uniform in all patients, but rather, is modulated by either preoperative genetic predisposition or post-procedure differential regulation of the innate immune response to injury that promotes a maladaptive phenotype leading to intervention failure. Therefore, some of these differences may be present and detectable pre-intervention amenable to class prediction and prospective treatment strategies, while others may be detectable in the early post-procedure period, prior to the onset of clinical failure, permitting interventions to prevent an adverse outcome. The combination of genomic/proteomic data together with functional and quality of life outcome measures to define a critical model for class prediction and analysis should lead to new knowledge about failure mechanisms of vascular intervention and new strategies to improve existing approaches to lower extremity revascularization. PMID:17544028

  16. Methylococcus capsulatus (Bath) from genome to protein function, and vice versa.

    PubMed

    Karlsen, Odd A; Berven, Frode S; Bagstevold, June I; Larsen, Oivind; Jensen, Harald B

    2011-01-01

    The genome sequence of Methylococcus capsulatus (Bath), considered a model methylotroph, was published in 2004 [Ward, N., et al. (2004). Genomic insights into methanotrophy: the complete genome sequence of Methylococcus capsulatus (Bath). PLoS Biol.2, e303]. In the postgenomic era, the challenge is to determine the gene function, and to this end, genomics must be complemented with proteomic approaches. This chapter describes some experimental and computational approaches we have used and developed for the exploration of the genome and proteome of M. capsulatus (Bath). PMID:21419915

  17. Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis.

    PubMed

    Kelkar, Dhanashree S; Provost, Elayne; Chaerkady, Raghothama; Muthusamy, Babylakshmi; Manda, Srikanth S; Subbannayya, Tejaswini; Selvan, Lakshmi Dhevi N; Wang, Chieh-Huei; Datta, Keshava K; Woo, Sunghee; Dwivedi, Sutopa B; Renuse, Santosh; Getnet, Derese; Huang, Tai-Chung; Kim, Min-Sik; Pinto, Sneha M; Mitchell, Christopher J; Madugundu, Anil K; Kumar, Praveen; Sharma, Jyoti; Advani, Jayshree; Dey, Gourav; Balakrishnan, Lavanya; Syed, Nazia; Nanjappa, Vishalakshi; Subbannayya, Yashwanth; Goel, Renu; Prasad, T S Keshava; Bafna, Vineet; Sirdeshmukh, Ravi; Gowda, Harsha; Wang, Charles; Leach, Steven D; Pandey, Akhilesh

    2014-11-01

    Accurate annotation of protein-coding genes is one of the primary tasks upon the completion of whole genome sequencing of any organism. In this study, we used an integrated transcriptomic and proteomic strategy to validate and improve the existing zebrafish genome annotation. We undertook high-resolution mass-spectrometry-based proteomic profiling of 10 adult organs, whole adult fish body, and two developmental stages of zebrafish (SAT line), in addition to transcriptomic profiling of six organs. More than 7,000 proteins were identified from proteomic analyses, and ∼ 69,000 high-confidence transcripts were assembled from the RNA sequencing data. Approximately 15% of the transcripts mapped to intergenic regions, the majority of which are likely long non-coding RNAs. These high-quality transcriptomic and proteomic data were used to manually reannotate the zebrafish genome. We report the identification of 157 novel protein-coding genes. In addition, our data led to modification of existing gene structures including novel exons, changes in exon coordinates, changes in frame of translation, translation in annotated UTRs, and joining of genes. Finally, we discovered four instances of genome assembly errors that were supported by both proteomic and transcriptomic data. Our study shows how an integrative analysis of the transcriptome and the proteome can extend our understanding of even well-annotated genomes. PMID:25060758

  18. CGUG: in silico proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to ca. 1.9 Mb

    PubMed Central

    Mahadevan, Padmanabhan; King, John F; Seto, Donald

    2009-01-01

    Background Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG) is an in silico genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms. Findings CGUG is available at as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an in silico display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with Chlamydophila and Francisella genomes. Conclusion CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins. PMID:19706165

  19. Annotation of Protein Domains Reveals Remarkable Conservation in the Functional Make up of Proteomes Across Superkingdoms

    PubMed Central

    Nasir, Arshan; Naeem, Aisha; Khan, Muhammad Jawad; Lopez-Nicora, Horacio D.; Caetano-Anollés, Gustavo

    2011-01-01

    The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins encoded in the genome of an organism. The molecular functions of proteins are the direct consequence of their structure and structure can be inferred from sequence using hidden Markov models of structural recognition. Here we analyze the functional annotation of protein domain structures in almost a thousand sequenced genomes, exploring the functional and structural diversity of proteomes. We find there is a remarkable conservation in the distribution of domains with respect to the molecular functions they perform in the three superkingdoms of life. In general, most of the protein repertoire is spent in functions related to metabolic processes but there are significant differences in the usage of domains for regulatory and extra-cellular processes both within and between superkingdoms. Our results support the hypotheses that the proteomes of superkingdom Eukarya evolved via genome expansion mechanisms that were directed towards innovating new domain architectures for regulatory and extra/intracellular process functions needed for example to maintain the integrity of multicellular structure or to interact with environmental biotic and abiotic factors (e.g., cell signaling and adhesion, immune responses, and toxin production). Proteomes of microbial superkingdoms Archaea and Bacteria retained fewer numbers of domains and maintained simple and smaller protein repertoires. Viruses appear to play an important role in the evolution of superkingdoms. We finally identify few genomic outliers that deviate significantly from the conserved functional design. These include Nanoarchaeum equitans, proteobacterial symbionts of insects with extremely reduced genomes, Tenericutes and Guillardia theta. These organisms spend most of their domains on information functions, including translation and transcription, rather than on metabolism and harbor a domain repertoire characteristic of

  20. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  1. Screening and Functional Analyses of Nilaparvata lugens Salivary Proteome.

    PubMed

    Huang, Hai-Jian; Liu, Cheng-Wen; Huang, Xiao-Hui; Zhou, Xiang; Zhuo, Ji-Chong; Zhang, Chuan-Xi; Bao, Yan-Yuan

    2016-06-01

    Most phloem-feeding insects secrete gelling and watery saliva during the feeding process. However, the functions of salivary proteins are poorly understood. In this study, our purpose was to reveal the components and functions of saliva in a rice sap-sucking insect pest, Nilaparvata lugens. The accomplishment of the whole genome and transcriptome sequencing in N. lugens would be helpful for elucidating the gene information and expression specificity of the salivary proteins. In this study, we have, for the first time, identified the abundant protein components from gelling and watery saliva in a monophagous sap-sucking insect species through shotgun proteomic detection combined with the genomic and transcriptomic analysis. Eight unknown secreted proteins were limited to N. lugens, indicating species-specific saliva components. A group of annexin-like proteins first identified in the secreted saliva displayed different domain structure and expression specificity with typical insect annexins. Nineteen genes encoding five annexin-like proteins, six salivaps (salivary glands-specific proteins with unknown function), seven putative enzymes, and a mucin-like protein showed salivary gland-specific expression pattern, suggesting their importance in the physiological mechanisms of salivary gland and saliva in this insect species. RNA interference revealed that salivap-3 is a key protein factor in forming the salivary sheath, while annexin-like5 and carbonic anhydrase are indispensable for N. lugens survival. These novel findings will greatly help to clarify the detailed functions of salivary proteins in the physiological process of N. lugens and elucidate the interaction mechanisms between N. lugens and the rice plant, which could provide important targets for the future management of rice pests. PMID:27142481

  2. Genomic and Proteomic Biomarkers for Cancer: A Multitude of Opportunities

    PubMed Central

    Tainsky, Michael A.

    2009-01-01

    Biomarkers are molecular indicators of a biological status, and as biochemical species can be assayed to evaluate the presence of cancer and therapeutic interventions. Through a variety of mechanisms cancer cells provide the biomarker material for their own detection. Biomarkers may be detectable in the blood, other body fluids, or tissues. The expectation is that the level of an informative biomarker is related to the specific type of disease present in the body. Biomarkers have potential both as diagnostic indicators and monitors of the effectiveness of clinical interventions. Biomarkers are also able to stratify cancer patients to the most appropriate treatment. Effective biomarkers for the early detection of cancer should provide a patient with a better outcome which in turn will translate into more efficient delivery of healthcare. Technologies for the early detection of cancer have resulted in reductions in disease-associated mortalities from cancers that are otherwise deadly if allowed to progress. Such screening technologies have proven that early detection will decrease the morbidity and mortality from cancer. An emerging theme in biomarker research is the expectation that panels of biomarker analytes rather than single markers will be needed to have sufficient sensitivity and specificity for the presymptomatic detection of cancer. Biomarkers may provide prognostic information of disease enabling interventions using targeted therapeutic agents as well as course-corrections in cancer treatment. Novel genomic, proteomic and metabolomic technologies are being used to discover and validate tumor biomarkers individually and in panels. PMID:19406210

  3. Encapsulated in silica: genome, proteome and physiology of the thermophilic bacterium Anoxybacillus flavithermus WK1

    PubMed Central

    Saw, Jimmy H; Mountain, Bruce W; Feng, Lu; Omelchenko, Marina V; Hou, Shaobin; Saito, Jennifer A; Stott, Matthew B; Li, Dan; Zhao, Guang; Wu, Junli; Galperin, Michael Y; Koonin, Eugene V; Makarova, Kira S; Wolf, Yuri I; Rigden, Daniel J; Dunfield, Peter F; Wang, Lei; Alam, Maqsudul

    2008-01-01

    Background Gram-positive bacteria of the genus Anoxybacillus have been found in diverse thermophilic habitats, such as geothermal hot springs and manure, and in processed foods such as gelatin and milk powder. Anoxybacillus flavithermus is a facultatively anaerobic bacterium found in super-saturated silica solutions and in opaline silica sinter. The ability of A. flavithermus to grow in super-saturated silica solutions makes it an ideal subject to study the processes of sinter formation, which might be similar to the biomineralization processes that occurred at the dawn of life. Results We report here the complete genome sequence of A. flavithermus strain WK1, isolated from the waste water drain at the Wairakei geothermal power station in New Zealand. It consists of a single chromosome of 2,846,746 base pairs and is predicted to encode 2,863 proteins. In silico genome analysis identified several enzymes that could be involved in silica adaptation and biofilm formation, and their predicted functions were experimentally validated in vitro. Proteomic analysis confirmed the regulation of biofilm-related proteins and crucial enzymes for the synthesis of long-chain polyamines as constituents of silica nanospheres. Conclusions Microbial fossils preserved in silica and silica sinters are excellent objects for studying ancient life, a new paleobiological frontier. An integrated analysis of the A. flavithermus genome and proteome provides the first glimpse of metabolic adaptation during silicification and sinter formation. Comparative genome analysis suggests an extensive gene loss in the Anoxybacillus/Geobacillus branch after its divergence from other bacilli. PMID:19014707

  4. Functional Genomics Tools for Papaya

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With the genome of papaya (Carica papaya L.) sequenced, the study of gene function is becoming an increasing priority. Our research is to develop an RNA-induced gene silencing tool for the study of functional genomics in papaya. We employed agrobacterium leaf infiltration to induce PTGS in '-glucuro...

  5. Proteomics-inferred genome typing (PIGT) demonstrates inter-populationrecombination as a strategy for environmental adaptation

    SciTech Connect

    Denef, Vincent; Verberkmoes, Nathan C; Shah, Manesh B; Abraham, Paul E; Lefsrud, Mark G; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    Analyses of ecological and evolutionary processes that shape microbial consortia are facilitated by comprehensive studies of ecosystems with low species richness. In the current study we evaluated the role of recombination in altering the fitness of chemoautotrophic bacteria in their natural environment. Proteomics-inferred genome typing (PIGT) was used to determine the genomic make-up of Leptospirillum group II populations in 27 biofilms sampled from six locations in the Richmond Mine acid mine drainage system (Iron Mountain, CA) over a four-year period. We observed six distinct genotypes that are recombinants comprised of segments from two parental genotypes. Community genomic analyses revealed additional low abundance recombinant variants. The dominance of some genotypes despite a larger available genome pool, and patterns of spatiotemporal distribution within the ecosystem, indicate selection for distinct recombinants. Genes involved in motility, signal transduction and transport were overrepresented in the tens to hundreds of kilobase recombinant blocks, whereas core metabolic functions were significantly underrepresented. Our findings demonstrate the power of PIGT and reveal that recombination is a mechanism for fine-scale adaptation in this system.

  6. Protein intrinsic disorder within the Potyvirus genus: from proteome-wide analysis to functional annotation.

    PubMed

    Charon, Justine; Theil, Sébastien; Nicaise, Valérie; Michon, Thierry

    2016-02-01

    Within proteins, intrinsically disordered regions (IDRs) are devoid of stable secondary and tertiary structures under physiological conditions and rather exist as dynamic ensembles of inter-converting conformers. Although ubiquitous in all domains of life, the intrinsic disorder content is highly variable in viral genomes. Over the years, functional annotations of disordered regions at the scale of the whole proteome have been conducted for several animal viruses. But to date, similar studies applied to plant viruses are still missing. Based on disorder prediction tools combined with annotation programs and evolutionary studies, we analyzed the intrinsic disorder content in Potyvirus, using a 10-species dataset representative of this genus diversity. In this paper, we revealed that: (i) the Potyvirus proteome displays high disorder content, (ii) disorder is conserved during Potyvirus evolution, suggesting a functional advantage of IDRs, (iii) IDRs evolve faster than ordered regions, and (iv) IDRs may be associated with major biological functions required for the Potyvirus cycle. Notably, the proteins P1, Coat protein (CP) and Viral genome-linked protein (VPg) display a high content of conserved disorder, enriched in specific motifs mimicking eukaryotic functional modules and suggesting strategies of host machinery hijacking. In these three proteins, IDRs are particularly conserved despite their high amino acid polymorphism, indicating a link to adaptive processes. Through this comprehensive study, we further investigate the biological relevance of intrinsic disorder in Potyvirus biology and we propose a functional annotation of potyviral proteome IDRs. PMID:26699268

  7. Genomic and Proteomic Studies on Plesiomonas shigelloides Lipopolysaccharide Core Biosynthesis

    PubMed Central

    Aquilini, Eleonora; Merino, Susana; Regué, Miguel

    2014-01-01

    We report here the identification of waa clusters with the genes required for the biosynthesis of the core lipopolysaccharides (LPS) of two Plesiomonas shigelloides strains. Both P. shigelloides waa clusters shared all of the genes besides the ones flanking waaL. In both strains, all of the genes were found in the waa gene cluster, although one common core biosynthetic gene (wapG) was found in a different chromosome location outside the cluster. Since P. shigelloides and Klebsiella pneumoniae share a core LPS carbohydrate backbone extending up at least to the second outer-core residue, the functions of the common P. shigelloides genes were elucidated by genetic complementation studies using well-defined K. pneumoniae mutants. The function of strain-specific inner- or outer-core genes was identified by using as a surrogate acceptor LPS from three well-defined K. pneumoniae core LPS mutants. Using this strategy, we were able to assign a proteomic function to all of the P. shigelloides waa genes identified in the two strains encoding six new glycosyltransferases (WapA, -B, -C, -D, -F, and -G). P. shigelloides demonstrated an important variety of core LPS structures, despite being a single species of the genus, as well as high homologous recombination in housekeeping genes. PMID:24244003

  8. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

    PubMed Central

    Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  9. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics.

    PubMed

    Takeuchi, Takeshi; Yamada, Lixy; Shinzato, Chuya; Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  10. ProtoBug: functional families from the complete proteomes of insects.

    PubMed

    Rappoport, Nadav; Linial, Michal

    2015-01-01

    ProtoBug (http://www.protobug.cs.huji.ac.il) is a database and resource of protein families in Arthropod genomes. ProtoBug platform presents the relatedness of complete proteomes from 17 insects as well as a proteome of the crustacean, Daphnia pulex. The represented proteomes from insects include louse, bee, beetle, ants, flies and mosquitoes. Based on an unsupervised clustering method, protein sequences were clustered into a hierarchical tree, called ProtoBug. ProtoBug covers about 300,000 sequences that are partitioned to families. At the default setting, all sequences are partitioned to ∼20,000 families (excluding singletons). From the species perspective, each of the 18 analysed proteomes is composed of 5000-8000 families. In the regime of the advanced operational mode, the ProtoBug provides rich navigation capabilities for touring the hierarchy of the families at any selected resolution. A proteome viewer shows the composition of sequences from any of the 18 analysed proteomes. Using functional annotation from an expert system (Pfam) we assigned domains, families and repeats by 4400 keywords that cover 73% of the sequences. A strict inference protocol is applied for expanding the functional knowledge. Consequently, secured annotations were associated with 81% of the proteins, and with 70% of the families (≥10 proteins each). ProtoBug is a database and webtool with rich visualization and navigation tools. The properties of each family in relation to other families in the ProtoBug tree, and in view of the taxonomy composition are reported. Furthermore, the user can paste its own sequences to find relatedness to any of the ProtoBug families. The database and the navigation tools are the basis for functional discoveries that span 350 million years of evolution of Arthropods. ProtoBug is available with no restriction at: www.protobug.cs.huji.ac.il. Database URL: www.protobug.cs.huji.ac.il PMID:25911153

  11. ProtoBug: functional families from the complete proteomes of insects

    PubMed Central

    Rappoport, Nadav; Linial, Michal

    2015-01-01

    ProtoBug (http://www.protobug.cs.huji.ac.il) is a database and resource of protein families in Arthropod genomes. ProtoBug platform presents the relatedness of complete proteomes from 17 insects as well as a proteome of the crustacean, Daphnia pulex. The represented proteomes from insects include louse, bee, beetle, ants, flies and mosquitoes. Based on an unsupervised clustering method, protein sequences were clustered into a hierarchical tree, called ProtoBug. ProtoBug covers about 300 000 sequences that are partitioned to families. At the default setting, all sequences are partitioned to ∼20 000 families (excluding singletons). From the species perspective, each of the 18 analysed proteomes is composed of 5000–8000 families. In the regime of the advanced operational mode, the ProtoBug provides rich navigation capabilities for touring the hierarchy of the families at any selected resolution. A proteome viewer shows the composition of sequences from any of the 18 analysed proteomes. Using functional annotation from an expert system (Pfam) we assigned domains, families and repeats by 4400 keywords that cover 73% of the sequences. A strict inference protocol is applied for expanding the functional knowledge. Consequently, secured annotations were associated with 81% of the proteins, and with 70% of the families (≥10 proteins each). ProtoBug is a database and webtool with rich visualization and navigation tools. The properties of each family in relation to other families in the ProtoBug tree, and in view of the taxonomy composition are reported. Furthermore, the user can paste its own sequences to find relatedness to any of the ProtoBug families. The database and the navigation tools are the basis for functional discoveries that span 350 million years of evolution of Arthropods. ProtoBug is available with no restriction at: www.protobug.cs.huji.ac.il. Database URL: www.protobug.cs.huji.ac.il. PMID:25911153

  12. Mass spectrometry-based functional proteomics of poly(ADP-ribose) polymerase-1.

    PubMed

    Pic, Emilie; Gagné, Jean-Philippe; Poirier, Guy G

    2011-12-01

    PARP-1 is an abundant nuclear protein that plays an essential role in the regulation of many genome integrity and chromatin-based processes, such as DNA repair, replication or transcriptional regulation. PARP-1 modulates the function of chromatin and nuclear proteins through several poly(ADP-ribose) (pADPr)-dependent pathways. Aside from the clearly established role of PARP-1 in the maintenance of genome stability, PARP-1 also emerged as an important regulator that links chromatin functions with extranuclear compartments. pADPr signaling has notably been found to be responsible for PARP-1-mediated mitochondrial dysfunction and cell death. Defining the mechanisms that govern the intrinsic functions of PARP-1 is fundamental to the understanding of signaling networks regulated by pADPr. The emergence of mass spectrometry-based proteomics and its broad applications in the study of biological systems represents an outstanding opportunity to widen our knowledge of the functional spectrum of PARP-1. In this article, we summarize various PARP-1 targeted proteomics studies and proteome-wide analyses that shed light on its protein interaction partners, expression levels and post-translational modifications. PMID:22087659

  13. Moving from unsequenced to sequenced genome: Reanalysis of the proteome of Leishmania donovani☆

    PubMed Central

    Nirujogi, Raja Sekhar; Pawar, Harsh; Renuse, Santosh; Kumar, Praveen; Chavan, Sandip; Sathe, Gajanan; Sharma, Jyoti; Khobragade, Sweta; Pande, Janhavee; Modak, Bhakti; Prasad, T.S. Keshava; Harsha, H.C.; Patole, Milind S.; Pandey, Akhilesh

    2015-01-01

    The kinetoplastid protozoan parasite, Leishmania donovani, is the causative agent of kala azar or visceral leishmaniasis. Kala azar is a severe form of leishmaniasis that is fatal in the majority of untreated cases. Studies on proteomic analysis of L. donovani thus far have been carried out using homology-based identification based on related Leishmania species (L. infantum, L. major and L. braziliensis) whose genomes have been sequenced. Recently, the genome of L. donovani was fully sequenced and the data became publicly available. We took advantage of the availability of its genomic sequence to carry out a more accurate proteogenomic analysis of L. donovani proteome using our previously generated dataset. This resulted in identification of 17,504 unique peptides upon database-dependent search against the annotated proteins in L. donovani. These peptides were assigned to 3999 unique proteins in L. donovani. 2296 proteins were identified in both the life stages of L. donovani, while 613 and 1090 proteins were identified only from amastigote and promastigote stages, respectively. The proteomic data was also searched against six-frame translated L. donovani genome, which led to 255 genome search-specific peptides (GSSPs) resulting in identification of 20 novel genes and correction of 40 existing gene models in L. donovani. Biological significance Leishmania donovani genome sequencing was recently completed, which permitted us to use a proteogenomic approach to map its proteome and to carry out annotation of it genome. This resulted in mapping of 50% (3999 proteins) of L. donovani proteome. Our study identified 20 novel genes previously not predicted from the L. donovani genome in addition to correcting annotations of 40 existing gene models. The identified proteins may help in better understanding of stage-specific protein expression profiles in L. donovani and to identify novel stage-specific drug targets in L. donovani which could be used in the treatment of

  14. Proteomic and genomic analysis of PITX2 interacting and regulating networks

    PubMed Central

    Huang, Yue; Huang, Kan; Boskovic, Goran; Dementieva, Yulia; Denvir, James; Primerano, Donald A.; Zhu, Guo-Zhang

    2009-01-01

    PITX2 is a homeodomain transcription factor that has a substantial role in cell proliferation and differentiation in various tissues. In this report, we have conducted a systematic study, using proteomic and genomic approaches, to characterize PITX2-interacting proteins and PITX2-regulating genes. We identified four novel PITX2-associated protein partners YB-1, hnRNP K, nucleolin and hnRNP U in mass spectrometry analysis. We also found that overexpression of PITX2 upregulated 868 genes (two-fold to twenty five-fold) and downregulated 191 genes (twofold to fifteen fold) in DNA microarray analysis. These data provide an insightful perspective for further studying PITX2 function and mechanism of action. PMID:19174163

  15. Application of Functional Genomics for Bovine Respiratory Disease Diagnostics

    PubMed Central

    Rai, Aswathy N.; Epperson, William B.; Nanduri, Bindu

    2015-01-01

    Bovine respiratory disease (BRD) is the most common economically important disease affecting cattle. For developing accurate diagnostics that can predict disease susceptibility/resistance and stratification, it is necessary to identify the molecular mechanisms that underlie BRD. To study the complex interactions among the bovine host and the multitude of viral and bacterial pathogens, as well as the environmental factors associated with BRD etiology, genome-scale high-throughput functional genomics methods such as microarrays, RNA-seq, and proteomics are helpful. In this review, we summarize the progress made in our understanding of BRD using functional genomics approaches. We also discuss some of the available bioinformatics resources for analyzing high-throughput data, in the context of biological pathways and molecular interactions. Although resources for studying host response to infection are avail-able, the corresponding information is lacking for majority of BRD pathogens, impeding progress in identifying diagnostic signatures for BRD using functional genomics approaches. PMID:26526746

  16. Application of Functional Genomics for Bovine Respiratory Disease Diagnostics.

    PubMed

    Rai, Aswathy N; Epperson, William B; Nanduri, Bindu

    2015-01-01

    Bovine respiratory disease (BRD) is the most common economically important disease affecting cattle. For developing accurate diagnostics that can predict disease susceptibility/resistance and stratification, it is necessary to identify the molecular mechanisms that underlie BRD. To study the complex interactions among the bovine host and the multitude of viral and bacterial pathogens, as well as the environmental factors associated with BRD etiology, genome-scale high-throughput functional genomics methods such as microarrays, RNA-seq, and proteomics are helpful. In this review, we summarize the progress made in our understanding of BRD using functional genomics approaches. We also discuss some of the available bioinformatics resources for analyzing high-throughput data, in the context of biological pathways and molecular interactions. Although resources for studying host response to infection are avail-able, the corresponding information is lacking for majority of BRD pathogens, impeding progress in identifying diagnostic signatures for BRD using functional genomics approaches. PMID:26526746

  17. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    SciTech Connect

    Brooks, Brandon; Mueller, R. S.; Young, Jacque C.; Morowitz, Michael J.; Robert L. Hettich; Banfield, Jillian F.

    2015-07-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13 21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains of Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential.

  18. Quantitative Proteomic Analysis of Mitochondrial Proteins Reveals Pro-Survival Mechanisms in the Perpetuation of Radiation-Induced Genomic Instability

    SciTech Connect

    Thomas, Stefani N.; Waters, Katrina M.; Morgan, William F.; Yang, Austin; Baulch, Janet E.

    2012-07-26

    Radiation induced genomic instability is a well-studied phenomenon that is measured as mitotically heritable genetic alterations observed in the progeny of an irradiated cell. The mechanisms that perpetuate this instability are unclear, however, a role for chronic oxidative stress has consistently been demonstrated. In the chromosomally unstable LS12 cell line, oxidative stress and genomic instability were correlated with mitochondrial dysfunction. To clarify this mitochondrial dysfunction and gain insight into the mechanisms underlying radiation induced genomic instability we have evaluated the mitochondrial sub-proteome and performed quantitative mass spectrometry (MS) analysis of LS12 cells. Of 98 quantified mitochondrial proteins, 17 met criteria for fold changes and reproducibility; and 11 were statistically significant in comparison with the stable parental GM10115 cell line. Previous observations implicated defects in the electron transport chain (ETC) in the LS12 cell mitochondrial dysfunction. Proteomic analysis supports these observations, demonstrating significantly reduced levels of mitochondrial cytochrome c, the intermediary between complexes III and IV of the ETC. Results also suggest that LS12 cells compensate for ETC dysfunction and oxidative stress through increased levels of tricarboxylic acid cycle enzymes and up-regulation of proteins that protect against oxidative stress and apoptosis. More than one cellular defect is likely to contribute to the genomic instability phenotype. These data suggest that LS12 cells have adapted mechanisms that allow survival under sub-optimal conditions of oxidative stress and compromised mitochondrial function to perpetuate genomic instability.

  19. [FUNCTIONAL DIFFERENTIATION IN BRYOZOAN COLONY: A PROTEOMIC ANALYSIS].

    PubMed

    Kutyumov, V A; Maltseva, A L; Kotenko, N; Ostrovsky, A N

    2016-01-01

    Bryozoans are typical modular organisms. They consist of repetitive structural units, the zooids. Bryozoan colonies grow by zooidal budding, with the distribution pattern of the budding loci underlying the diversity of colony forms. Budding is usually restricted to the zooids at the periphery of the colony, which form a "growing edge" or local terminal growth zones. Non-budding parts of the colony can be functionally subdivided, too. In many species colonies consists of regular, often repetitive zones of feeding and non-feeding modules, associated with a periodical degeneration and regeneration of the polypide, retractile tentacle crown with a gut and the accompanying musculature. So, there is functional differentiation in bryozoan colonies but its mechanisms are unknown. Presumably, budding and/or polypide recycling in different colony parts are induced or inhibited by certain determinants of functional specialization. An effective tool of their identification is the comparison of proteomes of functionally different zones. Here we report the results of proteomic analysis of three bryozoan species from the White Sea, which have a different colony form: Flustrellidra hispida, Terminoflustra membranaceotruncata and Securiflustra securifrons. Using differential two-dimensional electrophoresis (2D-DIGE), we compared proteomes of the growing edge and the zones consisting of feeding and non-feeding zooids in these species. We estimated the overall proteome variability, revealed proteins whose relative abundance gradually changed along the proximal-distal colony axis and suggested that they might be involved in the functional differentiation of the colony. PMID:27220253

  20. VESPA: Software to Facilitate Genomic Annotation of Prokaryotic Organisms Through Integration of Proteomic and Transcriptomic Data

    SciTech Connect

    Peterson, Elena S.; McCue, Lee Ann; Rutledge, Alexandra C.; Jensen, Jeffrey L.; Walker, Julia; Kobold, Mark A.; Webb, Samantha R.; Payne, Samuel H.; Ansong, Charles; Adkins, Joshua N.; Cannon, William R.; Webb-Robertson, Bobbie-Jo M.

    2012-04-25

    Visual Exploration and Statistics to Promote Annotation (VESPA) is an interactive visual analysis software tool that facilitates the discovery of structural mis-annotations in prokaryotic genomes. VESPA integrates high-throughput peptide-centric proteomics data and oligo-centric or RNA-Seq transcriptomics data into a genomic context. The data may be interrogated via visual analysis across multiple levels of genomic resolution, linked searches, exports and interaction with BLAST to rapidly identify location of interest within the genome and evaluate potential mis-annotations.

  1. Single-Nucleotide Variations in Cardiac Arrhythmias: Prospects for Genomics and Proteomics Based Biomarker Discovery and Diagnostics

    PubMed Central

    Abunimer, Ayman; Smith, Krista; Wu, Tsung-Jung; Lam, Phuc; Simonyan, Vahan; Mazumder, Raja

    2014-01-01

    Cardiovascular diseases are a large contributor to causes of early death in developed countries. Some of these conditions, such as sudden cardiac death and atrial fibrillation, stem from arrhythmias—a spectrum of conditions with abnormal electrical activity in the heart. Genome-wide association studies can identify single nucleotide variations (SNVs) that may predispose individuals to developing acquired forms of arrhythmias. Through manual curation of published genome-wide association studies, we have collected a comprehensive list of 75 SNVs associated with cardiac arrhythmias. Ten of the SNVs result in amino acid changes and can be used in proteomic-based detection methods. In an effort to identify additional non-synonymous mutations that affect the proteome, we analyzed the post-translational modification S-nitrosylation, which is known to affect cardiac arrhythmias. We identified loss of seven known S-nitrosylation sites due to non-synonymous single nucleotide variations (nsSNVs). For predicted nitrosylation sites we found 1429 proteins where the sites are modified due to nsSNV. Analysis of the predicted S-nitrosylation dataset for over- or under-representation (compared to the complete human proteome) of pathways and functional elements shows significant statistical over-representation of the blood coagulation pathway. Gene Ontology (GO) analysis displays statistically over-represented terms related to muscle contraction, receptor activity, motor activity, cystoskeleton components, and microtubule activity. Through the genomic and proteomic context of SNVs and S-nitrosylation sites presented in this study, researchers can look for variation that can predispose individuals to cardiac arrhythmias. Such attempts to elucidate mechanisms of arrhythmia thereby add yet another useful parameter in predicting susceptibility for cardiac diseases. PMID:24705329

  2. Proteomic profiling reveals insights into Triticeae stigma development and function.

    PubMed

    Nazemof, Nazila; Couroux, Philippe; Rampitsch, Christof; Xing, Tim; Robert, Laurian S

    2014-11-01

    To our knowledge, this study represents the first high-throughput characterization of a stigma proteome in the Triticeae. A total of 2184 triticale mature stigma proteins were identified using three different gel-based approaches combined with mass spectrometry. The great majority of these proteins are described in a Triticeae stigma for the first time. These results revealed many proteins likely to play important roles in stigma development and pollen-stigma interactions, as well as protection against biotic and abiotic stresses. Quantitative comparison of the triticale stigma transcriptome and proteome showed poor correlation, highlighting the importance of having both types of analysis. This work makes a significant contribution towards the elucidation of the Triticeae stigma proteome and provides novel insights into its role in stigma development and function. PMID:25170101

  3. Proteomic profiling reveals insights into Triticeae stigma development and function

    PubMed Central

    Nazemof, Nazila; Couroux, Philippe; Rampitsch, Christof; Xing, Tim; Robert, Laurian S.

    2014-01-01

    To our knowledge, this study represents the first high-throughput characterization of a stigma proteome in the Triticeae. A total of 2184 triticale mature stigma proteins were identified using three different gel-based approaches combined with mass spectrometry. The great majority of these proteins are described in a Triticeae stigma for the first time. These results revealed many proteins likely to play important roles in stigma development and pollen–stigma interactions, as well as protection against biotic and abiotic stresses. Quantitative comparison of the triticale stigma transcriptome and proteome showed poor correlation, highlighting the importance of having both types of analysis. This work makes a significant contribution towards the elucidation of the Triticeae stigma proteome and provides novel insights into its role in stigma development and function. PMID:25170101

  4. The Eukaryotic Flagellum Makes the Day: Novel and Unforeseen Roles Uncovered After Post-Genomics and Proteomics Data

    PubMed Central

    Diniz, Michely C; Pacheco, Ana Carolina L; Farias, Kaio M; de Oliveira, Diana M

    2012-01-01

    This review will summarize and discuss the current biological understanding of the motile eukaryotic flagellum, as posed out by recent advances enabled by post-genomics and proteomics approaches. The organelle, which is crucial for motility, survival, differentiation, reproduction, division and feeding, among other activities, of many eukaryotes, is a great example of a natural nanomachine assembled mostly by proteins (around 350-650 of them) that have been conserved throughout eukaryotic evolution. Flagellar proteins are discussed in terms of their arrangement on to the axoneme, the canonical “9+2” microtubule pattern, and also motor and sensorial elements that have been detected by recent proteomic analyses in organisms such as Chlamydomonas reinhardtii, sea urchin, and trypanosomatids. Such findings can be remarkably matched up to important discoveries in vertebrate and mammalian types as diverse as sperm cells, ciliated kidney epithelia, respiratory and oviductal cilia, and neuro-epithelia, among others. Here we will focus on some exciting work regarding eukaryotic flagellar proteins, particularly using the flagellar proteome of C. reinhardtii as a reference map for exploring motility in function, dysfunction and pathogenic flagellates. The reference map for the eukaryotic flagellar proteome consists of 652 proteins that include known structural and intraflagellar transport (IFT) proteins, less well-characterized signal transduction proteins and flagellar associated proteins (FAPs), besides almost two hundred unannotated conserved proteins, which lately have been the subject of intense investigation and of our present examination. PMID:22708495

  5. A DATABASE FOR TRACKING TOXICOGENOMIC SAMPLES AND PROCEDURES WITH GENOMIC, PROTEOMIC AND METABONOMIC COMPONENTS

    EPA Science Inventory

    A Database for Tracking Toxicogenomic Samples and Procedures with Genomic, Proteomic and Metabonomic Components
    Wenjun Bao1, Jennifer Fostel2, Michael D. Waters2, B. Alex Merrick2, Drew Ekman3, Mitchell Kostich4, Judith Schmid1, David Dix1
    Office of Research and Developmen...

  6. GENOMIC AND PROTEOMIC ANALYSIS OF SURROGATE TISSUES FOR ASSESSING TOXIC EXPOSURES AND DISEASE STATES

    EPA Science Inventory

    Genomic and Proteomic Analysis of Surrogate Tissues for Assessing Toxic Exposures and Disease States
    David J. Dix and John C. Rockett
    Reproductive Toxicology Division, National Health and Environmental Effects Research Laboratory, Office of Research and Development, USEPA, ...

  7. Advances in genomic- and proteomic-based methods to study food-borne pathogens

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In recent years, a number of “omics” technologies, including genomics, proteomics, metabolomics, and others are being utilized to enhance the understanding of the complexities of pathogen behavior at the molecular level and for the development of improved pathogen detection and typing systems. Gene...

  8. USING GENOMICS AND PROTEOMICS TO DIAGNOSE EXPOSURE OF AQUATIC ORGANISMS TO ENVIRONMENTAL CONTAMINANTS

    EPA Science Inventory

    Advances in molecular biology allow the use of cutting-edge genomic and proteomic tools to assess the effects of environmental contaminants on aquatic organisms. Techniques are available to measure changes in expression of single genes (quantitative real-time PCR) or to measure g...

  9. The Changing Face of Scientific Discourse: Analysis of Genomic and Proteomic Database Usage and Acceptance.

    ERIC Educational Resources Information Center

    Brown, Cecelia

    2003-01-01

    Discusses the growth in use and acceptance of Web-based genomic and proteomic databases (GPD) in scholarly communication. Confirms the role of GPD in the scientific literature cycle, suggests GPD are a storage and retrieval mechanism for molecular biology information, and recommends that existing models of scientific communication be updated to…

  10. High-Throughput Cloning and Expression Library Creation for Functional Proteomics

    PubMed Central

    Festa, Fernanda; Steel, Jason; Bian, Xiaofang; Labaer, Joshua

    2013-01-01

    The study of protein function usually requires the use of a cloned version of the gene for protein expression and functional assays. This strategy is particular important when the information available regarding function is limited. The functional characterization of the thousands of newly identified proteins revealed by genomics requires faster methods than traditional single gene experiments, creating the need for fast, flexible and reliable cloning systems. These collections of open reading frame (ORF) clones can be coupled with high-throughput proteomics platforms, such as protein microarrays and cell-based assays, to answer biological questions. In this tutorial we provide the background for DNA cloning, discuss the major high-throughput cloning systems (Gateway® Technology, Flexi® Vector Systems, and Creator™ DNA Cloning System) and compare them side-by-side. We also report an example of high-throughput cloning study and its application in functional proteomics. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP12). Details can be found at http://www.proteomicstutorials.org. PMID:23457047

  11. Human genome protein function database.

    PubMed Central

    Sorenson, D. K.

    1991-01-01

    A database which focuses on the normal functions of the currently-known protein products of the Human Genome was constructed. Information is stored as text, figures, tables, and diagrams. The program contains built-in functions to modify, update, categorize, hypertext, search, create reports, and establish links to other databases. The semi-automated categorization feature of the database program was used to classify these proteins in terms of biomedical functions. PMID:1807638

  12. Applied proteomics: mitochondrial proteins and effect on function.

    PubMed

    Lopez, Mary F; Melov, Simon

    2002-03-01

    The identification of a majority of the polypeptides in mitochondria would be invaluable because they play crucial and diverse roles in many cellular processes and diseases. The endogenous production of reactive oxygen species (ROS) is a major limiter of life as illustrated by studies in which the transgenic overexpression in invertebrates of catalytic antioxidant enzymes results in increased lifespans. Mitochondria have received considerable attention as a principal source---and target---of ROS. Mitochondrial oxidative stress has been implicated in heart disease including myocardial preconditioning, ischemia/reperfusion, and other pathologies. In addition, oxidative stress in the mitochondria is associated with the pathogenesis of Alzheimer's disease, Parkinson's disease, prion diseases, and amyotrophic lateral sclerosis (ALS) as well as aging itself. The rapidly emerging field of proteomics can provide powerful strategies for the characterization of mitochondrial proteins. Current approaches to mitochondrial proteomics include the creation of detailed catalogues of the protein components in a single sample or the identification of differentially expressed proteins in diseased or physiologically altered samples versus a reference control. It is clear that for any proteomics approach prefractionation of complex protein mixtures is essential to facilitate the identification of low-abundance proteins because the dynamic range of protein abundance within cells has been estimated to be as high as 10(7). The opportunities for identification of proteins directly involved in diseases associated with or caused by mitochondrial dysfunction are compelling. Future efforts will focus on linking genomic array information to actual protein levels in mitochondria. PMID:11884366

  13. AgBase: a functional genomics resource for agriculture

    PubMed Central

    McCarthy, Fiona M; Wang, Nan; Magee, G Bryce; Nanduri, Bindu; Lawrence, Mark L; Camon, Evelyn B; Barrell, Daniel G; Hill, David P; Dolan, Mary E; Williams, W Paul; Luthe, Dawn S; Bridges, Susan M; Burgess, Shane C

    2006-01-01

    Background Many agricultural species and their pathogens have sequenced genomes and more are in progress. Agricultural species provide food, fiber, xenotransplant tissues, biopharmaceuticals and biomedical models. Moreover, many agricultural microorganisms are human zoonoses. However, systems biology from functional genomics data is hindered in agricultural species because agricultural genome sequences have relatively poor structural and functional annotation and agricultural research communities are smaller with limited funding compared to many model organism communities. Description To facilitate systems biology in these traditionally agricultural species we have established "AgBase", a curated, web-accessible, public resource for structural and functional annotation of agricultural genomes. The AgBase database includes a suite of computational tools to use GO annotations. We use standardized nomenclature following the Human Genome Organization Gene Nomenclature guidelines and are currently functionally annotating chicken, cow and sheep gene products using the Gene Ontology (GO). The computational tools we have developed accept and batch process data derived from different public databases (with different accession codes), return all existing GO annotations, provide a list of products without GO annotation, identify potential orthologs, model functional genomics data using GO and assist proteomics analysis of ESTs and EST assemblies. Our journal database helps prevent redundant manual GO curation. We encourage and publicly acknowledge GO annotations from researchers and provide a service for researchers interested in GO and analysis of functional genomics data. Conclusion The AgBase database is the first database dedicated to functional genomics and systems biology analysis for agriculturally important species and their pathogens. We use experimental data to improve structural annotation of genomes and to functionally characterize gene products. AgBase is also

  14. Evaluation of a Genome-Scale In Silico Metabolic Model for Geobacter metallireducens Using Proteomic Data from a Field Biostimulation Experiment

    SciTech Connect

    Fang, Yilin; Wilkins, Michael J.; Yabusaki, Steven B.; Lipton, Mary S.; Long, Philip E.

    2012-12-12

    Biomass and shotgun global proteomics data that reflected relative protein abundances from samples collected during the 2008 experiment at the U.S. Department of Energy Integrated Field-Scale Subsurface Research Challenge site in Rifle, Colorado, provided an unprecedented opportunity to validate a genome-scale metabolic model of Geobacter metallireducens and assess its performance with respect to prediction of metal reduction, biomass yield, and growth rate under dynamic field conditions. Reconstructed from annotated genomic sequence, biochemical, and physiological data, the constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes. Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low fluxes through amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment.

  15. Current Approaches on Viral Infection: Proteomics and Functional Validations

    PubMed Central

    Zheng, Jie; Tan, Boon Huan; Sugrue, Richard; Tang, Kai

    2012-01-01

    Viruses could manipulate cellular machinery to ensure their continuous survival and thus become parasites of living organisms. Delineation of sophisticated host responses upon virus infection is a challenging task. It lies in identifying the repertoire of host factors actively involved in the viral infectious cycle and characterizing host responses qualitatively and quantitatively during viral pathogenesis. Mass spectrometry based proteomics could be used to efficiently study pathogen-host interactions and virus-hijacked cellular signaling pathways. Moreover, direct host and viral responses upon infection could be further investigated by activity-based functional validation studies. These approaches involve drug inhibition of secretory pathway, immunofluorescence staining, dominant negative mutant of protein target, real-time PCR, small interfering siRNA-mediated knockdown, and molecular cloning studies. In this way, functional validation could gain novel insights into the high-content proteomic dataset in an unbiased and comprehensive way. PMID:23162545

  16. Functional Genomics in the Study of Mind-Body Therapies

    PubMed Central

    Niles, Halsey; Mehta, Darshan H.; Corrigan, Alexandra A.; Bhasin, Manoj K.; Denninger, John W.

    2014-01-01

    Background Mind-body therapies (MBTs) are used throughout the world in treatment, disease prevention, and health promotion. However, the mechanisms by which MBTs exert their positive effects are not well understood. Investigations into MBTs using functional genomics have revolutionized the understanding of MBT mechanisms and their effects on human physiology. Methods We searched the literature for the effects of MBTs on functional genomics determinants using MEDLINE, supplemented by a manual search of additional journals and a reference list review. Results We reviewed 15 trials that measured global or targeted transcriptomic, epigenomic, or proteomic changes in peripheral blood. Sample sizes ranged from small pilot studies (n=2) to large trials (n=500). While the reliability of individual genes from trial to trial was often inconsistent, genes related to inflammatory response, particularly those involved in the nuclear factor-kappa B (NF-κB) pathway, were consistently downregulated across most studies. Conclusion In general, existing trials focusing on gene expression changes brought about by MBTs have revealed intriguing connections to the immune system through the NF-κB cascade, to telomere maintenance, and to apoptotic regulation. However, these findings are limited to a small number of trials and relatively small sample sizes. More rigorous randomized controlled trials of healthy subjects and specific disease states are warranted. Future research should investigate functional genomics areas both upstream and downstream of MBT-related gene expression changes—from epigenomics to proteomics and metabolomics. PMID:25598735

  17. Rice proteomics: A move toward expanded proteome coverage to comparative and functional proteomics uncovers the mysteries of rice and plant biology.

    PubMed

    Agrawal, Ganesh Kumar; Rakwal, Randeep

    2011-05-01

    Growing rice is an important socio-economic activity. Rice proteomics has achieved a tremendous progress in establishing techniques to proteomes of almost all tissues, organs, and organelles during the past one decade (year 2000-2010). We have compiled these progresses time to time over this period. The present compilation discusses proteomics research in rice published between 1st April 2008 and 30th July 2010. Progress continues mainly towards protein cataloging deep into the proteome with high-confident protein assignment and some functional significance than ever before by (i) identifying previously unreported/low-abundance proteins, (ii) quantifying relative/absolute values of proteins, (iii) assigning protein responses to biotic/abiotic stresses, (iv) protein localization into organelles, (v) validating previous proteomes and eliminating false-positive proteins, and (vi) discovering potential biomarkers for tissues, organs, organelles, and for screening transgenic plants and food-safety evaluation. The notable achievements in global mapping of phosphorylation sites and identifying several novel secreted proteins into the extracellular space are worth appreciating. Our ever-increasing knowledge on the rice proteomics is beginning to impact the biology of not only rice, but also crops and plants. These major achievements will be discussed in this review keeping in mind newcomers, young, and established scientists in proteomics and plants. PMID:21462347

  18. Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics

    PubMed Central

    Fermin, Damian; Allen, Baxter B; Blackwell, Thomas W; Menon, Rajasree; Adamski, Marcin; Xu, Yin; Ulintz, Peter; Omenn, Gilbert S; States, David J

    2006-01-01

    Background Defining the location of genes and the precise nature of gene products remains a fundamental challenge in genome annotation. Interrogating tandem mass spectrometry data using genomic sequence provides an unbiased method to identify novel translation products. A six-frame translation of the entire human genome was used as the query database to search for novel blood proteins in the data from the Human Proteome Organization Plasma Proteome Project. Because this target database is orders of magnitude larger than the databases traditionally employed in tandem mass spectra analysis, careful attention to significance testing is required. Confidence of identification is assessed using our previously described Poisson statistic, which estimates the significance of multi-peptide identifications incorporating the length of the matching sequence, number of spectra searched and size of the target sequence database. Results Applying a false discovery rate threshold of 0.05, we identified 282 significant open reading frames, each containing two or more peptide matches. There were 627 novel peptides associated with these open reading frames that mapped to a unique genomic coordinate placed within the start/stop points of previously annotated genes. These peptides matched 1,110 distinct tandem MS spectra. Peptides fell into four categories based upon where their genomic coordinates placed them relative to annotated exons within the parent gene. Conclusion This work provides evidence for novel alternative splice variants in many previously annotated genes. These findings suggest that annotation of the genome is not yet complete and that proteomics has the potential to further add to our understanding of gene structures. PMID:16646984

  19. Investigation of Yersinia pestis Laboratory Adaptation through a Combined Genomics and Proteomics Approach

    SciTech Connect

    Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.; Kaiser, Brooke LD; Lin, Andy; Hutchison, Janine R.; Melville, Angela M.; Wagner, David M.; Keim, Paul S.; Foster, Jeff; Kreuzer, Helen W.

    2015-11-24

    The bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experiment (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.

  20. Genomic and Proteomic Studies on the Mode of Action of Oxaboroles against the African Trypanosome

    PubMed Central

    Jones, Deuan C.; Foth, Bernardo J.; Urbaniak, Michael D.; Patterson, Stephen; Ong, Han B.; Berriman, Matthew; Fairlamb, Alan H.

    2015-01-01

    SCYX-7158, an oxaborole, is currently in Phase I clinical trials for the treatment of human African trypanosomiasis. Here we investigate possible modes of action against Trypanosoma brucei using orthogonal chemo-proteomic and genomic approaches. SILAC-based proteomic studies using an oxaborole analogue immobilised onto a resin was used either in competition with a soluble oxaborole or an immobilised inactive control to identify thirteen proteins common to both strategies. Cell-cycle analysis of cells incubated with sub-lethal concentrations of an oxaborole identified a subtle but significant accumulation of G2 and >G2 cells. Given the possibility of compromised DNA fidelity, we investigated long-term exposure of T. brucei to oxaboroles by generating resistant cell lines in vitro. Resistance proved more difficult to generate than for drugs currently used in the field, and in one of our three cell lines was unstable. Whole-genome sequencing of the resistant cell lines revealed single nucleotide polymorphisms in 66 genes and several large-scale genomic aberrations. The absence of a simple consistent mechanism among resistant cell lines and the diverse list of binding partners from the proteomic studies suggest a degree of polypharmacology that should reduce the risk of resistance to this compound class emerging in the field. The combined genetic and chemical biology approaches have provided lists of candidates to be investigated for more detailed information on the mode of action of this promising new drug class. PMID:26684831

  1. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    PubMed Central

    Brooks, Brandon; Mueller, Ryan S.; Young, Jacque C.; Morowitz, Michael J.; Hettich, Robert L.; Banfield, Jillian F.

    2015-01-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13–21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains of Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential. PMID:26191049

  2. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    DOE PAGESBeta

    Brooks, Brandon; Mueller, R. S.; Young, Jacque C.; Morowitz, Michael J.; Robert L. Hettich; Banfield, Jillian F.

    2015-07-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13 21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains ofmore » Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential.« less

  3. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant.

    PubMed

    Brooks, Brandon; Mueller, Ryan S; Young, Jacque C; Morowitz, Michael J; Hettich, Robert L; Banfield, Jillian F

    2015-01-01

    While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13-21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains of Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential. PMID:26191049

  4. Cyanobacterial KnowledgeBase (CKB), a Compendium of Cyanobacterial Genomes and Proteomes

    PubMed Central

    Mohandass, Shylajanaciyar; Varadharaj, Sangeetha; Thilagar, Sivasudha; Abdul Kareem, Kaleel Ahamed; Dharmar, Prabaharan; Gopalakrishnan, Subramanian; Lakshmanan, Uma

    2015-01-01

    Cyanobacterial KnowledgeBase (CKB) is a free access database that contains the genomic and proteomic information of 74 fully sequenced cyanobacterial genomes belonging to seven orders. The database also contains tools for sequence analysis. The Species report and the gene report provide details about each species and gene (including sequence features and gene ontology annotations) respectively. The database also includes cyanoBLAST, an advanced tool that facilitates comparative analysis, among cyanobacterial genomes and genomes of E. coli (prokaryote) and Arabidopsis (eukaryote). The database is developed and maintained by the Sub-Distributed Informatics Centre (sponsored by the Department of Biotechnology, Govt. of India) of the National Facility for Marine Cyanobacteria, a facility dedicated to marine cyanobacterial research. CKB is freely available at http://nfmc.res.in/ckb/index.html. PMID:26305368

  5. Functional Insights from Structural Genomics

    SciTech Connect

    Forouhar,F.; Kuzin, A.; Seetharaman, J.; Lee, I.; Zhou, W.; Abashidze, M.; Chen, Y.; Montelione, G.; Tong, L.; et al

    2007-01-01

    Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNA methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).

  6. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu

    PubMed Central

    Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381

  7. Genome-, Transcriptome- and Proteome-Wide Analyses of the Gliadin Gene Families in Triticum urartu.

    PubMed

    Zhang, Yanlin; Luo, Guangbin; Liu, Dongcheng; Wang, Dongzhi; Yang, Wenlong; Sun, Jiazhu; Zhang, Aimin; Zhan, Kehui

    2015-01-01

    Gliadins are the major components of storage proteins in wheat grains, and they play an essential role in the dough extensibility and nutritional quality of flour. Because of the large number of the gliadin family members, the high level of sequence identity, and the lack of abundant genomic data for Triticum species, identifying the full complement of gliadin family genes in hexaploid wheat remains challenging. Triticum urartu is a wild diploid wheat species and considered the A-genome donor of polyploid wheat species. The accession PI428198 (G1812) was chosen to determine the complete composition of the gliadin gene families in the wheat A-genome using the available draft genome. Using a PCR-based cloning strategy for genomic DNA and mRNA as well as a bioinformatics analysis of genomic sequence data, 28 gliadin genes were characterized. Of these genes, 23 were α-gliadin genes, three were γ-gliadin genes and two were ω-gliadin genes. An RNA sequencing (RNA-Seq) survey of the dynamic expression patterns of gliadin genes revealed that their synthesis in immature grains began prior to 10 days post-anthesis (DPA), peaked at 15 DPA and gradually decreased at 20 DPA. The accumulation of proteins encoded by 16 of the expressed gliadin genes was further verified and quantified using proteomic methods. The phylogenetic analysis demonstrated that the homologs of these α-gliadin genes were present in tetraploid and hexaploid wheat, which was consistent with T. urartu being the A-genome progenitor species. This study presents a systematic investigation of the gliadin gene families in T. urartu that spans the genome, transcriptome and proteome, and it provides new information to better understand the molecular structure, expression profiles and evolution of the gliadin genes in T. urartu and common wheat. PMID:26132381

  8. Novel Multivariate Methods for Integration of Genomics and Proteomics Data: Applications in a Kidney Transplant Rejection Study

    PubMed Central

    Günther, Oliver P.; Shin, Heesun; Ng, Raymond T.; McMaster, W. Robert; McManus, Bruce M.; Keown, Paul A.; Tebbutt, Scott. J.

    2014-01-01

    Abstract Multi-omics research is a key ingredient of data-intensive life sciences research, permitting measurement of biological molecules at different functional levels in the same individual. For a complete picture at the biological systems level, appropriate statistical techniques must however be developed to integrate different ‘omics’ data sets (e.g., genomics and proteomics). We report here multivariate projection-based analyses approaches to genomics and proteomics data sets, using the case study of and applications to observations in kidney transplant patients who experienced an acute rejection event (n=20) versus non-rejecting controls (n=20). In this data sets, we show how these novel methodologies might serve as promising tools for dimension reduction and selection of relevant features for different analytical frameworks. Unsupervised analyses highlighted the importance of post transplant time-of-rejection, while supervised analyses identified gene and protein signatures that together predicted rejection status with little time effect. The selected genes are part of biological pathways that are representative of immune responses. Gene enrichment profiles revealed increases in innate immune responses and neutrophil activities and a depletion of T lymphocyte related processes in rejection samples as compared to controls. In all, this article offers candidate biomarkers for future detection and monitoring of acute kidney transplant rejection, as well as ways forward for methodological advances to better harness multi-omics data sets. PMID:25387159

  9. Global analyses of Ceratocystis cacaofunesta mitochondria: from genome to proteome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background The ascomycete fungus Ceratocystis cacaofunesta is the causal agent of wilt disease in cacao, which results in significant economic losses in the affected producing areas. Despite the economic importance of the Ceratocystis complex of species, no genomic data are available for any of its ...

  10. Public Access for Teaching Genomics, Proteomics, and Bioinformatics

    ERIC Educational Resources Information Center

    Campbell, A. Malcolm

    2003-01-01

    When the human genome project was conceived, its leaders wanted all researchers to have equal access to the data and associated research tools. Their vision of equal access provides an unprecedented teaching opportunity. Teachers and students have free access to the same databases that researchers are using. Furthermore, the recent movement to…

  11. A New Literary Metaphor for the Genome or Proteome

    ERIC Educational Resources Information Center

    Pappas, Gus

    2005-01-01

    Previously, the idea of a blueprint has been used to explain the genome. The concept of a play's cast of characters, the Dramatis Personae, is a more fluid metaphor that allows for mutations and time-dependent phenomena to be taken into account. It also provides an educational and mnemonic exercise for students.

  12. Evolution of early eukaryotic cells: genomes, proteomes, and compartments.

    PubMed

    Bogorad, Lawrence

    2008-01-01

    Eukaryotes arose from an endosymbiotic association of an alpha-proteobacterium-like organism (the ancestor of mitochondria) with a host cell (lacking mitochondria or plastids). Plants arose by the addition of a cyanobacterium-like endosymbiont (the ancestor of plastids) to the two-member association. Each member of the association brought a unique internal environment and a unique genome. Analyses of recently acquired genomic sequences with newly developed algorithms have revealed (a) that the number of endosymbiont genes that remain in eukaryotic cells-principally in the nucleus-is surprisingly large, (b) that protein products of a large number of genes (or their descendents) that entered the association in the genome of the host are now directed to an organelle derived from an endosymbiont, and (c) that protein products of genes traceable to endosymbiont genomes are directed to the nucleo-cytoplasmic compartment. Consideration of these remarkable findings has led to the present suggestion that contemporary eukaryotic cells evolved through continual chance relocation and testing of genes as well as combinations of gene products and biochemical processes in each unique cell compartment derived from a member of the eukaryotic association. Most of these events occurred during about 300 million years, or so, before contemporary forms of eukaryotic cells appear in the fossil record; they continue today. PMID:17912611

  13. Investigation of Yersinia pestis Laboratory Adaptation through a Combined Genomics and Proteomics Approach

    PubMed Central

    Clowers, Brian H.; Deatherage Kaiser, Brooke L.; Lin, Andy; Hutchison, Janine R.; Melville, Angela M.; Wagner, David M.; Keim, Paul S.; Foster, Jeffrey T.; Kreuzer, Helen W.

    2015-01-01

    The bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a parallel serial passage experiment (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing, LC-MS/MS proteomic analysis, and GC/MS metabolomics. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS/MS proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism and cell envelope biogenesis. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement. PMID:26599979

  14. Investigation of Yersinia pestis Laboratory Adaptation through a Combined Genomics and Proteomics Approach.

    PubMed

    Leiser, Owen P; Merkley, Eric D; Clowers, Brian H; Deatherage Kaiser, Brooke L; Lin, Andy; Hutchison, Janine R; Melville, Angela M; Wagner, David M; Keim, Paul S; Foster, Jeffrey T; Kreuzer, Helen W

    2015-01-01

    The bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a parallel serial passage experiment (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing, LC-MS/MS proteomic analysis, and GC/MS metabolomics. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS/MS proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism and cell envelope biogenesis. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement. PMID:26599979

  15. Evaluation of a Genome-Scale In Silico Metabolic Model for Geobacter metallireducens by Using Proteomic Data from a Field Biostimulation Experiment

    PubMed Central

    Fang, Yilin; Yabusaki, Steven B.; Lipton, Mary S.; Long, Philip E.

    2012-01-01

    Accurately predicting the interactions between microbial metabolism and the physical subsurface environment is necessary to enhance subsurface energy development, soil and groundwater cleanup, and carbon management. This study was an initial attempt to confirm the metabolic functional roles within an in silico model using environmental proteomic data collected during field experiments. Shotgun global proteomics data collected during a subsurface biostimulation experiment were used to validate a genome-scale metabolic model of Geobacter metallireducens—specifically, the ability of the metabolic model to predict metal reduction, biomass yield, and growth rate under dynamic field conditions. The constraint-based in silico model of G. metallireducens relates an annotated genome sequence to the physiological functions with 697 reactions controlled by 747 enzyme-coding genes. Proteomic analysis showed that 180 of the 637 G. metallireducens proteins detected during the 2008 experiment were associated with specific metabolic reactions in the in silico model. When the field-calibrated Fe(III) terminal electron acceptor process reaction in a reactive transport model for the field experiments was replaced with the genome-scale model, the model predicted that the largest metabolic fluxes through the in silico model reactions generally correspond to the highest abundances of proteins that catalyze those reactions. Central metabolism predicted by the model agrees well with protein abundance profiles inferred from proteomic analysis. Model discrepancies with the proteomic data, such as the relatively low abundances of proteins associated with amino acid transport and metabolism, revealed pathways or flux constraints in the in silico model that could be updated to more accurately predict metabolic processes that occur in the subsurface environment. PMID:23042184

  16. Exome-based proteogenomics of HEK-293 human cell line: Coding genomic variants identified at the level of shotgun proteome.

    PubMed

    Lobas, Anna A; Karpov, Dmitry S; Kopylov, Arthur T; Solovyeva, Elizaveta M; Ivanov, Mark V; Ilina, Irina Y; Lazarev, Vassily N; Kuznetsova, Ksenia G; Ilgisonis, Ekaterina V; Zgoda, Victor G; Gorshkov, Mikhail V; Moshkovskii, Sergei A

    2016-07-01

    Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK-293) cell line at the proteome level. Shotgun proteome data published by Geiger et al. (2012), Chick et al. (2015), and obtained in this work for HEK-293 were searched against the customized genomic database generated using exome data published by Lin et al. (2014). Overall, 112 unique variants were identified at the proteome level out of ∼1200 coding variants annotated in the exome. Seven identified variants were shared between all the three considered proteomic datasets, and 27 variants were found in any two datasets. Some of the found variants belonged to widely known genomic polymorphisms originated from the germline, while the others were more likely resulting from somatic mutations. At least, eight of the proteins bearing amino acid variants were annotated as cancer-related ones, including p53 tumor suppressor. In all the considered shotgun datasets, the variant peptides were at the ratio of 1:2.5 less likely being identified than the wild-type ones compared with the corresponding theoretical peptides. This can be explained by the presence of the so-called "passenger" mutations in the genes, which were never expressed in HEK-293 cells. All MS data have been deposited in the ProteomeXchange with the dataset identifier PXD002613 (http://proteomecentral.proteomexchange.org/dataset/PXD002613). PMID:27233776

  17. Quantitative reactivity profiling predicts functional cysteines in proteomes

    PubMed Central

    Weerapana, Eranthie; Wang, Chu; Simon, Gabriel M.; Richter, Florian; Khare, Sagar; Dillon, Myles B.D.; Bachovchin, Daniel A.; Mowen, Kerri; Baker, David; Cravatt, Benjamin F.

    2010-01-01

    Cysteine is the most intrinsically nucleophilic amino acid in proteins, where its reactivity is tuned to perform diverse biochemical functions. The absence of a consensus sequence that defines functional cysteines in proteins has hindered their discovery and characterization. Here, we describe a proteomics method to quantitatively profile the intrinsic reactivity of cysteine residues en masse directly in native biological systems. Hyperreactivity was a rare feature among cysteines and found to specify a wide range of activities, including nucleophilic and reductive catalysis and sites of oxidative modification. Hyperreactive cysteines were identified in several proteins of uncharacterized function, including a residue conserved across eukaryotic phylogeny that we show is required for yeast viability and involved in iron-sulfur protein biogenesis. Finally, we demonstrate that quantitative reactivity profiling can also form the basis for screening and functional assignment of cysteines in computationally designed proteins, where it discriminated catalytically active from inactive cysteine hydrolase designs. PMID:21085121

  18. Quantitative reactivity profiling predicts functional cysteines in proteomes.

    PubMed

    Weerapana, Eranthie; Wang, Chu; Simon, Gabriel M; Richter, Florian; Khare, Sagar; Dillon, Myles B D; Bachovchin, Daniel A; Mowen, Kerri; Baker, David; Cravatt, Benjamin F

    2010-12-01

    Cysteine is the most intrinsically nucleophilic amino acid in proteins, where its reactivity is tuned to perform diverse biochemical functions. The absence of a consensus sequence that defines functional cysteines in proteins has hindered their discovery and characterization. Here we describe a proteomics method to profile quantitatively the intrinsic reactivity of cysteine residues en masse directly in native biological systems. Hyper-reactivity was a rare feature among cysteines and it was found to specify a wide range of activities, including nucleophilic and reductive catalysis and sites of oxidative modification. Hyper-reactive cysteines were identified in several proteins of uncharacterized function, including a residue conserved across eukaryotic phylogeny that we show is required for yeast viability and is involved in iron-sulphur protein biogenesis. We also demonstrate that quantitative reactivity profiling can form the basis for screening and functional assignment of cysteines in computationally designed proteins, where it discriminated catalytically active from inactive cysteine hydrolase designs. PMID:21085121

  19. Medulloblastoma Exosome Proteomics Yield Functional Roles for Extracellular Vesicles

    PubMed Central

    Epple, Laura M.; Griffiths, Steve G.; Dechkovskaia, Anjelika M.; Dusto, Nathaniel L.; White, Jason; Ouellette, Rodney J.; Anchordoquy, Thomas J.; Bemis, Lynne T.; Graner, Michael W.

    2012-01-01

    Medulloblastomas are the most prevalent malignant pediatric brain tumors. Survival for these patients has remained largely the same for approximately 20 years, and our therapies for these cancers cause significant health, cognitive, behavioral and developmental sequelae for those who survive the tumor and their treatments. We obviously need a better understanding of the biology of these tumors, particularly with regard to their migratory/invasive behaviors, their proliferative propensity, and their abilities to deflect immune responses. Exosomes, virus-sized membrane vesicles released extracellularly from cells after formation in, and transit thru, the endosomal pathway, may play roles in medulloblastoma pathogenesis but are as yet unstudied in this disease. Here we characterized exosomes from a medulloblastoma cell line with biochemical and proteomic analyses, and included characterization of patient serum exosomes. Further scrutiny of the proteomic data suggested functional properties of the exosomes that are relevant to medulloblastoma tumor biology, including their roles as proliferation stimulants, their activities as attractants for tumor cell migration, and their immune modulatory impacts on lymphocytes. Aspects of this held true for exosomes from other medulloblastoma cell lines as well. Additionally, pathway analyses suggested a possible role for the transcription factor hepatocyte nuclear factor 4 alpha (HNF4A); however, inhibition of the protein’s activity actually increased D283MED cell proliferation/clonogenecity, suggesting that HNF4A may act as a tumor suppressor in this cell line. Our work demonstrates that relevant functional properties of exosomes may be derived from appropriate proteomic analyses, which translate into mechanisms of tumor pathophysiology harbored in these extracellular vesicles. PMID:22848702

  20. Evolution, language and analogy in functional genomics

    NASA Technical Reports Server (NTRS)

    Benner, S. A.; Gaucher, E. A.

    2001-01-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  1. Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled.

    PubMed

    Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran

    2015-06-01

    The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes' ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281

  2. Uncovering the Potential Pan Proteomes Encoded by Genomic Strand RNAs of Influenza A Viruses

    PubMed Central

    Yang, Chu-Wen; Chen, Mei-Fang

    2016-01-01

    Influenza A virus genomes are composed of eight negative sense RNAs. In total, 16 proteins encoded by eight positive sense RNAs were identified. One putative protein coding sequence (PCS) encoded by genomic strand RNA of segment 8 has been previously proposed. In this study, 95,608, 123,965 and 35,699 genomic strand RNA sequences from influenza A viruses from avian, human and mammalian hosts, respectively, were used to identify PCSs encoded by the genomic strand RNAs. In total, 326,069 PCSs with lengths equal to or longer than 80 amino acids were identified and clustered into 270 PCS groups. Twenty of the 270 PCS groups which have greater than 10% proportion in influenza A viruses from avian, human or mammalian hosts were selected for detailed study. Maps of the 20 PCSGs in the influenza A virus genomes were constructed. The proportions of the 20 PCSGs in influenza A viruses from different hosts and serotypes were analyzed. One secretory and five membrane proteins predicted from the PCS groups encoded by genomic strand RNAs of segments 1, 2, 4, 6, 7 and 8 were identified. These results suggest the possibility of the ambisense nature of the influenza A virus genomic RNAs and a potential coding sequence reservoir encoding potential pan proteomes of influenza A viruses. PMID:26761196

  3. Comparative analysis of predicted plastid-targeted proteomes of sequenced higher plant genomes.

    PubMed

    Schaeffer, Scott; Harper, Artemus; Raja, Rajani; Jaiswal, Pankaj; Dhingra, Amit

    2014-01-01

    Plastids are actively involved in numerous plant processes critical to growth, development and adaptation. They play a primary role in photosynthesis, pigment and monoterpene synthesis, gravity sensing, starch and fatty acid synthesis, as well as oil, and protein storage. We applied two complementary methods to analyze the recently published apple genome (Malus × domestica) to identify putative plastid-targeted proteins, the first using TargetP and the second using a custom workflow utilizing a set of predictive programs. Apple shares roughly 40% of its 10,492 putative plastid-targeted proteins with that of the Arabidopsis (Arabidopsis thaliana) plastid-targeted proteome as identified by the Chloroplast 2010 project and ∼57% of its entire proteome with Arabidopsis. This suggests that the plastid-targeted proteomes between apple and Arabidopsis are different, and interestingly alludes to the presence of differential targeting of homologs between the two species. Co-expression analysis of 2,224 genes encoding putative plastid-targeted apple proteins suggests that they play a role in plant developmental and intermediary metabolism. Further, an inter-specific comparison of Arabidopsis, Prunus persica (Peach), Malus × domestica (Apple), Populus trichocarpa (Black cottonwood), Fragaria vesca (Woodland Strawberry), Solanum lycopersicum (Tomato) and Vitis vinifera (Grapevine) also identified a large number of novel species-specific plastid-targeted proteins. This analysis also revealed the presence of alternatively targeted homologs across species. Two separate analyses revealed that a small subset of proteins, one representing 289 protein clusters and the other 737 unique protein sequences, are conserved between seven plastid-targeted angiosperm proteomes. Majority of the novel proteins were annotated to play roles in stress response, transport, catabolic processes, and cellular component organization. Our results suggest that the current state of knowledge regarding

  4. The genome and proteome of a Campylobacter coli bacteriophage vB_CcoM-IBB_35 reveal unusual features

    PubMed Central

    2012-01-01

    Background Campylobacter is the leading cause of foodborne diseases worldwide. Bacteriophages (phages) are naturally occurring predators of bacteria, ubiquitous in the environment, with high host specificity and thus considered an appealing option to control bacterial pathogens. Nevertheless for an effective use of phages as antimicrobial agents, it is important to understand phage biology which renders crucial the analysis of phage genomes and proteomes. The lack of sequence data from Campylobacter phages adds further importance to these studies. Methods vB_CcoM-IBB_35 is a broad lytic spectrum Myoviridae Campylobacter phage with high potential for therapeutic use. The genome of this phage was obtained by pyrosequencing and the sequence data was further analyzed. The proteomic analysis was performed by SDS-PAGE and Mass spectrometry. Results and conclusions The DNA sequence data of vB_CcoM-IBB_35 consists of five contigs for a total of 172,065 bp with an average GC content of 27%. Attempts to close the gaps between contigs were unsuccessful since the DNA preparations appear to contain substances that inhibited Taq and ϕ29 polymerases. From the 210 identified ORFs, around 60% represent proteins that were not functionally assigned. Homology exists with members of the Teequatrovirinae namely for T4 proteins involved in morphogenesis, nucleotide metabolism, transcription, DNA replication and recombination. Tandem mass spectrometric analysis revealed 38 structural proteins as part of the mature phage particle. Conclusions Genes encoding proteins involved in the carbohydrate metabolism along with several incidences of gene duplications, split genes with inteins and introns have been rarely found in other phage genomes yet are found in this phage. We identified the genes encoding for tail fibres and for the lytic cassette, this later, expressing enzymes for bacterial capsular polysaccharides (CPS) degradation, which has not been reported before for Campylobacter phages

  5. Investigation of Yersinia pestis laboratory adaptation through a combined genomics and proteomics approach

    DOE PAGESBeta

    Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.; Kaiser, Brooke L. Deatherage; Lin, Andy; Hutchison, Janine R.; Melville, Angela M.; Wagner, David M.; Keim, Paul S.; Foster, Jeff; et al

    2015-11-24

    Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experimentmore » (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.« less

  6. Single-cell-type Proteomics: Toward a Holistic Understanding of Plant Function*

    PubMed Central

    Dai, Shaojun; Chen, Sixue

    2012-01-01

    Multicellular organisms such as plants contain different types of cells with specialized functions. Analyzing the protein characteristics of each type of cell will not only reveal specific cell functions, but also enhance understanding of how an organism works. Most plant proteomics studies have focused on using tissues and organs containing a mixture of different cells. Recent single-cell-type proteomics efforts on pollen grains, guard cells, mesophyll cells, root hairs, and trichomes have shown utility. We expect that high resolution proteomic analyses will reveal novel functions in single cells. This review provides an overview of recent developments in plant single-cell-type proteomics. We discuss application of the approach for understanding important cell functions, and we consider the technical challenges of extending the approach to all plant cell types. Finally, we consider the integration of single-cell-type proteomics with transcriptomics and metabolomics with the goal of providing a holistic understanding of plant function. PMID:22982375

  7. Endosperm and Amyloplast Proteomes of Wheat Grain

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Advances in proteomics and genomics have improved our understanding of the gluten proteins, a complex and functionally important protein group. Proteomic approaches also have been used to identify other proteins that may play roles in wheat flour functionality, to assign genes for gluten proteins to...

  8. A functional proteomics approach to the comprehension of sarcoidosis.

    PubMed

    Landi, C; Bargagli, E; Carleo, A; Bianchi, L; Gagliardi, A; Cillis, G; Perari, M G; Refini, R M; Prasse, A; Bini, L; Rottoli, P

    2015-10-14

    Pulmonary sarcoidosis (Sar) is an idiopathic disease histologically typified by non-caseating epitheliod cell sarcoid granulomas. A cohort of 37 Sar patients with chronic persistent pulmonary disease was described in this study. BAL protein profiles from 9 of these Sar patients were compared with those from 8 smoker (SC) and 10 no-smoker controls (NSC) by proteomic approach. Principal Component Analysis was performed to clusterize the samples in the corresponding conditions highlighting a differential pattern profiles primarily in Sar than SC. Spot identification reveals thirty-four unique proteins involved in lipid, mineral, and vitamin Dmetabolism, and immuneregulation of macrophage function. Enrichment analysis has been elaborated by MetaCore, revealing 14-3-3ε, α1-antitrypsin, GSTP1, and ApoA1 as "central hubs". Process Network as well as Pathway Maps underline proteins involved in immune response and inflammation induced by complement system, innate inflammatory response and IL-6signalling. Disease Biomarker Network highlights Tuberculosis and COPD as pathologies that share biomarkers with sarcoidosis. In conclusion, Sar protein expression profile seems more similar to that of NSC than SC, conversely to other ILDs. Moreover, Disease Biomarker Network revealed several common features between Sar and TB, exhorting to orientate the future proteomics investigations also in comparative BALF analysis of Sar and TB. PMID:26342673

  9. Comparative Analysis of Genomics and Proteomics in the New Isolated Bacillus thuringiensis X022 Revealed the Metabolic Regulation Mechanism of Carbon Flux Following Cu(2+) Treatment.

    PubMed

    Quan, Meifang; Xie, Junyan; Liu, Xuemei; Li, Yang; Rang, Jie; Zhang, Tong; Zhou, Fengjuan; Xia, Liqiu; Hu, Shengbiao; Sun, Yunjun; Ding, Xuezhi

    2016-01-01

    Bacillus thuringiensis (Bt) X022 is a novel strain isolated from soil in China, and showed strong insecticidal activity against several Lepidopteran pests. In this work, we performed whole genome sequencing of this Bt strain using the next-generation sequencing technology, and further conducted a comparative analysis with the proteomics data of the specific spore-release period based on LC-MS/MS approach. The Bt X022 genome consisted of one circular chromosomal DNA and seven plasmids, which were further functionally annotated using the RAST server. Comparative analysis of insecticidal substances showed that X022 contained genes coding for three Cry proteins (Cry1Ac, Cry1Ia and Cry2Ab) and a vegetative insecticidal protein (Vip3A). However, three insecticidal crystal proteins (ICPs) (Cry1Ca, Cry1Ac and Cry1Da) were detected by proteomics in the spore-release period. Moreover, a putative biosynthetic gene cluster and the metabolic pathway for poly-β-hydroxybutyrate in Bt X022 were deduced based on the comparative analysis of genomic and proteomic data, which revealed the metabolic regulation mechanism of carbon flux correlated with increased production of ICPs caused by Cu(2+.) Hence, these results provided a deeper understanding of the genetic background and protein expression profile of Bt X022. This study established a foundation for directed genetic modification and further application of this new isolated Bt strain. PMID:27303381

  10. Comparative Analysis of Genomics and Proteomics in the New Isolated Bacillus thuringiensis X022 Revealed the Metabolic Regulation Mechanism of Carbon Flux Following Cu2+ Treatment

    PubMed Central

    Quan, Meifang; Xie, Junyan; Liu, Xuemei; Li, Yang; Rang, Jie; Zhang, Tong; Zhou, Fengjuan; Xia, Liqiu; Hu, Shengbiao; Sun, Yunjun; Ding, Xuezhi

    2016-01-01

    Bacillus thuringiensis (Bt) X022 is a novel strain isolated from soil in China, and showed strong insecticidal activity against several Lepidopteran pests. In this work, we performed whole genome sequencing of this Bt strain using the next-generation sequencing technology, and further conducted a comparative analysis with the proteomics data of the specific spore-release period based on LC-MS/MS approach. The Bt X022 genome consisted of one circular chromosomal DNA and seven plasmids, which were further functionally annotated using the RAST server. Comparative analysis of insecticidal substances showed that X022 contained genes coding for three Cry proteins (Cry1Ac, Cry1Ia and Cry2Ab) and a vegetative insecticidal protein (Vip3A). However, three insecticidal crystal proteins (ICPs) (Cry1Ca, Cry1Ac and Cry1Da) were detected by proteomics in the spore-release period. Moreover, a putative biosynthetic gene cluster and the metabolic pathway for poly-β-hydroxybutyrate in Bt X022 were deduced based on the comparative analysis of genomic and proteomic data, which revealed the metabolic regulation mechanism of carbon flux correlated with increased production of ICPs caused by Cu2+. Hence, these results provided a deeper understanding of the genetic background and protein expression profile of Bt X022. This study established a foundation for directed genetic modification and further application of this new isolated Bt strain. PMID:27303381

  11. The Discovery of Novel Genomic, Transcriptomic, and Proteomic Biomarkers in Cardiovascular and Peripheral Vascular Disease: The State of the Art

    PubMed Central

    de Franciscis, Stefano; Metzinger, Laurent; Serra, Raffaele

    2016-01-01

    Cardiovascular disease (CD) and peripheral vascular disease (PVD) are leading causes of mortality and morbidity in western countries and also responsible of a huge burden in terms of disability, functional decline, and healthcare costs. Biomarkers are measurable biological elements that reflect particular physiological or pathological states or predisposition towards diseases and they are currently widely studied in medicine and especially in CD. In this context, biomarkers can also be used to assess the severity or the evolution of several diseases, as well as the effectiveness of particular therapies. Genomics, transcriptomics, and proteomics have opened new windows on disease phenomena and may permit in the next future an effective development of novel diagnostic and prognostic medicine in order to better prevent or treat CD. This review will consider the current evidence of novel biomarkers with clear implications in the improvement of risk assessment, prevention strategies, and medical decision making in the field of CD. PMID:27298828

  12. Proteomics Analysis Reveals Overlapping Functions of Clustered Protocadherins*

    PubMed Central

    Han, Meng-Hsuan; Lin, Chengyi; Meng, Shuxia; Wang, Xiaozhong

    2010-01-01

    The three tandem-arrayed protocadherin (Pcdh) gene clusters, namely Pcdh-α, Pcdh-β, and Pcdh-γ, play important roles in the development of the vertebrate central nervous system. To gain insight into the molecular action of PCDHs, we performed a systematic proteomics analysis of PCDH-γ-associated protein complexes. We identified a list of 154 non-redundant proteins in the PCDH-γ complexes. This list includes nearly 30 members of clustered Pcdh-α, -β, and -γ families as core components of the complexes and additionally over 120 putative PCDH-associated proteins. We validated a selected subset of PCDH-γ-associated proteins using specific antibodies. Analysis of the identities of PCDH-associated proteins showed that the majority of them overlap with the proteomic profile of postsynaptic density preparations. Further analysis of membrane protein complexes revealed that several validated PCDH-γ-associated proteins exhibit reduced levels in Pcdh-γ-deficient brain tissues. Therefore, PCDH-γs are required for the integrity of the complexes. However, the size of the overall complexes and the abundance of many other proteins remained unchanged, raising a possibility that PCDH-αs and PCDH-βs might compensate for PCDH-γ function in complex formation. As a test of this idea, RNA interference knockdown of both PCDH-αs and PCDH-γs showed that PCDHs have redundant functions in regulating neuronal survival in the chicken spinal cord. Taken together, our data provide evidence that clustered PCDHs coexist in large protein complexes and have overlapping functions during vertebrate neural development. PMID:19843561

  13. Genome and Proteome of Campylobacter jejuni Bacteriophage NCTC 12673▿†

    PubMed Central

    Kropinski, Andrew M.; Arutyunov, Denis; Foss, Mary; Cunningham, Anna; Ding, Wen; Singh, Amit; Pavlov, Andrey R.; Henry, Matthew; Evoy, Stephane; Kelly, John; Szymanski, Christine M.

    2011-01-01

    Campylobacter jejuni continues to be the leading cause of bacterial food-borne illness worldwide, so improvements to current methods used for bacterial detection and disease prevention are needed. We describe here the genome and proteome of C. jejuni bacteriophage NCTC 12673 and the exploitation of its receptor-binding protein for specific bacterial detection. Remarkably, the 135-kb Myoviridae genome of NCTC 12673 differs greatly from any other proteobacterial phage genome described (including C. jejuni phages CP220 and CPt10) and instead shows closest homology to the cyanobacterial T4-related myophages. The phage genome contains 172 putative open reading frames, including 12 homing endonucleases, no visible means of packaging, and a putative trans-splicing intein. The phage DNA appears to be strongly associated with a protein that interfered with PCR amplification and estimation of the phage genome mass by pulsed-field gel electrophoresis. Identification and analyses of the receptor-binding protein (Gp48) revealed features common to the Salmonella enterica P22 phage tailspike protein, including the ability to specifically recognize a host organism. Bacteriophage receptor-binding proteins may offer promising alternatives for use in pathogen detection platforms. PMID:21965409

  14. Comparative proteomics reveals a significant bias toward alternative protein isoforms with conserved structure and function.

    PubMed

    Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L

    2012-09-01

    Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of "novel" and "putative" protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and

  15. Genomic and Proteomic Analyses of the Fungus Arthrobotrys oligospora Provide Insights into Nematode-Trap Formation

    PubMed Central

    Feng, Yun; Li, Xiaomin; Zou, Chenggang; Xu, Jianping; Ren, Yan; Mi, Qili; Wu, Junli; Liu, Shuqun; Liu, Yu; Huang, Xiaowei; Wang, Haiyan; Niu, Xuemei; Li, Juan; Liang, Lianming; Luo, Yanlu; Ji, Kaifang; Zhou, Wei; Yu, Zefen; Li, Guohong; Liu, Yajun; Li, Lei; Qiao, Min; Feng, Lu; Zhang, Ke-Qin

    2011-01-01

    Nematode-trapping fungi are “carnivorous” and attack their hosts using specialized trapping devices. The morphological development of these traps is the key indicator of their switch from saprophytic to predacious lifestyles. Here, the genome of the nematode-trapping fungus Arthrobotrys oligospora Fres. (ATCC24927) was reported. The genome contains 40.07 Mb assembled sequence with 11,479 predicted genes. Comparative analysis showed that A. oligospora shared many more genes with pathogenic fungi than with non-pathogenic fungi. Specifically, compared to several sequenced ascomycete fungi, the A. oligospora genome has a larger number of pathogenicity-related genes in the subtilisin, cellulase, cellobiohydrolase, and pectinesterase gene families. Searching against the pathogen-host interaction gene database identified 398 homologous genes involved in pathogenicity in other fungi. The analysis of repetitive sequences provided evidence for repeat-induced point mutations in A. oligospora. Proteomic and quantitative PCR (qPCR) analyses revealed that 90 genes were significantly up-regulated at the early stage of trap-formation by nematode extracts and most of these genes were involved in translation, amino acid metabolism, carbohydrate metabolism, cell wall and membrane biogenesis. Based on the combined genomic, proteomic and qPCR data, a model for the formation of nematode trapping device in this fungus was proposed. In this model, multiple fungal signal transduction pathways are activated by its nematode prey to further regulate downstream genes associated with diverse cellular processes such as energy metabolism, biosynthesis of the cell wall and adhesive proteins, cell division, glycerol accumulation and peroxisome biogenesis. This study will facilitate the identification of pathogenicity-related genes and provide a broad foundation for understanding the molecular and evolutionary mechanisms underlying fungi-nematodes interactions. PMID:21909256

  16. Quantitative proteomic analysis of mitochondrial proteins reveals prosurvival mechanisms in the perpetuation of radiation-induced genomic instability

    PubMed Central

    Thomas, Stefani N.; Waters, Katrina M.; Morgan, William F.

    2016-01-01

    Radiation-induced genomic instability is a well-studied phenomenon that is measured as mitotically heritable genetic alterations observed in the progeny of an irradiated cell. The mechanisms that perpetuate this instability are unclear; however, a role for chronic oxidative stress has consistently been demonstrated. In the chromosomally unstable LS12 cell line, oxidative stress and genomic instability were correlated with mitochondrial dysfunction. To clarify this mitochondrial dysfunction and gain insight into the mechanisms underlying radiation-induced genomic instability we have evaluated the mitochondrial subproteome and performed quantitative mass spectrometry analysis of LS12 cells. Of 98 quantified mitochondrial proteins, 17 met criteria for fold changes and reproducibility; and 11 were statistically significant in comparison with the stable parental GM10115 cell line. Previous observations implicated defects in the electron transport chain (ETC) in the LS12 cell mitochondrial dysfunction. Proteomic analysis supports these observations, demonstrating significantly reduced levels of mitochondrial cytochrome c, the intermediary between complexes III and IV of the ETC. Results also suggest that LS12 cells compensate for ETC dysfunction and oxidative stress through increased levels of tricarboxylic acid cycle enzymes and upregulation of proteins that protect against oxidative stress and apoptosis. More than one cellular defect is likely to contribute to the genomic instability phenotype, and evaluation of gene and microRNA expression suggests that epigenetics play a role in the phenotype. These data suggest that LS12 cells have adapted mechanisms that allow survival under suboptimal conditions of oxidative stress and compromised mitochondrial function to perpetuate genomic instability. PMID:22569412

  17. Streptococcus iniae SF1: Complete Genome Sequence, Proteomic Profile, and Immunoprotective Antigens

    PubMed Central

    Zhang, Bao-cun; Zhang, Jian; Sun, Li

    2014-01-01

    Streptococcus iniae is a Gram-positive bacterium that is reckoned one of the most severe aquaculture pathogens. It has a broad host range among farmed marine and freshwater fish and can also cause zoonotic infection in humans. Here we report for the first time the complete genome sequence as well as the host factor-induced proteomic profile of a pathogenic S. iniae strain, SF1, a serotype I isolate from diseased fish. SF1 possesses a single chromosome of 2,149,844 base pairs, which contains 2,125 predicted protein coding sequences (CDS), 12 rRNA genes, and 45 tRNA genes. Among the protein-encoding CDS are genes involved in resource acquisition and utilization, signal sensing and transduction, carbohydrate metabolism, and defense against host immune response. Potential virulence genes include those encoding adhesins, autolysins, toxins, exoenzymes, and proteases. In addition, two putative prophages and a CRISPR-Cas system were found in the genome, the latter containing a CRISPR locus and four cas genes. Proteomic analysis detected 21 secreted proteins whose expressions were induced by host serum. Five of the serum-responsive proteins were subjected to immunoprotective analysis, which revealed that two of the proteins were highly protective against lethal S. iniae challenge when used as purified recombinant subunit vaccines. Taken together, these results provide an important molecular basis for future study of S. iniae in various aspects, in particular those related to pathogenesis and disease control. PMID:24621602

  18. Integration and Querying of Genomic and Proteomic Semantic Annotations for Biomedical Knowledge Extraction.

    PubMed

    Masseroli, Marco; Canakoglu, Arif; Ceri, Stefano

    2016-01-01

    Understanding complex biological phenomena involves answering complex biomedical questions on multiple biomolecular information simultaneously, which are expressed through multiple genomic and proteomic semantic annotations scattered in many distributed and heterogeneous data sources; such heterogeneity and dispersion hamper the biologists' ability of asking global queries and performing global evaluations. To overcome this problem, we developed a software architecture to create and maintain a Genomic and Proteomic Knowledge Base (GPKB), which integrates several of the most relevant sources of such dispersed information (including Entrez Gene, UniProt, IntAct, Expasy Enzyme, GO, GOA, BioCyc, KEGG, Reactome, and OMIM). Our solution is general, as it uses a flexible, modular, and multilevel global data schema based on abstraction and generalization of integrated data features, and a set of automatic procedures for easing data integration and maintenance, also when the integrated data sources evolve in data content, structure, and number. These procedures also assure consistency, quality, and provenance tracking of all integrated data, and perform the semantic closure of the hierarchical relationships of the integrated biomedical ontologies. At http://www.bioinformatics.deib.polimi.it/GPKB/, a Web interface allows graphical easy composition of queries, although complex, on the knowledge base, supporting also semantic query expansion and comprehensive explorative search of the integrated data to better sustain biomedical knowledge extraction. PMID:27045824

  19. Representative proteomes: a stable, scalable and unbiased proteome set for sequence analysis and functional annotation.

    PubMed

    Chen, Chuming; Natale, Darren A; Finn, Robert D; Huang, Hongzhan; Zhang, Jian; Wu, Cathy H; Mazumder, Raja

    2011-01-01

    The accelerating growth in the number of protein sequences taxes both the computational and manual resources needed to analyze them. One approach to dealing with this problem is to minimize the number of proteins subjected to such analysis in a way that minimizes loss of information. To this end we have developed a set of Representative Proteomes (RPs), each selected from a Representative Proteome Group (RPG) containing similar proteomes calculated based on co-membership in UniRef50 clusters. A Representative Proteome is the proteome that can best represent all the proteomes in its group in terms of the majority of the sequence space and information. RPs at 75%, 55%, 35% and 15% co-membership threshold (CMT) are provided to allow users to decrease or increase the granularity of the sequence space based on their requirements. We find that a CMT of 55% (RP55) most closely follows standard taxonomic classifications. Further analysis of this set reveals that sequence space is reduced by more than 80% relative to UniProtKB, while retaining both sequence diversity (over 95% of InterPro domains) and annotation information (93% of experimentally characterized proteins). All sets can be browsed and are available for sequence similarity searches and download at http://www.proteininformationresource.org/rps, while the set of 637 RPs determined using a 55% CMT are also available for text searches. Potential applications include sequence similarity searches, protein classification and targeted protein annotation and characterization. PMID:21556138

  20. Open chromatin reveals the functional maize genome

    PubMed Central

    Rodgers-Melnick, Eli; Vera, Daniel L.; Bass, Hank W.

    2016-01-01

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  1. Open chromatin reveals the functional maize genome.

    PubMed

    Rodgers-Melnick, Eli; Vera, Daniel L; Bass, Hank W; Buckler, Edward S

    2016-05-31

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  2. Brewing yeast genomes and genome-wide expression and proteome profiling during fermentation.

    PubMed

    Smart, Katherine A

    2007-11-01

    The genome structure, ancestry and instability of the brewing yeast strains have received considerable attention. The hybrid nature of brewing lager yeast strains provides adaptive potential but yields genome instability which can adversely affect fermentation performance. The requirement to differentiate between production strains and assess master cultures for genomic instability has led to significant adoption of specialized molecular tool kits by the industry. Furthermore, the development of genome-wide transcriptional and protein expression technologies has generated significant interest from brewers. The opportunity presented to explore, and the concurrent requirement to understand both, the constraints and potential of their strains to generate existing and new products during fermentation is discussed. PMID:17879324

  3. Resources for Functional Genomics Studies in Drosophila melanogaster

    PubMed Central

    Mohr, Stephanie E.; Hu, Yanhui; Kim, Kevin; Housden, Benjamin E.; Perrimon, Norbert

    2014-01-01

    Drosophila melanogaster has become a system of choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, “meta” information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases, and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate, and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally. PMID:24653003

  4. Project management system for structural and functional proteomics: Sesame.

    PubMed

    Zolnai, Zsolt; Lee, Peter T; Li, Jing; Chapman, Michael R; Newman, Craig S; Phillips, George N; Rayment, Ivan; Ulrich, Eldon L; Volkman, Brian F; Markley, John L

    2003-01-01

    A computing infrastructure (Sesame) has been designed to manage and link individual steps in complex projects. Sesame is being developed to support a large-scale structural proteomics pilot project. When complete, the system is expected to manage all steps from target selection to data-bank deposition and report writing. We report here on the design criteria of the Sesame system and on results demonstrating successful achievement of the basic goals of its architecture. The Sesame software package, which follows the client/server paradigm, consists of a framework, which supports secure interactions among the three tiers of the system (the client, server, and database tiers), and application modules that carry out specific tasks. The framework utilizes industry standards. The client tier is written in Java2 and can be accessed anywhere through the Internet. All the development on the server tier is also carried out in Java2 so as to accommodate a wide variety of computer platforms. The database tier employs a commercial database management system. Each Sesame application module consists of a simple user interface in the client tier, corresponding objects in the server tier, and relevant data stored in the centralized database. For security, access to stored data is controlled by access privileges. The system facilitates both local and remote collaborations. Because users interact with the system using Java Web Start or through a web browser, access is limited only by the availability of an Internet connection. We describe several Sesame modules that have been developed to the point where they are being utilized routinely to support steps involved in structural and functional proteomics. This software is available to parties interested in using it and assisting to guide its further development. PMID:12943363

  5. Community genomic and proteomic analysis of chemoautotrophic, iron-oxidizing "Leptospirillum rubarum" (Group II) and Leptospirillum ferrodiazotrophum (Group III) in acid mine drainage biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum Groups II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, CA acid mine drainage (AMD) biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum Groups II and III, respectively and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and > 60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid encodes conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacteria have genes for community-essential functions, including carbon fixation, biosynthesis of vitamins, fatty acids and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum Group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum Group II uses a methyl-dependent and Leptospirillum Group III a methyl-independent response pathway. Although only Leptospirillum Group III can fix nitrogen, these proteins were not identified by proteomics. Abundances of core proteins are similar in all communities, but abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum Groups II and III.

  6. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  7. Functional Networks of Highest-Connected Splice Isoforms: From The Chromosome 17 Human Proteome Project.

    PubMed

    Li, Hong-Dong; Menon, Rajasree; Govindarajoo, Brandon; Panwar, Bharat; Zhang, Yang; Omenn, Gilbert S; Guan, Yuanfang

    2015-09-01

    Alternative splicing allows a single gene to produce multiple transcript-level splice isoforms from which the translated proteins may show differences in their expression and function. Identifying the major functional or canonical isoform is important for understanding gene and protein functions. Identification and characterization of splice isoforms is a stated goal of the HUPO Human Proteome Project and of neXtProt. Multiple efforts have catalogued splice isoforms as "dominant", "principal", or "major" isoforms based on expression or evolutionary traits. In contrast, we recently proposed highest connected isoforms (HCIs) as a new class of canonical isoforms that have the strongest interactions in a functional network and revealed their significantly higher (differential) transcript-level expression compared to nonhighest connected isoforms (NCIs) regardless of tissues/cell lines in the mouse. HCIs and their expression behavior in the human remain unexplored. Here we identified HCIs for 6157 multi-isoform genes using a human isoform network that we constructed by integrating a large compendium of heterogeneous genomic data. We present examples for pairs of transcript isoforms of ABCC3, RBM34, ERBB2, and ANXA7. We found that functional networks of isoforms of the same gene can show large differences. Interestingly, differential expression between HCIs and NCIs was also observed in the human on an independent set of 940 RNA-seq samples across multiple tissues, including heart, kidney, and liver. Using proteomic data from normal human retina and placenta, we showed that HCIs are a promising indicator of expressed protein isoforms exemplified by NUDFB6 and M6PR. Furthermore, we found that a significant percentage (20%, p = 0.0003) of human and mouse HCIs are homologues, suggesting their conservation between species. Our identified HCIs expand the repertoire of canonical isoforms and are expected to facilitate studying main protein products, understanding gene

  8. Exploration of Panviral Proteome: High-Throughput Cloning and Functional Implications in Virus-host Interactions

    PubMed Central

    Yu, Xiaobo; Bian, Xiaofang; Throop, Andrea; Song, Lusheng; Moral, Lerys Del; Park, Jin; Seiler, Catherine; Fiacco, Michael; Steel, Jason; Hunter, Preston; Saul, Justin; Wang, Jie; Qiu, Ji; Pipas, James M.; LaBaer, Joshua

    2014-01-01

    Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies. PMID:24955142

  9. Functional Complexity of the Axonal Growth Cone: A Proteomic Analysis

    PubMed Central

    Estrada-Bernal, Adriana; Sanford, Staci D.; Sosa, Lucas J.; Simon, Glenn C.; Hansen, Kirk C.; Pfenninger, Karl H.

    2012-01-01

    The growth cone, the tip of the emerging neurite, plays a crucial role in establishing the wiring of the developing nervous system. We performed an extensive proteomic analysis of axonal growth cones isolated from the brains of fetal Sprague-Dawley rats. Approximately 2000 proteins were identified at ≥99% confidence level. Using informatics, including functional annotation cluster and KEGG pathway analysis, we found great diversity of proteins involved in axonal pathfinding, cytoskeletal remodeling, vesicular traffic and carbohydrate metabolism, as expected. We also found a large and complex array of proteins involved in translation, protein folding, posttranslational processing, and proteasome/ubiquitination-dependent degradation. Immunofluorescence studies performed on hippocampal neurons in culture confirmed the presence in the axonal growth cone of proteins representative of these processes. These analyses also provide evidence for rough endoplasmic reticulum and reveal a reticular structure equipped with Golgi-like functions in the axonal growth cone. Furthermore, Western blot revealed the growth cone enrichment, relative to fetal brain homogenate, of some of the proteins involved in protein synthesis, folding and catabolism. Our study provides a resource for further research and amplifies the relatively recently developed concept that the axonal growth cone is equipped with proteins capable of performing a highly diverse range of functions. PMID:22384089

  10. A peptide identification-free, genome sequence-independent shotgun proteomics workflow for strain-level bacterial differentiation

    PubMed Central

    Shao, Wenguang; Zhang, Min; Lam, Henry; Lau, Stanley C. K.

    2015-01-01

    Shotgun proteomics is an emerging tool for bacterial identification and differentiation. However, the identification of the mass spectra of peptides to genome-derived peptide sequences remains a key issue that limits the use of shotgun proteomics to bacteria with genome sequences available. In this proof-of-concept study, we report a novel bacterial fingerprinting method that enjoys the resolving power and accuracy of mass spectrometry without the burden of peptide identification (i.e. genome sequence-independent). This method uses a similarity-clustering algorithm to search for mass spectra that are derived from the same peptide and merge them into a unique consensus spectrum as the basis to generate proteomic fingerprints of bacterial isolates. In comparison to a traditional peptide identification-based shotgun proteomics workflow and a PCR-based DNA fingerprinting method targeting the repetitive extragenic palindromes elements in bacterial genomes, the novel method generated fingerprints that were richer in information and more discriminative in differentiating E. coli isolates by their animal sources. The novel method is readily deployable to any cultivable bacteria, and may be used for several fields of study such as environmental microbiology, applied microbiology, and clinical microbiology. PMID:26395646

  11. A peptide identification-free, genome sequence-independent shotgun proteomics workflow for strain-level bacterial differentiation.

    PubMed

    Shao, Wenguang; Zhang, Min; Lam, Henry; Lau, Stanley C K

    2015-01-01

    Shotgun proteomics is an emerging tool for bacterial identification and differentiation. However, the identification of the mass spectra of peptides to genome-derived peptide sequences remains a key issue that limits the use of shotgun proteomics to bacteria with genome sequences available. In this proof-of-concept study, we report a novel bacterial fingerprinting method that enjoys the resolving power and accuracy of mass spectrometry without the burden of peptide identification (i.e. genome sequence-independent). This method uses a similarity-clustering algorithm to search for mass spectra that are derived from the same peptide and merge them into a unique consensus spectrum as the basis to generate proteomic fingerprints of bacterial isolates. In comparison to a traditional peptide identification-based shotgun proteomics workflow and a PCR-based DNA fingerprinting method targeting the repetitive extragenic palindromes elements in bacterial genomes, the novel method generated fingerprints that were richer in information and more discriminative in differentiating E. coli isolates by their animal sources. The novel method is readily deployable to any cultivable bacteria, and may be used for several fields of study such as environmental microbiology, applied microbiology, and clinical microbiology. PMID:26395646

  12. Characterization of the Genome, Proteome, and Structure of Yersiniophage ϕR1-37

    PubMed Central

    Hyytiäinen, Heidi J.; Happonen, Lotta J.; Kiljunen, Saija; Datta, Neeta; Mattinen, Laura; Williamson, Kirsty; Kristo, Paula; Szeliga, Magdalena; Kalin-Mänttäri, Laura; Ahola-Iivarinen, Elina; Kalkkinen, Nisse; Butcher, Sarah J.

    2012-01-01

    The bacteriophage vB_YecM-ϕR1-37 (ϕR1-37) is a lytic yersiniophage that can propagate naturally in different Yersinia species carrying the correct lipopolysaccharide receptor. This large-tailed phage has deoxyuridine (dU) instead of thymidine in its DNA. In this study, we determined the genomic sequence of phage ϕR1-37, mapped parts of the phage transcriptome, characterized the phage particle proteome, and characterized the virion structure by cryo-electron microscopy and image reconstruction. The 262,391-bp genome of ϕR1-37 is one of the largest sequenced phage genomes, and it contains 367 putative open reading frames (ORFs) and 5 tRNA genes. Mass-spectrometric analysis identified 69 phage particle structural proteins with the genes scattered throughout the genome. A total of 269 of the ORFs (73%) lack homologues in sequence databases. Based on terminator and promoter sequences identified from the intergenic regions, the phage genome was predicted to consist of 40 to 60 transcriptional units. Image reconstruction revealed that the ϕR1-37 capsid consists of hexameric capsomers arranged on a T=27 lattice similar to the bacteriophage ϕKZ. The tail of ϕR1-37 has a contractile sheath. We conclude that phage ϕR1-37 is a representative of a novel phage type that carries the dU-containing genome in a ϕKZ-like head. PMID:22973030

  13. Ascribing Functions to Genes: Journey Towards Genetic Improvement of Rice Via Functional Genomics.

    PubMed

    Mustafiz, Ananda; Kumari, Sumita; Karan, Ratna

    2016-06-01

    Rice, one of the most important cereal crops for mankind, feeds more than half the world population. Rice has been heralded as a model cereal owing to its small genome size, amenability to easy transformation, high synteny to other cereal crops and availability of complete genome sequence. Moreover, sequence wealth in rice is getting more refined and precise due to resequencing efforts. This humungous resource of sequence data has confronted research fraternity with a herculean challenge as well as an excellent opportunity to functionally validate expressed as well as regulatory portions of the genome. This will not only help us in understanding the genetic basis of plant architecture and physiology but would also steer us towards developing improved cultivars. No single technique can achieve such a mammoth task. Functional genomics through its diverse tools viz. loss and gain of function mutants, multifarious omics strategies like transcriptomics, proteomics, metabolomics and phenomics provide us with the necessary handle. A paradigm shift in technological advances in functional genomics strategies has been instrumental in generating considerable amount of information w.r.t functionality of rice genome. We now have several databases and online resources for functionally validated genes but despite that we are far from reaching the desired milestone of functionally characterizing each and every rice gene. There is an urgent need for a common platform, for information already available in rice, and collaborative efforts between researchers in a concerted manner as well as healthy public-private partnership, for genetic improvement of rice crop better able to handle the pressures of climate change and exponentially increasing population. PMID:27252584

  14. Enriching the annotation of Mycobacterium tuberculosis H37Rv proteome using remote homology detection approaches: insights into structure and function.

    PubMed

    Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy

    2015-01-01

    The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better

  15. Functional genomics and cancer drug target discovery.

    PubMed

    Moody, Susan E; Boehm, Jesse S; Barbie, David A; Hahn, William C

    2010-06-01

    The recent development of technologies for whole-genome sequencing, copy number analysis and expression profiling enables the generation of comprehensive descriptions of cancer genomes. However, although the structural analysis and expression profiling of tumors and cancer cell lines can allow the identification of candidate molecules that are altered in the malignant state, functional analyses are necessary to confirm such genes as oncogenes or tumor suppressors. Moreover, recent research suggests that tumor cells also depend on synthetic lethal targets, which are not mutated or amplified in cancer genomes; functional genomics screening can facilitate the discovery of such targets. This review provides an overview of the tools available for the study of functional genomics, and discusses recent research involving the use of these tools to identify potential novel drug targets in cancer. PMID:20521217

  16. Molecular targets of TBBPA in zebrafish analysed through integration of genomic and proteomic approaches.

    PubMed

    De Wit, Marijke; Keil, Dorien; Remmerie, Noor; van der Ven, Karlijn; van den Brandhof, Evert-Jan; Knapen, Dries; Witters, Erwin; De Coen, Wim

    2008-12-01

    Tetrabromobisphenol-A (TBBPA) is nowadays one of the most frequently used brominated flame retardants (BFRs) and can be considered as a high production volume chemical. Over the last decade, numerous reports of increasing concentrations of BFRs in the environment and humans have been published. However, the toxicological knowledge on TBBPA, and more specifically its molecular mode of action, is rather fragmentary. In this study two populations of adult zebrafish (Danio rerio) were exposed for 14 days to 0.75 microM and 1.5 microM TBBPA. Subsequently, we employed a combined transcriptomic and proteomic approach to evaluate the molecular effects of TBBPA in zebrafish liver. Oligonucleotide microarrays were used to study the effects on gene expression levels. These results were validated through real-time PCR. The proteome of the liver was analysed by means of differential in-gel electrophoresis (DiGE), an innovative application of traditional 2D-PAGE. Combination of the extracted datasets allowed reassembling of individual molecular responses into a comprehensive overview of affected molecular pathways. Interpretation of the results depicted an interference of thyroid and Vitamin A homeostasis in the exposed zebrafish, TBBPA also elicited responses indicating onset of oxidative stress and general stress responses. Additionally, numerous differentially expressed transcripts could be associated with defence mechanisms or corresponded to metabolizing enzymes. Furthermore, cellular metabolism was clearly affected, illustrated as disturbance of e.g. lipid, carbohydrate, and organic acid metabolic processes. Summarizing, these results enabled us to hypothesize several working mechanisms of TBBPA and demonstrated the potential of a combined genome and proteome approach to generate detailed mechanistic toxicological information. PMID:18976794

  17. Statistical Methods for Proteomic Biomarker Discovery based on Feature Extraction or Functional Modeling Approaches*

    PubMed Central

    Morris, Jeffrey S.

    2012-01-01

    In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry (Cromwell) and 2D gel electrophoresis (Pinnacle) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods

  18. Statistical Methods for Proteomic Biomarker Discovery based on Feature Extraction or Functional Modeling Approaches.

    PubMed

    Morris, Jeffrey S

    2012-01-01

    In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational aspects of comparative proteomic studies, and summarizes contributions I along with numerous collaborators have made. First, there is an overview of comparative proteomics technologies, followed by a discussion of important experimental design and preprocessing issues that must be considered before statistical analysis can be done. Next, the two key approaches to analyzing proteomics data, feature extraction and functional modeling, are described. Feature extraction involves detection and quantification of discrete features like peaks or spots that theoretically correspond to different proteins in the sample. After an overview of the feature extraction approach, specific methods for mass spectrometry (Cromwell) and 2D gel electrophoresis (Pinnacle) are described. The functional modeling approach involves modeling the proteomic data in their entirety as functions or images. A general discussion of the approach is followed by the presentation of a specific method that can be applied, wavelet-based functional mixed models, and its extensions. All methods are illustrated by application to two example proteomic data sets, one from mass spectrometry and one from 2D gel electrophoresis. While the specific methods

  19. Target selection for structural genomics based on combining fold recognition and crystallisation prediction methods: application to the human proteome.

    PubMed

    Bray, James E

    2012-03-01

    The objective of this study is to automatically identify regions of the human proteome that are suitable for 3D structure determination by X-ray crystallography and to annotate them according to their likelihood to produce diffraction quality crystals. The results provide a powerful tool for structural genomics laboratories who wish to select human proteins based on the statistical likelihood of crystallisation success. Combining fold recognition and crystallisation prediction algorithms enables the efficient calculation of the crystallisability of the entire human proteome. This novel study estimates that there are approximately 40,000 crystallisable regions in the human proteome. Currently, only 15% of these regions (approx. 6,000 sequences) have been solved to at least 95% sequence identity. The remaining unsolved regions have been categorised into 5 crystallisation classes and an integral membrane protein (IMP) class, based on established structure prediction, crystallisation prediction and transmembrane (TM) helix prediction algorithms. Approximately 750 unsolved regions (2% of the proteome) have been identified as having a PDB fold representative (template) and an 'optimal' likelihood of crystallisation. At the other end of the spectrum, more than 10,500 non-IMP regions with a PDB template are classified as 'very difficult' to crystallise (26%) and almost 2,500 regions (6%) were predicted to contain at least 3 TM helices. The 3D-SPECS (3D Structural Proteomics Explorer with Crystallisation Scores) website contains crystallisation predictions for the entire human proteome and can be found at http://www.bioinformaticsplus.org/3dspecs. PMID:22354707

  20. metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research.

    PubMed

    Lyne, Mike; Smith, Richard N; Lyne, Rachel; Aleksic, Jelena; Hu, Fengyuan; Kalderimis, Alex; Stepan, Radek; Micklem, Gos

    2013-01-01

    Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first

  1. metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research

    PubMed Central

    Lyne, Mike; Smith, Richard N; Lyne, Rachel; Aleksic, Jelena; Hu, Fengyuan; Kalderimis, Alex; Stepan, Radek; Micklem, Gos

    2013-01-01

    Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first

  2. Functional proteomic and interactome analysis of proteins associated with beef tenderness in angus cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Beef is a source of high quality protein for the human population, and beef tenderness has significant influence on beef palatability, consumer expectation and industry profitability. To further elucidate the factors affecting beef tenderness, functional proteomics and bioinformatics interactome ana...

  3. From genomes to function: haloarchaea as model organisms.

    PubMed

    Soppa, Jörg

    2006-03-01

    Haloarchaea are adapted to high-salt environments and accumulate equally high salt concentrations in the cytoplasm. The genomes of representatives of six haloarchaeal genera have been fully or partially sequenced, allowing the analysis of haloarchaeal properties in silico. Transcriptome and proteome analyses have been established for Halobacterium salinarum and Haloferax volcanii. Genetic systems are available including methods that allow the fast in-frame deletion or modification of chromosomal genes. The high-efficiency transformation system of Hf. volcanii allows the isolation of genes essential for a biological process by complementation of loss-of-function mutants. For the analysis of haloarchaeal biology many molecular genetic, biochemical, structural and cell biological methods have been adapted to application at high salt concentrations. Recently it has become clear that several different mechanisms allow the adaptation of proteins to the high salt concentration of the cytoplasm. Taken together, the wealth of techniques available make haloarchaea excellent archaeal model species. PMID:16514139

  4. Genomic, Proteomic, and Metabolite Characterization of Gemfibrozil-Degrading Organism Bacillus sp. GeD10.

    PubMed

    Kjeldal, Henrik; Zhou, Nicolette A; Wissenbach, Dirk K; von Bergen, Martin; Gough, Heidi L; Nielsen, Jeppe L

    2016-01-19

    Gemfibrozil is a widely used hypolipidemic and triglyceride lowering drug. Excess of the drug is excreted and discharged into the environment primarily via wastewater treatment plant effluents. Bacillus sp. GeD10, a gemfibrozil-degrader, was previously isolated from activated sludge. It is the first identified bacterium capable of degrading gemfibrozil. Gemfibrozil degradation by Bacillus sp. GeD10 was here studied through genome sequencing, quantitative proteomics and metabolite analysis. From the bacterial proteome of Bacillus sp. GeD10 1974 proteins were quantified, of which 284 proteins were found to be overabundant by more than 2-fold (FDR corrected p-value ≤0.032, fold change (log2) ≥ 1) in response to gemfibrozil exposure. Metabolomic analysis identified two hydroxylated intermediates as well as a glucuronidated hydroxyl-metabolite of gemfibrozil. Overall, gemfibrozil exposure in Bacillus sp. GeD10 increased the abundance of several enzymes potentially involved in gemfibrozil degradation as well as resulted in the production of several gemfibrozil metabolites. The potential catabolic pathway/modification included ring-hydroxylation preparing the substrate for subsequent ring cleavage by a meta-cleaving enzyme. The identified genes may allow for monitoring of potential gemfibrozil-degrading organisms in situ and increase the understanding of microbial processing of trace level contaminants. This study represents the first omics study on a gemfibrozil-degrading bacterium. PMID:26683816

  5. Microbial proteomics: the quiet revolution

    SciTech Connect

    Seraphin, Bertrand; Hettich, Robert {Bob} L

    2012-01-01

    Technological developments in DNA sequencing and their application to study thousands of microbial genomes or even microbial ecosystems still today often make the headlines of general newspapers and scientific journals. These revolutionary changes are hiding another revolution that is unfolding more quietly in the background: the development of microbial proteomics to study genome expression products. It is important to recognize that while DNA sequencing reveals extensive details about the genomic potential of an organism or community, proteomic measurements reveal the functional gene products that are present and operational under specific environmental conditions, and thus perhaps better characterize the critical biomolecules that execute the life processes (enzymes, signaling, structural factors, etc.).

  6. Proteomics for Validation of Automated Gene Model Predictions

    SciTech Connect

    Zhou, Kemin; Panisko, Ellen A.; Magnuson, Jon K.; Baker, Scott E.; Grigoriev, Igor V.

    2008-02-14

    High-throughput liquid chromatography mass spectrometry (LC-MS)-based proteomic analysis has emerged as a powerful tool for functional annotation of genome sequences. These analyses complement the bioinformatic and experimental tools used for deriving, verifying, and functionally annotating models of genes and their transcripts. Furthermore, proteomics extends verification and functional annotation to the level of the translation product of the gene model.

  7. The Present and Future of Biomarkers in Prostate Cancer: Proteomics, Genomics, and Immunology Advancements

    PubMed Central

    Gaudreau, Pierre-Olivier; Stagg, John; Soulières, Denis; Saad, Fred

    2016-01-01

    Prostate cancer (PC) is the second most common form of cancer in men worldwide. Biomarkers have emerged as essential tools for treatment and assessment since the variability of disease behavior, the cost and diversity of treatments, and the related impairment of quality of life have given rise to a need for a personalized approach. High-throughput technology platforms in proteomics and genomics have accelerated the development of biomarkers. Furthermore, recent successes of several new agents in PC, including immunotherapy, have stimulated the search for predictors of response and resistance and have improved the understanding of the biological mechanisms at work. This review provides an overview of currently established biomarkers in PC, as well as a selection of the most promising biomarkers within these particular fields of development. PMID:27168728

  8. Biomarkers for Bone Tumors: Discovery from Genomics and Proteomics Studies and Their Challenges

    PubMed Central

    Wan-Ibrahim, Wan I; Singh, Vivek A; Hashim, Onn H; Abdul-Rahman, Puteri S

    2015-01-01

    Diagnosis of bone tumor currently relies on imaging and biopsy, and hence, the need to find less invasive ways for its accurate detection. More recently, numerous promising deoxyribonucleic acid (DNA) and protein biomarkers with significant prognostic, diagnostic and/or predictive abilities for various types of bone tumors have been identified from genomics and proteomics studies. This article reviewed the putative biomarkers for the more common types of bone tumors (that is, osteosarcoma, Ewing sarcoma, chondrosarcoma [malignant] and giant cell tumor [benign]) that were unveiled from the studies. The benefits and drawbacks of these biomarkers, as well as the technology platforms involved in the research, were also discussed. Challenges faced in the biomarker discovery studies and the problems in their translation from the bench to the clinical settings were also addressed. PMID:26581086

  9. The Present and Future of Biomarkers in Prostate Cancer: Proteomics, Genomics, and Immunology Advancements.

    PubMed

    Gaudreau, Pierre-Olivier; Stagg, John; Soulières, Denis; Saad, Fred

    2016-01-01

    Prostate cancer (PC) is the second most common form of cancer in men worldwide. Biomarkers have emerged as essential tools for treatment and assessment since the variability of disease behavior, the cost and diversity of treatments, and the related impairment of quality of life have given rise to a need for a personalized approach. High-throughput technology platforms in proteomics and genomics have accelerated the development of biomarkers. Furthermore, recent successes of several new agents in PC, including immunotherapy, have stimulated the search for predictors of response and resistance and have improved the understanding of the biological mechanisms at work. This review provides an overview of currently established biomarkers in PC, as well as a selection of the most promising biomarkers within these particular fields of development. PMID:27168728

  10. Genome-Wide Transcriptome and Proteome Analysis on Different Developmental Stages of Cordyceps militaris

    PubMed Central

    Yin, Yalin; Yu, Guojun; Chen, Yijie; Jiang, Shuai; Wang, Man; Jin, Yanxia; Lan, Xianqing; Liang, Yi; Sun, Hui

    2012-01-01

    Background Cordyceps militaris, an ascomycete caterpillar fungus, has been used as a traditional Chinese medicine for many years owing to its anticancer and immunomodulatory activities. Currently, artificial culturing of this beneficial fungus has been widely used and can meet the market, but systematic molecular studies on the developmental stages of cultured C. militaris at transcriptional and translational levels have not been determined. Methodology/Principal Findings We utilized high-throughput Illumina sequencing to obtain the transcriptomes of C. militaris mycelium and fruiting body. All clean reads were mapped to C. militaris genome and most of the reads showed perfect coverage. Alternative splicing and novel transcripts were predicted to enrich the database. Gene expression analysis revealed that 2,113 genes were up-regulated in mycelium and 599 in fruiting body. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed to analyze the genes with expression differences. Moreover, the putative cordycepin metabolism difference between different developmental stages was studied. In addition, the proteome data of mycelium and fruiting body were obtained by one-dimensional gel electrophoresis (1-DGE) coupled with nano-electrospray ionization liquid chromatography tandem mass spectrometry (nESI-LC-MS/MS). 359 and 214 proteins were detected from mycelium and fruiting body respectively. GO, KEGG and Cluster of Orthologous Groups (COG) analysis were further conducted to better understand their difference. We analyzed the amounts of some noteworthy proteins in these two samples including lectin, superoxide dismutase, glycoside hydrolase and proteins involved in cordycepin metabolism, providing important information for further protein studies. Conclusions/Significance The results reveal the difference in gene expression between the mycelium and fruiting body of artificially cultivated C. militaris by transcriptome and proteome

  11. Identification and Validation of Specific Markers of Bacillus anthracis Spores by Proteomics and Genomics Approaches*

    PubMed Central

    Chenau, Jérôme; Fenaille, François; Caro, Valérie; Haustant, Michel; Diancourt, Laure; Klee, Silke R.; Junot, Christophe; Ezan, Eric; Goossens, Pierre L.; Becher, François

    2014-01-01

    Bacillus anthracis is the causative bacteria of anthrax, an acute and often fatal disease in humans. The infectious agent, the spore, represents a real bioterrorism threat and its specific identification is crucial. However, because of the high genomic relatedness within the Bacillus cereus group, it is still a real challenge to identify B. anthracis spores confidently. Mass spectrometry-based tools represent a powerful approach to the efficient discovery and identification of such protein markers. Here we undertook comparative proteomics analyses of Bacillus anthracis, cereus and thuringiensis spores to identify proteoforms unique to B. anthracis. The marker discovery pipeline developed combined peptide- and protein-centric approaches using liquid chromatography coupled to tandem mass spectrometry experiments using a high resolution/high mass accuracy LTQ-Orbitrap instrument. By combining these data with those from complementary bioinformatics approaches, we were able to highlight a dozen novel proteins consistently observed across all the investigated B. anthracis spores while being absent in B. cereus/thuringiensis spores. To further demonstrate the relevance of these markers and their strict specificity to B. anthracis, the number of strains studied was extended to 55, by including closely related strains such as B. thuringiensis 9727, and above all the B. cereus biovar anthracis CI, CA strains that possess pXO1- and pXO2-like plasmids. Under these conditions, the combination of proteomics and genomics approaches confirms the pertinence of 11 markers. Genes encoding these 11 markers are located on the chromosome, which provides additional targets complementary to the commonly used plasmid-encoded markers. Last but not least, we also report the development of a targeted liquid chromatography coupled to tandem mass spectrometry method involving the selection reaction monitoring mode for the monitoring of the 4 most suitable protein markers. Within a proof

  12. Functional Module Search in Protein Networks based on Semantic Similarity Improves the Analysis of Proteomics Data*

    PubMed Central

    Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus

    2014-01-01

    The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868

  13. Proteomic Insight into the Molecular Function of the Vitreous

    PubMed Central

    Skeie, Jessica M.; Roybal, C. Nathaniel; Mahajan, Vinit B.

    2015-01-01

    The human vitreous contains primarily water, but also contains proteins which have yet to be fully characterized. To gain insight into the four vitreous substructures and their potential functions, we isolated and analyzed the vitreous protein profiles of three non-diseased human eyes. The four analyzed substructures were the anterior hyaloid, the vitreous cortex, the vitreous core, and the vitreous base. Proteins were separated by multidimensional liquid chromatography and identified by tandem mass spectrometry. Bioinformatics tools then extracted the expression profiles, signaling pathways, and interactomes unique to each tissue. From each substructure, a mean of 2,062 unique proteins were identified, with many being differentially expressed in a specific substructure: 278 proteins were unique to the anterior hyaloid, 322 to the vitreous cortex, 128 to the vitreous base, and 136 to the vitreous core. When the identified proteins were organized according to relevant functional pathways and networks, key patterns appeared. The blood coagulation pathway and extracellular matrix turnover networks were highly represented. Oxidative stress regulation and energy metabolism proteins were distributed throughout the vitreous. Immune functions were represented by high levels of immunoglobulin, the complement pathway, damage-associated molecular patterns (DAMPs), and evolutionarily conserved antimicrobial proteins. The majority of vitreous proteins detected were intracellular proteins, some of which originate from the retina, including rhodopsin (RHO), phosphodiesterase 6 (PDE6), and glial fibrillary acidic protein (GFAP). This comprehensive analysis uncovers a picture of the vitreous as a biologically active tissue, where proteins localize to distinct substructures to protect the intraocular tissues from infection, oxidative stress, and energy disequilibrium. It also reveals the retina as a potential source of inflammatory mediators. The vitreous proteome catalogues the

  14. Aging synaptic mitochondria exhibit dynamic proteomic changes while maintaining bioenergetic function

    PubMed Central

    Stauch, Kelly L.; Purnell, Phillip R.; Fox, Howard S.

    2014-01-01

    Aging correlates with a progressive impairment of mitochondrial homeostasis and is an influential factor for several forms of neurodegeneration. However, the mechanisms underlying age-related alterations in synaptosomal mitochondria, a neuronal mitochondria population highly susceptible to insults and critical for brain function, remain incompletely understood. Therefore this study investigates the synaptic mitochondrial proteomic and bioenergetic alterations that occur with age. The utilization of a state of the art quantitative proteomics approach allowed for the comparison of protein expression levels in synaptic mitochondria isolated from 5 (mature), 12 (old), and 24 (aged) month old mice. During the process of aging we find that dynamic proteomic alterations occur in synaptic mitochondria. Despite direct (mitochondrial DNA deletions) and indirect (increased antioxidant protein levels) signs of mitochondrial damage in the aged mice, there was an overall maintenance of mitochondrial function. Therefore the synaptic mitochondrial proteomic changes that occur with aging correlate with preservation of synaptic mitochondrial function. PMID:24827396

  15. Functional environmental proteomics: elucidating the role of a c-type cytochrome abundant during uranium bioremediation.

    PubMed

    Yun, Jiae; Malvankar, Nikhil S; Ueki, Toshiyuki; Lovley, Derek R

    2016-02-01

    Studies with pure cultures of dissimilatory metal-reducing microorganisms have demonstrated that outer-surface c-type cytochromes are important electron transfer agents for the reduction of metals, but previous environmental proteomic studies have typically not recovered cytochrome sequences from subsurface environments in which metal reduction is important. Gel-separation, heme-staining and mass spectrometry of proteins in groundwater from in situ uranium bioremediation experiments identified a putative c-type cytochrome, designated Geobacter subsurface c-type cytochrome A (GscA), encoded within the genome of strain M18, a Geobacter isolate previously recovered from the site. Homologs of GscA were identified in the genomes of other Geobacter isolates in the phylogenetic cluster known as subsurface clade 1, which predominates in a diversity of Fe(III)-reducing subsurface environments. Most of the gscA sequences recovered from groundwater genomic DNA clustered in a tight phylogenetic group closely related to strain M18. GscA was most abundant in groundwater samples in which Geobacter sp. predominated. Expression of gscA in a strain of Geobacter sulfurreducens that lacked the gene for the c-type cytochrome OmcS, thought to facilitate electron transfer from conductive pili to Fe(III) oxide, restored the capacity for Fe(III) oxide reduction. Atomic force microscopy provided evidence that GscA was associated with the pili. These results demonstrate that a c-type cytochrome with an apparent function similar to that of OmcS is abundant when Geobacter sp. are abundant in the subsurface, providing insight into the mechanisms for the growth of subsurface Geobacter sp. on Fe(III) oxide and suggesting an approach for functional analysis of other Geobacter proteins found in the subsurface. PMID:26140532

  16. Functional analysis of natural microbial consortia using community proteomics

    SciTech Connect

    Verberkmoes, Nathan C; Denef, Vincent; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    It is widely accepted that microbial communities, not individual microorganisms, are the relevant ecological units, yet what we know about metabolic functioning of microbial communities could be written on a postage stamp. The recent developments in comprehensive molecular methods promise to gain a better understanding of how organisms within communities interact and how communities and their populations evolve. Here we highlight recent advances and insights gathered by the application of proteogenomics to microbial communities. We explore the history of how unrelated fields of microbial ecology, genomics, biological mass spectrometry and informatics converge to the development of a new field of metaproteomics.

  17. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    PubMed Central

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA) through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR) of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain. PMID:27302759

  18. Genomic, Proteomic, and Biochemical Analysis of the Organohalide Respiratory Pathway in Desulfitobacterium dehalogenans

    PubMed Central

    van de Pas, Bram A.; Atteia, Ariane; Krab, Klaas; Hagen, Wilfred R.; Goodwin, Lynne; Chain, Patrick; Boeren, Sjef; Maphosa, Farai; Schraa, Gosse; de Vos, Willem M.; van der Oost, John; Smidt, Hauke

    2014-01-01

    Desulfitobacterium dehalogenans is able to grow by organohalide respiration using 3-chloro-4-hydroxyphenyl acetate (Cl-OHPA) as an electron acceptor. We used a combination of genome sequencing, biochemical analysis of redox active components, and shotgun proteomics to study elements of the organohalide respiratory electron transport chain. The genome of Desulfitobacterium dehalogenans JW/IU-DC1T consists of a single circular chromosome of 4,321,753 bp with a GC content of 44.97%. The genome contains 4,252 genes, including six rRNA operons and six predicted reductive dehalogenases. One of the reductive dehalogenases, CprA, is encoded by a well-characterized cprTKZEBACD gene cluster. Redox active components were identified in concentrated suspensions of cells grown on formate and Cl-OHPA or formate and fumarate, using electron paramagnetic resonance (EPR), visible spectroscopy, and high-performance liquid chromatography (HPLC) analysis of membrane extracts. In cell suspensions, these components were reduced upon addition of formate and oxidized after addition of Cl-OHPA, indicating involvement in organohalide respiration. Genome analysis revealed genes that likely encode the identified components of the electron transport chain from formate to fumarate or Cl-OHPA. Data presented here suggest that the first part of the electron transport chain from formate to fumarate or Cl-OHPA is shared. Electrons are channeled from an outward-facing formate dehydrogenase via menaquinones to a fumarate reductase located at the cytoplasmic face of the membrane. When Cl-OHPA is the terminal electron acceptor, electrons are transferred from menaquinones to outward-facing CprA, via an as-yet-unidentified membrane complex, and potentially an extracellular flavoprotein acting as an electron shuttle between the quinol dehydrogenase membrane complex and CprA. PMID:25512312

  19. Discovery and annotation of small proteins using genomics, proteomics and computational approaches

    SciTech Connect

    Yang, Xiaohan; Tschaplinski, Timothy J.; Hurst, Gregory B.; Jawdy, Sara; Abraham, Paul E.; Lankford, Patricia K.; Adams, Rachel M.; Shah, Manesh B.; Hettich, Robert L.; Lindquist, Erika; Kalluri, Udaya C.; Gunter, Lee E.; Pennacchio, Christa; Tuskan, Gerald A.

    2011-03-02

    Small proteins (10 200 amino acids aa in length) encoded by short open reading frames (sORF) play important regulatory roles in various biological processes, including tumor progression, stress response, flowering, and hormone signaling. However, ab initio discovery of small proteins has been relatively overlooked. Recent advances in deep transcriptome sequencing make it possible to efficiently identify sORFs at the genome level. In this study, we obtained 2.6 million expressed sequence tag (EST) reads from Populus deltoides leaf transcriptome and reconstructed full-length transcripts from the EST sequences. We identified an initial set of 12,852 sORFs encoding proteins of 10 200 aa in length. Three computational approaches were then used to enrich for bona fide protein-coding sORFs from the initial sORF set: (1) codingpotential prediction, (2) evolutionary conservation between P. deltoides and other plant species, and (3) gene family clustering within P. deltoides. As a result, a high-confidence sORF candidate set containing 1469 genes was obtained. Analysis of the protein domains, non-protein-coding RNA motifs, sequence length distribution, and protein mass spectrometry data supported this high-confidence sORF set. In the high-confidence sORF candidate set, known protein domains were identified in 1282 genes (higher-confidence sORF candidate set), out of which 611 genes, designated as highest-confidence candidate sORF set, were supported by proteomics data. Of the 611 highest-confidence candidate sORF genes, 56 were new to the current Populus genome annotation. This study not only demonstrates that there are potential sORF candidates to be annotated in sequenced genomes, but also presents an efficient strategy for discovery of sORFs in species with no genome annotation yet available.

  20. Proteome analysis of functionally differentiated bovine (Bos indicus) mammary epithelial cells isolated from milk.

    PubMed

    Janjanam, Jagadeesh; Jamwal, Manu; Singh, Surender; Kumar, Saravanan; Panigrahi, Aswini K; Hariprasad, Gururao; Jena, Manoj K; Anand, Vijay; Kumar, Sudarshan; Kaushik, Jai K; Dang, Ajay K; Mukesh, Manishi; Mishra, Bishnu P; Srinivasan, Alagiri; Reddy, Vanga S; Mohanty, Ashok K

    2013-11-01

    Mammary gland is made up of a branching network of ducts that end in alveoli. Terminally differentiated mammary epithelial cells (MECs) constitute the innermost layer of aveoli. They are milk-secreting cuboidal cells that secrete milk proteins during lactation. Little is known about the expression profile of proteins in the metabolically active MECs during lactation or their functional role in the lactation process. In the present investigation, we have reported the proteome map of MECs in lactating cows using 2DE MALDI-TOF/TOF MS and 1D-Gel-LC-MS/MS. MECs were isolated from milk using immunomagnetic beads and confirmed by RT-PCR and Western blotting. The 1D-Gel-LC-MS/MS and 2DE-MS/MS based approaches led to identification of 431 and 134 proteins, respectively, with a total of 497 unique proteins. Proteins identified in this study were clustered into functional groups using bioinformatics tools. Pathway analysis of the identified proteins revealed 28 pathways (p < 0.05) providing evidence for involvement of various proteins in lactation function. This study further provides experimental evidence for the presence of many proteins that have been predicted in annotated bovine genome. The data generated further provide a set of bovine MEC-specific proteins that will help the researchers to understand the molecular events taking place during lactation. PMID:24030930

  1. Proteome-wide dataset supporting functional study of tyrosine kinases in breast cancer.

    PubMed

    Angelopoulos, Nicos; Stebbing, Justin; Xu, Yichen; Giamas, Georgios; Zhang, Hua

    2016-06-01

    Tyrosine kinases (TKs) play an essential role in regulating various cellular activities and dysregulation of TK signaling contributes to oncogenesis. However, less than half of the TKs have been thoroughly studied. Through a combined use of RNAi and stable isotope labeling with amino acids in cell culture (SILAC)-based quantitative proteomics, a global functional proteomic landscape of TKs in breast cancer was recently revealed highlighting a comprehensive and highly integrated signaling network regulated by TKs (Stebbing et al., 2015) [1]. We collate the enormous amount of the proteomic data in an open access platform, providing a valuable resource for studying the function of TKs in cancer and benefiting the science community. Here we present a detailed description related to this study (Stebbing et al., 2015) [1] and the raw data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the identifier PXD002065. PMID:27054188

  2. Proteome-wide dataset supporting functional study of tyrosine kinases in breast cancer

    PubMed Central

    Angelopoulos, Nicos; Stebbing, Justin; Xu, Yichen; Giamas, Georgios; Zhang, Hua

    2016-01-01

    Tyrosine kinases (TKs) play an essential role in regulating various cellular activities and dysregulation of TK signaling contributes to oncogenesis. However, less than half of the TKs have been thoroughly studied. Through a combined use of RNAi and stable isotope labeling with amino acids in cell culture (SILAC)-based quantitative proteomics, a global functional proteomic landscape of TKs in breast cancer was recently revealed highlighting a comprehensive and highly integrated signaling network regulated by TKs (Stebbing et al., 2015) [1]. We collate the enormous amount of the proteomic data in an open access platform, providing a valuable resource for studying the function of TKs in cancer and benefiting the science community. Here we present a detailed description related to this study (Stebbing et al., 2015) [1] and the raw data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the identifier PXD002065. PMID:27054188

  3. AgBase: a functional genomics resource for agriculture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Many agricultural species and their pathogens have sequenced genomes and more are in progress. However, systems biology from functional genomics data is hindered in agricultural species because agricultural genome sequences have relatively poor structural and functional annotation and agricultural ...

  4. Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster

    NASA Astrophysics Data System (ADS)

    Zdobnov, Evgeny M.; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R.; Christophides, George K.; Thomasova, Dana; Holt, Robert A.; Subramanian, G. Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H.; Wells, Michael A.; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L.; Kokoza, Elena; Kraft, Cheryl L.; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M.; Salzberg, Steven L.; Sutton, Granger G.; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H.; Ribeiro, Jose; Gelbart, William M.; Kafatos, Fotis C.; Bork, Peer

    2002-10-01

    Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small ``microsyntenic'' clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected.

  5. Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster.

    PubMed

    Zdobnov, Evgeny M; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R; Christophides, George K; Thomasova, Dana; Holt, Robert A; Subramanian, G Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H; Wells, Michael A; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L; Kokoza, Elena; Kraft, Cheryl L; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M; Salzberg, Steven L; Sutton, Granger G; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H; Ribeiro, Jose; Gelbart, William M; Kafatos, Fotis C; Bork, Peer

    2002-10-01

    Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small "microsyntenic" clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected. PMID:12364792

  6. Tomato functional genomics database (TFGD): a comprehensive collection and analysis package for tomato functional genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...

  7. Ossifying Fibromyxoid Tumor of Soft Parts: A Clinicopathologic, Proteomic and Genomic Study

    PubMed Central

    Graham, Rondell P.D.; Dry, Sarah; Li, Xinmin; Binder, Scott; Bahrami, Armita; Raimondi, Susana C.; Dogan, Ahmet; Chakraborty, Subhankar; Souchek, Joshua J.; Folpe, Andrew L.

    2011-01-01

    Ossifying fibromyxoid tumor of soft parts (OFMT) is a rare soft tissue and bone tumor of borderline malignancy displaying an uncertain line of differentiation. The existence of fully malignant OFMT is controversial. In order to better understand the natural history and line of differentiation taken by OFMT, we studied 46 cases by light microscopic, immunohistochemical (IHC), genomic, proteomic, and fluorescent in situ hybridization (FISH) methods. Cases were classified according to the 2003 Folpe and Weiss system. Clinical and follow-up information was obtained. IHC for S-100 protein, desmin, epithelial membrane antigen (EMA), cytokeratins, smooth muscle actin (SMA), INI-1, neurofilament protein (NFP), CD56d excitatory amino acid transporter-4 (EAAT4), and MUC4 was performed on formalin-fixed, paraffin-embedded (FFPE) tissues. Gene expression profiling and proteomic studies were performed on FFPE tissues from 13 and 5 cases, respectively. FISH for INI-1 was performed on 10 cases. The 46 tumors arose in 29 males and 17 females (median age 52 years, range 39-63 years) and involved the proximal (N=17) and distal extremities (N=13), head and neck (N=9) and trunk (N=5). Median tumor size was 5.4 cm (range 1.0-21.0 cm). Cases were classified as typical OFMT (26 of 46, 57%), atypical OFMT (5 of 46, 11%) and malignant OFMT (15 of 46 cases, 32%). Clinical follow-up (27 cases, median 55 months duration) showed all patients with typical and atypical OFMT to be alive without disease. Adverse events, including 3 local recurrences, 3 metastases, and 3 deaths, were seen only in malignant OFMT. IHC results were: S100 protein (30/41, 73%), desmin (15/39, 38%), cytokeratin (4/35 11%), EMA (5/32, 16%), SMA (2/34, 6%), INI-1 (lost in mosaic pattern in 14/19, 74%), EAAT4 (31/39, 80%), MUC4 (3/14, 21%), NFP (8/10, 80%) and CD56 (6/14, 43%). Gene expression profiling showed typical and malignant OFMT to cluster together, distinct from schwannian tumors. Proteomic study showed expression

  8. Assessing Susceptibility to Age-related Macular Degeneration with Proteomic and Genomic Biomarkers*

    PubMed Central

    Gu, Jiayin; Pauer, Gayle J. T.; Yue, Xiuzhen; Narendra, Umadevi; Sturgill, Gwen M.; Bena, James; Gu, Xiaorong; Peachey, Neal S.; Salomon, Robert G.; Hagstrom, Stephanie A.; Crabb, John W.

    2009-01-01

    Age-related macular degeneration (AMD) is a progressive disease and major cause of severe visual loss. Toward the discovery of tools for early identification of AMD susceptibility, we evaluated the combined predictive capability of proteomic and genomic AMD biomarkers. We quantified plasma carboxyethylpyrrole (CEP) oxidative protein modifications and CEP autoantibodies by ELISA in 916 AMD and 488 control donors. CEP adducts are uniquely generated from oxidation of docosahexaenoate-containing lipids that are abundant in the retina. Mean CEP adduct and autoantibody levels were found to be elevated in AMD plasma by ∼60 and ∼30%, respectively. The odds ratio for both CEP markers elevated was 3-fold greater or more in AMD than in control patients. Genotyping was performed for AMD risk polymorphisms associated with age-related maculopathy susceptibility 2 (ARMS2), high temperature requirement factor A1 (HTRA1), complement factor H, and complement C3, and the risk of AMD was predicted based on genotype alone or in combination with the CEP markers. The AMD risk predicted for those exhibiting elevated CEP markers and risk genotypes was 2–3-fold greater than the risk based on genotype alone. AMD donors carrying the ARMS2 and HTRA1 risk alleles were the most likely to exhibit elevated CEP markers. The results compellingly demonstrate higher mean CEP marker levels in AMD plasma over a broad age range. Receiver operating characteristic curves suggest that CEP markers alone can discriminate between AMD and control plasma donors with ∼76% accuracy and in combination with genomic markers provide up to ∼80% discrimination accuracy. Plasma CEP marker levels were altered slightly by several demographic and health factors that warrant further study. We conclude that CEP plasma biomarkers, particularly in combination with genomic markers, offer a potential early warning system for assessing susceptibility to this blinding, multifactorial disease. PMID:19202148

  9. Discovery and annotation of small proteins using genomics, proteomics, and computational approaches

    SciTech Connect

    Yang, Xiaohan; Tschaplinski, Timothy J; Hurst, Gregory {Greg} B; Jawdy, Sara; Abraham, Paul E; Lankford, Patricia K; Adams, Rachel M; Shah, Manesh B; Hettich, Robert {Bob} L; Kalluri, Udaya C; Gunter, Lee E; Pennacchio, Christa; Tuskan, Gerald A

    2011-01-01

    Small proteins (10 200 amino acids (AA) in length) encoded by short open reading frames (sORF) play important regulatory roles in various biological processes, including tumor progression, stress response, flowering and hormone signaling. However, ab initio discovery of small proteins has been relatively overlooked. Recent advances in deep transcriptome sequencing make it possible to efficiently identify sORFs at the genome level. In this study, we obtained ~2.6 million expressed sequence tag (EST) reads from Populus deltoides leaf transcriptome and reconstructed full-length transcripts from the EST sequences. We identified an initial set of 12,852 sORFs encoding proteins of 10 200 AA in length. Three computational approaches were then used to enrich for bona fide protein-coding sORFs from the initial sORF set: 1) coding-potential prediction, 2) evolutionary conservation between P. deltoides and other plant species, and 3) gene family clustering within P. deltoides. As a result, a high-confidence sORF candidate set containing 1,469 genes was obtained. Analysis of the protein domains, non-protein-coding RNA motifs, sequence length distribution, and protein mass spectrometry data supported this high-confidence sORF set. In the high-confidence sORF candidate set, known protein domains were identified in 1,282 genes (higher-confidence sORF candidate set), out of which 611 genes, designated as highest-confidence candidate sORF set, were also supported by proteomics data. This study not only demonstrates that there are potential sORF candidates to be annotated in sequenced genomes, but also presents an efficient strategy for discovery of sORFs in species with no genome annotation yet available.

  10. Proteomics of ovarian cancer: functional insights and clinical applications

    PubMed Central

    Rodland, Karin D.

    2016-01-01

    In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification of aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. We propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs. PMID:25736266

  11. Proteomics of ovarian cancer: functional insights and clinical applications

    DOE PAGESBeta

    Elzek, Mohamed A.; Rodland, Karin D.

    2015-03-04

    In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification ofmore » aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. In conclusion, we propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.« less

  12. Proteomics of ovarian cancer: functional insights and clinical applications

    SciTech Connect

    Elzek, Mohamed A.; Rodland, Karin D.

    2015-03-04

    In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicability of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification of aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. In conclusion, we propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.

  13. Proteomics of ovarian cancer: functional insights and clinical applications

    SciTech Connect

    Elzek, Mohamed A.; Rodland, Karin D.

    2015-03-01

    In the past decade, there has been an increasing interest in applying proteomics to assist in understanding the pathogenesis of ovarian cancer, elucidating the mechanism of drug resistance, and in the development of biomarkers for early detection of ovarian cancer. Although ovarian cancer is a spectrum of different diseases, the strategies for diagnosis and treatment with surgery and adjuvant therapy are similar across ovarian cancer types, increasing the general applicabil- ity of discoveries made through proteomics research. While proteomic experiments face many difficulties which slow the pace of clinical applications, recent advances in proteomic technology contribute significantly to the identification of aberrant proteins and networks which can serve as targets for biomarker development and individualized therapies. This review provides a summary of the literature on proteomics’ contributions to ovarian cancer research and highlights the current issues, future directions, and challenges. We propose that protein-level characterization of primary lesion in ovarian cancer can decipher the mystery of this disease, improve diagnostic tools, and lead to more effective screening programs.

  14. From genome to function: the Arabidopsis aquaporins

    PubMed Central

    Quigley, Francoise; Rosenberg, Joshua M; Shachar-Hill, Yair; Bohnert, Hans J

    2002-01-01

    Background In the post-genomic era newly sequenced genomes can be used to deduce organismal functions from our knowledge of other systems. Here we apply this approach to analyzing the aquaporin gene family in Arabidopsis thaliana. The aquaporins are intrinsic membrane proteins that have been characterized as facilitators of water flux. Originally termed major intrinsic proteins (MIPs), they are now also known as water channels, glycerol facilitators and aqua-glyceroporins, yet recent data suggest that they facilitate the movement of other low-molecular-weight metabolites as well. Results The Arabidopsis genome contains 38 sequences with homology to aquaporin in four subfamilies, termed PIP, TIP, NIP and SIP. We have analyzed aquaporin family structure and expression using the A. thaliana genome sequence, and introduce a new NMR approach for the purpose of analyzing water movement in plant roots in vivo. Conclusions Our preliminary data indicate a strongly transcellular component for the flux of water in roots. PMID:11806824

  15. Proteomic analysis uncovers a metabolic phenotype in C. elegans after nhr-40 reduction of function

    SciTech Connect

    Pohludka, Michal; Simeckova, Katerina; Vohanka, Jaroslav; Yilma, Petr; Novak, Petr; Krause, Michael W.; Kostrouchova, Marta; Kostrouch, Zdenek

    2008-09-12

    Caenorhabditis elegans has an unexpectedly large number (284) of genes encoding nuclear hormone receptors, most of which are nematode-specific and are of unknown function. We have exploited comparative two-dimensional chromatography of synchronized cultures of wild type C. elegans larvae and a mutant in nhr-40 to determine if proteomic approaches will provide additional insight into gene function. Chromatofocusing, followed by reversed-phase chromatography and mass spectrometry, identified altered chromatographic patterns for a set of proteins, many of which function in muscle and metabolism. Prompted by the proteomic analysis, we find that the penetrance of the developmental phenotypes in the mutant is enhanced at low temperatures and by food restriction. The combination of our phenotypic and proteomic analysis strongly suggests that NHR-40 provides a link between metabolism and muscle development. Our results highlight the utility of comparative two-dimensional chromatography to provide a relatively rapid method to gain insight into gene function.

  16. The Karyote physico-chemical genomic, proteomic, metabolic cell modeling system.

    PubMed

    Ortoleva, P; Berry, E; Brun, Y; Fan, J; Fontus, M; Hubbard, K; Jaqaman, K; Jarymowycz, L; Navid, A; Sayyed-Ahmad, A; Shreif, Z; Stanley, F; Tuncay, K; Weitzke, E; Wu, L-C

    2003-01-01

    Modeling approaches to the dynamics of a living cell are presented that are strongly based on its underlying physical and chemical processes and its hierarchical spatio-temporal organization. Through the inclusion of a broad spectrum of processes and a rigorous analysis of the multiple scale nature of cellular dynamics, we are attempting to advance cell modeling and its applications. The presentation focuses on our cell modeling system, which integrates data archiving and quantitative physico-chemical modeling and information theory to provide a seamless approach to the modeling/data analysis endeavor. Thereby the rapidly growing mess of genomic, proteomic, metabolic, and cell physiological data can be automatically used to develop and calibrate a predictive cell model. The discussion focuses on the Karyote cell modeling system and an introduction to the CellX and VirusX models. The Karyote software system integrates three elements: (1) a model-building and data archiving module that allows one to define a cell type to be modeled through its reaction network, structure, and transport processes as well as to choose the surrounding medium and other parameters of the phenomenon to be modeled; (2) a genomic, proteomic, metabolic cell simulator that solves the equations of metabolic reaction, transcription/translation polymerization and the exchange of molecules between parts of the cell and with the surrounding medium; and (3) an information theory module (ITM) that automates model calibration and development, and integrates a variety of data types with the cell dynamic computations. In Karyote, reactions may be fast (equilibrated) or slow (finite rate), and the special effects of enzymes and other minority species yielding steady-state cycles of arbitrary complexities are accounted for. These features of the dynamics are handled via rigorous multiple scale analysis. A user interface allows for an automated generation and solution of the equations of multiple timescale

  17. Establishing Research Strategies, Methodologies and Technologies to Link Genomics and Proteomics to Seagrass Productivity, Community Metabolism, and Ecosystem Carbon Fluxes

    PubMed Central

    Mazzuca, Silvia; Björk, M.; Beer, S.; Felisberto, P.; Gobert, S.; Procaccini, G.; Runcie, J.; Silva, J.; Borges, A. V.; Brunet, C.; Buapet, P.; Champenois, W.; Costa, M. M.; D’Esposito, D.; Gullström, M.; Lejeune, P.; Lepoint, G.; Olivé, I.; Rasmusson, L. M.; Richir, J.; Ruocco, M.; Serra, I. A.; Spadafora, A.; Santos, Rui

    2013-01-01

    A complete understanding of the mechanistic basis of marine ecosystem functioning is only possible through integrative and interdisciplinary research. This enables the prediction of change and possibly the mitigation of the consequences of anthropogenic impacts. One major aim of the European Cooperation in Science and Technology (COST) Action ES0609 “Seagrasses productivity. From genes to ecosystem management,” is the calibration and synthesis of various methods and the development of innovative techniques and protocols for studying seagrass ecosystems. During 10 days, 20 researchers representing a range of disciplines (molecular biology, physiology, botany, ecology, oceanography, and underwater acoustics) gathered at The Station de Recherches Sous-marines et Océanographiques (STARESO, Corsica) to study together the nearby Posidonia oceanica meadow. STARESO is located in an oligotrophic area classified as “pristine site” where environmental disturbances caused by anthropogenic pressure are exceptionally low. The healthy P. oceanica meadow, which grows in front of the research station, colonizes the sea bottom from the surface to 37 m depth. During the study, genomic and proteomic approaches were integrated with ecophysiological and physical approaches with the aim of understanding changes in seagrass productivity and metabolism at different depths and along daily cycles. In this paper we report details on the approaches utilized and we forecast the potential of the data that will come from this synergistic approach not only for P. oceanica but for seagrasses in general. PMID:23515425

  18. Functional genomics of the chicken - a model organism

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The chicken has reached model organism status after genome sequencing and development of high-throughput tools for the exploration of functional elements of the genome. Functional genomics focuses on understanding the function and regulation of genes and gene products on a global or genome-wide scal...

  19. The Use of Functional Genomics in Conjunction with Metabolomics for Mycobacterium tuberculosis Research

    PubMed Central

    Swanepoel, Conrad C.

    2014-01-01

    Tuberculosis (TB), caused by Mycobacterium tuberculosis, is a fatal infectious disease, resulting in 1.4 million deaths globally per annum. Over the past three decades, genomic studies have been conducted in an attempt to elucidate the functionality of the genome of the pathogen. However, many aspects of this complex genome remain largely unexplored, as approaches like genomics, proteomics, and transcriptomics have failed to characterize them successfully. In turn, metabolomics, which is relatively new to the “omics” revolution, has shown great potential for investigating biological systems or their modifications. Furthermore, when these data are interpreted in combination with previously acquired genomics, proteomics and transcriptomics data, using what is termed a systems biology approach, a more holistic understanding of these systems can be achieved. In this review we discuss how metabolomics has contributed so far to characterizing TB, with emphasis on the resulting improved elucidation of M. tuberculosis in terms of (1) metabolism, (2) growth and replication, (3) pathogenicity, and (4) drug resistance, from the perspective of systems biology. PMID:24771957

  20. An evolutionary classification of genomic function.

    PubMed

    Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B R

    2015-03-01

    The pronouncements of the ENCODE Project Consortium regarding "junk DNA" exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into "functional DNA," that is, DNA sequences that have a selected-effect function, and "rubbish DNA," that is, sequences that do not. Functional DNA is further subdivided into "literal DNA" and "indifferent DNA." In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into "junk DNA" and "garbage DNA." Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA. PMID:25635041

  1. Functional proteomic and structural insights into molecular recognition in the nitrilase family enzymes

    PubMed Central

    Barglow, Katherine T.; Saikatendu, Kumar S.; Bracey, Michael H.; Huey, Ruth; Morris, Garrett M.; Olson, Arthur J.; Stevens, Raymond C.; Cravatt, Benjamin F.

    2009-01-01

    Nitrilases are a large and diverse family of non-peptidic C-N hydrolases. The mammalian genome encodes eight nitrilase enzymes, several of which remain poorly characterized. Prominent among these are nitrilase-1 (Nit1) and nitrilase-2 (Nit2), which, despite having been shown to exert effects on cell growth and possibly serving as tumor suppressor genes, are without known substrates or selective inhibitors. In previous studies, we identified several nitrilases, including Nit1 and Nit2, as targets for dipeptide-chloroacetamide activity-based proteomics probes. Here, we have used these probes, in combination with high-resolution crystallography and molecular modeling, to systematically map the active site of Nit2 and identify residues involved in molecular recognition. We report the 1.4 Å crystal structure of mouse Nit2, and use this structure to identify residues that discriminate probe-labeling between the Nit1 and Nit2 enzymes. Interestingly, some of these residues are conserved across all vertebrate Nit2 enzymes and, conversely, not found in any vertebrate Nit1 enzymes, suggesting that they are key discriminators of molecular recognition between these otherwise highly homologous enzymes. Our findings thus point to a limited set of active site residues that establish distinct patterns of molecular recognition among nitrilases and provide chemical probes to selectively perturb the function of these enzymes in biological systems. PMID:19053248

  2. Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

    SciTech Connect

    Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.; Perry, Leslie M.; Pan, Chongle; Lankford, Patricia K.; Pelletier, Dale A.

    2015-04-08

    Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σRPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensus sequence GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σRPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.

  3. Rhodopseudomonas palustris CGA010 Proteome Implicates Extracytoplasmic Function Sigma Factor in Stress Response

    DOE PAGESBeta

    Allen, Michael S.; Hurst, Gregory B.; Lu, Tse-Yuan S.; Perry, Leslie M.; Pan, Chongle; Lankford, Patricia K.; Pelletier, Dale A.

    2015-04-08

    Rhodopseudomonas palustris encodes 16 extracytoplasmic function (ECF) σ factors. In this paper, to begin to investigate the regulatory network of one of these ECF σ factors, the whole proteome of R. palustris CGA010 was quantitatively analyzed by tandem mass spectrometry from cultures episomally expressing the ECF σRPA4225 (ecfT) versus a WT control. Among the proteins with the greatest increase in abundance were catalase KatE, trehalose synthase, a DPS-like protein, and several regulatory proteins. Alignment of the cognate promoter regions driving expression of several upregulated proteins suggested a conserved binding motif in the -35 and -10 regions with the consensus sequencemore » GGAAC-18N-TT. Additionally, the putative anti-σ factor RPA4224, whose gene is contained in the same predicted operon as RPA4225, was identified as interacting directly with the predicted response regulator RPA4223 by mass spectrometry of affinity-isolated protein complexes. Furthermore, another gene (RPA4226) coding for a protein that contains a cytoplasmic histidine kinase domain is located immediately upstream of RPA4225. The genomic organization of orthologs for these four genes is conserved in several other strains of R. palustris as well as in closely related α-Proteobacteria. Finally, taken together, these data suggest that ECF σRPA4225 and the three additional genes make up a sigma factor mimicry system in R. palustris.« less

  4. Identification of Multiple Metabolic Enzymes from Mice Cochleae Tissue Using a Novel Functional Proteomics Technology

    PubMed Central

    Wang, David L.; Li, Hui; Liang, Ruqiang; Bao, Jianxin

    2015-01-01

    A new type of technology in proteomics was developed in order to separate a complex protein mixture and analyze protein functions systematically. The technology combines the ability of two-dimensional gel electrophoresis (2-DE) to separate proteins with a protein elution plate (PEP) to recover active proteins for functional analysis and mass spectrometry (MS)-based identification. In order to demonstrate the feasibility of this functional proteomics approach, NADH and NADPH-dependent oxidases, major redox enzyme families, were identified from mice cochlear tissue after a specific drug treatment. By comparing the enzymatic activity between mice that were treated with a drug and a control group significant changes were observed. Using MS, five NADH-dependent oxidases were identified that showed highly altered enzymatic activities due to the drug treatment. In essence, the PEP technology allows for a systematic analysis of a large enzyme family from a complex proteome, providing insights in understanding the mechanism of drug treatment. PMID:25811366

  5. Green systems biology - From single genomes, proteomes and metabolomes to ecosystems research and biotechnology.

    PubMed

    Weckwerth, Wolfram

    2011-12-10

    biochemical networks up to whole species populations. This process relies on the development of new technologies for the analysis of molecular data, especially genomics, metabolomics and proteomics data. The ambitious aim of these non-targeted 'omic' technologies is to extend our understanding beyond the analysis of separated parts of the system, in contrast to traditional reductionistic hypothesis-driven approaches. The consequent integration of genotyping, pheno/morphotyping and the analysis of the molecular phenotype using metabolomics, proteomics and transcriptomics will reveal a novel understanding of plant metabolism and its interaction with the environment. The analysis of single model systems - plants, fungi, animals and bacteria - will finally emerge in the analysis of populations of plants and other organisms and their adaptation to the ecological niche. In parallel, this novel understanding of ecophysiology will translate into knowledge-based approaches in crop plant biotechnology and marker- or genome-assisted breeding approaches. In this review the foundations of green systems biology are described and applications in ecosystems research are presented. Knowledge exchange of ecosystems research and green biotechnology merging into green systems biology is anticipated based on the principles of natural variation, biodiversity and the genotype-phenotype environment relationship as the fundamental drivers of ecology and evolution. PMID:21802534

  6. Genomic and Proteomic Analyses of the Agarolytic System Expressed by Saccharophagus degradans 2-40†

    PubMed Central

    Ekborg, Nathan A.; Taylor, Larry E.; Longmire, Atkinson G.; Henrissat, Bernard; Weiner, Ronald M.; Hutcheson, Steven W.

    2006-01-01

    Saccharophagus degradans 2-40 (formerly Microbulbifer degradans 2-40) is a marine gamma-subgroup proteobacterium capable of degrading many complex polysaccharides, such as agar. While several agarolytic systems have been characterized biochemically, the genetics of agarolytic systems have been only partially determined. By use of genomic, proteomic, and genetic approaches, the components of the S. degradans 2-40 agarolytic system were identified. Five agarases were identified in the S. degradans 2-40 genome. Aga50A and Aga50D include GH50 domains. Aga86C and Aga86E contain GH86 domains, whereas Aga16B carries a GH16 domain. Novel family 6 carbohydrate binding modules (CBM6) were identified in Aga16B and Aga86E. Aga86C has an amino-terminal acylation site, suggesting that it is surface associated. Aga16B, Aga86C, and Aga86E were detected by mass spectrometry in agarolytic fractions obtained from culture filtrates of agar-grown cells. Deletion analysis revealed that aga50A and aga86E were essential for the metabolism of agarose. Aga16B was shown to endolytically degrade agarose to release neoagarotetraose, similarly to a β-agarase I, whereas Aga86E was demonstrated to exolytically degrade agarose to form neoagarobiose. The agarolytic system of S. degradans 2-40 is thus predicted to be composed of a secreted endo-acting GH16-dependent depolymerase, a surface-associated GH50-dependent depolymerase, an exo-acting GH86-dependent agarase, and an α-neoagarobiose hydrolase to release galactose from agarose. PMID:16672483

  7. Network Analysis of Epidermal Growth Factor Signaling using Integrated Genomic, Proteomic and Phosphorylation Data

    SciTech Connect

    Waters, Katrina M.; Liu, Tao; Quesenberry, Ryan D.; Willse, Alan R.; Bandyopadhyay, Somnath; Kathmann, Loel E.; Weber, Thomas J.; Smith, Richard D.; Wiley, H. S.; Thrall, Brian D.

    2012-03-29

    To understand how integration of multiple data types can help decipher cellular responses at the systems level, we analyzed the mitogenic response of human mammary epithelial cells to epidermal growth factor (EGF) using whole genome microarrays, mass spectrometry-based proteomics and large-scale western blots with over 1000 antibodies. A time course analysis revealed significant differences in the expression of 3172 genes and 596 proteins, including protein phosphorylation changes measured by western blot. Integration of these disparate data types showed that each contributed qualitatively different components to the observed cell response to EGF and that varying degrees of concordance in gene expression and protein abundance measurements could be linked to specific biological processes. Networks inferred from individual data types were relatively limited, whereas networks derived from the integrated data recapitulated the known major cellular responses to EGF and exhibited more highly connected signaling nodes than networks derived from any individual dataset. While cell cycle regulatory pathways were altered as anticipated, we found the most robust response to mitogenic concentrations of EGF was induction of matrix metalloprotease cascades, highlighting the importance of the EGFR system as a regulator of the extracellular environment. These results demonstrate the value of integrating multiple levels of biological information to more accurately reconstruct networks of cellular response.

  8. Application of functional genomics and proteomics to plant cryopreservation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Plant cryobiology has primarily emerged from the classical fields of cryobiology and plant stress physiology. Cryopreservation tools are now available to geneticists for germplasm preservation and the field itself is advancing significantly through the use of molecular techniques. Long-term preser...

  9. Application of Functional Genomics and Proteomics to Plant Cryopreservation

    PubMed Central

    Volk, Gayle M

    2010-01-01

    Plant cryobiology has primarily emerged from the classical fields of cryobiology and plant stress physiology. Cryopreservation tools are now available to geneticists for germplasm preservation and the field itself is advancing significantly through the use of molecular techniques. Long-term preservation of vegetatively propagated tissues can minimize the risks of long-term maintenance under tissue culture or field conditions. Cells can be successfully cryopreserved when the adverse affects of ice crystal formation are mitigated by the removal of water or procedures to limit ice formation and crystal growth. The addition of cryoprotectant solutions to hydrated cells may improve the survival of microdissected shoot tips or embryonic axes. Recent discoveries in the genetic pathways leading to cold acclimation and freezing tolerance suggest the involvement of key cold-regulated genes in the acquisition of cold tolerance in plant tissues. Model systems of banana and Arabidopsis have revealed the involvement of genes and proteins in the glycolytic and other metabolic pathways, particularly processes involved in dehydration tolerance, osmoprotection, and membrane transport. Furthermore, successful recovery appears to be dependent upon the presence of antioxidant protection from reactive oxygen species. Characterization of specific genes and proteins will lead to significant advances in plant cryobiology research. PMID:20808520

  10. Functional Annotation Analytics of Rhodopseudomonas palustris Genomes

    PubMed Central

    Simmons, Shaneka S.; Isokpehi, Raphael D.; Brown, Shyretha D.; McAllister, Donee L.; Hall, Charnia C.; McDuffy, Wanaki M.; Medley, Tamara L.; Udensi, Udensi K.; Rajnarayanan, Rajendram V.; Ayensu, Wellington K.; Cohly, Hari H.P.

    2011-01-01

    Rhodopseudomonas palustris, a nonsulphur purple photosynthetic bacteria, has been extensively investigated for its metabolic versatility including ability to produce hydrogen gas from sunlight and biomass. The availability of the finished genome sequences of six R. palustris strains (BisA53, BisB18, BisB5, CGA009, HaA2 and TIE-1) combined with online bioinformatics software for integrated analysis presents new opportunities to determine the genomic basis of metabolic versatility and ecological lifestyles of the bacteria species. The purpose of this investigation was to compare the functional annotations available for multiple R. palustris genomes to identify annotations that can be further investigated for strain-specific or uniquely shared phenotypic characteristics. A total of 2,355 protein family Pfam domain annotations were clustered based on presence or absence in the six genomes. The clustering process identified groups of functional annotations including those that could be verified as strain-specific or uniquely shared phenotypes. For example, genes encoding water/glycerol transport were present in the genome sequences of strains CGA009 and BisB5, but absent in strains BisA53, BisB18, HaA2 and TIE-1. Protein structural homology modeling predicted that the two orthologous 240 aa R. palustris aquaporins have water-specific transport function. Based on observations in other microbes, the presence of aquaporin in R. palustris strains may improve freeze tolerance in natural conditions of rapid freezing such as nitrogen fixation at low temperatures where access to liquid water is a limiting factor for nitrogenase activation. In the case of adaptive loss of aquaporin genes, strains may be better adapted to survive in conditions of high-sugar content such as fermentation of biomass for biohydrogen production. Finally, web-based resources were developed to allow for interactive, user-defined selection of the relationship between protein family annotations and the R

  11. Non-coding genome functions in diabetes.

    PubMed

    Cebola, Inês; Pasquali, Lorenzo

    2016-01-01

    Most of the genetic variation associated with diabetes, through genome-wide association studies, does not reside in protein-coding regions, making the identification of functional variants and their eventual translation to the clinic challenging. In recent years, high-throughput sequencing-based methods have enabled genome-scale high-resolution epigenomic profiling in a variety of human tissues, allowing the exploration of the human genome outside of the well-studied coding regions. These experiments unmasked tens of thousands of regulatory elements across several cell types, including diabetes-relevant tissues, providing new insights into their mechanisms of gene regulation. Regulatory landscapes are highly dynamic and cell-type specific and, being sensitive to DNA sequence variation, can vary with individual genomes. The scientific community is now in place to exploit the regulatory maps of tissues central to diabetes etiology, such as pancreatic progenitors and adult islets. This giant leap forward in the understanding of pancreatic gene regulation is revolutionizing our capacity to discriminate between functional and non-functional non-coding variants, opening opportunities to uncover regulatory links between sequence variation and diabetes susceptibility. In this review, we focus on the non-coding regulatory landscape of the pancreatic endocrine cells and provide an overview of the recent developments in this field. PMID:26438568

  12. A transgenic perspective on plant functional genomics.

    PubMed

    Pereira, A

    2000-01-01

    Transgenic crops are very much in the news due to the increasing public debate on their acceptance. In the scientific community though, transgenic plants are proving to be powerful tools to study various aspects of plant sciences. The emerging scientific revolution sparked by genomics based technologies is producing enormous amounts of DNA sequence information that, together with plant transformation methodology, is opening up new experimental opportunities for functional genomics analysis. An overview is provided here on the use of transgenic technology for the functional analysis of plant genes in model plants and a link made to their utilization in transgenic crops. In transgenic plants, insertional mutagenesis using heterologous maize transposons or Agrobacterium mediated T-DNA insertions, have been valuable tools for the identification and isolation of genes that display a mutant phenotype. To discover functions of genes that do not display phenotypes when mutated, insertion sequences have been engineered to monitor or change the expression pattern of adjacent genes. These gene detector insertions can detect adjacent promoters, enhancers or gene exons and precisely reflect the expression pattern of the tagged gene. Activation tag insertions can mis-express the adjacent gene and confer dominant phenotypes that help bridge the phenotype gap. Employment of various forms of gene silencing technology broadens the scope of recovering knockout phenotypes for genes with redundant function. All these transgenic strategies describing gene-phenotype relationships can be addressed by high throughput reverse genetics methods that will help provide functions to the genes discovered by genome sequencing. The gene functions discovered by insertional mutagenesis and silencing strategies along with expression pattern analysis will provide an integrated functional genomics perspective and offer unique applications in transgenic crops. PMID:11131004

  13. Current Progress in Tonoplast Proteomics Reveals Insights into the Function of the Large Central Vacuole

    PubMed Central

    Trentmann, Oliver; Haferkamp, Ilka

    2013-01-01

    Vacuoles of plants fulfill various biologically important functions, like turgor generation and maintenance, detoxification, solute sequestration, or protein storage. Different types of plant vacuoles (lytic versus protein storage) are characterized by different functional properties apparently caused by a different composition/abundance and regulation of transport proteins in the surrounding membrane, the tonoplast. Proteome analyses allow the identification of vacuolar proteins and provide an informative basis for assigning observed transport processes to specific carriers or channels. This review summarizes techniques required for vacuolar proteome analyses, like e.g., isolation of the large central vacuole or tonoplast membrane purification. Moreover, an overview about diverse published vacuolar proteome studies is provided. It becomes evident that qualitative proteomes from different plant species represent just the tip of the iceberg. During the past few years, mass spectrometry achieved immense improvement concerning its accuracy, sensitivity, and application. As a consequence, modern tonoplast proteome approaches are suited for detecting alterations in membrane protein abundance in response to changing environmental/physiological conditions and help to clarify the regulation of tonoplast transport processes. PMID:23459586

  14. Integration of genomic and proteomic data to identify candidate genes in HT-29 cells after incubation with Bifidobacterium bifidum ATCC 29521.

    PubMed

    Wang, Bao-Gui; Wu, Yaoping; Qiu, Liang; Shah, Nagendra P; Xu, Feng; Wei, Hua

    2016-09-01

    As the predominant group inhabiting the human gastrointestinal tract, bifidobacteria play a vital role in human nutrition, therapeutics, and health by shaping and maintaining the gut ecosystem, reducing blood cholesterol, and promoting the supply of nutrients. The interaction between bacterial cells and human intestinal epithelial cell lines has been studied for decades in an attempt to understand the mechanisms of action. These studies, however, have been limited by lack of genomic and proteomic database to aid in achieving comprehensive understanding of these mechanisms at molecular levels. Microarray data (GSE: 74119) coupled with isobaric tags for relative and absolute quantitation (iTRAQ) were performed to detect differentially expressed genes and proteins in HT-29 cells after incubation with Bifidobacterium bifidum. Real-time quantitative PCR, gene ontology, and Kyoto Encyclopedia of Genes and Genomes analyses were further conducted for mRNA validation, functional annotation, and pathway identification, respectively. According to the results of microarray, 1,717 differentially expressed genes, including 1,693 upregulated and 24 downregulated genes, were selected and classified by the gene ontology database. The iTRAQ analysis identified 43 differentially expressed proteins, where 29 proteins were upregulated and 14 proteins were downregulated. Eighty-two candidate genes showing consistent differences with microarray and iTRAQ were further validated in HT-29 and Caco-2 cells by real-time quantitative PCR. Nine of the top genes showing interesting results with high confidence were further investigated in vivo in mice intestine samples. Integration of genomic and proteomic data provides an approach to identify candidate genes that are more likely to function in ubiquitin-mediated proteolysis, positive regulation of apoptosis, membrane proteins, and transferase catalysis. These findings might contribute to our understanding of molecular mechanisms regulating the

  15. Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity

    PubMed Central

    2010-01-01

    We consider the problem of biological complexity via a projection of protein-coding genes of complex organisms onto the functional space of the proteome. The latter can be defined as a set of all functions committed by proteins of an organism. Alternative splicing (AS) allows an organism to generate diverse mature RNA transcripts from a single mRNA strand and thus it could be one of the key mechanisms of increasing of functional complexity of the organism's proteome and a driving force of biological evolution. Thus, the projection of transcription units (TU) and alternative splice-variant (SV) forms onto proteome functional space could generate new types of relational networks (e.g. SV-protein function networks, SFN) and lead to discoveries of novel evolutionarily conservative functional modules. Such types of networks might provide new reliable characteristics of organism complexity and a better understanding of the evolutionary integration and plasticity of interconnection of genome-transcriptome-proteome functions. Results We use the InterPro and UniProt databases to attribute descriptive features (keywords) to protein sequences. UniProt database includes a controlled and curated vocabulary of specific descriptors or keywords. The keywords have been assigned to a protein sequence via conserved domains or via similarity with annotated sequences. Then we consider the unique combinations of keywords as the protein functional labels (FL), which characterize the biological functions of the given protein and construct the contingency tables and graphs providing the projections of transcription units (TU) and alternative splice-variants (SV) onto all FL of the proteome of a given organism. We constructed SFNs for organisms with different evolutionary history and levels of complexity, and performed detailed statistical parameterization of the networks. Conclusions The application of the algorithm to organisms with different evolutionary history and level of biological

  16. Application of an improved proteomics method for abundant protein cleanup: molecular and genomic mechanisms study in plant defense.

    PubMed

    Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S; Dai, Susie Y

    2013-11-01

    High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779

  17. Application of an Improved Proteomics Method for Abundant Protein Cleanup: Molecular and Genomic Mechanisms Study in Plant Defense*

    PubMed Central

    Zhang, Yixiang; Gao, Peng; Xing, Zhuo; Jin, Shumei; Chen, Zhide; Liu, Lantao; Constantino, Nasie; Wang, Xinwang; Shi, Weibing; Yuan, Joshua S.; Dai, Susie Y.

    2013-01-01

    High abundance proteins like ribulose-1,5-bisphosphate carboxylase oxygenase (Rubisco) impose a consistent challenge for the whole proteome characterization using shot-gun proteomics. To address this challenge, we developed and evaluated Polyethyleneimine Assisted Rubisco Cleanup (PARC) as a new method by combining both abundant protein removal and fractionation. The new approach was applied to a plant insect interaction study to validate the platform and investigate mechanisms for plant defense against herbivorous insects. Our results indicated that PARC can effectively remove Rubisco, improve the protein identification, and discover almost three times more differentially regulated proteins. The significantly enhanced shot-gun proteomics performance was translated into in-depth proteomic and molecular mechanisms for plant insect interaction, where carbon re-distribution was used to play an essential role. Moreover, the transcriptomic validation also confirmed the reliability of PARC analysis. Finally, functional studies were carried out for two differentially regulated genes as revealed by PARC analysis. Insect resistance was induced by over-expressing either jacalin-like or cupin-like genes in rice. The results further highlighted that PARC can serve as an effective strategy for proteomics analysis and gene discovery. PMID:23943779

  18. Genomic and proteomic characterization of “Candidatus Nitrosopelagicus brevis”: An ammonia-oxidizing archaeon from the open ocean

    PubMed Central

    Santoro, Alyson E.; Dupont, Christopher L.; Richter, R. Alex; Craig, Matthew T.; Carini, Paul; McIlvin, Matthew R.; Yang, Youngik; Orsi, William D.; Moran, Dawn M.; Saito, Mak A.

    2015-01-01

    Thaumarchaeota are among the most abundant microbial cells in the ocean, but difficulty in cultivating marine Thaumarchaeota has hindered investigation into the physiological and evolutionary basis of their success. We report here a closed genome assembled from a highly enriched culture of the ammonia-oxidizing pelagic thaumarchaeon CN25, originating from the open ocean. The CN25 genome exhibits strong evidence of genome streamlining, including a 1.23-Mbp genome, a high coding density, and a low number of paralogous genes. Proteomic analysis recovered nearly 70% of the predicted proteins encoded by the genome, demonstrating that a high fraction of the genome is translated. In contrast to other minimal marine microbes that acquire, rather than synthesize, cofactors, CN25 encodes and expresses near-complete biosynthetic pathways for multiple vitamins. Metagenomic fragment recruitment indicated the presence of DNA sequences >90% identical to the CN25 genome throughout the oligotrophic ocean. We propose the provisional name “Candidatus Nitrosopelagicus brevis” str. CN25 for this minimalist marine thaumarchaeon and suggest it as a potential model system for understanding archaeal adaptation to the open ocean. PMID:25587132

  19. Proteomic analysis of the royal jelly and characterization of the functions of its derivation glands in the honeybee.

    PubMed

    Fujita, Toshiyuki; Kozuka-Hata, Hiroko; Ao-Kondo, Hiroko; Kunieda, Takekazu; Oyama, Masaaki; Kubo, Takeo

    2013-01-01

    To identify candidate royal jelly (RJ) proteins that might affect the physiologic status of honeybee colony members, we used shotgun proteomics to comprehensively identify the RJ proteome as well as proteomes of the hypopharyngeal gland (HpG), postcerebral gland (PcG), and thoracic gland (TG), from which RJ proteins are assumed to be derived. We identified a total of 38 nonredundant RJ proteins, including 22 putative secretory proteins and Insulin-like growth factor-binding protein complex acid labile subunit. Among them, 9 proteins were newly identified from RJ. Comparison of the RJ proteome with the HpG, PcG, and TG proteomes revealed that 17 of the 22 putative secretory RJ proteins were derived from some of these glands, suggesting that the RJ proteome is a cocktail of proteins from these three glands. Furthermore, pathway analysis suggested that the HpG proteome represents the molecular basis of the extremely high protein-synthesizing ability, whereas the PcG proteome suggests that the PcG functions as a reservoir for the volatile compounds and a primer pheromone. Finally, to further characterize the possible total RJ proteome, we identified putative secretory proteins in the proteomes of these three glands. This will be useful for predicting novel RJ protein components in future studies. PMID:23157659

  20. Using CAVE technology for functional genomics studies.

    PubMed

    Sensen, Christoph W

    2002-01-01

    We have established the first Java 3D-enabled CAVE (CAVE automated virtual environment). The Java application programming interface allows the complete separation of the program development from the program execution, opening new application domains for the CAVE technology. Programs can be developed on any Java-enabled computer platform, including Windows, Macintosh, and Linux workstations, and executed in the CAVE without modification. The introduction of Java, one of the major programming environments for bioinformatics, into the CAVE environment allows the rapid development applications for genome research, especially for the analysis of the spatial and temporal data that are being produced by functional genomics experiments. The CAVE technology will play a major role in the modeling of biological systems that is necessary to understand how these systems are organized and how they function. PMID:12614491

  1. [Genomic, proteomic and metabolomic predictors of nonalcoholic fatty liver disease development in obese patients. Part I].

    PubMed

    Chernyak, O O; Sentsova, T B; Vorozhko, I V; Tutelyan, V A; Gapparova, K M; Isakov, V A

    2015-01-01

    The prevention, diagnosis and treatment of diseases associated to obesity require a qualitative increase of efficiency. There are still disputable questions about diagnostic significance of some molecules, including genomic, proteomic and metabolomic biomarkers. We observed 72 obese patients (20 men and 52 women, mean age--41.3 +/- 2.5) and performed ultrasound elastography and ultrasound of liver. We have identified two groups of patients: Group 1 consisted of 50 obese patients without complications (BMI 43.2 +/- 0.6), group 2 consisted of 22 patients with obesity complicated with nonalcoholic fatty liver disease (BMI 45.8 +/- 2.3). Determination of the adipokines (adiponectin, ghrelin, resistin, visfatin, and apelin), cytokine (interleukin--6, TNFalpha) oxidized lipoproteins (oxLDL), adhesion molecule sICAM (soluble intercellular cell adhesion molecule), fatty acid transporter L-FABP in serum was performed by ELISA. The study of the lipid metabolism involved determination of the concentration of total cholesterols, triglycerides, low and high density lipoproteins (LDL and HDL) by turbidimetry and spectrophotometry by analyzer. In addition, we conducted analysis of polymorphic alleles epsilon2, epsilon3, episolon4 of ApoE gene using polymerase chain reaction. Our data indicate that reducing the concentration of adiponectin (0.46-1.71 mcg/ml), increasing the level of glucose (5.57-6.25 mmol/l), triglycerides (2.06-3.94 mmol/l), TNFalpha (5.07-16.68 pg/ml) and L-FABP (11.62-23.76 pg/ml) are predictors of nonalcoholic fatty liver disease in obese patients, and the presence of genotype epsilon3/epsilon4 of ApoE gene is a poor prognostic marker of severity of nonalcoholic fatty liver disease. PMID:26852528

  2. Biomarkers for pancreatic cancer: Recent achievements in proteomics and genomics through classical and multivariate statistical methods

    PubMed Central

    Marengo, Emilio; Robotti, Elisa

    2014-01-01

    Pancreatic cancer (PC) is one of the most aggressive and lethal neoplastic diseases. A valid alternative to the usual invasive diagnostic tools would certainly be the determination of biomarkers in peripheral fluids to provide less invasive tools for early diagnosis. Nowadays, biomarkers are generally investigated mainly in peripheral blood and tissues through high-throughput omics techniques comparing control vs pathological samples. The results can be evaluated by two main strategies: (1) classical methods in which the identification of significant biomarkers is accomplished by monovariate statistical tests where each biomarker is considered as independent from the others; and (2) multivariate methods, taking into consideration the correlations existing among the biomarkers themselves. This last approach is very powerful since it allows the identification of pools of biomarkers with diagnostic and prognostic performances which are superior to single markers in terms of sensitivity, specificity and robustness. Multivariate techniques are usually applied with variable selection procedures to provide a restricted set of biomarkers with the best predictive ability; however, standard selection methods are usually aimed at the identification of the smallest set of variables with the best predictive ability and exhaustivity is usually neglected. The exhaustive search for biomarkers is instead an important alternative to standard variable selection since it can provide information about the etiology of the pathology by producing a comprehensive set of markers. In this review, the most recent applications of the omics techniques (proteomics, genomics and metabolomics) to the identification of exploratory biomarkers for PC will be presented with particular regard to the statistical methods adopted for their identification. The basic theory related to classical and multivariate methods for identification of biomarkers is presented and then, the most recent applications in

  3. Salivary Proteomic and Genomic Biomarkers for Primary Sjögren’s Syndrome

    PubMed Central

    Hu, Shen; Wang, Jianghua; Meijer, Jiska; Ieong, Sonya; Xie, Yongming; Yu, Tianwei; Zhou, Hui; Henry, Sharon; Vissink, Arjan; Pijpe, Justin; Kallenberg, Cees; Elashoff, David; Loo, Joseph A.; Wong, David T.

    2010-01-01

    Objective To identify a panel of protein and messenger RNA (mRNA) biomarkers in human whole saliva (WS) that may be used in the detection of primary Sjögren’s syndrome (SS). Methods Mass spectrometry and expression microarray profiling were used to identify candidate protein and mRNA biomarkers of primary SS in WS samples. Validation of the discovered mRNA and protein biomarkers was also demonstrated using real-time quantitative polymerase chain reaction and immunoblotting techniques. Results Sixteen WS proteins were found to be down-regulated and 25 WS proteins were found to be up-regulated in primary SS patients compared with matched healthy control subjects. These proteins reflected the damage of glandular cells and inflammation of the oral cavity system in patients with primary SS. In addition, 16 WS peptides (10 up-regulated and 6 down-regulated in primary SS) were found at significantly different levels (P <0.05) in primary SS patients and controls. Using stringent criteria (3-fold change; P <0.0005), 27 mRNA in saliva samples were found to be significantly up-regulated in the primary SS patients. Strikingly, 19 of 27 genes that were found to be overex-pressed were interferon-inducible or were related to lymphocyte filtration and antigen presentation known to be involved in the pathogenesis of primary SS. Conclusion Our preliminary study has indicated that WS from patients with primary SS contains molecular signatures that reflect damaged glandular cells and an activated immune response in this autoimmune disease. These candidate proteomic and genomic biomarkers may improve the clinical detection of primary SS once they have been further validated. We also found that WS contains more informative proteins, peptides, and mRNA, as compared with gland-specific saliva, that can be used in generating candidate biomarkers for the detection of primary SS. PMID:17968930

  4. Investigation of Yersinia pestis laboratory adaptation through a combined genomics and proteomics approach

    SciTech Connect

    Leiser, Owen P.; Merkley, Eric D.; Clowers, Brian H.; Kaiser, Brooke L. Deatherage; Lin, Andy; Hutchison, Janine R.; Melville, Angela M.; Wagner, David M.; Keim, Paul S.; Foster, Jeff; Kreuzer, Helen W.

    2015-11-24

    Here, the bacterial pathogen Yersinia pestis, the cause of plague in humans and animals, normally has a sylvatic lifestyle, cycling between fleas and mammals. In contrast, laboratory-grown Y. pestis experiences a more constant environment and conditions that it would not normally encounter. The transition from the natural environment to the laboratory results in a vastly different set of selective pressures, and represents what could be considered domestication. Understanding the kinds of adaptations Y. pestis undergoes as it becomes domesticated will contribute to understanding the basic biology of this important pathogen. In this study, we performed a Parallel Serial Passage Experiment (PSPE) to explore the mechanisms by which Y. pestis adapts to laboratory conditions, hypothesizing that cells would undergo significant changes in virulence and nutrient acquisition systems. Two wild strains were serially passaged in 12 independent populations each for ~750 generations, after which each population was analyzed using whole-genome sequencing. We observed considerable parallel evolution in the endpoint populations, detecting multiple independent mutations in ail, pepA, and zwf, suggesting that specific selective pressures are shaping evolutionary responses. Complementary LC-MS-based proteomic data provide physiological context to the observed mutations, and reveal regulatory changes not necessarily associated with specific mutations, including changes in amino acid metabolism, envelope biogenesis, iron storage and acquisition, and a type VI secretion system. Proteomic data support hypotheses generated by genomic data in addition to suggesting future mechanistic studies, indicating that future whole-genome sequencing studies be designed to leverage proteomics as a critical complement.

  5. Orchidstra: an integrated orchid functional genomics database.

    PubMed

    Su, Chun-lin; Chao, Ya-Ting; Yen, Shao-Hua; Chen, Chun-Yi; Chen, Wan-Chieh; Chang, Yao-Chien Alex; Shih, Ming-Che

    2013-02-01

    A specialized orchid database, named Orchidstra (URL: http://orchidstra.abrc.sinica.edu.tw), has been constructed to collect, annotate and share genomic information for orchid functional genomics studies. The Orchidaceae is a large family of Angiosperms that exhibits extraordinary biodiversity in terms of both the number of species and their distribution worldwide. Orchids exhibit many unique biological features; however, investigation of these traits is currently constrained due to the limited availability of genomic information. Transcriptome information for five orchid species and one commercial hybrid has been included in the Orchidstra database. Altogether, these comprise >380,000 non-redundant orchid transcript sequences, of which >110,000 are protein-coding genes. Sequences from the transcriptome shotgun assembly (TSA) were obtained either from output reads from next-generation sequencing technologies assembled into contigs, or from conventional cDNA library approaches. An annotation pipeline using Gene Ontology, KEGG and Pfam was built to assign gene descriptions and functional annotation to protein-coding genes. Deep sequencing of small RNA was also performed for Phalaenopsis aphrodite to search for microRNAs (miRNAs), extending the information archived for this species to miRNA annotation, precursors and putative target genes. The P. aphrodite transcriptome information was further used to design probes for an oligonucleotide microarray, and expression profiling analysis was carried out. The intensities of hybridized probes derived from microarray assays of various tissues were incorporated into the database as part of the functional evidence. In the future, the content of the Orchidstra database will be expanded with transcriptome data and genomic information from more orchid species. PMID:23324169

  6. Microbial community proteomics for characterizing the range of metabolic functions and activities of human gut microbiota

    SciTech Connect

    Xiong, Weili; Abraham, Paul E.; Li, Zhou; Pan, Chongle; Robert L. Hettich

    2015-01-01

    We found that the human gastrointestinal (GI) tract is a complex, dynamic ecosystem that consists of a carefully tuned balance of human host and microbiota membership. The microbiome component is not insignificant, but rather provides important functions that are absolutely critical to many aspects of human health, including nutrient transformation and absorption, drug metabolism, pathogen defense, and immune system development. Microbial community proteomics (sometimes referred to as metaproteomics) provides a powerful approach to measure the range and details of human gut microbiota functions and metabolic activities, revealing information about microbiome development and stability especially with regard to human health vs. disease states. In most cases, both microbial and human proteins are extracted from fecal samples and then measured by the high performance MS-based proteomics technology. We review the field of human gut microbiome community proteomics, with a focus on the experimental and informatics considerations involved in characterizing systems that range from low complexity defined model gut microbiota in gnotobiotic mice, to the simple gut microbiota in the GI tract of newborn infants, and finally to the complex gut microbiota in adults. Moreover, the current state-of-the-art in experimental and bioinformatics capabilities for community proteomics enable a detailed measurement of the gut microbiota, yielding valuable insights into the broad functional profiles of even complex microbiota. Future developments are likely to expand into improved analysis throughput and coverage depth, as well as post-translational modification characterizations.

  7. Microbial community proteomics for characterizing the range of metabolic functions and activities of human gut microbiota

    DOE PAGESBeta

    Xiong, Weili; Abraham, Paul E.; Li, Zhou; Pan, Chongle; Robert L. Hettich

    2015-01-01

    We found that the human gastrointestinal (GI) tract is a complex, dynamic ecosystem that consists of a carefully tuned balance of human host and microbiota membership. The microbiome component is not insignificant, but rather provides important functions that are absolutely critical to many aspects of human health, including nutrient transformation and absorption, drug metabolism, pathogen defense, and immune system development. Microbial community proteomics (sometimes referred to as metaproteomics) provides a powerful approach to measure the range and details of human gut microbiota functions and metabolic activities, revealing information about microbiome development and stability especially with regard to human health vs.more » disease states. In most cases, both microbial and human proteins are extracted from fecal samples and then measured by the high performance MS-based proteomics technology. We review the field of human gut microbiome community proteomics, with a focus on the experimental and informatics considerations involved in characterizing systems that range from low complexity defined model gut microbiota in gnotobiotic mice, to the simple gut microbiota in the GI tract of newborn infants, and finally to the complex gut microbiota in adults. Moreover, the current state-of-the-art in experimental and bioinformatics capabilities for community proteomics enable a detailed measurement of the gut microbiota, yielding valuable insights into the broad functional profiles of even complex microbiota. Future developments are likely to expand into improved analysis throughput and coverage depth, as well as post-translational modification characterizations.« less

  8. Polyploidy and the proteome.

    PubMed

    Soltis, Douglas E; Misra, Biswapriya B; Shan, Shengchen; Chen, Sixue; Soltis, Pamela S

    2016-08-01

    Although major advances have been made during the past 20 years in our understanding of the genetic and genomic consequences of polyploidy, our knowledge of polyploidy and the proteome is in its infancy. One of our goals is to stimulate additional study, particularly broad-scale proteomic analyses of polyploids and their progenitors. Although it may be too early to generalize regarding the extent to which transcriptomic data are predictive of the proteome of polyploids, it is clear that the proteome does not always reflect the transcriptome. Despite limited data, important observations on the proteomes of polyploids are emerging. In some cases, proteomic profiles show qualitatively and/or quantitatively non-additive patterns, and proteomic novelty has been observed. Allopolyploids generally combine the parental contributions, but there is evidence of parental dominance of one contributing genome in some allopolyploids. Autopolyploids are typically qualitatively identical to but quantitatively different from their parents. There is also evidence of parental legacy at the proteomic level. Proteomes clearly provide insights into the consequences of genomic merger and doubling beyond what is obtained from genomic and/or transcriptomic data. Translating proteomic changes in polyploids to differences in morphology and physiology remains the holy grail of polyploidy--this daunting task of linking genotype to proteome to phenotype should emerge as a focus of polyploidy research in the next decade. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock. PMID:26993527

  9. Functional genomics for food fermentation processes.

    PubMed

    Smid, E J; Hugenholtz, J

    2010-01-01

    This review describes recent scientific and technological drivers of food fermentation research. In addition, a number of practical implications of the results of this development will be highlighted. The first part of the manuscript elaborates on the message that genome sequence information gives us an unprecedented view on the biodiversity of microbes in food fermentation. This information can be made applicable for tailoring relevant characteristics of food products through fermentation. The second part deals with the integration of genome sequence data into metabolic models and the use of these models for a number of topics that are relevant for food fermentation processes. The final part will be about metagenomics approaches to reveal the complexity and understand the functionality of undefined complex microbial consortia used in a diverse range of food fermentation processes. PMID:22129346

  10. Thermus thermophilus bacteriophage ϕYS40 genome and proteomic characterization of virions

    PubMed Central

    Naryshkina, Tatyana; Liu, Jing; Florens, Laurence; Swanson, Selene K.; Pavlov, Andrey R.; Pavlova, Nadejda V.; Inman, Ross; Kozyavkin, Sergei A.; Washburn, Michael; Mushegian, Arcady; Severinov, Konstantin

    2006-01-01

    We determined the sequence of the 152,372-bp genome of ϕYS40, a lytic tailed bacteriophage of Thermus thermophilus. The genome contains 170 putative open reading frames and three tRNA genes. Functions for 25% of ϕYS40 gene products were predicted on the basis of similarity to proteins of known function from diverse phages and bacteria. ϕYS40 encodes a cluster of proteins involved in nucleotide salvage, such as flavin-dependent thymidylate synthase, thymidylate kinase, ribonucleotide reductase, and deoxycytidylate deaminase, and in DNA replication, such as DNA primase, helicase, type A DNA polymerase, and predicted terminal protein involved in initiation of DNA synthesis. The structural genes of ϕYS40, most of which have no similarity to sequences in public databases, were identified by mass-spectrometric analysis of purified virions. Various ϕYS40 proteins have different phylogenetic neighbors, including Myovirus, Podovirus, and Siphovirus gene products, bacterial genes, and in one case, a dUTPase from a eukaryotic virus. ϕYS40 has apparently arisen through multiple acts of recombination between different phage genomes as well as through acquisition of bacterial genes. PMID:17027029