Sample records for integrated omics analysis

  1. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.

    PubMed

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics. © The Author(s) 2015. Published by Oxford University Press.

  2. MONGKIE: an integrated tool for network analysis and visualization for multi-omics data.

    PubMed

    Jang, Yeongjun; Yu, Namhee; Seo, Jihae; Kim, Sun; Lee, Sanghyuk

    2016-03-18

    Network-based integrative analysis is a powerful technique for extracting biological insights from multilayered omics data such as somatic mutations, copy number variations, and gene expression data. However, integrated analysis of multi-omics data is quite complicated and can hardly be done in an automated way. Thus, a powerful interactive visual mining tool supporting diverse analysis algorithms for identification of driver genes and regulatory modules is much needed. Here, we present a software platform that integrates network visualization with omics data analysis tools seamlessly. The visualization unit supports various options for displaying multi-omics data as well as unique network models for describing sophisticated biological networks such as complex biomolecular reactions. In addition, we implemented diverse in-house algorithms for network analysis including network clustering and over-representation analysis. Novel functions include facile definition and optimized visualization of subgroups, comparison of a series of data sets in an identical network by data-to-visual mapping and subsequent overlaying function, and management of custom interaction networks. Utility of MONGKIE for network-based visual data mining of multi-omics data was demonstrated by analysis of the TCGA glioblastoma data. MONGKIE was developed in Java based on the NetBeans plugin architecture, thus being OS-independent with intrinsic support of module extension by third-party developers. We believe that MONGKIE would be a valuable addition to network analysis software by supporting many unique features and visualization options, especially for analysing multi-omics data sets in cancer and other diseases. .

  3. MinOmics, an Integrative and Immersive Tool for Multi-Omics Analysis.

    PubMed

    Maes, Alexandre; Martinez, Xavier; Druart, Karen; Laurent, Benoist; Guégan, Sean; Marchand, Christophe H; Lemaire, Stéphane D; Baaden, Marc

    2018-06-21

    Proteomic and transcriptomic technologies resulted in massive biological datasets, their interpretation requiring sophisticated computational strategies. Efficient and intuitive real-time analysis remains challenging. We use proteomic data on 1417 proteins of the green microalga Chlamydomonas reinhardtii to investigate physicochemical parameters governing selectivity of three cysteine-based redox post translational modifications (PTM): glutathionylation (SSG), nitrosylation (SNO) and disulphide bonds (SS) reduced by thioredoxins. We aim to understand underlying molecular mechanisms and structural determinants through integration of redox proteome data from gene- to structural level. Our interactive visual analytics approach on an 8.3 m2 display wall of 25 MPixel resolution features stereoscopic three dimensions (3D) representation performed by UnityMol WebGL. Virtual reality headsets complement the range of usage configurations for fully immersive tasks. Our experiments confirm that fast access to a rich cross-linked database is necessary for immersive analysis of structural data. We emphasize the possibility to display complex data structures and relationships in 3D, intrinsic to molecular structure visualization, but less common for omics-network analysis. Our setup is powered by MinOmics, an integrated analysis pipeline and visualization framework dedicated to multi-omics analysis. MinOmics integrates data from various sources into a materialized physical repository. We evaluate its performance, a design criterion for the framework.

  4. omicsNPC: Applying the Non-Parametric Combination Methodology to the Integrative Analysis of Heterogeneous Omics Data

    PubMed Central

    Karathanasis, Nestoras; Tsamardinos, Ioannis

    2016-01-01

    Background The advance of omics technologies has made possible to measure several data modalities on a system of interest. In this work, we illustrate how the Non-Parametric Combination methodology, namely NPC, can be used for simultaneously assessing the association of different molecular quantities with an outcome of interest. We argue that NPC methods have several potential applications in integrating heterogeneous omics technologies, as for example identifying genes whose methylation and transcriptional levels are jointly deregulated, or finding proteins whose abundance shows the same trends of the expression of their encoding genes. Results We implemented the NPC methodology within “omicsNPC”, an R function specifically tailored for the characteristics of omics data. We compare omicsNPC against a range of alternative methods on simulated as well as on real data. Comparisons on simulated data point out that omicsNPC produces unbiased / calibrated p-values and performs equally or significantly better than the other methods included in the study; furthermore, the analysis of real data show that omicsNPC (a) exhibits higher statistical power than other methods, (b) it is easily applicable in a number of different scenarios, and (c) its results have improved biological interpretability. Conclusions The omicsNPC function competitively behaves in all comparisons conducted in this study. Taking into account that the method (i) requires minimal assumptions, (ii) it can be used on different studies designs and (iii) it captures the dependences among heterogeneous data modalities, omicsNPC provides a flexible and statistically powerful solution for the integrative analysis of different omics data. PMID:27812137

  5. A retrospective likelihood approach for efficient integration of multiple omics factors in case-control association studies.

    PubMed

    Balliu, Brunilda; Tsonaka, Roula; Boehringer, Stefan; Houwing-Duistermaat, Jeanine

    2015-03-01

    Integrative omics, the joint analysis of outcome and multiple types of omics data, such as genomics, epigenomics, and transcriptomics data, constitute a promising approach for powerful and biologically relevant association studies. These studies often employ a case-control design, and often include nonomics covariates, such as age and gender, that may modify the underlying omics risk factors. An open question is how to best integrate multiple omics and nonomics information to maximize statistical power in case-control studies that ascertain individuals based on the phenotype. Recent work on integrative omics have used prospective approaches, modeling case-control status conditional on omics, and nonomics risk factors. Compared to univariate approaches, jointly analyzing multiple risk factors with a prospective approach increases power in nonascertained cohorts. However, these prospective approaches often lose power in case-control studies. In this article, we propose a novel statistical method for integrating multiple omics and nonomics factors in case-control association studies. Our method is based on a retrospective likelihood function that models the joint distribution of omics and nonomics factors conditional on case-control status. The new method provides accurate control of Type I error rate and has increased efficiency over prospective approaches in both simulated and real data. © 2015 Wiley Periodicals, Inc.

  6. Review of Statistical Learning Methods in Integrated Omics Studies (An Integrated Information Science).

    PubMed

    Zeng, Irene Sui Lan; Lumley, Thomas

    2018-01-01

    Integrated omics is becoming a new channel for investigating the complex molecular system in modern biological science and sets a foundation for systematic learning for precision medicine. The statistical/machine learning methods that have emerged in the past decade for integrated omics are not only innovative but also multidisciplinary with integrated knowledge in biology, medicine, statistics, machine learning, and artificial intelligence. Here, we review the nontrivial classes of learning methods from the statistical aspects and streamline these learning methods within the statistical learning framework. The intriguing findings from the review are that the methods used are generalizable to other disciplines with complex systematic structure, and the integrated omics is part of an integrated information science which has collated and integrated different types of information for inferences and decision making. We review the statistical learning methods of exploratory and supervised learning from 42 publications. We also discuss the strengths and limitations of the extended principal component analysis, cluster analysis, network analysis, and regression methods. Statistical techniques such as penalization for sparsity induction when there are fewer observations than the number of features and using Bayesian approach when there are prior knowledge to be integrated are also included in the commentary. For the completeness of the review, a table of currently available software and packages from 23 publications for omics are summarized in the appendix.

  7. Data integration in the era of omics: current and future challenges

    PubMed Central

    2014-01-01

    To integrate heterogeneous and large omics data constitutes not only a conceptual challenge but a practical hurdle in the daily analysis of omics data. With the rise of novel omics technologies and through large-scale consortia projects, biological systems are being further investigated at an unprecedented scale generating heterogeneous and often large data sets. These data-sets encourage researchers to develop novel data integration methodologies. In this introduction we review the definition and characterize current efforts on data integration in the life sciences. We have used a web-survey to assess current research projects on data-integration to tap into the views, needs and challenges as currently perceived by parts of the research community. PMID:25032990

  8. [Applications of meta-analysis in multi-omics].

    PubMed

    Han, Mingfei; Zhu, Yunping

    2014-07-01

    As a statistical method integrating multi-features and multi-data, meta-analysis was introduced to the field of life science in the 1990s. With the rapid advances in high-throughput technologies, life omics, the core of which are genomics, transcriptomics and proteomics, is becoming the new hot spot of life science. Although the fast output of massive data has promoted the development of omics study, it results in excessive data that are difficult to integrate systematically. In this case, meta-analysis is frequently applied to analyze different types of data and is improved continuously. Here, we first summarize the representative meta-analysis methods systematically, and then study the current applications of meta-analysis in various omics fields, finally we discuss the still-existing problems and the future development of meta-analysis.

  9. PaintOmics 3: a web resource for the pathway analysis and visualization of multi-omics data.

    PubMed

    Hernández-de-Diego, Rafael; Tarazona, Sonia; Martínez-Mira, Carlos; Balzano-Nogueira, Leandro; Furió-Tarí, Pedro; Pappas, Georgios J; Conesa, Ana

    2018-05-25

    The increasing availability of multi-omic platforms poses new challenges to data analysis. Joint visualization of multi-omics data is instrumental in better understanding interconnections across molecular layers and in fully utilizing the multi-omic resources available to make biological discoveries. We present here PaintOmics 3, a web-based resource for the integrated visualization of multiple omic data types onto KEGG pathway diagrams. PaintOmics 3 combines server-end capabilities for data analysis with the potential of modern web resources for data visualization, providing researchers with a powerful framework for interactive exploration of their multi-omics information. Unlike other visualization tools, PaintOmics 3 covers a comprehensive pathway analysis workflow, including automatic feature name/identifier conversion, multi-layered feature matching, pathway enrichment, network analysis, interactive heatmaps, trend charts, and more. It accepts a wide variety of omic types, including transcriptomics, proteomics and metabolomics, as well as region-based approaches such as ATAC-seq or ChIP-seq data. The tool is freely available at www.paintomics.org.

  10. Integration, warehousing, and analysis strategies of Omics data.

    PubMed

    Gedela, Srinubabu

    2011-01-01

    "-Omics" is a current suffix for numerous types of large-scale biological data generation procedures, which naturally demand the development of novel algorithms for data storage and analysis. With next generation genome sequencing burgeoning, it is pivotal to decipher a coding site on the genome, a gene's function, and information on transcripts next to the pure availability of sequence information. To explore a genome and downstream molecular processes, we need umpteen results at the various levels of cellular organization by utilizing different experimental designs, data analysis strategies and methodologies. Here comes the need for controlled vocabularies and data integration to annotate, store, and update the flow of experimental data. This chapter explores key methodologies to merge Omics data by semantic data carriers, discusses controlled vocabularies as eXtensible Markup Languages (XML), and provides practical guidance, databases, and software links supporting the integration of Omics data.

  11. Integrative Analysis of Omics Big Data.

    PubMed

    Yu, Xiang-Tian; Zeng, Tao

    2018-01-01

    The diversity and huge omics data take biology and biomedicine research and application into a big data era, just like that popular in human society a decade ago. They are opening a new challenge from horizontal data ensemble (e.g., the similar types of data collected from different labs or companies) to vertical data ensemble (e.g., the different types of data collected for a group of person with match information), which requires the integrative analysis in biology and biomedicine and also asks for emergent development of data integration to address the great changes from previous population-guided to newly individual-guided investigations.Data integration is an effective concept to solve the complex problem or understand the complicate system. Several benchmark studies have revealed the heterogeneity and trade-off that existed in the analysis of omics data. Integrative analysis can combine and investigate many datasets in a cost-effective reproducible way. Current integration approaches on biological data have two modes: one is "bottom-up integration" mode with follow-up manual integration, and the other one is "top-down integration" mode with follow-up in silico integration.This paper will firstly summarize the combinatory analysis approaches to give candidate protocol on biological experiment design for effectively integrative study on genomics and then survey the data fusion approaches to give helpful instruction on computational model development for biological significance detection, which have also provided newly data resources and analysis tools to support the precision medicine dependent on the big biomedical data. Finally, the problems and future directions are highlighted for integrative analysis of omics big data.

  12. Sarcoidosis Related Novel Candidate Genes Identified by Multi-Omics Integrative Analyses.

    PubMed

    Hočevar, Keli; Maver, Aleš; Kunej, Tanja; Peterlin, Borut

    2018-05-01

    Sarcoidosis is a multifactorial systemic disease characterized by granulomatous inflammation and greatly impacting on global public health. The etiology and mechanisms of sarcoidosis are not fully understood. Recent high-throughput biological research has generated vast amounts of multi-omics big data on sarcoidosis, but their significance remains to be determined. We sought to identify novel candidate regions, and genes consistently altered in heterogeneous omics studies so as to reveal the underlying molecular mechanisms. We conducted a comprehensive integrative literature analysis on global data on sarcoidosis, including genomic, transcriptomic, proteomic, and phenomic studies. We performed positional integration analysis of 38 eligible datasets originating from 17 different biological layers. Using the integration interval length of 50 kb, we identified 54 regions reaching significance value p ≤ 0.0001 and 15 regions with significance value p ≤ 0.00001, when applying more stringent criteria. Secondary literature analysis of the top 20 regions, with the most significant accumulation of signals, revealed several novel candidate genes for which associations with sarcoidosis have not yet been established, but have considerable support for their involvement based on omic data. These new plausible candidate genes include NELFE, CFB, EGFL7, AGPAT2, FKBPL, NRC3, and NEU1. Furthermore, annotated data were prepared to enable custom visualization and browsing of these sarcoidosis related omics evidence in the University of California Santa Cruz (UCSC) Genome Browser. Further multi-omics approaches are called for sarcoidosis biomarkers and diagnostic and therapeutic innovation. Our approach for harnessing multi-omics data and the findings presented herein reflect important steps toward understanding the etiology and underlying pathological mechanisms of sarcoidosis.

  13. Integrative eQTL analysis of tumor and host omics data in individuals with bladder cancer.

    PubMed

    Pineda, Silvia; Van Steen, Kristel; Malats, Núria

    2017-09-01

    Integrative analyses of several omics data are emerging. The data are usually generated from the same source material (i.e., tumor sample) representing one level of regulation. However, integrating different regulatory levels (i.e., blood) with those from tumor may also reveal important knowledge about the human genetic architecture. To model this multilevel structure, an integrative-expression quantitative trait loci (eQTL) analysis applying two-stage regression (2SR) was proposed. This approach first regressed tumor gene expression levels with tumor markers and the adjusted residuals from the previous model were then regressed with the germline genotypes measured in blood. Previously, we demonstrated that penalized regression methods in combination with a permutation-based MaxT method (Global-LASSO) is a promising tool to fix some of the challenges that high-throughput omics data analysis imposes. Here, we assessed whether Global-LASSO can also be applied when tumor and blood omics data are integrated. We further compared our strategy with two 2SR-approaches, one using multiple linear regression (2SR-MLR) and other using LASSO (2SR-LASSO). We applied the three models to integrate genomic, epigenomic, and transcriptomic data from tumor tissue with blood germline genotypes from 181 individuals with bladder cancer included in the TCGA Consortium. Global-LASSO provided a larger list of eQTLs than the 2SR methods, identified a previously reported eQTLs in prostate stem cell antigen (PSCA), and provided further clues on the complexity of APBEC3B loci, with a minimal false-positive rate not achieved by 2SR-MLR. It also represents an important contribution for omics integrative analysis because it is easy to apply and adaptable to any type of data. © 2017 WILEY PERIODICALS, INC.

  14. Robust Selection Algorithm (RSA) for Multi-Omic Biomarker Discovery; Integration with Functional Network Analysis to Identify miRNA Regulated Pathways in Multiple Cancers.

    PubMed

    Sehgal, Vasudha; Seviour, Elena G; Moss, Tyler J; Mills, Gordon B; Azencott, Robert; Ram, Prahlad T

    2015-01-01

    MicroRNAs (miRNAs) play a crucial role in the maintenance of cellular homeostasis by regulating the expression of their target genes. As such, the dysregulation of miRNA expression has been frequently linked to cancer. With rapidly accumulating molecular data linked to patient outcome, the need for identification of robust multi-omic molecular markers is critical in order to provide clinical impact. While previous bioinformatic tools have been developed to identify potential biomarkers in cancer, these methods do not allow for rapid classification of oncogenes versus tumor suppressors taking into account robust differential expression, cutoffs, p-values and non-normality of the data. Here, we propose a methodology, Robust Selection Algorithm (RSA) that addresses these important problems in big data omics analysis. The robustness of the survival analysis is ensured by identification of optimal cutoff values of omics expression, strengthened by p-value computed through intensive random resampling taking into account any non-normality in the data and integration into multi-omic functional networks. Here we have analyzed pan-cancer miRNA patient data to identify functional pathways involved in cancer progression that are associated with selected miRNA identified by RSA. Our approach demonstrates the way in which existing survival analysis techniques can be integrated with a functional network analysis framework to efficiently identify promising biomarkers and novel therapeutic candidates across diseases.

  15. From data to knowledge: The future of multi-omics data analysis for the rhizosphere

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Allen White, Richard; Borkum, Mark I.; Rivas-Ubach, Albert

    The rhizosphere is the interface between a plant's roots and its surrounding soil. The rhizosphere microbiome, a complex microbial ecosystem, nourishes the terrestrial biosphere. Integrated multi-omics is a modern approach to systems biology that analyzes and interprets the datasets of multiple -omes of both individual organisms and multi-organism communities and consortia. The successful usage and application of integrated multi-omics to rhizospheric science is predicated upon the availability of rhizosphere-specific data, metadata and software. This review analyzes the availability of multi-omics data, metadata and software for rhizospheric science, identifying potential issues, challenges and opportunities.

  16. CircadiOmics: circadian omic web portal.

    PubMed

    Ceglia, Nicholas; Liu, Yu; Chen, Siwei; Agostinelli, Forest; Eckel-Mahan, Kristin; Sassone-Corsi, Paolo; Baldi, Pierre

    2018-06-15

    Circadian rhythms play a fundamental role at all levels of biological organization. Understanding the mechanisms and implications of circadian oscillations continues to be the focus of intense research. However, there has been no comprehensive and integrated way for accessing and mining all circadian omic datasets. The latest release of CircadiOmics (http://circadiomics.ics.uci.edu) fills this gap for providing the most comprehensive web server for studying circadian data. The newly updated version contains high-throughput 227 omic datasets corresponding to over 74 million measurements sampled over 24 h cycles. Users can visualize and compare oscillatory trajectories across species, tissues and conditions. Periodicity statistics (e.g. period, amplitude, phase, P-value, q-value etc.) obtained from BIO_CYCLE and other methods are provided for all samples in the repository and can easily be downloaded in the form of publication-ready figures and tables. New features and substantial improvements in performance and data volume make CircadiOmics a powerful web portal for integrated analysis of circadian omic data.

  17. Analysis of metabolomic data: tools, current strategies and future challenges for omics data integration.

    PubMed

    Cambiaghi, Alice; Ferrario, Manuela; Masseroli, Marco

    2017-05-01

    Metabolomics is a rapidly growing field consisting of the analysis of a large number of metabolites at a system scale. The two major goals of metabolomics are the identification of the metabolites characterizing each organism state and the measurement of their dynamics under different situations (e.g. pathological conditions, environmental factors). Knowledge about metabolites is crucial for the understanding of most cellular phenomena, but this information alone is not sufficient to gain a comprehensive view of all the biological processes involved. Integrated approaches combining metabolomics with transcriptomics and proteomics are thus required to obtain much deeper insights than any of these techniques alone. Although this information is available, multilevel integration of different 'omics' data is still a challenge. The handling, processing, analysis and integration of these data require specialized mathematical, statistical and bioinformatics tools, and several technical problems hampering a rapid progress in the field exist. Here, we review four main tools for number of users or provided features (MetaCoreTM, MetaboAnalyst, InCroMAP and 3Omics) out of the several available for metabolomic data analysis and integration with other 'omics' data, highlighting their strong and weak aspects; a number of related issues affecting data analysis and integration are also identified and discussed. Overall, we provide an objective description of how some of the main currently available software packages work, which may help the experimental practitioner in the choice of a robust pipeline for metabolomic data analysis and integration. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. LinkedOmics: analyzing multi-omics data within and across 32 cancer types.

    PubMed

    Vasaikar, Suhas V; Straub, Peter; Wang, Jing; Zhang, Bing

    2018-01-04

    The LinkedOmics database contains multi-omics data and clinical data for 32 cancer types and a total of 11 158 patients from The Cancer Genome Atlas (TCGA) project. It is also the first multi-omics database that integrates mass spectrometry (MS)-based global proteomics data generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) on selected TCGA tumor samples. In total, LinkedOmics has more than a billion data points. To allow comprehensive analysis of these data, we developed three analysis modules in the LinkedOmics web application. The LinkFinder module allows flexible exploration of associations between a molecular or clinical attribute of interest and all other attributes, providing the opportunity to analyze and visualize associations between billions of attribute pairs for each cancer cohort. The LinkCompare module enables easy comparison of the associations identified by LinkFinder, which is particularly useful in multi-omics and pan-cancer analyses. The LinkInterpreter module transforms identified associations into biological understanding through pathway and network analysis. Using five case studies, we demonstrate that LinkedOmics provides a unique platform for biologists and clinicians to access, analyze and compare cancer multi-omics data within and across tumor types. LinkedOmics is freely available at http://www.linkedomics.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Integrated omics analysis of specialized metabolism in medicinal plants.

    PubMed

    Rai, Amit; Saito, Kazuki; Yamazaki, Mami

    2017-05-01

    Medicinal plants are a rich source of highly diverse specialized metabolites with important pharmacological properties. Until recently, plant biologists were limited in their ability to explore the biosynthetic pathways of these metabolites, mainly due to the scarcity of plant genomics resources. However, recent advances in high-throughput large-scale analytical methods have enabled plant biologists to discover biosynthetic pathways for important plant-based medicinal metabolites. The reduced cost of generating omics datasets and the development of computational tools for their analysis and integration have led to the elucidation of biosynthetic pathways of several bioactive metabolites of plant origin. These discoveries have inspired synthetic biology approaches to develop microbial systems to produce bioactive metabolites originating from plants, an alternative sustainable source of medicinally important chemicals. Since the demand for medicinal compounds are increasing with the world's population, understanding the complete biosynthesis of specialized metabolites becomes important to identify or develop reliable sources in the future. Here, we review the contributions of major omics approaches and their integration to our understanding of the biosynthetic pathways of bioactive metabolites. We briefly discuss different approaches for integrating omics datasets to extract biologically relevant knowledge and the application of omics datasets in the construction and reconstruction of metabolic models. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  20. Unsupervised multiple kernel learning for heterogeneous data integration.

    PubMed

    Mariette, Jérôme; Villa-Vialaneix, Nathalie

    2018-03-15

    Recent high-throughput sequencing advances have expanded the breadth of available omics datasets and the integrated analysis of multiple datasets obtained on the same samples has allowed to gain important insights in a wide range of applications. However, the integration of various sources of information remains a challenge for systems biology since produced datasets are often of heterogeneous types, with the need of developing generic methods to take their different specificities into account. We propose a multiple kernel framework that allows to integrate multiple datasets of various types into a single exploratory analysis. Several solutions are provided to learn either a consensus meta-kernel or a meta-kernel that preserves the original topology of the datasets. We applied our framework to analyse two public multi-omics datasets. First, the multiple metagenomic datasets, collected during the TARA Oceans expedition, was explored to demonstrate that our method is able to retrieve previous findings in a single kernel PCA as well as to provide a new image of the sample structures when a larger number of datasets are included in the analysis. To perform this analysis, a generic procedure is also proposed to improve the interpretability of the kernel PCA in regards with the original data. Second, the multi-omics breast cancer datasets, provided by The Cancer Genome Atlas, is analysed using a kernel Self-Organizing Maps with both single and multi-omics strategies. The comparison of these two approaches demonstrates the benefit of our integration method to improve the representation of the studied biological system. Proposed methods are available in the R package mixKernel, released on CRAN. It is fully compatible with the mixOmics package and a tutorial describing the approach can be found on mixOmics web site http://mixomics.org/mixkernel/. jerome.mariette@inra.fr or nathalie.villa-vialaneix@inra.fr. Supplementary data are available at Bioinformatics online.

  1. Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.

    PubMed

    Wu, Dingming; Wang, Dongfang; Zhang, Michael Q; Gu, Jin

    2015-12-01

    One major goal of large-scale cancer omics study is to identify molecular subtypes for more accurate cancer diagnoses and treatments. To deal with high-dimensional cancer multi-omics data, a promising strategy is to find an effective low-dimensional subspace of the original data and then cluster cancer samples in the reduced subspace. However, due to data-type diversity and big data volume, few methods can integrative and efficiently find the principal low-dimensional manifold of the high-dimensional cancer multi-omics data. In this study, we proposed a novel low-rank approximation based integrative probabilistic model to fast find the shared principal subspace across multiple data types: the convexity of the low-rank regularized likelihood function of the probabilistic model ensures efficient and stable model fitting. Candidate molecular subtypes can be identified by unsupervised clustering hundreds of cancer samples in the reduced low-dimensional subspace. On testing datasets, our method LRAcluster (low-rank approximation based multi-omics data clustering) runs much faster with better clustering performances than the existing method. Then, we applied LRAcluster on large-scale cancer multi-omics data from TCGA. The pan-cancer analysis results show that the cancers of different tissue origins are generally grouped as independent clusters, except squamous-like carcinomas. While the single cancer type analysis suggests that the omics data have different subtyping abilities for different cancer types. LRAcluster is a very useful method for fast dimension reduction and unsupervised clustering of large-scale multi-omics data. LRAcluster is implemented in R and freely available via http://bioinfo.au.tsinghua.edu.cn/software/lracluster/ .

  2. MPLEx: a Robust and Universal Protocol for Single-Sample Integrative Proteomic, Metabolomic, and Lipidomic Analyses

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.

    2016-05-03

    ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. Themetabolite,protein, andlipidextraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of thismore » protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental,in vitro, and clinical). IMPORTANCEIn systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample.« less

  3. Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.

    PubMed

    Argelaguet, Ricard; Velten, Britta; Arnol, Damien; Dietrich, Sascha; Zenz, Thorsten; Marioni, John C; Buettner, Florian; Huber, Wolfgang; Stegle, Oliver

    2018-06-20

    Multi-omics studies promise the improved characterization of biological processes across molecular layers. However, methods for the unsupervised integration of the resulting heterogeneous data sets are lacking. We present Multi-Omics Factor Analysis (MOFA), a computational method for discovering the principal sources of variation in multi-omics data sets. MOFA infers a set of (hidden) factors that capture biological and technical sources of variability. It disentangles axes of heterogeneity that are shared across multiple modalities and those specific to individual data modalities. The learnt factors enable a variety of downstream analyses, including identification of sample subgroups, data imputation and the detection of outlier samples. We applied MOFA to a cohort of 200 patient samples of chronic lymphocytic leukaemia, profiled for somatic mutations, RNA expression, DNA methylation and ex vivo drug responses. MOFA identified major dimensions of disease heterogeneity, including immunoglobulin heavy-chain variable region status, trisomy of chromosome 12 and previously underappreciated drivers, such as response to oxidative stress. In a second application, we used MOFA to analyse single-cell multi-omics data, identifying coordinated transcriptional and epigenetic changes along cell differentiation. © 2018 The Authors. Published under the terms of the CC BY 4.0 license.

  4. CAS-viewer: web-based tool for splicing-guided integrative analysis of multi-omics cancer data.

    PubMed

    Han, Seonggyun; Kim, Dongwook; Kim, Youngjun; Choi, Kanghoon; Miller, Jason E; Kim, Dokyoon; Lee, Younghee

    2018-04-20

    The Cancer Genome Atlas (TCGA) project is a public resource that provides transcriptomic, DNA sequence, methylation, and clinical data for 33 cancer types. Transforming the large size and high complexity of TCGA cancer genome data into integrated knowledge can be useful to promote cancer research. Alternative splicing (AS) is a key regulatory mechanism of genes in human cancer development and in the interaction with epigenetic factors. Therefore, AS-guided integration of existing TCGA data sets will make it easier to gain insight into the genetic architecture of cancer risk and related outcomes. There are already existing tools analyzing and visualizing alternative mRNA splicing patterns for large-scale RNA-seq experiments. However, these existing web-based tools are limited to the analysis of individual TCGA data sets at a time, such as only transcriptomic information. We implemented CAS-viewer (integrative analysis of Cancer genome data based on Alternative Splicing), a web-based tool leveraging multi-cancer omics data from TCGA. It illustrates alternative mRNA splicing patterns along with methylation, miRNAs, and SNPs, and then provides an analysis tool to link differential transcript expression ratio to methylation, miRNA, and splicing regulatory elements for 33 cancer types. Moreover, one can analyze AS patterns with clinical data to identify potential transcripts associated with different survival outcome for each cancer. CAS-viewer is a web-based application for transcript isoform-driven integration of multi-omics data in multiple cancer types and will aid in the visualization and possible discovery of biomarkers for cancer by integrating multi-omics data from TCGA.

  5. Integrated network analysis and effective tools in plant systems biology

    PubMed Central

    Fukushima, Atsushi; Kanaya, Shigehiko; Nishida, Kozo

    2014-01-01

    One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms. PMID:25408696

  6. Toward more transparent and reproducible omics studies through a common metadata checklist and data publications.

    PubMed

    Kolker, Eugene; Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R; Chen, Rui; Choiniere, John; Dearth, Stephen P; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W; Kohane, Isaac S; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; Macnealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H; Warnich, Louise; Wilhelm, Steven W; Yandl, Gregory

    2014-01-01

    Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.

  7. Toward More Transparent and Reproducible Omics Studies Through a Common Metadata Checklist and Data Publications.

    PubMed

    Kolker, Eugene; Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R; Chen, Rui; Choiniere, John; Dearth, Stephen P; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W; Kohane, Isaac S; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; MacNealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H; Warnich, Louise; Wilhelm, Steven W; Yandl, Gregory

    2013-12-01

    Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.

  8. [Progress in omics research of Aspergillus niger].

    PubMed

    Sui, Yufei; Ouyang, Liming; Lu, Hongzhong; Zhuang, Yingping; Zhang, Siliang

    2016-08-25

    Aspergillus niger, as an important industrial fermentation strain, is widely applied in the production of organic acids and industrial enzymes. With the development of diverse omics technologies, the data of genome, transcriptome, proteome and metabolome of A. niger are increasing continuously, which declared the coming era of big data for the research in fermentation process of A. niger. The data analysis from single omics and the comparison of multi-omics, to the integrations of multi-omics based on the genome-scale metabolic network model largely extends the intensive and systematic understanding of the efficient production mechanism of A. niger. It also provides possibilities for the reasonable global optimization of strain performance by genetic modification and process regulation. We reviewed and summarized progress in omics research of A. niger, and proposed the development direction of omics research on this cell factory.

  9. The Promise of Multi-Omics and Clinical Data Integration to Identify and Target Personalized Healthcare Approaches in Autism Spectrum Disorders

    PubMed Central

    Higdon, Roger; Earl, Rachel K.; Stanberry, Larissa; Hudac, Caitlin M.; Montague, Elizabeth; Stewart, Elizabeth; Janko, Imre; Choiniere, John; Broomall, William; Kolker, Natali

    2015-01-01

    Abstract Complex diseases are caused by a combination of genetic and environmental factors, creating a difficult challenge for diagnosis and defining subtypes. This review article describes how distinct disease subtypes can be identified through integration and analysis of clinical and multi-omics data. A broad shift toward molecular subtyping of disease using genetic and omics data has yielded successful results in cancer and other complex diseases. To determine molecular subtypes, patients are first classified by applying clustering methods to different types of omics data, then these results are integrated with clinical data to characterize distinct disease subtypes. An example of this molecular-data-first approach is in research on Autism Spectrum Disorder (ASD), a spectrum of social communication disorders marked by tremendous etiological and phenotypic heterogeneity. In the case of ASD, omics data such as exome sequences and gene and protein expression data are combined with clinical data such as psychometric testing and imaging to enable subtype identification. Novel ASD subtypes have been proposed, such as CHD8, using this molecular subtyping approach. Broader use of molecular subtyping in complex disease research is impeded by data heterogeneity, diversity of standards, and ineffective analysis tools. The future of molecular subtyping for ASD and other complex diseases calls for an integrated resource to identify disease mechanisms, classify new patients, and inform effective treatment options. This in turn will empower and accelerate precision medicine and personalized healthcare. PMID:25831060

  10. MPLEx: a Robust and Universal Protocol for Single-Sample Integrative Proteomic, Metabolomic, and Lipidomic Analyses

    PubMed Central

    Nakayasu, Ernesto S.; Nicora, Carrie D.; Sims, Amy C.; Burnum-Johnson, Kristin E.; Kim, Young-Mo; Kyle, Jennifer E.; Matzke, Melissa M.; Shukla, Anil K.; Chu, Rosalie K.; Schepmoes, Athena A.; Jacobs, Jon M.; Baric, Ralph S.; Webb-Robertson, Bobbie-Jo; Smith, Richard D.

    2016-01-01

    ABSTRACT Integrative multi-omics analyses can empower more effective investigation and complete understanding of complex biological systems. Despite recent advances in a range of omics analyses, multi-omic measurements of the same sample are still challenging and current methods have not been well evaluated in terms of reproducibility and broad applicability. Here we adapted a solvent-based method, widely applied for extracting lipids and metabolites, to add proteomics to mass spectrometry-based multi-omics measurements. The metabolite, protein, and lipid extraction (MPLEx) protocol proved to be robust and applicable to a diverse set of sample types, including cell cultures, microbial communities, and tissues. To illustrate the utility of this protocol, an integrative multi-omics analysis was performed using a lung epithelial cell line infected with Middle East respiratory syndrome coronavirus, which showed the impact of this virus on the host glycolytic pathway and also suggested a role for lipids during infection. The MPLEx method is a simple, fast, and robust protocol that can be applied for integrative multi-omic measurements from diverse sample types (e.g., environmental, in vitro, and clinical). IMPORTANCE In systems biology studies, the integration of multiple omics measurements (i.e., genomics, transcriptomics, proteomics, metabolomics, and lipidomics) has been shown to provide a more complete and informative view of biological pathways. Thus, the prospect of extracting different types of molecules (e.g., DNAs, RNAs, proteins, and metabolites) and performing multiple omics measurements on single samples is very attractive, but such studies are challenging due to the fact that the extraction conditions differ according to the molecule type. Here, we adapted an organic solvent-based extraction method that demonstrated broad applicability and robustness, which enabled comprehensive proteomics, metabolomics, and lipidomics analyses from the same sample. Author Video: An author video summary of this article is available. PMID:27822525

  11. An overview on forensic analysis devoted to analytical chemists.

    PubMed

    Castillo-Peinado, L S; Luque de Castro, M D

    2017-05-15

    The present article has as main aim to show analytical chemists interested in forensic analysis the world they will face if decision in favor of being a forensic analytical chemist is adopted. With this purpose, the most outstanding aspects of forensic analysis in dealing with sampling (involving both bodily and no bodily samples), sample preparation, and analytical equipment used in detection, identification and quantitation of key sample components are critically discussed. The role of the great omics in forensic analysis, and the growing role of the youngest of the great omics -metabolomics- are also discussed. The foreseeable role of integrative omics is also outlined. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Towards more transparent and reproducible omics studies through a common metadata checklist and data publications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kolker, Eugene; Ozdemir, Vural; Martens , Lennart

    Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies omics studies are becoming increasingly prevalent yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research,. These three essential steps require consistent generation, capture, and distribution of the metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologiesmore » and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. This omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.« less

  13. Toward More Transparent and Reproducible Omics Studies Through a Common Metadata Checklist and Data Publications

    PubMed Central

    Özdemir, Vural; Martens, Lennart; Hancock, William; Anderson, Gordon; Anderson, Nathaniel; Aynacioglu, Sukru; Baranova, Ancha; Campagna, Shawn R.; Chen, Rui; Choiniere, John; Dearth, Stephen P.; Feng, Wu-Chun; Ferguson, Lynnette; Fox, Geoffrey; Frishman, Dmitrij; Grossman, Robert; Heath, Allison; Higdon, Roger; Hutz, Mara H.; Janko, Imre; Jiang, Lihua; Joshi, Sanjay; Kel, Alexander; Kemnitz, Joseph W.; Kohane, Isaac S.; Kolker, Natali; Lancet, Doron; Lee, Elaine; Li, Weizhong; Lisitsa, Andrey; Llerena, Adrian; MacNealy-Koch, Courtney; Marshall, Jean-Claude; Masuzzo, Paola; May, Amanda; Mias, George; Monroe, Matthew; Montague, Elizabeth; Mooney, Sean; Nesvizhskii, Alexey; Noronha, Santosh; Omenn, Gilbert; Rajasimha, Harsha; Ramamoorthy, Preveen; Sheehan, Jerry; Smarr, Larry; Smith, Charles V.; Smith, Todd; Snyder, Michael; Rapole, Srikanth; Srivastava, Sanjeeva; Stanberry, Larissa; Stewart, Elizabeth; Toppo, Stefano; Uetz, Peter; Verheggen, Kenneth; Voy, Brynn H.; Warnich, Louise; Wilhelm, Steven W.; Yandl, Gregory

    2014-01-01

    Abstract Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement. PMID:24456465

  14. MetaComp: comprehensive analysis software for comparative meta-omics including comparative metagenomics.

    PubMed

    Zhai, Peng; Yang, Longshu; Guo, Xiao; Wang, Zhe; Guo, Jiangtao; Wang, Xiaoqi; Zhu, Huaiqiu

    2017-10-02

    During the past decade, the development of high throughput nucleic sequencing and mass spectrometry analysis techniques have enabled the characterization of microbial communities through metagenomics, metatranscriptomics, metaproteomics and metabolomics data. To reveal the diversity of microbial communities and interactions between living conditions and microbes, it is necessary to introduce comparative analysis based upon integration of all four types of data mentioned above. Comparative meta-omics, especially comparative metageomics, has been established as a routine process to highlight the significant differences in taxon composition and functional gene abundance among microbiota samples. Meanwhile, biologists are increasingly concerning about the correlations between meta-omics features and environmental factors, which may further decipher the adaptation strategy of a microbial community. We developed a graphical comprehensive analysis software named MetaComp comprising a series of statistical analysis approaches with visualized results for metagenomics and other meta-omics data comparison. This software is capable to read files generated by a variety of upstream programs. After data loading, analyses such as multivariate statistics, hypothesis testing of two-sample, multi-sample as well as two-group sample and a novel function-regression analysis of environmental factors are offered. Here, regression analysis regards meta-omic features as independent variable and environmental factors as dependent variables. Moreover, MetaComp is capable to automatically choose an appropriate two-group sample test based upon the traits of input abundance profiles. We further evaluate the performance of its choice, and exhibit applications for metagenomics, metaproteomics and metabolomics samples. MetaComp, an integrative software capable for applying to all meta-omics data, originally distills the influence of living environment on microbial community by regression analysis. Moreover, since the automatically chosen two-group sample test is verified to be outperformed, MetaComp is friendly to users without adequate statistical training. These improvements are aiming to overcome the new challenges under big data era for all meta-omics data. MetaComp is available at: http://cqb.pku.edu.cn/ZhuLab/MetaComp/ and https://github.com/pzhaipku/MetaComp/ .

  15. Establishing multiple omics baselines for three Southeast Asian populations in the Singapore Integrative Omics Study.

    PubMed

    Saw, Woei-Yuh; Tantoso, Erwin; Begum, Husna; Zhou, Lihan; Zou, Ruiyang; He, Cheng; Chan, Sze Ling; Tan, Linda Wei-Lin; Wong, Lai-Ping; Xu, Wenting; Moong, Don Kyin Nwe; Lim, Yenly; Li, Bowen; Pillai, Nisha Esakimuthu; Peterson, Trevor A; Bielawny, Tomasz; Meikle, Peter J; Mundra, Piyushkumar A; Lim, Wei-Yen; Luo, Ma; Chia, Kee-Seng; Ong, Rick Twee-Hee; Brunham, Liam R; Khor, Chiea-Chuen; Too, Heng Phon; Soong, Richie; Wenk, Markus R; Little, Peter; Teo, Yik-Ying

    2017-09-21

    The Singapore Integrative Omics Study provides valuable insights on establishing population reference measurement in 364 Chinese, Malay, and Indian individuals. These measurements include > 2.5 millions genetic variants, 21,649 transcripts expression, 282 lipid species quantification, and 284 clinical, lifestyle, and dietary variables. This concept paper introduces the depth of the data resource, and investigates the extent of ethnic variation at these omics and non-omics biomarkers. It is evident that there are specific biomarkers in each of these platforms to differentiate between the ethnicities, and intra-population analyses suggest that Chinese and Indians are the most biologically homogeneous and heterogeneous, respectively, of the three groups. Consistent patterns of correlations between lipid species also suggest the possibility of lipid tagging to simplify future lipidomics assays. The Singapore Integrative Omics Study is expected to allow the characterization of intra-omic and inter-omic correlations within and across all three ethnic groups through a systems biology approach.The Singapore Genome Variation projects characterized the genetics of Singapore's Chinese, Malay, and Indian populations. The Singapore Integrative Omics Study introduced here goes further in providing multi-omic measurements in individuals from these populations, including genetic, transcriptome, lipidome, and lifestyle data, and will facilitate the study of common diseases in Asian communities.

  16. Integrative Exploratory Analysis of Two or More Genomic Datasets.

    PubMed

    Meng, Chen; Culhane, Aedin

    2016-01-01

    Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.

  17. shinyGISPA: A web application for characterizing phenotype by gene sets using multiple omics data combinations.

    PubMed

    Dwivedi, Bhakti; Kowalski, Jeanne

    2018-01-01

    While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/.

  18. shinyGISPA: A web application for characterizing phenotype by gene sets using multiple omics data combinations

    PubMed Central

    Dwivedi, Bhakti

    2018-01-01

    While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/. PMID:29415010

  19. Dimension reduction techniques for the integrative analysis of multi-omics data

    PubMed Central

    Zeleznik, Oana A.; Thallinger, Gerhard G.; Kuster, Bernhard; Gholami, Amin M.

    2016-01-01

    State-of-the-art next-generation sequencing, transcriptomics, proteomics and other high-throughput ‘omics' technologies enable the efficient generation of large experimental data sets. These data may yield unprecedented knowledge about molecular pathways in cells and their role in disease. Dimension reduction approaches have been widely used in exploratory analysis of single omics data sets. This review will focus on dimension reduction approaches for simultaneous exploratory analyses of multiple data sets. These methods extract the linear relationships that best explain the correlated structure across data sets, the variability both within and between variables (or observations) and may highlight data issues such as batch effects or outliers. We explore dimension reduction techniques as one of the emerging approaches for data integration, and how these can be applied to increase our understanding of biological systems in normal physiological function and disease. PMID:26969681

  20. Techniques for integrating ‐omics data

    PubMed Central

    Akula, Siva Prasad; Miriyala, Raghava Naidu; Thota, Hanuman; Rao, Allam Appa; Gedela, Srinubabu

    2009-01-01

    The challenge for -omics research is to tackle the problem of fragmentation of knowledge by integrating several sources of heterogeneous information into a coherent entity. It is widely recognized that successful data integration is one of the keys to improve productivity for stored data. Through proper data integration tools and algorithms, researchers may correlate relationships that enable them to make better and faster decisions. The need for data integration is essential for present ‐omics community, because ‐omics data is currently spread world wide in wide variety of formats. These formats can be integrated and migrated across platforms through different techniques and one of the important techniques often used is XML. XML is used to provide a document markup language that is easier to learn, retrieve, store and transmit. It is semantically richer than HTML. Here, we describe bio warehousing, database federation, controlled vocabularies and highlighting the XML application to store, migrate and validate -omics data. PMID:19255651

  1. Techniques for integrating -omics data.

    PubMed

    Akula, Siva Prasad; Miriyala, Raghava Naidu; Thota, Hanuman; Rao, Allam Appa; Gedela, Srinubabu

    2009-01-01

    The challenge for -omics research is to tackle the problem of fragmentation of knowledge by integrating several sources of heterogeneous information into a coherent entity. It is widely recognized that successful data integration is one of the keys to improve productivity for stored data. Through proper data integration tools and algorithms, researchers may correlate relationships that enable them to make better and faster decisions. The need for data integration is essential for present -omics community, because -omics data is currently spread world wide in wide variety of formats. These formats can be integrated and migrated across platforms through different techniques and one of the important techniques often used is XML. XML is used to provide a document markup language that is easier to learn, retrieve, store and transmit. It is semantically richer than HTML. Here, we describe bio warehousing, database federation, controlled vocabularies and highlighting the XML application to store, migrate and validate -omics data.

  2. Using "Omics" and Integrated Multi-Omics Approaches to Guide Probiotic Selection to Mitigate Chytridiomycosis and Other Emerging Infectious Diseases.

    PubMed

    Rebollar, Eria A; Antwis, Rachael E; Becker, Matthew H; Belden, Lisa K; Bletz, Molly C; Brucker, Robert M; Harrison, Xavier A; Hughey, Myra C; Kueneman, Jordan G; Loudon, Andrew H; McKenzie, Valerie; Medina, Daniel; Minbiole, Kevin P C; Rollins-Smith, Louise A; Walke, Jenifer B; Weiss, Sophie; Woodhams, Douglas C; Harris, Reid N

    2016-01-01

    Emerging infectious diseases in wildlife are responsible for massive population declines. In amphibians, chytridiomycosis caused by Batrachochytrium dendrobatidis, Bd, has severely affected many amphibian populations and species around the world. One promising management strategy is probiotic bioaugmentation of antifungal bacteria on amphibian skin. In vivo experimental trials using bioaugmentation strategies have had mixed results, and therefore a more informed strategy is needed to select successful probiotic candidates. Metagenomic, transcriptomic, and metabolomic methods, colloquially called "omics," are approaches that can better inform probiotic selection and optimize selection protocols. The integration of multiple omic data using bioinformatic and statistical tools and in silico models that link bacterial community structure with bacterial defensive function can allow the identification of species involved in pathogen inhibition. We recommend using 16S rRNA gene amplicon sequencing and methods such as indicator species analysis, the Kolmogorov-Smirnov Measure, and co-occurrence networks to identify bacteria that are associated with pathogen resistance in field surveys and experimental trials. In addition to 16S amplicon sequencing, we recommend approaches that give insight into symbiont function such as shotgun metagenomics, metatranscriptomics, or metabolomics to maximize the probability of finding effective probiotic candidates, which can then be isolated in culture and tested in persistence and clinical trials. An effective mitigation strategy to ameliorate chytridiomycosis and other emerging infectious diseases is necessary; the advancement of omic methods and the integration of multiple omic data provide a promising avenue toward conservation of imperiled species.

  3. Integrating multi-omic features exploiting Chromosome Conformation Capture data.

    PubMed

    Merelli, Ivan; Tordini, Fabio; Drocco, Maurizio; Aldinucci, Marco; Liò, Pietro; Milanesi, Luciano

    2015-01-01

    The representation, integration, and interpretation of omic data is a complex task, in particular considering the huge amount of information that is daily produced in molecular biology laboratories all around the world. The reason is that sequencing data regarding expression profiles, methylation patterns, and chromatin domains is difficult to harmonize in a systems biology view, since genome browsers only allow coordinate-based representations, discarding functional clusters created by the spatial conformation of the DNA in the nucleus. In this context, recent progresses in high throughput molecular biology techniques and bioinformatics have provided insights into chromatin interactions on a larger scale and offer a formidable support for the interpretation of multi-omic data. In particular, a novel sequencing technique called Chromosome Conformation Capture allows the analysis of the chromosome organization in the cell's natural state. While performed genome wide, this technique is usually called Hi-C. Inspired by service applications such as Google Maps, we developed NuChart, an R package that integrates Hi-C data to describe the chromosomal neighborhood starting from the information about gene positions, with the possibility of mapping on the achieved graphs genomic features such as methylation patterns and histone modifications, along with expression profiles. In this paper we show the importance of the NuChart application for the integration of multi-omic data in a systems biology fashion, with particular interest in cytogenetic applications of these techniques. Moreover, we demonstrate how the integration of multi-omic data can provide useful information in understanding why genes are in certain specific positions inside the nucleus and how epigenetic patterns correlate with their expression.

  4. Omics Integration in Biology and Medicine Workshop | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The focus of this meeting will be on the emerging field of integrating disparate omic data from genomics, proteomics, glycomics, etc. in order to better understand key biological processes and also improve clinical practice. Discussants will focus on identifying the technical and biological barriers in omic integration, with solutions to build a consensus towards data integration in bioscience and to better define phenotypes.

  5. BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.

    PubMed

    Chae, Heejoon; Lee, Sangseon; Seo, Seokjun; Jung, Daekyoung; Chang, Hyeonsook; Nephew, Kenneth P; Kim, Sun

    2016-12-01

    Measuring gene expression, DNA sequence variation, and DNA methylation status is routinely done using high throughput sequencing technologies. To analyze such multi-omics data and explore relationships, reliable bioinformatics systems are much needed. Existing systems are either for exploring curated data or for processing omics data in the form of a library such as R. Thus scientists have much difficulty in investigating relationships among gene expression, DNA sequence variation, and DNA methylation using multi-omics data. In this study, we report a system called BioVLAB-mCpG-SNP-EXPRESS for the integrated analysis of DNA methylation, sequence variation (SNPs), and gene expression for distinguishing cellular phenotypes at the pairwise and multiple phenotype levels. The system can be deployed on either the Amazon cloud or a publicly available high-performance computing node, and the data analysis and exploration of the analysis result can be conveniently done using a web-based interface. In order to alleviate analysis complexity, all the process are fully automated, and graphical workflow system is integrated to represent real-time analysis progression. The BioVLAB-mCpG-SNP-EXPRESS system works in three stages. First, it processes and analyzes multi-omics data as input in the form of the raw data, i.e., FastQ files. Second, various integrated analyses such as methylation vs. gene expression and mutation vs. methylation are performed. Finally, the analysis result can be explored in a number of ways through a web interface for the multi-level, multi-perspective exploration. Multi-level interpretation can be done by either gene, gene set, pathway or network level and multi-perspective exploration can be explored from either gene expression, DNA methylation, sequence variation, or their relationship perspective. The utility of the system is demonstrated by performing analysis of phenotypically distinct 30 breast cancer cell line data set. BioVLAB-mCpG-SNP-EXPRESS is available at http://biohealth.snu.ac.kr/software/biovlab_mcpg_snp_express/. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.

    PubMed

    Liu, Cong; Wang, Xujun; Genchev, Georgi Z; Lu, Hui

    2017-07-15

    New developments in high-throughput genomic technologies have enabled the measurement of diverse types of omics biomarkers in a cost-efficient and clinically-feasible manner. Developing computational methods and tools for analysis and translation of such genomic data into clinically-relevant information is an ongoing and active area of investigation. For example, several studies have utilized an unsupervised learning framework to cluster patients by integrating omics data. Despite such recent advances, predicting cancer prognosis using integrated omics biomarkers remains a challenge. There is also a shortage of computational tools for predicting cancer prognosis by using supervised learning methods. The current standard approach is to fit a Cox regression model by concatenating the different types of omics data in a linear manner, while penalty could be added for feature selection. A more powerful approach, however, would be to incorporate data by considering relationships among omics datatypes. Here we developed two methods: a SKI-Cox method and a wLASSO-Cox method to incorporate the association among different types of omics data. Both methods fit the Cox proportional hazards model and predict a risk score based on mRNA expression profiles. SKI-Cox borrows the information generated by these additional types of omics data to guide variable selection, while wLASSO-Cox incorporates this information as a penalty factor during model fitting. We show that SKI-Cox and wLASSO-Cox models select more true variables than a LASSO-Cox model in simulation studies. We assess the performance of SKI-Cox and wLASSO-Cox using TCGA glioblastoma multiforme and lung adenocarcinoma data. In each case, mRNA expression, methylation, and copy number variation data are integrated to predict the overall survival time of cancer patients. Our methods achieve better performance in predicting patients' survival in glioblastoma and lung adenocarcinoma. Copyright © 2017. Published by Elsevier Inc.

  7. Pathview Web: user friendly pathway visualization and data integration

    PubMed Central

    Pant, Gaurav; Bhavnasi, Yeshvant K.; Blanchard, Steven G.; Brouwer, Cory

    2017-01-01

    Abstract Pathway analysis is widely used in omics studies. Pathway-based data integration and visualization is a critical component of the analysis. To address this need, we recently developed a novel R package called Pathview. Pathview maps, integrates and renders a large variety of biological data onto molecular pathway graphs. Here we developed the Pathview Web server, as to make pathway visualization and data integration accessible to all scientists, including those without the special computing skills or resources. Pathview Web features an intuitive graphical web interface and a user centered design. The server not only expands the core functions of Pathview, but also provides many useful features not available in the offline R package. Importantly, the server presents a comprehensive workflow for both regular and integrated pathway analysis of multiple omics data. In addition, the server also provides a RESTful API for programmatic access and conveniently integration in third-party software or workflows. Pathview Web is openly and freely accessible at https://pathview.uncc.edu/. PMID:28482075

  8. A correlated meta-analysis strategy for data mining "OMIC" scans.

    PubMed

    Province, Michael A; Borecki, Ingrid B

    2013-01-01

    Meta-analysis is becoming an increasingly popular and powerful tool to integrate findings across studies and OMIC dimensions. But there is the danger that hidden dependencies between putatively "independent" studies can cause inflation of type I error, due to reinforcement of the evidence from false-positive findings. We present here a simple method for conducting meta-analyses that automatically estimates the degree of any such non-independence between OMIC scans and corrects the inference for it, retaining the proper type I error structure. The method does not require the original data from the source studies, but operates only on summary analysis results from these in OMIC scans. The method is applicable in a wide variety of situations including combining GWAS and or sequencing scan results across studies with dependencies due to overlapping subjects, as well as to scans of correlated traits, in a meta-analysis scan for pleiotropic genetic effects. The method correctly detects which scans are actually independent in which case it yields the traditional meta-analysis, so it may safely be used in all cases, when there is even a suspicion of correlation amongst scans.

  9. OmicsNet: a web-based tool for creation and visual analysis of biological networks in 3D space.

    PubMed

    Zhou, Guangyan; Xia, Jianguo

    2018-06-07

    Biological networks play increasingly important roles in omics data integration and systems biology. Over the past decade, many excellent tools have been developed to support creation, analysis and visualization of biological networks. However, important limitations remain: most tools are standalone programs, the majority of them focus on protein-protein interaction (PPI) or metabolic networks, and visualizations often suffer from 'hairball' effects when networks become large. To help address these limitations, we developed OmicsNet - a novel web-based tool that allows users to easily create different types of molecular interaction networks and visually explore them in a three-dimensional (3D) space. Users can upload one or multiple lists of molecules of interest (genes/proteins, microRNAs, transcription factors or metabolites) to create and merge different types of biological networks. The 3D network visualization system was implemented using the powerful Web Graphics Library (WebGL) technology that works natively in most major browsers. OmicsNet supports force-directed layout, multi-layered perspective layout, as well as spherical layout to help visualize and navigate complex networks. A rich set of functions have been implemented to allow users to perform coloring, shading, topology analysis, and enrichment analysis. OmicsNet is freely available at http://www.omicsnet.ca.

  10. New strategy for drug discovery by large-scale association analysis of molecular networks of different species.

    PubMed

    Zhang, Bo; Fu, Yingxue; Huang, Chao; Zheng, Chunli; Wu, Ziyin; Zhang, Wenjuan; Yang, Xiaoyan; Gong, Fukai; Li, Yuerong; Chen, Xiaoyu; Gao, Shuo; Chen, Xuetong; Li, Yan; Lu, Aiping; Wang, Yonghua

    2016-02-25

    The development of modern omics technology has not significantly improved the efficiency of drug development. Rather precise and targeted drug discovery remains unsolved. Here a large-scale cross-species molecular network association (CSMNA) approach for targeted drug screening from natural sources is presented. The algorithm integrates molecular network omics data from humans and 267 plants and microbes, establishing the biological relationships between them and extracting evolutionarily convergent chemicals. This technique allows the researcher to assess targeted drugs for specific human diseases based on specific plant or microbe pathways. In a perspective validation, connections between the plant Halliwell-Asada (HA) cycle and the human Nrf2-ARE pathway were verified and the manner by which the HA cycle molecules act on the human Nrf2-ARE pathway as antioxidants was determined. This shows the potential applicability of this approach in drug discovery. The current method integrates disparate evolutionary species into chemico-biologically coherent circuits, suggesting a new cross-species omics analysis strategy for rational drug development.

  11. Fluxomics - connecting 'omics analysis and phenotypes.

    PubMed

    Winter, Gal; Krömer, Jens O

    2013-07-01

    In our modern 'omics era, metabolic flux analysis (fluxomics) represents the physiological counterpart of its siblings transcriptomics, proteomics and metabolomics. Fluxomics integrates in vivo measurements of metabolic fluxes with stoichiometric network models to allow the determination of absolute flux through large networks of the central carbon metabolism. There are many approaches to implement fluxomics including flux balance analysis (FBA), (13) C fluxomics and (13) C-constrained FBA as well as many experimental settings for flux measurement including dynamic, stationary and semi-stationary. Here we outline the principles of the different approaches and their relative advantages. We demonstrate the unique contribution of flux analysis for phenotype elucidation using a thoroughly studied metabolic reaction as a case study, the microbial aerobic/anaerobic shift, highlighting the importance of flux analysis as a single layer of data as well as interlaced in multi-omics studies. © 2012 John Wiley & Sons Ltd and Society for Applied Microbiology.

  12. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis

    DOE PAGES

    Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.; ...

    2018-04-20

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less

  13. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Jack P.; Matthews, Megan L.; Williams, Cranos M.

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux,more » metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.« less

  14. Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis.

    PubMed

    Wang, Jack P; Matthews, Megan L; Williams, Cranos M; Shi, Rui; Yang, Chenmin; Tunlaya-Anukit, Sermsawat; Chen, Hsi-Chuan; Li, Quanzi; Liu, Jie; Lin, Chien-Yuan; Naik, Punith; Sun, Ying-Hsuan; Loziuk, Philip L; Yeh, Ting-Feng; Kim, Hoon; Gjersing, Erica; Shollenberger, Todd; Shuford, Christopher M; Song, Jina; Miller, Zachary; Huang, Yung-Yun; Edmunds, Charles W; Liu, Baoguang; Sun, Yi; Lin, Ying-Chung Jimmy; Li, Wei; Chen, Hao; Peszlen, Ilona; Ducoste, Joel J; Ralph, John; Chang, Hou-Min; Muddiman, David C; Davis, Mark F; Smith, Chris; Isik, Fikret; Sederoff, Ronald; Chiang, Vincent L

    2018-04-20

    A multi-omics quantitative integrative analysis of lignin biosynthesis can advance the strategic engineering of wood for timber, pulp, and biofuels. Lignin is polymerized from three monomers (monolignols) produced by a grid-like pathway. The pathway in wood formation of Populus trichocarpa has at least 21 genes, encoding enzymes that mediate 37 reactions on 24 metabolites, leading to lignin and affecting wood properties. We perturb these 21 pathway genes and integrate transcriptomic, proteomic, fluxomic and phenomic data from 221 lines selected from ~2000 transgenics (6-month-old). The integrative analysis estimates how changing expression of pathway gene or gene combination affects protein abundance, metabolic-flux, metabolite concentrations, and 25 wood traits, including lignin, tree-growth, density, strength, and saccharification. The analysis then predicts improvements in any of these 25 traits individually or in combinations, through engineering expression of specific monolignol genes. The analysis may lead to greater understanding of other pathways for improved growth and adaptation.

  15. Identifying novel glioma associated pathways based on systems biology level meta-analysis.

    PubMed

    Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong

    2013-01-01

    With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.

  16. MOPED 2.5—An Integrated Multi-Omics Resource: Multi-Omics Profiling Expression Database Now Includes Transcriptomics Data

    PubMed Central

    Montague, Elizabeth; Stanberry, Larissa; Higdon, Roger; Janko, Imre; Lee, Elaine; Anderson, Nathaniel; Choiniere, John; Stewart, Elizabeth; Yandl, Gregory; Broomall, William; Kolker, Natali

    2014-01-01

    Abstract Multi-omics data-driven scientific discovery crucially rests on high-throughput technologies and data sharing. Currently, data are scattered across single omics repositories, stored in varying raw and processed formats, and are often accompanied by limited or no metadata. The Multi-Omics Profiling Expression Database (MOPED, http://moped.proteinspire.org) version 2.5 is a freely accessible multi-omics expression database. Continual improvement and expansion of MOPED is driven by feedback from the Life Sciences Community. In order to meet the emergent need for an integrated multi-omics data resource, MOPED 2.5 now includes gene relative expression data in addition to protein absolute and relative expression data from over 250 large-scale experiments. To facilitate accurate integration of experiments and increase reproducibility, MOPED provides extensive metadata through the Data-Enabled Life Sciences Alliance (DELSA Global, http://delsaglobal.org) metadata checklist. MOPED 2.5 has greatly increased the number of proteomics absolute and relative expression records to over 500,000, in addition to adding more than four million transcriptomics relative expression records. MOPED has an intuitive user interface with tabs for querying different types of omics expression data and new tools for data visualization. Summary information including expression data, pathway mappings, and direct connection between proteins and genes can be viewed on Protein and Gene Details pages. These connections in MOPED provide a context for multi-omics expression data exploration. Researchers are encouraged to submit omics data which will be consistently processed into expression summaries. MOPED as a multi-omics data resource is a pivotal public database, interdisciplinary knowledge resource, and platform for multi-omics understanding. PMID:24910945

  17. Meta-analysis of pathway enrichment: combining independent and dependent omics data sets.

    PubMed

    Kaever, Alexander; Landesfeind, Manuel; Feussner, Kirstin; Morgenstern, Burkhard; Feussner, Ivo; Meinicke, Peter

    2014-01-01

    A major challenge in current systems biology is the combination and integrative analysis of large data sets obtained from different high-throughput omics platforms, such as mass spectrometry based Metabolomics and Proteomics or DNA microarray or RNA-seq-based Transcriptomics. Especially in the case of non-targeted Metabolomics experiments, where it is often impossible to unambiguously map ion features from mass spectrometry analysis to metabolites, the integration of more reliable omics technologies is highly desirable. A popular method for the knowledge-based interpretation of single data sets is the (Gene) Set Enrichment Analysis. In order to combine the results from different analyses, we introduce a methodical framework for the meta-analysis of p-values obtained from Pathway Enrichment Analysis (Set Enrichment Analysis based on pathways) of multiple dependent or independent data sets from different omics platforms. For dependent data sets, e.g. obtained from the same biological samples, the framework utilizes a covariance estimation procedure based on the nonsignificant pathways in single data set enrichment analysis. The framework is evaluated and applied in the joint analysis of Metabolomics mass spectrometry and Transcriptomics DNA microarray data in the context of plant wounding. In extensive studies of simulated data set dependence, the introduced correlation could be fully reconstructed by means of the covariance estimation based on pathway enrichment. By restricting the range of p-values of pathways considered in the estimation, the overestimation of correlation, which is introduced by the significant pathways, could be reduced. When applying the proposed methods to the real data sets, the meta-analysis was shown not only to be a powerful tool to investigate the correlation between different data sets and summarize the results of multiple analyses but also to distinguish experiment-specific key pathways.

  18. "Omics" of maize stress response for sustainable food production: opportunities and challenges.

    PubMed

    Gong, Fangping; Yang, Le; Tai, Fuju; Hu, Xiuli; Wang, Wei

    2014-12-01

    Maize originated in the highlands of Mexico approximately 8700 years ago and is one of the most commonly grown cereal crops worldwide, followed by wheat and rice. Abiotic stresses (primarily drought, salinity, and high and low temperatures), together with biotic stresses (primarily fungi, viruses, and pests), negatively affect maize growth, development, and eventually production. To understand the response of maize to abiotic and biotic stresses and its mechanism of stress tolerance, high-throughput omics approaches have been used in maize stress studies. Integrated omics approaches are crucial for dissecting the temporal and spatial system-level changes that occur in maize under various stresses. In this comprehensive analysis, we review the primary types of stresses that threaten sustainable maize production; underscore the recent advances in maize stress omics, especially proteomics; and discuss the opportunities, challenges, and future directions of maize stress omics, with a view to sustainable food production. The knowledge gained from studying maize stress omics is instrumental for improving maize to cope with various stresses and to meet the food demands of the exponentially growing global population. Omics systems science offers actionable potential solutions for sustainable food production, and we present maize as a notable case study.

  19. From data to knowledge: The future of multi-omics data analysis for the rhizosphere

    DOE PAGES

    Allen White III, Richard; Borkum, Mark I.; Rivas-Ubach, Albert; ...

    2017-05-04

    The rhizosphere is the interface between the root system of a plant and its surrounding soil. The microbiome of the rhizosphere, which is the totality of all microbes present there, represents a complex microbial ecosystem that nourishes the terrestrial biosphere. In order to untangle the complexity of the rhizosphere, and of the rhizospheric microbiome in particular, an integrated multi-omics approach can be applied to reveal the composition of the rhizospheric microbiome (through 16S ribosomal amplicons and metagenomics), the functional properties of the microbiome (through metatranscriptomics and metaproteomics), and the signaling network within the rhizosphere (through metametabolomics). The successful application ofmore » integrated multi-omics to rhizospheric science depends on the availability of rhizosphere-specific data and on the appropriate software used to analyze omics data from the rhizosphere. Here, we analyze the availability of software suites that are normally applied to surrogate disciplines (e.g., soil and plants) but which can be used for rhizospheric science. We also identify potential issues, challenges, and opportunities for rhizosphere science.« less

  20. Integrated metabolic flux and omics analysis of Synechocystis sp. PCC 6803 under mixotrophic and photoheterotrophic conditions.

    PubMed

    Nakajima, Tsubasa; Kajihata, Shuichi; Yoshikawa, Katsunori; Matsuda, Fumio; Furusawa, Chikara; Hirasawa, Takashi; Shimizu, Hiroshi

    2014-09-01

    Cyanobacteria have flexible metabolic capability that enables them to adapt to various environments. To investigate their underlying metabolic regulation mechanisms, we performed an integrated analysis of metabolic flux using transcriptomic and metabolomic data of a cyanobacterium Synechocystis sp. PCC 6803, under mixotrophic and photoheterotrophic conditions. The integrated analysis indicated drastic metabolic flux changes, with much smaller changes in gene expression levels and metabolite concentrations between the conditions, suggesting that the flux change was not caused mainly by the expression levels of the corresponding genes. Under photoheterotrophic conditions, created by the addition of the photosynthesis inhibitor atrazine in mixotrophic conditions, the result of metabolic flux analysis indicated the significant repression of carbon fixation and the activation of the oxidative pentose phosphate pathway (PPP). Moreover, we observed gluconeogenic activity of upstream of glycolysis, which enhanced the flux of the oxidative PPP to compensate for NADPH depletion due to the inhibition of the light reaction of photosynthesis. 'Omics' data suggested that these changes were probably caused by the repression of the gap1 gene, which functions as a control valve in the metabolic network. Since metabolic flux is the outcome of a complicated interplay of cellular components, integrating metabolic flux with other 'omics' layers can identify metabolic changes and narrow down these regulatory mechanisms more effectively. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  1. Poly-Omic Prediction of Complex Traits: OmicKriging

    PubMed Central

    Wheeler, Heather E.; Aquino-Michaels, Keston; Gamazon, Eric R.; Trubetskoy, Vassily V.; Dolan, M. Eileen; Huang, R. Stephanie; Cox, Nancy J.; Im, Hae Kyung

    2014-01-01

    High-confidence prediction of complex traits such as disease risk or drug response is an ultimate goal of personalized medicine. Although genome-wide association studies have discovered thousands of well-replicated polymorphisms associated with a broad spectrum of complex traits, the combined predictive power of these associations for any given trait is generally too low to be of clinical relevance. We propose a novel systems approach to complex trait prediction, which leverages and integrates similarity in genetic, transcriptomic, or other omics-level data. We translate the omic similarity into phenotypic similarity using a method called Kriging, commonly used in geostatistics and machine learning. Our method called OmicKriging emphasizes the use of a wide variety of systems-level data, such as those increasingly made available by comprehensive surveys of the genome, transcriptome, and epigenome, for complex trait prediction. Furthermore, our OmicKriging framework allows easy integration of prior information on the function of subsets of omics-level data from heterogeneous sources without the sometimes heavy computational burden of Bayesian approaches. Using seven disease datasets from the Wellcome Trust Case Control Consortium (WTCCC), we show that OmicKriging allows simple integration of sparse and highly polygenic components yielding comparable performance at a fraction of the computing time of a recently published Bayesian sparse linear mixed model method. Using a cellular growth phenotype, we show that integrating mRNA and microRNA expression data substantially increases performance over either dataset alone. Using clinical statin response, we show improved prediction over existing methods. PMID:24799323

  2. Pathview Web: user friendly pathway visualization and data integration.

    PubMed

    Luo, Weijun; Pant, Gaurav; Bhavnasi, Yeshvant K; Blanchard, Steven G; Brouwer, Cory

    2017-07-03

    Pathway analysis is widely used in omics studies. Pathway-based data integration and visualization is a critical component of the analysis. To address this need, we recently developed a novel R package called Pathview. Pathview maps, integrates and renders a large variety of biological data onto molecular pathway graphs. Here we developed the Pathview Web server, as to make pathway visualization and data integration accessible to all scientists, including those without the special computing skills or resources. Pathview Web features an intuitive graphical web interface and a user centered design. The server not only expands the core functions of Pathview, but also provides many useful features not available in the offline R package. Importantly, the server presents a comprehensive workflow for both regular and integrated pathway analysis of multiple omics data. In addition, the server also provides a RESTful API for programmatic access and conveniently integration in third-party software or workflows. Pathview Web is openly and freely accessible at https://pathview.uncc.edu/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Software for the Integration of Multiomics Experiments in Bioconductor.

    PubMed

    Ramos, Marcel; Schiffer, Lucas; Re, Angela; Azhar, Rimsha; Basunia, Azfar; Rodriguez, Carmen; Chan, Tiffany; Chapman, Phil; Davis, Sean R; Gomez-Cabrero, David; Culhane, Aedin C; Haibe-Kains, Benjamin; Hansen, Kasper D; Kodali, Hanish; Louis, Marie S; Mer, Arvind S; Riester, Markus; Morgan, Martin; Carey, Vince; Waldron, Levi

    2017-11-01

    Multiomics experiments are increasingly commonplace in biomedical research and add layers of complexity to experimental design, data integration, and analysis. R and Bioconductor provide a generic framework for statistical analysis and visualization, as well as specialized data classes for a variety of high-throughput data types, but methods are lacking for integrative analysis of multiomics experiments. The MultiAssayExperiment software package, implemented in R and leveraging Bioconductor software and design principles, provides for the coordinated representation of, storage of, and operation on multiple diverse genomics data. We provide the unrestricted multiple 'omics data for each cancer tissue in The Cancer Genome Atlas as ready-to-analyze MultiAssayExperiment objects and demonstrate in these and other datasets how the software simplifies data representation, statistical analysis, and visualization. The MultiAssayExperiment Bioconductor package reduces major obstacles to efficient, scalable, and reproducible statistical analysis of multiomics data and enhances data science applications of multiple omics datasets. Cancer Res; 77(21); e39-42. ©2017 AACR . ©2017 American Association for Cancer Research.

  4. Integration of multi-omics data for integrative gene regulatory network inference.

    PubMed

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun; Kang, Mingon

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called 'multi-omics data', that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN's capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed.

  5. Integration of multi-omics data for integrative gene regulatory network inference

    PubMed Central

    Zarayeneh, Neda; Ko, Euiseong; Oh, Jung Hun; Suh, Sang; Liu, Chunyu; Gao, Jean; Kim, Donghyun

    2017-01-01

    Gene regulatory networks provide comprehensive insights and indepth understanding of complex biological processes. The molecular interactions of gene regulatory networks are inferred from a single type of genomic data, e.g., gene expression data in most research. However, gene expression is a product of sequential interactions of multiple biological processes, such as DNA sequence variations, copy number variations, histone modifications, transcription factors, and DNA methylations. The recent rapid advances of high-throughput omics technologies enable one to measure multiple types of omics data, called ‘multi-omics data’, that represent the various biological processes. In this paper, we propose an Integrative Gene Regulatory Network inference method (iGRN) that incorporates multi-omics data and their interactions in gene regulatory networks. In addition to gene expressions, copy number variations and DNA methylations were considered for multi-omics data in this paper. The intensive experiments were carried out with simulation data, where iGRN’s capability that infers the integrative gene regulatory network is assessed. Through the experiments, iGRN shows its better performance on model representation and interpretation than other integrative methods in gene regulatory network inference. iGRN was also applied to a human brain dataset of psychiatric disorders, and the biological network of psychiatric disorders was analysed. PMID:29354189

  6. Omics Workshop Videocast Available | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The Omics Integration in Biology and Medicine Workshop, held on June 19th and 20th is now available for viewing on NIH Videocast: Day 1 and Day 2.  The workshop focused on the emerging field of integrating disparate omic data from genomics, proteomics, glycomics, etc. in order to better understand key biological processes and also improve clinical practice.

  7. Analysis Commons, A Team Approach to Discovery in a Big-Data Environment for Genetic Epidemiology

    PubMed Central

    Brody, Jennifer A.; Morrison, Alanna C.; Bis, Joshua C.; O'Connell, Jeffrey R.; Brown, Michael R.; Huffman, Jennifer E.; Ames, Darren C.; Carroll, Andrew; Conomos, Matthew P.; Gabriel, Stacey; Gibbs, Richard A.; Gogarten, Stephanie M.; Gupta, Namrata; Jaquish, Cashell E.; Johnson, Andrew D.; Lewis, Joshua P.; Liu, Xiaoming; Manning, Alisa K.; Papanicolaou, George J.; Pitsillides, Achilleas N.; Rice, Kenneth M.; Salerno, William; Sitlani, Colleen M.; Smith, Nicholas L.; Heckbert, Susan R.; Laurie, Cathy C.; Mitchell, Braxton D.; Vasan, Ramachandran S.; Rich, Stephen S.; Rotter, Jerome I.; Wilson, James G.; Boerwinkle, Eric; Psaty, Bruce M.; Cupples, L. Adrienne

    2017-01-01

    Summary paragraph The exploding volume of whole-genome sequence (WGS) and multi-omics data requires new approaches for analysis. As one solution, we have created a cloud-based Analysis Commons, which brings together genotype and phenotype data from multiple studies in a setting that is accessible by multiple investigators. This framework addresses many of the challenges of multi-center WGS analyses, including data sharing mechanisms, phenotype harmonization, integrated multi-omics analyses, annotation, and computational flexibility. In this setting, the computational pipeline facilitates a sequence-to-discovery analysis workflow illustrated here by an analysis of plasma fibrinogen levels in 3996 individuals from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) WGS program. The Analysis Commons represents a novel model for transforming WGS resources from a massive quantity of phenotypic and genomic data into knowledge of the determinants of health and disease risk in diverse human populations. PMID:29074945

  8. Integrated omics dissection of proteome dynamics during cardiac remodeling.

    PubMed

    Lau, Edward; Cao, Quan; Lam, Maggie P Y; Wang, Jie; Ng, Dominic C M; Bleakley, Brian J; Lee, Jessica M; Liem, David A; Wang, Ding; Hermjakob, Henning; Ping, Peipei

    2018-01-09

    Transcript abundance and protein abundance show modest correlation in many biological models, but how this impacts disease signature discovery in omics experiments is rarely explored. Here we report an integrated omics approach, incorporating measurements of transcript abundance, protein abundance, and protein turnover to map the landscape of proteome remodeling in a mouse model of pathological cardiac hypertrophy. Analyzing the hypertrophy signatures that are reproducibly discovered from each omics data type across six genetic strains of mice, we find that the integration of transcript abundance, protein abundance, and protein turnover data leads to 75% gain in discovered disease gene candidates. Moreover, the inclusion of protein turnover measurements allows discovery of post-transcriptional regulations across diverse pathways, and implicates distinct disease proteins not found in steady-state transcript and protein abundance data. Our results suggest that multi-omics investigations of proteome dynamics provide important insights into disease pathogenesis in vivo.

  9. A practical data processing workflow for multi-OMICS projects.

    PubMed

    Kohl, Michael; Megger, Dominik A; Trippler, Martin; Meckel, Hagen; Ahrens, Maike; Bracht, Thilo; Weber, Frank; Hoffmann, Andreas-Claudius; Baba, Hideo A; Sitek, Barbara; Schlaak, Jörg F; Meyer, Helmut E; Stephan, Christian; Eisenacher, Martin

    2014-01-01

    Multi-OMICS approaches aim on the integration of quantitative data obtained for different biological molecules in order to understand their interrelation and the functioning of larger systems. This paper deals with several data integration and data processing issues that frequently occur within this context. To this end, the data processing workflow within the PROFILE project is presented, a multi-OMICS project that aims on identification of novel biomarkers and the development of new therapeutic targets for seven important liver diseases. Furthermore, a software called CrossPlatformCommander is sketched, which facilitates several steps of the proposed workflow in a semi-automatic manner. Application of the software is presented for the detection of novel biomarkers, their ranking and annotation with existing knowledge using the example of corresponding Transcriptomics and Proteomics data sets obtained from patients suffering from hepatocellular carcinoma. Additionally, a linear regression analysis of Transcriptomics vs. Proteomics data is presented and its performance assessed. It was shown, that for capturing profound relations between Transcriptomics and Proteomics data, a simple linear regression analysis is not sufficient and implementation and evaluation of alternative statistical approaches are needed. Additionally, the integration of multivariate variable selection and classification approaches is intended for further development of the software. Although this paper focuses only on the combination of data obtained from quantitative Proteomics and Transcriptomics experiments, several approaches and data integration steps are also applicable for other OMICS technologies. Keeping specific restrictions in mind the suggested workflow (or at least parts of it) may be used as a template for similar projects that make use of different high throughput techniques. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations

    PubMed Central

    Tebani, Abdellah; Afonso, Carlos; Marret, Stéphane; Bekri, Soumeya

    2016-01-01

    The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era. PMID:27649151

  11. Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations.

    PubMed

    Tebani, Abdellah; Afonso, Carlos; Marret, Stéphane; Bekri, Soumeya

    2016-09-14

    The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era.

  12. G-DOC Plus - an integrative bioinformatics platform for precision medicine.

    PubMed

    Bhuvaneshwar, Krithika; Belouali, Anas; Singh, Varun; Johnson, Robert M; Song, Lei; Alaoui, Adil; Harris, Michael A; Clarke, Robert; Weiner, Louis M; Gusev, Yuriy; Madhavan, Subha

    2016-04-30

    G-DOC Plus is a data integration and bioinformatics platform that uses cloud computing and other advanced computational tools to handle a variety of biomedical BIG DATA including gene expression arrays, NGS and medical images so that they can be analyzed in the full context of other omics and clinical information. G-DOC Plus currently holds data from over 10,000 patients selected from private and public resources including Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and the recently added datasets from REpository for Molecular BRAin Neoplasia DaTa (REMBRANDT), caArray studies of lung and colon cancer, ImmPort and the 1000 genomes data sets. The system allows researchers to explore clinical-omic data one sample at a time, as a cohort of samples; or at the level of population, providing the user with a comprehensive view of the data. G-DOC Plus tools have been leveraged in cancer and non-cancer studies for hypothesis generation and validation; biomarker discovery and multi-omics analysis, to explore somatic mutations and cancer MRI images; as well as for training and graduate education in bioinformatics, data and computational sciences. Several of these use cases are described in this paper to demonstrate its multifaceted usability. G-DOC Plus can be used to support a variety of user groups in multiple domains to enable hypothesis generation for precision medicine research. The long-term vision of G-DOC Plus is to extend this translational bioinformatics platform to stay current with emerging omics technologies and analysis methods to continue supporting novel hypothesis generation, analysis and validation for integrative biomedical research. By integrating several aspects of the disease and exposing various data elements, such as outpatient lab workup, pathology, radiology, current treatments, molecular signatures and expected outcomes over a web interface, G-DOC Plus will continue to strengthen precision medicine research. G-DOC Plus is available at: https://gdoc.georgetown.edu .

  13. Integrated data analysis for genome-wide research.

    PubMed

    Steinfath, Matthias; Repsilber, Dirk; Scholz, Matthias; Walther, Dirk; Selbig, Joachim

    2007-01-01

    Integrated data analysis is introduced as the intermediate level of a systems biology approach to analyse different 'omics' datasets, i.e., genome-wide measurements of transcripts, protein levels or protein-protein interactions, and metabolite levels aiming at generating a coherent understanding of biological function. In this chapter we focus on different methods of correlation analyses ranging from simple pairwise correlation to kernel canonical correlation which were recently applied in molecular biology. Several examples are presented to illustrate their application. The input data for this analysis frequently originate from different experimental platforms. Therefore, preprocessing steps such as data normalisation and missing value estimation are inherent to this approach. The corresponding procedures, potential pitfalls and biases, and available software solutions are reviewed. The multiplicity of observations obtained in omics-profiling experiments necessitates the application of multiple testing correction techniques.

  14. Omics and multi-omics approaches to study the biosynthesis of secondary metabolites in microorganisms.

    PubMed

    Palazzotto, Emilia; Weber, Tilmann

    2018-04-12

    Natural products produced by microorganisms represent the main source of bioactive molecules. The development of high-throughput (omics) techniques have importantly contributed to the renaissance of new antibiotic discovery increasing our understanding of complex mechanisms controlling the expression of biosynthetic gene clusters (BGCs) encoding secondary metabolites. In this context this review highlights recent progress in the use and integration of 'omics' approaches with focuses on genomics, transcriptomics, proteomics metabolomics meta-omics and combined omics as powerful strategy to discover new antibiotics. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. “Omics” of Maize Stress Response for Sustainable Food Production: Opportunities and Challenges

    PubMed Central

    Gong, Fangping; Yang, Le; Tai, Fuju; Hu, Xiuli

    2014-01-01

    Abstract Maize originated in the highlands of Mexico approximately 8700 years ago and is one of the most commonly grown cereal crops worldwide, followed by wheat and rice. Abiotic stresses (primarily drought, salinity, and high and low temperatures), together with biotic stresses (primarily fungi, viruses, and pests), negatively affect maize growth, development, and eventually production. To understand the response of maize to abiotic and biotic stresses and its mechanism of stress tolerance, high-throughput omics approaches have been used in maize stress studies. Integrated omics approaches are crucial for dissecting the temporal and spatial system-level changes that occur in maize under various stresses. In this comprehensive analysis, we review the primary types of stresses that threaten sustainable maize production; underscore the recent advances in maize stress omics, especially proteomics; and discuss the opportunities, challenges, and future directions of maize stress omics, with a view to sustainable food production. The knowledge gained from studying maize stress omics is instrumental for improving maize to cope with various stresses and to meet the food demands of the exponentially growing global population. Omics systems science offers actionable potential solutions for sustainable food production, and we present maize as a notable case study. PMID:25401749

  16. Integrated 'omics analysis reveals new drug-induced mitochondrial perturbations in human hepatocytes.

    PubMed

    Wolters, Jarno E J; van Breda, Simone G J; Grossmann, Jonas; Fortes, Claudia; Caiment, Florian; Kleinjans, Jos C S

    2018-06-01

    We performed a multiple 'omics study by integrating data on epigenomic, transcriptomic, and proteomic perturbations associated with mitochondrial dysfunction in primary human hepatocytes caused by the liver toxicant valproic acid (VPA), to deeper understand downstream events following epigenetic alterations in the mitochondrial genome. Furthermore, we investigated persistence of cross-omics changes after terminating drug treatment. Upon transient methylation changes of mitochondrial genes during VPA-treatment, increasing complexities of gene-interaction networks across time were demonstrated, which normalized during washout. Furthermore, co-expression between genes and their corresponding proteins increased across time. Additionally, in relation to persistently decreased ATP production, we observed decreased expression of mitochondrial complex I and III-V genes. Persistent transcripts and proteins were related to citric acid cycle and β-oxidation. In particular, we identified a potential novel mitochondrial-nuclear signaling axis, MT-CO2-FN1-MYC-CPT1. In summary, this cross-omics study revealed dynamic responses of the mitochondrial epigenome to an impulse toxicant challenge resulting in persistent mitochondrial dysfunctioning. Moreover, this approach allowed for discriminating between the toxic effect of VPA and adaptation. Copyright © 2018 Elsevier B.V. All rights reserved.

  17. Arrowland v1.0

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    BIRKEL, GARRETT; GARCIA MARTIN, HECTOR; MORRELL, WILLIAM

    "Arrowland" is a web-based software application primarily for mapping, integrating and visualizing a variety of metabolism data of living organisms, including but not limited to metabolomics, proteomics, transcriptomics and fluxomics. This software application makes multi-omics data analysis intuitive and interactive. It improves data sharing and communication by enabling users to visualize their omics data using a web browser (on a PC or mobile device). It increases user's productivity by simplifying multi-omics data analysis using well developed maps as a guide. Users using this tool can gain insights into their data sets that would be difficult or even impossible to teasemore » out by looking at raw number, or using their currently existing toolchains to generate static single-use maps. Arrowland helps users save time by visualizing relative changes in different conditions or over time, and helps users to produce more significant insights faster. Preexisting maps decrease the learning curve for beginners in the omics field. Sets of multi-omics data are presented in the browser, as a two-dimensional flowchart resembling a map, with varying levels of detail information, based on the scaling of the map. Users can pan and zoom to explore different maps, compare maps, upload their own research data sets onto desired maps, alter map appearance in ways that facilitate interpretation, visualization and analysis of the given data, and export data, reports and actionable items to help the user initiative.« less

  18. Network Analysis of Rodent Transcriptomes in Spaceflight

    NASA Technical Reports Server (NTRS)

    Ramachandran, Maya; Fogle, Homer; Costes, Sylvain

    2017-01-01

    Network analysis methods leverage prior knowledge of cellular systems and the statistical and conceptual relationships between analyte measurements to determine gene connectivity. Correlation and conditional metrics are used to infer a network topology and provide a systems-level context for cellular responses. Integration across multiple experimental conditions and omics domains can reveal the regulatory mechanisms that underlie gene expression. GeneLab has assembled rich multi-omic (transcriptomics, proteomics, epigenomics, and epitranscriptomics) datasets for multiple murine tissues from the Rodent Research 1 (RR-1) experiment. RR-1 assesses the impact of 37 days of spaceflight on gene expression across a variety of tissue types, such as adrenal glands, quadriceps, gastrocnemius, tibalius anterior, extensor digitorum longus, soleus, eye, and kidney. Network analysis is particularly useful for RR-1 -omics datasets because it reinforces subtle relationships that may be overlooked in isolated analyses and subdues confounding factors. Our objective is to use network analysis to determine potential target nodes for therapeutic intervention and identify similarities with existing disease models. Multiple network algorithms are used for a higher confidence consensus.

  19. Integrated ‘omics analysis for studying the microbial community response to a pH perturbation of a cellulose-degrading bioreactor culture

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Boaro, Amy A.; Kim, Young-Mo; Konopka, Allan

    2014-12-01

    Integrated ‘omics have been used on pure cultures and co-cultures, yet they have not been applied to complex microbial communities to examine questions of perturbation response. In this study, we used integrated ‘omics to measure the perturbation response of a cellulose-degrading bioreactor community fed with microcrystalline cellulose (Avicel). We predicted that a pH decrease by addition of a pulse of acid would reduce microbial community diversity and temporarily reduce reactor function such as cellulose degradation. However, 16S rDNA pyrosequencing results revealed increased alpha diversity in the microbial community after the perturbation, and a persistence of the dominant community members overmore » the duration of the experiment. Proteomics results showed a decrease in activity of proteins associated with Fibrobacter succinogenes two days after the perturbation followed by increased protein abundances six days after the perturbation. The decrease in cellulolytic activity suggested by the proteomics was confirmed by the accumulation of Avicel in the reactor. Metabolomics showed a pattern similar to that of the proteome, with amino acid production decreasing two days after the perturbation and increasing after six days. This study demonstrated that community ‘omics data provides valuable information about the interactions and function of anaerobic cellulolytic community members after a perturbation.« less

  20. Omics Data Complementarity Underlines Functional Cross-Communication in Yeast.

    PubMed

    Malod-Dognin, Noël; Pržulj, Nataša

    2017-06-10

    Mapping the complete functional layout of a cell and understanding the cross-talk between different processes are fundamental challenges. They elude us because of the incompleteness and noisiness of molecular data and because of the computational intractability of finding the exact answer. We perform a simple integration of three types of baker's yeast omics data to elucidate the functional organization and lines of cross-functional communication. We examine protein-protein interaction (PPI), co-expression (COEX) and genetic interaction (GI) data, and explore their relationship with the gold standard of functional organization, the Gene Ontology (GO). We utilize a simple framework that identifies functional cross-communication lines in each of the three data types, in GO, and collectively in the integrated model of the three omics data types; we present each of them in our new Functional Organization Map (FOM) model. We compare the FOMs of the three omics datasets with the FOM of GO and find that GI is in best agreement with GO, followed COEX and PPI. We integrate the three FOMs into a unified FOM and find that it is in better agreement with the FOM of GO than those of any omics dataset alone, demonstrating functional complementarity of different omics data.

  1. Integrative Analysis of Longitudinal Metabolomics Data from a Personal Multi-Omics Profile

    PubMed Central

    Stanberry, Larissa; Mias, George I.; Haynes, Winston; Higdon, Roger; Snyder, Michael; Kolker, Eugene

    2013-01-01

    The integrative personal omics profile (iPOP) is a pioneering study that combines genomics, transcriptomics, proteomics, metabolomics and autoantibody profiles from a single individual over a 14-month period. The observation period includes two episodes of viral infection: a human rhinovirus and a respiratory syncytial virus. The profile studies give an informative snapshot into the biological functioning of an organism. We hypothesize that pathway expression levels are associated with disease status. To test this hypothesis, we use biological pathways to integrate metabolomics and proteomics iPOP data. The approach computes the pathways’ differential expression levels at each time point, while taking into account the pathway structure and the longitudinal design. The resulting pathway levels show strong association with the disease status. Further, we identify temporal patterns in metabolite expression levels. The changes in metabolite expression levels also appear to be consistent with the disease status. The results of the integrative analysis suggest that changes in biological pathways may be used to predict and monitor the disease. The iPOP experimental design, data acquisition and analysis issues are discussed within the broader context of personal profiling. PMID:24958148

  2. Omics integrating physical techniques: aged Piedmontese meat analysis.

    PubMed

    Lana, Alessandro; Longo, Valentina; Dalmasso, Alessandra; D'Alessandro, Angelo; Bottero, Maria Teresa; Zolla, Lello

    2015-04-01

    Piedmontese meat tenderness becomes higher by extending the ageing period after slaughter up to 44 days. Classical physical analysis only partially explain this evidence, so in order to discover the reason of the potential beneficial effects of prolonged ageing, we performed omic analysis in the Longissimus thoracis muscle by examining main biochemical changes through mass spectrometry-based metabolomics and proteomics. We observed a progressive decline in myofibrillar structural integrity (underpinning meat tenderness) and impaired energy metabolism. Markers of autophagic responses (e.g. serine and glutathione metabolism) and nitrogen metabolism (urea cycle intermediates) accumulated until the end of the assayed period. Key metabolites such as glutamate, a mediator of the appreciated umami taste of the meat, were found to constantly accumulate until day 44. Finally, statistical analyses revealed that glutamate, serine and arginine could serve as good predictors of ultimate meat quality parameters, even though further studies are mandatory. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. [Proteomics and personalized medicine].

    PubMed

    Rocchiccioli, Silvia; Tedeschi, Lorena; Citti, Lorenzo; Cecchettini, Antonella

    2013-05-01

    With the disclosure of the human genome a new era for bio-medicine has arisen, characterized by the challenge to investigate pathogenic mechanisms, studying simultaneously metabolites, DNA, RNA, and proteins. As a result, the "omics" revolution boomed, giving birth to a new medicine named "omics-based medicine". Among the other "omics", proteomics has been widely used in medicine, since it can produce a more "holistic" overview of a disease and provide a "constellation" of possible specific markers, a molecular fingerprinting that defines the clinical condition of an individual. Endpoint of this comprehensive and detailed analysis is the "diagnostic-omics", i.e. the achievement of personalized diagnoses with obvious benefits for prevention and therapy and this goal can be reached only with a perfect integration between clinicians and proteomists. To impact on the possible key factors involved in the pathological processes, oligonucleotide-based knock-down strategies can be helpful. They exploit omics-derived molecular tools (antisense, siRNA, ribozymes, decoys, and aptamers) that can be used to inhibit, at transcriptional or post-transcriptional levels, the events leading to protein synthesis, thus decreasing its expression. The identification of the pivotal mechanisms involved in diseases using global, "scenic" approaches such as the "omics" ones, and the subsequent validation and detailed description of the processes by specific molecular tools, can result in a more preventive, predictive and personalized medicine.

  4. Modelling plankton ecosystems in the meta-omics era. Are we ready?

    PubMed

    Stec, Krzysztof Franciszek; Caputi, Luigi; Buttigieg, Pier Luigi; D'Alelio, Domenico; Ibarbalz, Federico Matias; Sullivan, Matthew B; Chaffron, Samuel; Bowler, Chris; Ribera d'Alcalà, Maurizio; Iudicone, Daniele

    2017-04-01

    Recent progress in applying meta-omics approaches to the study of marine ecosystems potentially allows scientists to study the genetic and functional diversity of plankton at an unprecedented depth and with enhanced precision. However, while a range of persistent technical issues still need to be resolved, a much greater obstacle currently preventing a complete and integrated view of the marine ecosystem is the absence of a clear conceptual framework. Herein, we discuss the knowledge that has thus far been derived from conceptual and statistical modelling of marine plankton ecosystems, and illustrate the potential power of integrated meta-omics approaches in the field. We then propose the use of a semantic framework is necessary to support integrative ecological modelling in the meta-omics era, particularly when having to face the increased interdisciplinarity needed to address global issues related to climate change. Copyright © 2017. Published by Elsevier B.V.

  5. Metabolic engineering with plants for a sustainable biobased economy.

    PubMed

    Yoon, Jong Moon; Zhao, Le; Shanks, Jacqueline V

    2013-01-01

    Plants are bona fide sustainable organisms because they accumulate carbon and synthesize beneficial metabolites from photosynthesis. To meet the challenges to food security and health threatened by increasing population growth and depletion of nonrenewable natural resources, recent metabolic engineering efforts have shifted from single pathways to holistic approaches with multiple genes owing to integration of omics technologies. Successful engineering of plants results in the high yield of biomass components for primary food sources and biofuel feedstocks, pharmaceuticals, and platform chemicals through synthetic biology and systems biology strategies. Further discovery of undefined biosynthesis pathways in plants, integrative analysis of discrete omics data, and diversified process developments for production of platform chemicals are essential to overcome the hurdles for sustainable production of value-added biomolecules from plants.

  6. Multiomics Data Triangulation for Asthma Candidate Biomarkers and Precision Medicine.

    PubMed

    Pecak, Matija; Korošec, Peter; Kunej, Tanja

    2018-06-01

    Asthma is a common complex disorder and has been subject to intensive omics research for disease susceptibility and therapeutic innovation. Candidate biomarkers of asthma and its precision treatment demand that they stand the test of multiomics data triangulation before they can be prioritized for clinical applications. We classified the biomarkers of asthma after a search of the literature and based on whether or not a given biomarker candidate is reported in multiple omics platforms and methodologies, using PubMed and Web of Science, we identified omics studies of asthma conducted on diverse platforms using keywords, such as asthma, genomics, metabolomics, and epigenomics. We extracted data about asthma candidate biomarkers from 73 articles and developed a catalog of 190 potential asthma biomarkers (167 human, 23 animal data), comprising DNA loci, transcripts, proteins, metabolites, epimutations, and noncoding RNAs. The data were sorted according to 13 omics types: genomics, epigenomics, transcriptomics, proteomics, interactomics, metabolomics, ncRNAomics, glycomics, lipidomics, environmental omics, pharmacogenomics, phenomics, and integrative omics. Importantly, we found that 10 candidate biomarkers were apparent in at least two or more omics levels, thus promising potential for further biomarker research and development and precision medicine applications. This multiomics catalog reported herein for the first time contributes to future decision-making on prioritization of biomarkers and validation efforts for precision medicine in asthma. The findings may also facilitate meta-analyses and integrative omics studies in the future.

  7. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.

    PubMed

    Wu, Yang; Zeng, Jian; Zhang, Futao; Zhu, Zhihong; Qi, Ting; Zheng, Zhili; Lloyd-Jones, Luke R; Marioni, Riccardo E; Martin, Nicholas G; Montgomery, Grant W; Deary, Ian J; Wray, Naomi R; Visscher, Peter M; McRae, Allan F; Yang, Jian

    2018-03-02

    The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.

  8. HARNESSING BIG DATA FOR PRECISION MEDICINE: INFRASTRUCTURES AND APPLICATIONS.

    PubMed

    Yu, Kun-Hsing; Hart, Steven N; Goldfeder, Rachel; Zhang, Qiangfeng Cliff; Parker, Stephen C J; Snyder, Michael

    2017-01-01

    Precision medicine is a health management approach that accounts for individual differences in genetic backgrounds and environmental exposures. With the recent advancements in high-throughput omics profiling technologies, collections of large study cohorts, and the developments of data mining algorithms, big data in biomedicine is expected to provide novel insights into health and disease states, which can be translated into personalized disease prevention and treatment plans. However, petabytes of biomedical data generated by multiple measurement modalities poses a significant challenge for data analysis, integration, storage, and result interpretation. In addition, patient privacy preservation, coordination between participating medical centers and data analysis working groups, as well as discrepancies in data sharing policies remain important topics of discussion. In this workshop, we invite experts in omics integration, biobank research, and data management to share their perspectives on leveraging big data to enable precision medicine.Workshop website: http://tinyurl.com/PSB17BigData; HashTag: #PSB17BigData.

  9. EU Framework 6 Project: Predictive Toxicology (PredTox)-overview and outcome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Suter, Laura, E-mail: Laura.suter-dick@roche.com; Schroeder, Susanne; Meyer, Kirstin

    2011-04-15

    In this publication, we report the outcome of the integrated EU Framework 6 Project: Predictive Toxicology (PredTox), including methodological aspects and overall conclusions. Specific details including data analysis and interpretation are reported in separate articles in this issue. The project, partly funded by the EU, was carried out by a consortium of 15 pharmaceutical companies, 2 SMEs, and 3 universities. The effects of 16 test compounds were characterized using conventional toxicological parameters and 'omics' technologies. The three major observed toxicities, liver hypertrophy, bile duct necrosis and/or cholestasis, and kidney proximal tubular damage were analyzed in detail. The combined approach ofmore » 'omics' and conventional toxicology proved a useful tool for mechanistic investigations and the identification of putative biomarkers. In our hands and in combination with histopathological assessment, target organ transcriptomics was the most prolific approach for the generation of mechanistic hypotheses. Proteomics approaches were relatively time-consuming and required careful standardization. NMR-based metabolomics detected metabolite changes accompanying histopathological findings, providing limited additional mechanistic information. Conversely, targeted metabolite profiling with LC/GC-MS was very useful for the investigation of bile duct necrosis/cholestasis. In general, both proteomics and metabolomics were supportive of other findings. Thus, the outcome of this program indicates that 'omics' technologies can help toxicologists to make better informed decisions during exploratory toxicological studies. The data support that hypothesis on mode of action and discovery of putative biomarkers are tangible outcomes of integrated 'omics' analysis. Qualification of biomarkers remains challenging, in particular in terms of identification, mechanistic anchoring, appropriate specificity, and sensitivity.« less

  10. Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology.

    PubMed

    Fondi, Marco; Liò, Pietro

    2015-02-01

    Integrated -omics approaches are quickly spreading across microbiology research labs, leading to (i) the possibility of detecting previously hidden features of microbial cells like multi-scale spatial organization and (ii) tracing molecular components across multiple cellular functional states. This promises to reduce the knowledge gap between genotype and phenotype and poses new challenges for computational microbiologists. We underline how the capability to unravel the complexity of microbial life will strongly depend on the integration of the huge and diverse amount of information that can be derived today from -omics experiments. In this work, we present opportunities and challenges of multi -omics data integration in current systems biology pipelines. We here discuss which layers of biological information are important for biotechnological and clinical purposes, with a special focus on bacterial metabolism and modelling procedures. A general review of the most recent computational tools for performing large-scale datasets integration is also presented, together with a possible framework to guide the design of systems biology experiments by microbiologists. Copyright © 2015. Published by Elsevier GmbH.

  11. imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel

    USDA-ARS?s Scientific Manuscript database

    Interactive modules for data exploration and visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data sets with a user-friendly interface. Individual modules were designed to provide toolsets to enable interactive ...

  12. Imaging and the completion of the omics paradigm in breast cancer.

    PubMed

    Leithner, D; Horvat, J V; Ochoa-Albiztegui, R E; Thakur, S; Wengert, G; Morris, E A; Helbich, T H; Pinker, K

    2018-06-08

    Within the field of oncology, "omics" strategies-genomics, transcriptomics, proteomics, metabolomics-have many potential applications and may significantly improve our understanding of the underlying processes of cancer development and progression. Omics strategies aim to develop meaningful imaging biomarkers for breast cancer (BC) by rapid assessment of large datasets with different biological information. In BC the paradigm of omics technologies has always favored the integration of multiple layers of omics data to achieve a complete portrait of BC. Advances in medical imaging technologies, image analysis, and the development of high-throughput methods that can extract and correlate multiple imaging parameters with "omics" data have ushered in a new direction in medical research. Radiogenomics is a novel omics strategy that aims to correlate imaging characteristics (i. e., the imaging phenotype) with underlying gene expression patterns, gene mutations, and other genome-related characteristics. Radiogenomics not only represents the evolution in the radiology-pathology correlation from the anatomical-histological level to the molecular level, but it is also a pivotal step in the omics paradigm in BC in order to fully characterize BC. Armed with modern analytical software tools, radiogenomics leads to new discoveries of quantitative and qualitative imaging biomarkers that offer hitherto unprecedented insights into the complex tumor biology and facilitate a deeper understanding of cancer development and progression. The field of radiogenomics in breast cancer is rapidly evolving, and results from previous studies are encouraging. It can be expected that radiogenomics will play an important role in the future and has the potential to revolutionize the diagnosis, treatment, and prognosis of BC patients. This article aims to give an overview of breast radiogenomics, its current role, future applications, and challenges.

  13. Plant Omics Data Center: An Integrated Web Repository for Interspecies Gene Expression Networks with NLP-Based Curation

    PubMed Central

    Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro

    2015-01-01

    Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. PMID:25505034

  14. A-WINGS: an integrated genome database for Pleurocybella porrigens (Angel's wing oyster mushroom, Sugihiratake).

    PubMed

    Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro

    2014-12-03

    The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.

  15. Accessing and integrating data and knowledge for biomedical research.

    PubMed

    Burgun, A; Bodenreider, O

    2008-01-01

    To review the issues that have arisen with the advent of translational research in terms of integration of data and knowledge, and survey current efforts to address these issues. Using examples form the biomedical literature, we identified new trends in biomedical research and their impact on bioinformatics. We analyzed the requirements for effective knowledge repositories and studied issues in the integration of biomedical knowledge. New diagnostic and therapeutic approaches based on gene expression patterns have brought about new issues in the statistical analysis of data, and new workflows are needed are needed to support translational research. Interoperable data repositories based on standard annotations, infrastructures and services are needed to support the pooling and meta-analysis of data, as well as their comparison to earlier experiments. High-quality, integrated ontologies and knowledge bases serve as a source of prior knowledge used in combination with traditional data mining techniques and contribute to the development of more effective data analysis strategies. As biomedical research evolves from traditional clinical and biological investigations towards omics sciences and translational research, specific needs have emerged, including integrating data collected in research studies with patient clinical data, linking omics knowledge with medical knowledge, modeling the molecular basis of diseases, and developing tools that support in-depth analysis of research data. As such, translational research illustrates the need to bridge the gap between bioinformatics and medical informatics, and opens new avenues for biomedical informatics research.

  16. Alzheimer's disease in the omics era.

    PubMed

    Sancesario, Giulia M; Bernardini, Sergio

    2018-06-18

    Recent progresses in high-throughput technologies have led to a new scenario in investigating pathologies, named the "Omics era", which integrate the opportunity to collect large amounts of data and information at the molecular and protein levels together with the development of novel computational and statistical tools that are able to analyze and filter such data. Subsequently, advances in genotyping arrays, next generation sequencing, mass spectrometry technology, and bioinformatics allowed for the simultaneous large-scale study of thousands of genes (genomics), epigenetics factors (epigenomics), RNA (transcriptomics), metabolites (metabolomics) and proteins(proteomics), with the possibility of integrating multiple types of omics data ("multi -omics"). All of these technological innovations have modified the approach to the study of complex diseases, such as Alzheimer's Disease (AD), thus representing a promising tool to investigate the relationship between several molecular pathways in AD as well as other pathologies. This review focuses on the current knowledge on the pathology of AD, the recent findings from Omics sciences, and the challenge of the use of Big Data. We then focus on future perspectives for Omics sciences, such as the discovery of novel diagnostic biomarkers or drugs. Copyright © 2018. Published by Elsevier Inc.

  17. Advances in Omics and Bioinformatics Tools for Systems Analyses of Plant Functions

    PubMed Central

    Mochida, Keiichi; Shinozaki, Kazuo

    2011-01-01

    Omics and bioinformatics are essential to understanding the molecular systems that underlie various plant functions. Recent game-changing sequencing technologies have revitalized sequencing approaches in genomics and have produced opportunities for various emerging analytical applications. Driven by technological advances, several new omics layers such as the interactome, epigenome and hormonome have emerged. Furthermore, in several plant species, the development of omics resources has progressed to address particular biological properties of individual species. Integration of knowledge from omics-based research is an emerging issue as researchers seek to identify significance, gain biological insights and promote translational research. From these perspectives, we provide this review of the emerging aspects of plant systems research based on omics and bioinformatics analyses together with their associated resources and technological advances. PMID:22156726

  18. Genome-environment interactions and prospective technology assessment: evolution from pharmacogenomics to nutrigenomics and ecogenomics.

    PubMed

    Ozdemir, Vural; Motulsky, Arno G; Kolker, Eugene; Godard, Béatrice

    2009-02-01

    The relationships between food, nutrition science, and health outcomes have been mapped over the past century. Genomic variation among individuals and populations is a new factor that enriches and challenges our understanding of these complex relationships. Hence, the confluence of nutritional science and genomics-nutrigenomics--was the focus of the OMICS: A Journal of Integrative Biology in December 2008 (Part 1). The 2009 Special Issue (Part 2) concludes the analysis of nutrigenomics research and innovations. Together, these two issues expand the scope and depth of critical scholarship in nutrigenomics, in keeping with an integrated multidisciplinary analysis across the bioscience, omics technology, social, ethical, intellectual property and policy dimensions. Historically, the field of pharmacogenetics provided the first examples of specifically identifiable gene variants predisposing to unexpected responses to drugs since the 1950s. Brewer coined the term ecogenetics in 1971 to broaden the concept of gene-environment interactions from drugs and nutrition to include environmental agents in general. In the mid-1990s, introduction of high-throughput technologies led to the terms pharmacogenomics, nutrigenomics and ecogenomics to describe, respectively, the contribution of genomic variability to differential responses to drugs, food, and environment defined in the broadest sense. The distinctions, if any, between these newer fields (e.g., nutrigenomics) and their predecessors (e.g., nutrigenetics) remain to be delineated. For nutrigenomics, its reliance on genome-wide analyses may lead to detection of new biological mechanisms governing host response to food. Recognizing "genome-environment interactions" as the conceptual thread that connects and runs through pharmacogenomics, nutrigenomics, and ecogenomics may contribute toward anticipatory governance and prospective real-time analysis of these omics fields. Such real-time analysis of omics technologies and innovations is crucial, because it can influence and positively shape them as these approaches develop, and help avoid predictable pitfalls, and thus ensure their effective and ethical application in the laboratory, clinic, and society.

  19. Metabolic Reconstruction of Setaria italica: A Systems Biology Approach for Integrating Tissue-Specific Omics and Pathway Analysis of Bioenergy Grasses.

    PubMed

    de Oliveira Dal'Molin, Cristiana G; Orellana, Camila; Gebbie, Leigh; Steen, Jennifer; Hodson, Mark P; Chrysanthopoulos, Panagiotis; Plan, Manuel R; McQualter, Richard; Palfreyman, Robin W; Nielsen, Lars K

    2016-01-01

    The urgent need for major gains in industrial crops productivity and in biofuel production from bioenergy grasses have reinforced attention on understanding C4 photosynthesis. Systems biology studies of C4 model plants may reveal important features of C4 metabolism. Here we chose foxtail millet (Setaria italica), as a C4 model plant and developed protocols to perform systems biology studies. As part of the systems approach, we have developed and used a genome-scale metabolic reconstruction in combination with the use of multi-omics technologies to gain more insights into the metabolism of S. italica. mRNA, protein, and metabolite abundances, were measured in mature and immature stem/leaf phytomers, and the multi-omics data were integrated into the metabolic reconstruction framework to capture key metabolic features in different developmental stages of the plant. RNA-Seq reads were mapped to the S. italica resulting for 83% coverage of the protein coding genes of S. italica. Besides revealing similarities and differences in central metabolism of mature and immature tissues, transcriptome analysis indicates significant gene expression of two malic enzyme isoforms (NADP- ME and NAD-ME). Although much greater expression levels of NADP-ME genes are observed and confirmed by the correspondent protein abundances in the samples, the expression of multiple genes combined to the significant abundance of metabolites that participates in C4 metabolism of NAD-ME and NADP-ME subtypes suggest that S. italica may use mixed decarboxylation modes of C4 photosynthetic pathways under different plant developmental stages. The overall analysis also indicates different levels of regulation in mature and immature tissues in carbon fixation, glycolysis, TCA cycle, amino acids, fatty acids, lignin, and cellulose syntheses. Altogether, the multi-omics analysis reveals different biological entities and their interrelation and regulation over plant development. With this study, we demonstrated that this systems approach is powerful enough to complement the functional metabolic annotation of bioenergy grasses.

  20. Metabolic Reconstruction of Setaria italica: A Systems Biology Approach for Integrating Tissue-Specific Omics and Pathway Analysis of Bioenergy Grasses

    PubMed Central

    de Oliveira Dal'Molin, Cristiana G.; Orellana, Camila; Gebbie, Leigh; Steen, Jennifer; Hodson, Mark P.; Chrysanthopoulos, Panagiotis; Plan, Manuel R.; McQualter, Richard; Palfreyman, Robin W.; Nielsen, Lars K.

    2016-01-01

    The urgent need for major gains in industrial crops productivity and in biofuel production from bioenergy grasses have reinforced attention on understanding C4 photosynthesis. Systems biology studies of C4 model plants may reveal important features of C4 metabolism. Here we chose foxtail millet (Setaria italica), as a C4 model plant and developed protocols to perform systems biology studies. As part of the systems approach, we have developed and used a genome-scale metabolic reconstruction in combination with the use of multi-omics technologies to gain more insights into the metabolism of S. italica. mRNA, protein, and metabolite abundances, were measured in mature and immature stem/leaf phytomers, and the multi-omics data were integrated into the metabolic reconstruction framework to capture key metabolic features in different developmental stages of the plant. RNA-Seq reads were mapped to the S. italica resulting for 83% coverage of the protein coding genes of S. italica. Besides revealing similarities and differences in central metabolism of mature and immature tissues, transcriptome analysis indicates significant gene expression of two malic enzyme isoforms (NADP- ME and NAD-ME). Although much greater expression levels of NADP-ME genes are observed and confirmed by the correspondent protein abundances in the samples, the expression of multiple genes combined to the significant abundance of metabolites that participates in C4 metabolism of NAD-ME and NADP-ME subtypes suggest that S. italica may use mixed decarboxylation modes of C4 photosynthetic pathways under different plant developmental stages. The overall analysis also indicates different levels of regulation in mature and immature tissues in carbon fixation, glycolysis, TCA cycle, amino acids, fatty acids, lignin, and cellulose syntheses. Altogether, the multi-omics analysis reveals different biological entities and their interrelation and regulation over plant development. With this study, we demonstrated that this systems approach is powerful enough to complement the functional metabolic annotation of bioenergy grasses. PMID:27559337

  1. Five omic technologies are concordant in differentiating the biochemical characteristics of the berries of five grapevine (Vitis vinifera L.) cultivars.

    PubMed

    Ghan, Ryan; Van Sluyter, Steven C; Hochberg, Uri; Degu, Asfaw; Hopper, Daniel W; Tillet, Richard L; Schlauch, Karen A; Haynes, Paul A; Fait, Aaron; Cramer, Grant R

    2015-11-16

    Grape cultivars and wines are distinguishable by their color, flavor and aroma profiles. Omic analyses (transcripts, proteins and metabolites) are powerful tools for assessing biochemical differences in biological systems. Berry skins of red- (Cabernet Sauvignon, Merlot, Pinot Noir) and white-skinned (Chardonnay, Semillon) wine grapes were harvested near optimum maturity (°Brix-to-titratable acidity ratio) from the same experimental vineyard. The cultivars were exposed to a mild, seasonal water-deficit treatment from fruit set until harvest in 2011. Identical sample aliquots were analyzed for transcripts by grapevine whole-genome oligonucleotide microarray and RNAseq technologies, proteins by nano-liquid chromatography-mass spectroscopy, and metabolites by gas chromatography-mass spectroscopy and liquid chromatography-mass spectroscopy. Principal components analysis of each of five Omic technologies showed similar results across cultivars in all Omic datasets. Comparison of the processed data of genes mapped in RNAseq and microarray data revealed a strong Pearson's correlation (0.80). The exclusion of probesets associated with genes with potential for cross-hybridization on the microarray improved the correlation to 0.93. The overall concordance of protein with transcript data was low with a Pearson's correlation of 0.27 and 0.24 for the RNAseq and microarray data, respectively. Integration of metabolite with protein and transcript data produced an expected model of phenylpropanoid biosynthesis, which distinguished red from white grapes, yet provided detail of individual cultivar differences. The mild water deficit treatment did not significantly alter the abundance of proteins or metabolites measured in the five cultivars, but did have a small effect on gene expression. The five Omic technologies were consistent in distinguishing cultivar variation. There was high concordance between transcriptomic technologies, but generally protein abundance did not correlate well with transcript abundance. The integration of multiple high-throughput Omic datasets revealed complex biochemical variation amongst five cultivars of an ancient and economically important crop species.

  2. GWAS in a Box: Statistical and Visual Analytics of Structured Associations via GenAMap

    PubMed Central

    Xing, Eric P.; Curtis, Ross E.; Schoenherr, Georg; Lee, Seunghak; Yin, Junming; Puniyani, Kriti; Wu, Wei; Kinnaird, Peter

    2014-01-01

    With the continuous improvement in genotyping and molecular phenotyping technology and the decreasing typing cost, it is expected that in a few years, more and more clinical studies of complex diseases will recruit thousands of individuals for pan-omic genetic association analyses. Hence, there is a great need for algorithms and software tools that could scale up to the whole omic level, integrate different omic data, leverage rich structure information, and be easily accessible to non-technical users. We present GenAMap, an interactive analytics software platform that 1) automates the execution of principled machine learning methods that detect genome- and phenome-wide associations among genotypes, gene expression data, and clinical or other macroscopic traits, and 2) provides new visualization tools specifically designed to aid in the exploration of association mapping results. Algorithmically, GenAMap is based on a new paradigm for GWAS and PheWAS analysis, termed structured association mapping, which leverages various structures in the omic data. We demonstrate the function of GenAMap via a case study of the Brem and Kruglyak yeast dataset, and then apply it on a comprehensive eQTL analysis of the NIH heterogeneous stock mice dataset and report some interesting findings. GenAMap is available from http://sailing.cs.cmu.edu/genamap. PMID:24905018

  3. Bivalve Omics: State of the Art and Potential Applications for the Biomonitoring of Harmful Marine Compounds

    PubMed Central

    Suárez-Ulloa, Victoria; Fernández-Tajes, Juan; Manfrin, Chiara; Gerdol, Marco; Venier, Paola; Eirín-López, José M.

    2013-01-01

    The extraordinary progress experienced by sequencing technologies and bioinformatics has made the development of omic studies virtually ubiquitous in all fields of life sciences nowadays. However, scientific attention has been quite unevenly distributed throughout the different branches of the tree of life, leaving molluscs, one of the most diverse animal groups, relatively unexplored and without representation within the narrow collection of well established model organisms. Within this Phylum, bivalve molluscs play a fundamental role in the functioning of the marine ecosystem, constitute very valuable commercial resources in aquaculture, and have been widely used as sentinel organisms in the biomonitoring of marine pollution. Yet, it has only been very recently that this complex group of organisms became a preferential subject for omic studies, posing new challenges for their integrative characterization. The present contribution aims to give a detailed insight into the state of the art of the omic studies and functional information analysis of bivalve molluscs, providing a timely perspective on the available data resources and on the current and prospective applications for the biomonitoring of harmful marine compounds. PMID:24189277

  4. From Sample to Multi-Omics Conclusions in under 48 Hours

    PubMed Central

    Navas-Molina, Jose A.; Hyde, Embriette R.; Vázquez-Baeza, Yoshiki; Humphrey, Greg; Gaffney, James; Minich, Jeremiah J.; Melnik, Alexey V.; Herschend, Jakob; DeReus, Jeff; Durant, Austin; Dutton, Rachel J.; Khosroheidari, Mahdieh; Green, Clifford; da Silva, Ricardo; Dorrestein, Pieter C.; Knight, Rob

    2016-01-01

    ABSTRACT Multi-omics methods have greatly advanced our understanding of the biological organism and its microbial associates. However, they are not routinely used in clinical or industrial applications, due to the length of time required to generate and analyze omics data. Here, we applied a novel integrated omics pipeline for the analysis of human and environmental samples in under 48 h. Human subjects that ferment their own foods provided swab samples from skin, feces, oral cavity, fermented foods, and household surfaces to assess the impact of home food fermentation on their microbial and chemical ecology. These samples were analyzed with 16S rRNA gene sequencing, inferred gene function profiles, and liquid chromatography-tandem mass spectrometry (LC-MS/MS) metabolomics through the Qiita, PICRUSt, and GNPS pipelines, respectively. The human sample microbiomes clustered with the corresponding sample types in the American Gut Project (http://www.americangut.org), and the fermented food samples produced a separate cluster. The microbial communities of the household surfaces were primarily sourced from the fermented foods, and their consumption was associated with increased gut microbial diversity. Untargeted metabolomics revealed that human skin and fermented food samples had separate chemical ecologies and that stool was more similar to fermented foods than to other sample types. Metabolites from the fermented foods, including plant products such as procyanidin and pheophytin, were present in the skin and stool samples of the individuals consuming the foods. Some food metabolites were modified during digestion, and others were detected in stool intact. This study represents a first-of-its-kind analysis of multi-omics data that achieved time intervals matching those of classic microbiological culturing. IMPORTANCE Polymicrobial infections are difficult to diagnose due to the challenge in comprehensively cultivating the microbes present. Omics methods, such as 16S rRNA sequencing, metagenomics, and metabolomics, can provide a more complete picture of a microbial community and its metabolite production, without the biases and selectivity of microbial culture. However, these advanced methods have not been applied to clinical or industrial microbiology or other areas where complex microbial dysbioses require immediate intervention. The reason for this is the length of time required to generate and analyze omics data. Here, we describe the development and application of a pipeline for multi-omics data analysis in time frames matching those of the culture-based approaches often used for these applications. This study applied multi-omics methods effectively in clinically relevant time frames and sets a precedent toward their implementation in clinical medicine and industrial microbiology. PMID:27822524

  5. Integrative FourD omics approach profiles the target network of the carbon storage regulatory system

    PubMed Central

    Sowa, Steven W.; Gelderman, Grant; Leistra, Abigail N.; Buvanendiran, Aishwarya; Lipp, Sarah; Pitaktong, Areen; Vakulskas, Christopher A.; Romeo, Tony; Baldea, Michael

    2017-01-01

    Abstract Multi-target regulators represent a largely untapped area for metabolic engineering and anti-bacterial development. These regulators are complex to characterize because they often act at multiple levels, affecting proteins, transcripts and metabolites. Therefore, single omics experiments cannot profile their underlying targets and mechanisms. In this work, we used an Integrative FourD omics approach (INFO) that consists of collecting and analyzing systems data throughout multiple time points, using multiple genetic backgrounds, and multiple omics approaches (transcriptomics, proteomics and high throughput sequencing crosslinking immunoprecipitation) to evaluate simultaneous changes in gene expression after imposing an environmental stress that accentuates the regulatory features of a network. Using this approach, we profiled the targets and potential regulatory mechanisms of a global regulatory system, the well-studied carbon storage regulatory (Csr) system of Escherichia coli, which is widespread among bacteria. Using 126 sets of proteomics and transcriptomics data, we identified 136 potential direct CsrA targets, including 50 novel ones, categorized their behaviors into distinct regulatory patterns, and performed in vivo fluorescence-based follow up experiments. The results of this work validate 17 novel mRNAs as authentic direct CsrA targets and demonstrate a generalizable strategy to integrate multiple lines of omics data to identify a core pool of regulator targets. PMID:28126921

  6. Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction

    PubMed Central

    Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han

    2015-01-01

    Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Methods Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Results Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Conclusions Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. PMID:25002459

  7. Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction.

    PubMed

    Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han

    2015-01-01

    Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. © The Author 2014. Published by Oxford University Press on behalf of the American Medical Informatics Association.

  8. Unveiling network-based functional features through integration of gene expression into protein networks.

    PubMed

    Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali

    2018-06-01

    Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Metabolic Network Modeling of Microbial Communities

    PubMed Central

    Biggs, Matthew B.; Medlock, Gregory L.; Kolling, Glynis L.

    2015-01-01

    Genome-scale metabolic network reconstructions and constraint-based analysis are powerful methods that have the potential to make functional predictions about microbial communities. Current use of genome-scale metabolic networks to characterize the metabolic functions of microbial communities includes species compartmentalization, separating species-level and community-level objectives, dynamic analysis, the “enzyme-soup” approach, multi-scale modeling, and others. There are many challenges inherent to the field, including a need for tools that accurately assign high-level omics signals to individual community members, new automated reconstruction methods that rival manual curation, and novel algorithms for integrating omics data and engineering communities. As technologies and modeling frameworks improve, we expect that there will be proportional advances in the fields of ecology, health science, and microbial community engineering. PMID:26109480

  10. Single Cell Multi-Omics Technology: Methodology and Application.

    PubMed

    Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying

    2018-01-01

    In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions.

  11. Single Cell Multi-Omics Technology: Methodology and Application

    PubMed Central

    Hu, Youjin; An, Qin; Sheu, Katherine; Trejo, Brandon; Fan, Shuxin; Guo, Ying

    2018-01-01

    In the era of precision medicine, multi-omics approaches enable the integration of data from diverse omics platforms, providing multi-faceted insight into the interrelation of these omics layers on disease processes. Single cell sequencing technology can dissect the genotypic and phenotypic heterogeneity of bulk tissue and promises to deepen our understanding of the underlying mechanisms governing both health and disease. Through modification and combination of single cell assays available for transcriptome, genome, epigenome, and proteome profiling, single cell multi-omics approaches have been developed to simultaneously and comprehensively study not only the unique genotypic and phenotypic characteristics of single cells, but also the combined regulatory mechanisms evident only at single cell resolution. In this review, we summarize the state-of-the-art single cell multi-omics methods and discuss their applications, challenges, and future directions. PMID:29732369

  12. Plant Omics Data Center: an integrated web repository for interspecies gene expression networks with NLP-based curation.

    PubMed

    Ohyanagi, Hajime; Takano, Tomoyuki; Terashima, Shin; Kobayashi, Masaaki; Kanno, Maasa; Morimoto, Kyoko; Kanegae, Hiromi; Sasaki, Yohei; Saito, Misa; Asano, Satomi; Ozaki, Soichi; Kudo, Toru; Yokoyama, Koji; Aya, Koichiro; Suwabe, Keita; Suzuki, Go; Aoki, Koh; Kubo, Yasutaka; Watanabe, Masao; Matsuoka, Makoto; Yano, Kentaro

    2015-01-01

    Comprehensive integration of large-scale omics resources such as genomes, transcriptomes and metabolomes will provide deeper insights into broader aspects of molecular biology. For better understanding of plant biology, we aim to construct a next-generation sequencing (NGS)-derived gene expression network (GEN) repository for a broad range of plant species. So far we have incorporated information about 745 high-quality mRNA sequencing (mRNA-Seq) samples from eight plant species (Arabidopsis thaliana, Oryza sativa, Solanum lycopersicum, Sorghum bicolor, Vitis vinifera, Solanum tuberosum, Medicago truncatula and Glycine max) from the public short read archive, digitally profiled the entire set of gene expression profiles, and drawn GENs by using correspondence analysis (CA) to take advantage of gene expression similarities. In order to understand the evolutionary significance of the GENs from multiple species, they were linked according to the orthology of each node (gene) among species. In addition to other gene expression information, functional annotation of the genes will facilitate biological comprehension. Currently we are improving the given gene annotations with natural language processing (NLP) techniques and manual curation. Here we introduce the current status of our analyses and the web database, PODC (Plant Omics Data Center; http://bioinf.mind.meiji.ac.jp/podc/), now open to the public, providing GENs, functional annotations and additional comprehensive omics resources. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.

  13. Integrative Analysis of “-Omics” Data Using Penalty Functions

    PubMed Central

    Zhao, Qing; Shi, Xingjie; Huang, Jian; Liu, Jin; Li, Yang; Ma, Shuangge

    2014-01-01

    In the analysis of omics data, integrative analysis provides an effective way of pooling information across multiple datasets or multiple correlated responses, and can be more effective than single-dataset (response) analysis. Multiple families of integrative analysis methods have been proposed in the literature. The current review focuses on the penalization methods. Special attention is paid to sparse meta-analysis methods that pool summary statistics across datasets, and integrative analysis methods that pool raw data across datasets. We discuss their formulation and rationale. Beyond “standard” penalized selection, we also review contrasted penalization and Laplacian penalization which accommodate finer data structures. The computational aspects, including computational algorithms and tuning parameter selection, are examined. This review concludes with possible limitations and extensions. PMID:25691921

  14. Big Data Transforms Discovery-Utilization Therapeutics Continuum.

    PubMed

    Waldman, S A; Terzic, A

    2016-03-01

    Enabling omic technologies adopt a holistic view to produce unprecedented insights into the molecular underpinnings of health and disease, in part, by generating massive high-dimensional biological data. Leveraging these systems-level insights as an engine driving the healthcare evolution is maximized through integration with medical, demographic, and environmental datasets from individuals to populations. Big data analytics has accordingly emerged to add value to the technical aspects of storage, transfer, and analysis required for merging vast arrays of omic-, clinical-, and eco-datasets. In turn, this new field at the interface of biology, medicine, and information science is systematically transforming modern therapeutics across discovery, development, regulation, and utilization. © 2015 ASCPT.

  15. Accessing and Integrating Data and Knowledge for Biomedical Research

    PubMed Central

    Burgun, A.; Bodenreider, O.

    2008-01-01

    Summary Objectives To review the issues that have arisen with the advent of translational research in terms of integration of data and knowledge, and survey current efforts to address these issues. Methods Using examples form the biomedical literature, we identified new trends in biomedical research and their impact on bioinformatics. We analyzed the requirements for effective knowledge repositories and studied issues in the integration of biomedical knowledge. Results New diagnostic and therapeutic approaches based on gene expression patterns have brought about new issues in the statistical analysis of data, and new workflows are needed are needed to support translational research. Interoperable data repositories based on standard annotations, infrastructures and services are needed to support the pooling and meta-analysis of data, as well as their comparison to earlier experiments. High-quality, integrated ontologies and knowledge bases serve as a source of prior knowledge used in combination with traditional data mining techniques and contribute to the development of more effective data analysis strategies. Conclusion As biomedical research evolves from traditional clinical and biological investigations towards omics sciences and translational research, specific needs have emerged, including integrating data collected in research studies with patient clinical data, linking omics knowledge with medical knowledge, modeling the molecular basis of diseases, and developing tools that support in-depth analysis of research data. As such, translational research illustrates the need to bridge the gap between bioinformatics and medical informatics, and opens new avenues for biomedical informatics research. PMID:18660883

  16. Contributions of polyunsaturated fatty acids (PUFA) on cerebral neurobiology: an integrated omics approach with epigenomic focus

    DTIC Science & Technology

    2016-12-01

    acids (PUFA) on cerebral neurobiology: an integrated omics approach with epigenomic focus Nabarun Chakrabortya,b, Seid Muhiea,b, Raina Kumara,c, Aarti...C57BL/6j mice fed on any of these three diets from their neonatal age to midlife. Integrating the multiomics data, we focused on the genes encoding both...been evaluated in the context of a wide variety of health issues [23]. The escalated risks of pathological and psychological disease have been

  17. Integrative FourD omics approach profiles the target network of the carbon storage regulatory system.

    PubMed

    Sowa, Steven W; Gelderman, Grant; Leistra, Abigail N; Buvanendiran, Aishwarya; Lipp, Sarah; Pitaktong, Areen; Vakulskas, Christopher A; Romeo, Tony; Baldea, Michael; Contreras, Lydia M

    2017-02-28

    Multi-target regulators represent a largely untapped area for metabolic engineering and anti-bacterial development. These regulators are complex to characterize because they often act at multiple levels, affecting proteins, transcripts and metabolites. Therefore, single omics experiments cannot profile their underlying targets and mechanisms. In this work, we used an Integrative FourD omics approach (INFO) that consists of collecting and analyzing systems data throughout multiple time points, using multiple genetic backgrounds, and multiple omics approaches (transcriptomics, proteomics and high throughput sequencing crosslinking immunoprecipitation) to evaluate simultaneous changes in gene expression after imposing an environmental stress that accentuates the regulatory features of a network. Using this approach, we profiled the targets and potential regulatory mechanisms of a global regulatory system, the well-studied carbon storage regulatory (Csr) system of Escherichia coli, which is widespread among bacteria. Using 126 sets of proteomics and transcriptomics data, we identified 136 potential direct CsrA targets, including 50 novel ones, categorized their behaviors into distinct regulatory patterns, and performed in vivo fluorescence-based follow up experiments. The results of this work validate 17 novel mRNAs as authentic direct CsrA targets and demonstrate a generalizable strategy to integrate multiple lines of omics data to identify a core pool of regulator targets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Reproducibility and Transparency of Omics Research - Impacts on Human Health Risk Assessment

    EPA Science Inventory

    Omics technologies are becoming more widely used in toxicology, necessitating their consideration in human health hazard and risk assessment programs. Today, risk assessors in the United States Environmental Protection Agency’s Integrated Risk Information System (IRIS) Toxicologi...

  19. Framework for the quantitative weight-of-evidence analysis of 'omics data for regulatory purposes.

    PubMed

    Bridges, Jim; Sauer, Ursula G; Buesen, Roland; Deferme, Lize; Tollefsen, Knut E; Tralau, Tewes; van Ravenzwaay, Ben; Poole, Alan; Pemberton, Mark

    2017-12-01

    A framework for the quantitative weight-of-evidence (QWoE) analysis of 'omics data for regulatory purposes is presented. The QWoE framework encompasses seven steps to evaluate 'omics data (also together with non-'omics data): (1) Hypothesis formulation, identification and weighting of lines of evidence (LoEs). LoEs conjoin different (types of) studies that are used to critically test the hypothesis. As an essential component of the QWoE framework, step 1 includes the development of templates for scoring sheets that predefine scoring criteria with scores of 0-4 to enable a quantitative determination of study quality and data relevance; (2) literature searches and categorisation of studies into the pre-defined LoEs; (3) and (4) quantitative assessment of study quality and data relevance using the respective pre-defined scoring sheets for each study; (5) evaluation of LoE-specific strength of evidence based upon the study quality and study relevance scores of the studies conjoined in the respective LoE; (6) integration of the strength of evidence from the individual LoEs to determine the overall strength of evidence; (7) characterisation of uncertainties and conclusion on the QWoE. To put the QWoE framework in practice, case studies are recommended to confirm the relevance of its different steps, or to adapt them as necessary. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  20. An Integrative Bioinformatics Approach for Knowledge Discovery

    NASA Astrophysics Data System (ADS)

    Peña-Castillo, Lourdes; Phan, Sieu; Famili, Fazel

    The vast amount of data being generated by large scale omics projects and the computational approaches developed to deal with this data have the potential to accelerate the advancement of our understanding of the molecular basis of genetic diseases. This better understanding may have profound clinical implications and transform the medical practice; for instance, therapeutic management could be prescribed based on the patient’s genetic profile instead of being based on aggregate data. Current efforts have established the feasibility and utility of integrating and analysing heterogeneous genomic data to identify molecular associations to pathogenesis. However, since these initiatives are data-centric, they either restrict the research community to specific data sets or to a certain application domain, or force researchers to develop their own analysis tools. To fully exploit the potential of omics technologies, robust computational approaches need to be developed and made available to the community. This research addresses such challenge and proposes an integrative approach to facilitate knowledge discovery from diverse datasets and contribute to the advancement of genomic medicine.

  1. Integrated omics for the identification of key functionalities in biological wastewater treatment microbial communities.

    PubMed

    Narayanasamy, Shaman; Muller, Emilie E L; Sheik, Abdul R; Wilmes, Paul

    2015-05-01

    Biological wastewater treatment plants harbour diverse and complex microbial communities which prominently serve as models for microbial ecology and mixed culture biotechnological processes. Integrated omic analyses (combined metagenomics, metatranscriptomics, metaproteomics and metabolomics) are currently gaining momentum towards providing enhanced understanding of community structure, function and dynamics in situ as well as offering the potential to discover novel biological functionalities within the framework of Eco-Systems Biology. The integration of information from genome to metabolome allows the establishment of associations between genetic potential and final phenotype, a feature not realizable by only considering single 'omes'. Therefore, in our opinion, integrated omics will become the future standard for large-scale characterization of microbial consortia including those underpinning biological wastewater treatment processes. Systematically obtained time and space-resolved omic datasets will allow deconvolution of structure-function relationships by identifying key members and functions. Such knowledge will form the foundation for discovering novel genes on a much larger scale compared with previous efforts. In general, these insights will allow us to optimize microbial biotechnological processes either through better control of mixed culture processes or by use of more efficient enzymes in bioengineering applications. © 2015 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  2. An Updated Taxonomy and a Graphical Summary Tool for Optimal Classification and Comprehension of Omics Research.

    PubMed

    Pirih, Nina; Kunej, Tanja

    2018-05-01

    The volume of publications and the type of research approaches used in omics system sciences are vast and continue to expand rapidly. This increased complexity and heterogeneity of omics data are challenging data extraction, sensemaking, analyses, knowledge translation, and interpretation. An extended and dynamic taxonomy for the classification and summary of omics studies are essential. We present an updated taxonomy for classification of omics research studies based on four criteria: (1) type and number of genomic loci in a research study, (2) number of species and biological samples, (3) the type of omics technology (e.g., genomics, transcriptomics, and proteomics) and omics technology application type (e.g., pharmacogenomics and nutrigenomics), and (4) phenotypes. In addition, we present a graphical summary approach that enables the researchers to define the main characteristics of their study in a single figure, and offers the readers to rapidly grasp the published study and omics data. We searched the PubMed and the Web of Science from 09/2002 to 02/2018, including research and review articles, and identified 90 scientific publications. We propose a call toward omics studies' standardization for reporting in scientific literature. We anticipate the proposed classification scheme will usefully contribute to improved classification of published reports in genomics and other omics fields, and help data extraction from publications for future multiomics data integration.

  3. Incorporating deep learning into the analysis of diverse livestock data

    USDA-ARS?s Scientific Manuscript database

    Technological advances in high-throughput phenotyping and multiple omics fields have led to an explosion in the volume of data across the whole spectrum of biology, allowing researchers to integrate data of different types to inform hypotheses and expand the scope of their research questions. Howeve...

  4. Brain Radiation Information Data Exchange (BRIDE): integration of experimental data from low-dose ionising radiation research for pathway discovery.

    PubMed

    Karapiperis, Christos; Kempf, Stefan J; Quintens, Roel; Azimzadeh, Omid; Vidal, Victoria Linares; Pazzaglia, Simonetta; Bazyka, Dimitry; Mastroberardino, Pier G; Scouras, Zacharias G; Tapio, Soile; Benotmane, Mohammed Abderrafi; Ouzounis, Christos A

    2016-05-11

    The underlying molecular processes representing stress responses to low-dose ionising radiation (LDIR) in mammals are just beginning to be understood. In particular, LDIR effects on the brain and their possible association with neurodegenerative disease are currently being explored using omics technologies. We describe a light-weight approach for the storage, analysis and distribution of relevant LDIR omics datasets. The data integration platform, called BRIDE, contains information from the literature as well as experimental information from transcriptomics and proteomics studies. It deploys a hybrid, distributed solution using both local storage and cloud technology. BRIDE can act as a knowledge broker for LDIR researchers, to facilitate molecular research on the systems biology of LDIR response in mammals. Its flexible design can capture a range of experimental information for genomics, epigenomics, transcriptomics, and proteomics. The data collection is available at: .

  5. Integrating Omics Technologies to Study Pulmonary Physiology and Pathology at the Systems Level

    PubMed Central

    Pathak, Ravi Ramesh; Davé, Vrushank

    2014-01-01

    Assimilation and integration of “omics” technologies, including genomics, epigenomics, proteomics, and metabolomics has readily altered the landscape of medical research in the last decade. The vast and complex nature of omics data can only be interpreted by linking molecular information at the organismic level, forming the foundation of systems biology. Research in pulmonary biology/medicine has necessitated integration of omics, network, systems and computational biology data to differentially diagnose, interpret, and prognosticate pulmonary diseases, facilitating improvement in therapy and treatment modalities. This review describes how to leverage this emerging technology in understanding pulmonary diseases at the systems level –called a “systomic” approach. Considering the operational wholeness of cellular and organ systems, diseased genome, proteome, and the metabolome needs to be conceptualized at the systems level to understand disease pathogenesis and progression. Currently available omics technology and resources require a certain degree of training and proficiency in addition to dedicated hardware and applications, making them relatively less user friendly for the pulmonary biologist and clinicians. Herein, we discuss the various strategies, computational tools and approaches required to study pulmonary diseases at the systems level for biomedical scientists and clinical researchers. PMID:24802001

  6. Multi-platform 'Omics Analysis of Human Ebola Virus Disease Pathogenesis.

    PubMed

    Eisfeld, Amie J; Halfmann, Peter J; Wendler, Jason P; Kyle, Jennifer E; Burnum-Johnson, Kristin E; Peralta, Zuleyma; Maemura, Tadashi; Walters, Kevin B; Watanabe, Tokiko; Fukuyama, Satoshi; Yamashita, Makoto; Jacobs, Jon M; Kim, Young-Mo; Casey, Cameron P; Stratton, Kelly G; Webb-Robertson, Bobbie-Jo M; Gritsenko, Marina A; Monroe, Matthew E; Weitz, Karl K; Shukla, Anil K; Tian, Mingyuan; Neumann, Gabriele; Reed, Jennifer L; van Bakel, Harm; Metz, Thomas O; Smith, Richard D; Waters, Katrina M; N'jai, Alhaji; Sahr, Foday; Kawaoka, Yoshihiro

    2017-12-13

    The pathogenesis of human Ebola virus disease (EVD) is complex. EVD is characterized by high levels of virus replication and dissemination, dysregulated immune responses, extensive virus- and host-mediated tissue damage, and disordered coagulation. To clarify how host responses contribute to EVD pathophysiology, we performed multi-platform 'omics analysis of peripheral blood mononuclear cells and plasma from EVD patients. Our results indicate that EVD molecular signatures overlap with those of sepsis, imply that pancreatic enzymes contribute to tissue damage in fatal EVD, and suggest that Ebola virus infection may induce aberrant neutrophils whose activity could explain hallmarks of fatal EVD. Moreover, integrated biomarker prediction identified putative biomarkers from different data platforms that differentiated survivors and fatalities early after infection. This work reveals insight into EVD pathogenesis, suggests an effective approach for biomarker identification, and provides an important community resource for further analysis of human EVD severity. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Current dichotomy between traditional molecular biological and omic research in cancer biology and pharmacology.

    PubMed

    Reinhold, William C

    2015-12-10

    There is currently a split within the cancer research community between traditional molecular biological hypothesis-driven and the more recent "omic" forms or research. While the molecular biological approach employs the tried and true single alteration-single response formulations of experimentation, the omic employs broad-based assay or sample collection approaches that generate large volumes of data. How to integrate the benefits of these two approaches in an efficient and productive fashion remains an outstanding issue. Ideally, one would merge the understandability, exactness, simplicity, and testability of the molecular biological approach, with the larger amounts of data, simultaneous consideration of multiple alterations, consideration of genes both of known interest along with the novel, cross-sample comparisons among cell lines and patient samples, and consideration of directed questions while simultaneously gaining exposure to the novel provided by the omic approach. While at the current time integration of the two disciplines remains problematic, attempts to do so are ongoing, and will be necessary for the understanding of the large cell line screens including the Developmental Therapeutics Program's NCI-60, the Broad Institute's Cancer Cell Line Encyclopedia, and the Wellcome Trust Sanger Institute's Cancer Genome Project, as well as the the Cancer Genome Atlas clinical samples project. Going forward there is significant benefit to be had from the integration of the molecular biological and the omic forms or research, with the desired goal being improved translational understanding and application.

  8. Integrating Omic Technologies into Aquatic Ecological Risk Assessment and Environmental Monitoring: Hurdles, Achievements and Future Outlook

    EPA Science Inventory

    In this commentary we present the findings from an international consortium on fish toxicogenomics sponsored by the UK Natural Environment Research Council (NERC) with an objective of moving omic technologies into chemical risk assessment and environmental monitoring. Objectiv...

  9. Integrating Omic Technologies into Aquatic Ecological Risk Assessment and Environmental Monitoring: Hurdles, Achievements and Future Outlook

    EPA Science Inventory

    Background: In this commentary we present the findings from an international consortium on fish toxicogenomics sponsored by the UK Natural Environment Research Council (NERC) with a remit of moving omic technologies into chemical risk assessment and environmental monitoring. Obj...

  10. Integrative analysis of transcriptomic and metabolomic data via sparse canonical correlation analysis with incorporation of biological information.

    PubMed

    Safo, Sandra E; Li, Shuzhao; Long, Qi

    2018-03-01

    Integrative analysis of high dimensional omics data is becoming increasingly popular. At the same time, incorporating known functional relationships among variables in analysis of omics data has been shown to help elucidate underlying mechanisms for complex diseases. In this article, our goal is to assess association between transcriptomic and metabolomic data from a Predictive Health Institute (PHI) study that includes healthy adults at a high risk of developing cardiovascular diseases. Adopting a strategy that is both data-driven and knowledge-based, we develop statistical methods for sparse canonical correlation analysis (CCA) with incorporation of known biological information. Our proposed methods use prior network structural information among genes and among metabolites to guide selection of relevant genes and metabolites in sparse CCA, providing insight on the molecular underpinning of cardiovascular disease. Our simulations demonstrate that the structured sparse CCA methods outperform several existing sparse CCA methods in selecting relevant genes and metabolites when structural information is informative and are robust to mis-specified structural information. Our analysis of the PHI study reveals that a number of gene and metabolic pathways including some known to be associated with cardiovascular diseases are enriched in the set of genes and metabolites selected by our proposed approach. © 2017, The International Biometric Society.

  11. Omics AnalySIs System for PRecision Oncology (OASISPRO): A Web-based Omics Analysis Tool for Clinical Phenotype Prediction.

    PubMed

    Yu, Kun-Hsing; Fitzpatrick, Michael R; Pappas, Luke; Chan, Warren; Kung, Jessica; Snyder, Michael

    2017-09-12

    Precision oncology is an approach that accounts for individual differences to guide cancer management. Omics signatures have been shown to predict clinical traits for cancer patients. However, the vast amount of omics information poses an informatics challenge in systematically identifying patterns associated with health outcomes, and no general-purpose data-mining tool exists for physicians, medical researchers, and citizen scientists without significant training in programming and bioinformatics. To bridge this gap, we built the Omics AnalySIs System for PRecision Oncology (OASISPRO), a web-based system to mine the quantitative omics information from The Cancer Genome Atlas (TCGA). This system effectively visualizes patients' clinical profiles, executes machine-learning algorithms of choice on the omics data, and evaluates the prediction performance using held-out test sets. With this tool, we successfully identified genes strongly associated with tumor stage, and accurately predicted patients' survival outcomes in many cancer types, including mesothelioma and adrenocortical carcinoma. By identifying the links between omics and clinical phenotypes, this system will facilitate omics studies on precision cancer medicine and contribute to establishing personalized cancer treatment plans. This web-based tool is available at http://tinyurl.com/oasispro ;source codes are available at http://tinyurl.com/oasisproSourceCode . © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  12. An Integrated Systems Genetics and Omics Toolkit to Probe Gene Function.

    PubMed

    Li, Hao; Wang, Xu; Rukina, Daria; Huang, Qingyao; Lin, Tao; Sorrentino, Vincenzo; Zhang, Hongbo; Bou Sleiman, Maroun; Arends, Danny; McDaid, Aaron; Luan, Peiling; Ziari, Naveed; Velázquez-Villegas, Laura A; Gariani, Karim; Kutalik, Zoltan; Schoonjans, Kristina; Radcliffe, Richard A; Prins, Pjotr; Morgenthaler, Stephan; Williams, Robert W; Auwerx, Johan

    2018-01-24

    Identifying genetic and environmental factors that impact complex traits and common diseases is a high biomedical priority. Here, we developed, validated, and implemented a series of multi-layered systems approaches, including (expression-based) phenome-wide association, transcriptome-/proteome-wide association, and (reverse-) mediation analysis, in an open-access web server (systems-genetics.org) to expedite the systems dissection of gene function. We applied these approaches to multi-omics datasets from the BXD mouse genetic reference population, and identified and validated associations between genes and clinical and molecular phenotypes, including previously unreported links between Rpl26 and body weight, and Cpt1a and lipid metabolism. Furthermore, through mediation and reverse-mediation analysis we established regulatory relations between genes, such as the co-regulation of BCKDHA and BCKDHB protein levels, and identified targets of transcription factors E2F6, ZFP277, and ZKSCAN1. Our multifaceted toolkit enabled the identification of gene-gene and gene-phenotype links that are robust and that translate well across populations and species, and can be universally applied to any populations with multi-omics datasets. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Second Era of OMICS in Caries Research: Moving Past the Phase of Disillusionment.

    PubMed

    Nascimento, M M; Zaura, E; Mira, A; Takahashi, N; Ten Cate, J M

    2017-07-01

    Novel approaches using OMICS techniques enable a collective assessment of multiple related biological units, including genes, gene expression, proteins, and metabolites. In the past decade, next-generation sequencing ( NGS) technologies were improved by longer sequence reads and the development of genome databases and user-friendly pipelines for data analysis, all accessible at lower cost. This has generated an outburst of high-throughput data. The application of OMICS has provided more depth to existing hypotheses as well as new insights in the etiology of dental caries. For example, the determination of complete bacterial microbiomes of oral samples rather than selected species, together with oral metatranscriptome and metabolome analyses, supports the viewpoint of dysbiosis of the supragingival biofilms. In addition, metabolome studies have been instrumental in disclosing the contributions of major pathways for central carbon and amino acid metabolisms to biofilm pH homeostasis. New, often noncultured, oral streptococci have been identified, and their phenotypic characterization has revealed candidates for probiotic therapy. Although findings from OMICS research have been greatly informative, problems related to study design, data quality, integration, and reproducibility still need to be addressed. Also, the emergence and continuous updates of these computationally demanding technologies require expertise in advanced bioinformatics for reliable interpretation of data. Despite the obstacles cited above, OMICS research is expected to encourage the discovery of novel caries biomarkers and the development of next-generation diagnostics and therapies for caries control. These observations apply equally to the study of other oral diseases.

  14. Clinical multi-omics strategies for the effective cancer management.

    PubMed

    Yoo, Byong Chul; Kim, Kyung-Hee; Woo, Sang Myung; Myung, Jae Kyung

    2017-08-15

    Cancer is a global health issue as a multi-factorial complex disease, and early detection and novel therapeutic strategies are required for more effective cancer management. With the development of systemic analytical -omics strategies, the therapeutic approach and study of the molecular mechanisms of carcinogenesis and cancer progression have moved from hypothesis-driven targeted investigations to data-driven untargeted investigations focusing on the integrated diagnosis, treatment, and prevention of cancer in individual patients. Predictive, preventive, and personalized medicine (PPPM) is a promising new approach to reduce the burden of cancer and facilitate more accurate prognosis, diagnosis, as well as effective treatment. Here we review the fundamentals of, and new developments in, -omics technologies, together with the key role of a variety of practical -omics strategies in PPPM for cancer treatment and diagnosis. In this review, a comprehensive and critical overview of the systematic strategy for predictive, preventive, and personalized medicine (PPPM) for cancer disease was described in a view of cancer prognostic prediction, diagnostics, and prevention as well as cancer therapy and drug responses. We have discussed multi-dimensional data obtained from various resources and integration of multisciplinary -omics strategies with computational method which could contribute the more effective PPPM for cancer. This review has provided the novel insights of the current applications of each and combined -omics technologies, which showed their powerful potential for the establishment of PPPM for cancer. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. An inference method from multi-layered structure of biomedical data.

    PubMed

    Kim, Myungjun; Nam, Yonghyun; Shin, Hyunjung

    2017-05-18

    Biological system is a multi-layered structure of omics with genome, epigenome, transcriptome, metabolome, proteome, etc., and can be further stretched to clinical/medical layers such as diseasome, drugs, and symptoms. One advantage of omics is that we can figure out an unknown component or its trait by inferring from known omics components. The component can be inferred by the ones in the same level of omics or the ones in different levels. To implement the inference process, an algorithm that can be applied to the multi-layered complex system is required. In this study, we develop a semi-supervised learning algorithm that can be applied to the multi-layered complex system. In order to verify the validity of the inference, it was applied to the prediction problem of disease co-occurrence with a two-layered network composed of symptom-layer and disease-layer. The symptom-disease layered network obtained a fairly high value of AUC, 0.74, which is regarded as noticeable improvement when comparing 0.59 AUC of single-layered disease network. If further stretched to whole layered structure of omics, the proposed method is expected to produce more promising results. This research has novelty in that it is a new integrative algorithm that incorporates the vertical structure of omics data, on contrary to other existing methods that integrate the data in parallel fashion. The results can provide enhanced guideline for disease co-occurrence prediction, thereby serve as a valuable tool for inference process of multi-layered biological system.

  16. Taking Bioinformatics to Systems Medicine.

    PubMed

    van Kampen, Antoine H C; Moerland, Perry D

    2016-01-01

    Systems medicine promotes a range of approaches and strategies to study human health and disease at a systems level with the aim of improving the overall well-being of (healthy) individuals, and preventing, diagnosing, or curing disease. In this chapter we discuss how bioinformatics critically contributes to systems medicine. First, we explain the role of bioinformatics in the management and analysis of data. In particular we show the importance of publicly available biological and clinical repositories to support systems medicine studies. Second, we discuss how the integration and analysis of multiple types of omics data through integrative bioinformatics may facilitate the determination of more predictive and robust disease signatures, lead to a better understanding of (patho)physiological molecular mechanisms, and facilitate personalized medicine. Third, we focus on network analysis and discuss how gene networks can be constructed from omics data and how these networks can be decomposed into smaller modules. We discuss how the resulting modules can be used to generate experimentally testable hypotheses, provide insight into disease mechanisms, and lead to predictive models. Throughout, we provide several examples demonstrating how bioinformatics contributes to systems medicine and discuss future challenges in bioinformatics that need to be addressed to enable the advancement of systems medicine.

  17. Applied choline-omics: lessons from human metabolic studies for the integration of genomics research into nutrition practice.

    PubMed

    West, Allyson A; Caudill, Marie A

    2014-08-01

    Nutritional genomics, defined as the study of reciprocal interactions among nutrients, metabolic intermediates, and the genome, along with other closely related nutritional -omic fields (eg, epigenomics, transcriptomics, and metabolomics) have become vital areas of nutrition study and knowledge. Utilizing results from human metabolic research on the essential nutrient choline, this article illustrates how nutrigenetic, nutrigenomic, and inter-related -omic research has provided new insights into choline metabolism and its effect on physiologic processes. Findings from highlighted choline research are also discussed in the context of translation to clinical and public health nutrition applications. Overall, this article underscores the utility of -omic research methods in elucidating nutrient metabolism as well as the potential for nutritional -omic concepts and discoveries to be broadly applied in nutritional practice. Copyright © 2014 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.

  18. Making the Most of Omics for Symbiosis Research

    PubMed Central

    Chaston, J.; Douglas, A.E.

    2012-01-01

    Omics, including genomics, proteomics and metabolomics, enable us to explain symbioses in terms of the underlying molecules and their interactions. The central task is to transform molecular catalogs of genes, metabolites etc. into a dynamic understanding of symbiosis function. We review four exemplars of omics studies that achieve this goal, through defined biological questions relating to metabolic integration and regulation of animal-microbial symbioses, the genetic autonomy of bacterial symbionts, and symbiotic protection of animal hosts from pathogens. As omic datasets become increasingly complex, computationally-sophisticated downstream analyses are essential to reveal interactions not evident to visual inspection of the data. We discuss two approaches, phylogenomics and transcriptional clustering, that can divide the primary output of omics studies – long lists of factors – into manageable subsets, and we describe how they have been applied to analyze large datasets and generate testable hypotheses. PMID:22983030

  19. Multi-omics approach to elucidate the gut microbiota activity: Metaproteomics and metagenomics connection.

    PubMed

    Guirro, Maria; Costa, Andrea; Gual-Grau, Andreu; Mayneris-Perxachs, Jordi; Torrell, Helena; Herrero, Pol; Canela, Núria; Arola, Lluís

    2018-02-10

    Over the last few years, the application of high-throughput meta-omics methods has provided great progress in improving the knowledge of the gut ecosystem and linking its biodiversity to host health conditions, offering complementary support to classical microbiology. Gut microbiota plays a crucial role in relevant diseases such as obesity or cardiovascular disease (CVD), and its regulation is closely influenced by several factors, such as dietary composition. In fact, polyphenol-rich diets are the most palatable treatment to prevent hypertension associated with CVD, although the polyphenol-microbiota interactions have not been completely elucidated. For this reason, the aim of this study was to evaluate microbiota effect in obese rats supplemented by hesperidin, after being fed with cafeteria or standard diet, using a multi meta-omics approaches combining strategy of metagenomics and metaproteomics analysis. We reported that cafeteria diet induces obesity, resulting in changes in the microbiota composition, which are related to functional alterations at proteome level. In addition, hesperidin supplementation alters microbiota diversity and also proteins involved in important metabolic pathways. Overall, going deeper into strategies to integrate omics sciences is necessary to understand the complex relationships between the host, gut microbiota, and diet. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. From Genomics to Omics Landscapes of Parkinson's Disease: Revealing the Molecular Mechanisms

    PubMed Central

    Redenšek, Sara; Dolžan, Vita

    2018-01-01

    Abstract Molecular mechanisms of Parkinson's disease (PD) have already been investigated in various different omics landscapes. We reviewed the literature about different omics approaches between November 2005 and November 2017 to depict the main pathological pathways for PD development. In total, 107 articles exploring different layers of omics data associated with PD were retrieved. The studies were grouped into 13 omics layers: genomics–DNA level, transcriptomics, epigenomics, proteomics, ncRNomics, interactomics, metabolomics, glycomics, lipidomics, phenomics, environmental omics, pharmacogenomics, and integromics. We discussed characteristics of studies from different landscapes, such as main findings, number of participants, sample type, methodology, and outcome. We also performed curation and preliminary synthesis of multiple omics data, and identified overlapping results, which could lead toward selection of biomarkers for further validation of PD risk loci. Biomarkers could support the development of targeted prognostic/diagnostic panels as a tool for early diagnosis and prediction of progression rate and prognosis. This review presents an example of a comprehensive approach to revealing the underlying processes and risk factors of a complex disease. It urges scientists to structure the already known data and integrate it into a meaningful context. PMID:29356624

  1. A generic Transcriptomics Reporting Framework (TRF) for 'omics data processing and analysis.

    PubMed

    Gant, Timothy W; Sauer, Ursula G; Zhang, Shu-Dong; Chorley, Brian N; Hackermüller, Jörg; Perdichizzi, Stefania; Tollefsen, Knut E; van Ravenzwaay, Ben; Yauk, Carole; Tong, Weida; Poole, Alan

    2017-12-01

    A generic Transcriptomics Reporting Framework (TRF) is presented that lists parameters that should be reported in 'omics studies used in a regulatory context. The TRF encompasses the processes from transcriptome profiling from data generation to a processed list of differentially expressed genes (DEGs) ready for interpretation. Included within the TRF is a reference baseline analysis (RBA) that encompasses raw data selection; data normalisation; recognition of outliers; and statistical analysis. The TRF itself does not dictate the methodology for data processing, but deals with what should be reported. Its principles are also applicable to sequencing data and other 'omics. In contrast, the RBA specifies a simple data processing and analysis methodology that is designed to provide a comparison point for other approaches and is exemplified here by a case study. By providing transparency on the steps applied during 'omics data processing and analysis, the TRF will increase confidence processing of 'omics data, and regulatory use. Applicability of the TRF is ensured by its simplicity and generality. The TRF can be applied to all types of regulatory 'omics studies, and it can be executed using different commonly available software tools. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.

  2. Advances in Integrating Traditional and Omic Biomarkers When Analyzing the Effects of the Mediterranean Diet Intervention in Cardiovascular Prevention

    PubMed Central

    Fitó, Montserrat; Melander, Olle; Martínez, José Alfredo; Toledo, Estefanía; Carpéné, Christian; Corella, Dolores

    2016-01-01

    Intervention with Mediterranean diet (MedDiet) has provided a high level of evidence in primary prevention of cardiovascular events. Besides enhancing protection from classical risk factors, an improvement has also been described in a number of non-classical ones. Benefits have been reported on biomarkers of oxidation, inflammation, cellular adhesion, adipokine production, and pro-thrombotic state. Although the benefits of the MedDiet have been attributed to its richness in antioxidants, the mechanisms by which it exercises its beneficial effects are not well known. It is thought that the integration of omics including genomics, transcriptomics, epigenomics, and metabolomics, into studies analyzing nutrition and cardiovascular diseases will provide new clues regarding these mechanisms. However, omics integration is still in its infancy. Currently, some single-omics analyses have provided valuable data, mostly in the field of genomics. Thus, several gene-diet interactions in determining both intermediate (plasma lipids, etc.) and final cardiovascular phenotypes (stroke, myocardial infarction, etc.) have been reported. However, few studies have analyzed changes in gene expression and, moreover very few have focused on epigenomic or metabolomic biomarkers related to the MedDiet. Nevertheless, these preliminary results can help to better understand the inter-individual differences in cardiovascular risk and dietary response for further applications in personalized nutrition. PMID:27598147

  3. Perspectives of Physiology as a Discipline from Senior-Level Millennial-Generation Students

    ERIC Educational Resources Information Center

    Steury, Michael D.; Poteracki, James M.; Kelly, Kevin L.; Wehrwein, Erica A.

    2015-01-01

    In the last several decades, there has been a shift in the mindset of research structure from classical "systems or integrative biology" to more molecular focused "-omics" study. A recent topic of debate in physiological societies has been whether or not the "-omic" revolution has delivered in its promises in both…

  4. Putative regulatory sites unraveled by network-embedded thermodynamic analysis of metabolome data

    PubMed Central

    Kümmel, Anne; Panke, Sven; Heinemann, Matthias

    2006-01-01

    As one of the most recent members of the omics family, large-scale quantitative metabolomics data are currently complementing our systems biology data pool and offer the chance to integrate the metabolite level into the functional analysis of cellular networks. Network-embedded thermodynamic analysis (NET analysis) is presented as a framework for mechanistic and model-based analysis of these data. By coupling the data to an operating metabolic network via the second law of thermodynamics and the metabolites' Gibbs energies of formation, NET analysis allows inferring functional principles from quantitative metabolite data; for example it identifies reactions that are subject to active allosteric or genetic regulation as exemplified with quantitative metabolite data from Escherichia coli and Saccharomyces cerevisiae. Moreover, the optimization framework of NET analysis was demonstrated to be a valuable tool to systematically investigate data sets for consistency, for the extension of sub-omic metabolome data sets and for resolving intracompartmental concentrations from cell-averaged metabolome data. Without requiring any kind of kinetic modeling, NET analysis represents a perfectly scalable and unbiased approach to uncover insights from quantitative metabolome data. PMID:16788595

  5. Toward the Replacement of Animal Experiments through the Bioinformatics-driven Analysis of 'Omics' Data from Human Cell Cultures.

    PubMed

    Grafström, Roland C; Nymark, Penny; Hongisto, Vesa; Spjuth, Ola; Ceder, Rebecca; Willighagen, Egon; Hardy, Barry; Kaski, Samuel; Kohonen, Pekka

    2015-11-01

    This paper outlines the work for which Roland Grafström and Pekka Kohonen were awarded the 2014 Lush Science Prize. The research activities of the Grafström laboratory have, for many years, covered cancer biology studies, as well as the development and application of toxicity-predictive in vitro models to determine chemical safety. Through the integration of in silico analyses of diverse types of genomics data (transcriptomic and proteomic), their efforts have proved to fit well into the recently-developed Adverse Outcome Pathway paradigm. Genomics analysis within state-of-the-art cancer biology research and Toxicology in the 21st Century concepts share many technological tools. A key category within the Three Rs paradigm is the Replacement of animals in toxicity testing with alternative methods, such as bioinformatics-driven analyses of data obtained from human cell cultures exposed to diverse toxicants. This work was recently expanded within the pan-European SEURAT-1 project (Safety Evaluation Ultimately Replacing Animal Testing), to replace repeat-dose toxicity testing with data-rich analyses of sophisticated cell culture models. The aims and objectives of the SEURAT project have been to guide the application, analysis, interpretation and storage of 'omics' technology-derived data within the service-oriented sub-project, ToxBank. Particularly addressing the Lush Science Prize focus on the relevance of toxicity pathways, a 'data warehouse' that is under continuous expansion, coupled with the development of novel data storage and management methods for toxicology, serve to address data integration across multiple 'omics' technologies. The prize winners' guiding principles and concepts for modern knowledge management of toxicological data are summarised. The translation of basic discovery results ranged from chemical-testing and material-testing data, to information relevant to human health and environmental safety. 2015 FRAME.

  6. The evolution of analytical chemistry methods in foodomics.

    PubMed

    Gallo, Monica; Ferranti, Pasquale

    2016-01-08

    The methodologies of food analysis have greatly evolved over the past 100 years, from basic assays based on solution chemistry to those relying on the modern instrumental platforms. Today, the development and optimization of integrated analytical approaches based on different techniques to study at molecular level the chemical composition of a food may allow to define a 'food fingerprint', valuable to assess nutritional value, safety and quality, authenticity and security of foods. This comprehensive strategy, defined foodomics, includes emerging work areas such as food chemistry, phytochemistry, advanced analytical techniques, biosensors and bioinformatics. Integrated approaches can help to elucidate some critical issues in food analysis, but also to face the new challenges of a globalized world: security, sustainability and food productions in response to environmental world-wide changes. They include the development of powerful analytical methods to ensure the origin and quality of food, as well as the discovery of biomarkers to identify potential food safety problems. In the area of nutrition, the future challenge is to identify, through specific biomarkers, individual peculiarities that allow early diagnosis and then a personalized prognosis and diet for patients with food-related disorders. Far from the aim of an exhaustive review of the abundant literature dedicated to the applications of omic sciences in food analysis, we will explore how classical approaches, such as those used in chemistry and biochemistry, have evolved to intersect with the new omics technologies to produce a progress in our understanding of the complexity of foods. Perhaps most importantly, a key objective of the review will be to explore the development of simple and robust methods for a fully applied use of omics data in food science. Copyright © 2015 Elsevier B.V. All rights reserved.

  7. Use of Free/Libre Open Source Software in Sepsis "-Omics" Research: A Bibliometric, Comparative Analysis Among the United States, EU-28 Member States, and China.

    PubMed

    Evangelatos, Nikolaos; Satyamourthy, Kapaettu; Levidou, Georgia; Brand, Helmut; Bauer, Pia; Kouskouti, Christina; Brand, Angela

    2018-05-01

    "-Omics" systems sciences are at the epicenter of personalized medicine and public health, and drivers of knowledge-based biotechnology innovation. Bioinformatics, a core component of omics research, is one of the disciplines that first employed Free/Libre Open Source Software (FLOSS), and thus provided a fertile ground for its further development. Understanding the use and characteristics of FLOSS deployed in the omics field is valuable for future innovation strategies, policy and funding priorities. We conducted a bibliometric, longitudinal study of the use of FLOSS in sepsis omics research from 2011 to 2015 in the United States, EU-28 and China. Because sepsis is an interdisciplinary field at the intersection of multiple omics technologies and medical specialties, it was chosen as a model innovation ecosystem for this empirical analysis, which used publicly available data. Despite development of and competition from proprietary commercial software, scholars in omics continue to employ FLOSS routinely, and independent of the type of omics technology they work with. The number of articles using FLOSS increased significantly over time in the EU-28, as opposed to the United States and China (R = 0.96, p = 0.004). Furthermore, in an era where sharing of knowledge is being strongly advocated and promoted by public agencies and social institutions, we discuss possible correlations between the use of FLOSS and various funding sources in omics research. These observations and analyses provide new insights into the use of FLOSS in sepsis omics research across three (supra)national regions. Further benchmarking studies are warranted for FLOSS trends in other omics fields and geographical settings. These could, in time, lead to the development of new composite innovation and technology use metrics in omics systems sciences and bioinformatics communities.

  8. Clustering multilayer omics data using MuNCut.

    PubMed

    Teran Hidalgo, Sebastian J; Ma, Shuangge

    2018-03-14

    Omics profiling is now a routine component of biomedical studies. In the analysis of omics data, clustering is an essential step and serves multiple purposes including for example revealing the unknown functionalities of omics units, assisting dimension reduction in outcome model building, and others. In the most recent omics studies, a prominent trend is to conduct multilayer profiling, which collects multiple types of genetic, genomic, epigenetic and other measurements on the same subjects. In the literature, clustering methods tailored to multilayer omics data are still limited. Directly applying the existing clustering methods to multilayer omics data and clustering each layer first and then combing across layers are both "suboptimal" in that they do not accommodate the interconnections within layers and across layers in an informative way. In this study, we develop the MuNCut (Multilayer NCut) clustering approach. It is tailored to multilayer omics data and sufficiently accounts for both across- and within-layer connections. It is based on the novel NCut technique and also takes advantages of regularized sparse estimation. It has an intuitive formulation and is computationally very feasible. To facilitate implementation, we develop the function muncut in the R package NcutYX. Under a wide spectrum of simulation settings, it outperforms competitors. The analysis of TCGA (The Cancer Genome Atlas) data on breast cancer and cervical cancer shows that MuNCut generates biologically meaningful results which differ from those using the alternatives. We propose a more effective clustering analysis of multiple omics data. It provides a new venue for jointly analyzing genetic, genomic, epigenetic and other measurements.

  9. Data on master regulators and transcription factor binding sites found by upstream analysis of multi-omics data on methotrexate resistance of colon cancer.

    PubMed

    Kel, AlexanderE

    2017-02-01

    Computational analysis of master regulators through the search for transcription factor binding sites followed by analysis of signal transduction networks of a cell is a new approach of causal analysis of multi-omics data. This paper contains results on analysis of multi-omics data that include transcriptomics, proteomics and epigenomics data of methotrexate (MTX) resistant colon cancer cell line. The data were used for analysis of mechanisms of resistance and for prediction of potential drug targets and promising compounds for reverting the MTX resistance of these cancer cells. We present all results of the analysis including the lists of identified transcription factors and their binding sites in genome and the list of predicted master regulators - potential drug targets. This data was generated in the study recently published in the article "Multi-omics "Upstream Analysis" of regulatory genomic regions helps identifying targets against methotrexate resistance of colon cancer" (Kel et al., 2016) [4]. These data are of interest for researchers from the field of multi-omics data analysis and for biologists who are interested in identification of novel drug targets against NTX resistance.

  10. Extracellular Enzyme Composition and Functional Characteristics of Aspergillus niger An-76 Induced by Food Processing Byproducts and Based on Integrated Functional Omics.

    PubMed

    Liu, Lin; Gong, Weili; Sun, Xiaomeng; Chen, Guanjun; Wang, Lushan

    2018-02-07

    Byproducts of food processing can be utilized for the production of high-value-added enzyme cocktails. In this study, we utilized integrated functional omics technology to analyze composition and functional characteristics of extracellular enzymes produced by Aspergillus niger grown on food processing byproducts. The results showed that oligosaccharides constituted by arabinose, xylose, and glucose in wheat bran were able to efficiently induce the production of extracellular enzymes of A. niger. Compared with other substrates, wheat bran was more effective at inducing the secretion of β-glucosidases from GH1 and GH3 families, as well as >50% of proteases from A1-family aspartic proteases. Compared with proteins induced by single wheat bran or soybean dregs, the protein yield induced by their mixture was doubled, and the time required to reach peak enzyme activity was shortened by 25%. This study provided a technical platform for the complex formulation of various substrates and functional analysis of extracellular enzymes.

  11. A systemic identification approach for primary transcription start site of Arabidopsis miRNAs from multidimensional omics data.

    PubMed

    You, Qi; Yan, Hengyu; Liu, Yue; Yi, Xin; Zhang, Kang; Xu, Wenying; Su, Zhen

    2017-05-01

    The 22-nucleotide non-coding microRNAs (miRNAs) are mostly transcribed by RNA polymerase II and are similar to protein-coding genes. Unlike the clear process from stem-loop precursors to mature miRNAs, the primary transcriptional regulation of miRNA, especially in plants, still needs to be further clarified, including the original transcription start site, functional cis-elements and primary transcript structures. Due to several well-characterized transcription signals in the promoter region, we proposed a systemic approach integrating multidimensional "omics" (including genomics, transcriptomics, and epigenomics) data to improve the genome-wide identification of primary miRNA transcripts. Here, we used the model plant Arabidopsis thaliana to improve the ability to identify candidate promoter locations in intergenic miRNAs and to determine rules for identifying primary transcription start sites of miRNAs by integrating high-throughput omics data, such as the DNase I hypersensitive sites, chromatin immunoprecipitation-sequencing of polymerase II and H3K4me3, as well as high throughput transcriptomic data. As a result, 93% of refined primary transcripts could be confirmed by the primer pairs from a previous study. Cis-element and secondary structure analyses also supported the feasibility of our results. This work will contribute to the primary transcriptional regulatory analysis of miRNAs, and the conserved regulatory pattern may be a suitable miRNA characteristic in other plant species.

  12. The Human Blood Metabolome-Transcriptome Interface

    PubMed Central

    Schramm, Katharina; Adamski, Jerzy; Gieger, Christian; Herder, Christian; Carstensen, Maren; Peters, Annette; Rathmann, Wolfgang; Roden, Michael; Strauch, Konstantin; Suhre, Karsten; Kastenmüller, Gabi; Prokisch, Holger; Theis, Fabian J.

    2015-01-01

    Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the ‘human blood metabolome-transcriptome interface’ (BMTI). Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease. PMID:26086077

  13. Resource recovery from wastewater: application of meta-omics to phosphorus and carbon management.

    PubMed

    Sales, Christopher M; Lee, Patrick K H

    2015-06-01

    A growing trend at wastewater treatment plants is the recovery of resources and energy from wastewater. Enhanced biological phosphorus removal and anaerobic digestion are two established biotechnology approaches for the recovery of phosphorus and carbon, respectively. Meta-omics approaches (meta-genomics, transcriptomics, proteomics, and metabolomics) are providing novel biological insights into these complex biological systems. In particular, genome-centric metagenomics analyses are revealing the function and physiology of individual community members. Querying transcripts, proteins and metabolites are emerging techniques that can inform the cellular responses under different conditions. Overall, meta-omics approaches are shedding light into complex microbial communities once regarded as 'blackboxes', but challenges remain to integrate information from meta-omics into engineering design and operation guidelines. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Systems metabolic engineering: the creation of microbial cell factories by rational metabolic design and evolution.

    PubMed

    Furusawa, Chikara; Horinouchi, Takaaki; Hirasawa, Takashi; Shimizu, Hiroshi

    2013-01-01

    It is widely acknowledged that in order to establish sustainable societies, production processes should shift from petrochemical-based processes to bioprocesses. Because bioconversion technologies, in which biomass resources are converted to valuable materials, are preferable to processes dependent on fossil resources, the former should be further developed. The following two approaches can be adopted to improve cellular properties and obtain high productivity and production yield of target products: (1) optimization of cellular metabolic pathways involved in various bioprocesses and (2) creation of stress-tolerant cells that can be active even under severe stress conditions in the bioprocesses. Recent progress in omics analyses has facilitated the analysis of microorganisms based on bioinformatics data for molecular breeding and bioprocess development. Systems metabolic engineering is a new area of study, and it has been defined as a methodology in which metabolic engineering and systems biology are integrated to upgrade the designability of industrially useful microorganisms. This chapter discusses multi-omics analyses and rational design methods for molecular breeding. The first is an example of the rational design of metabolic networks for target production by flux balance analysis using genome-scale metabolic models. Recent progress in the development of genome-scale metabolic models and the application of these models to the design of desirable metabolic networks is also described in this example. The second is an example of evolution engineering with omics analyses for the creation of stress-tolerant microorganisms. Long-term culture experiments to obtain the desired phenotypes and omics analyses to identify the phenotypic changes are described here.

  15. Omics Profiling in Precision Oncology*

    PubMed Central

    Yu, Kun-Hsing; Snyder, Michael

    2016-01-01

    Cancer causes significant morbidity and mortality worldwide, and is the area most targeted in precision medicine. Recent development of high-throughput methods enables detailed omics analysis of the molecular mechanisms underpinning tumor biology. These studies have identified clinically actionable mutations, gene and protein expression patterns associated with prognosis, and provided further insights into the molecular mechanisms indicative of cancer biology and new therapeutics strategies such as immunotherapy. In this review, we summarize the techniques used for tumor omics analysis, recapitulate the key findings in cancer omics studies, and point to areas requiring further research on precision oncology. PMID:27099341

  16. Identification of key regulators of pancreatic cancer progression through multidimensional systems-level analysis.

    PubMed

    Rajamani, Deepa; Bhasin, Manoj K

    2016-05-03

    Pancreatic cancer is an aggressive cancer with dismal prognosis, urgently necessitating better biomarkers to improve therapeutic options and early diagnosis. Traditional approaches of biomarker detection that consider only one aspect of the biological continuum like gene expression alone are limited in their scope and lack robustness in identifying the key regulators of the disease. We have adopted a multidimensional approach involving the cross-talk between the omics spaces to identify key regulators of disease progression. Multidimensional domain-specific disease signatures were obtained using rank-based meta-analysis of individual omics profiles (mRNA, miRNA, DNA methylation) related to pancreatic ductal adenocarcinoma (PDAC). These domain-specific PDAC signatures were integrated to identify genes that were affected across multiple dimensions of omics space in PDAC (genes under multiple regulatory controls, GMCs). To further pin down the regulators of PDAC pathophysiology, a systems-level network was generated from knowledge-based interaction information applied to the above identified GMCs. Key regulators were identified from the GMC network based on network statistics and their functional importance was validated using gene set enrichment analysis and survival analysis. Rank-based meta-analysis identified 5391 genes, 109 miRNAs and 2081 methylation-sites significantly differentially expressed in PDAC (false discovery rate ≤ 0.05). Bimodal integration of meta-analysis signatures revealed 1150 and 715 genes regulated by miRNAs and methylation, respectively. Further analysis identified 189 altered genes that are commonly regulated by miRNA and methylation, hence considered GMCs. Systems-level analysis of the scale-free GMCs network identified eight potential key regulator hubs, namely E2F3, HMGA2, RASA1, IRS1, NUAK1, ACTN1, SKI and DLL1, associated with important pathways driving cancer progression. Survival analysis on individual key regulators revealed that higher expression of IRS1 and DLL1 and lower expression of HMGA2, ACTN1 and SKI were associated with better survival probabilities. It is evident from the results that our hierarchical systems-level multidimensional analysis approach has been successful in isolating the converging regulatory modules and associated key regulatory molecules that are potential biomarkers for pancreatic cancer progression.

  17. Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated "Knowledge-Based" Platform.

    PubMed

    Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana

    2017-01-01

    Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).

  18. Epigenetic and genetic dissections of UV-induced global gene dysregulation in skin cells through multi-omics analyses

    PubMed Central

    Shen, Yao; Stanislauskas, Milda; Li, Gen; Zheng, Deyou; Liu, Liang

    2017-01-01

    To elucidate the complex molecular mechanisms underlying the adverse effects UV radiation (UVR) on skin homeostasis, we performed multi-omics studies to characterize UV-induced genetic and epigenetic changes. Human keratinocytes from a single donor treated with or without UVR were analyzed by RNA-seq, exome-seq, and H3K27ac ChIP-seq at 4 h and 72 h following UVR. Compared to the relatively moderate mutagenic effects of UVR, acute UV exposure induced substantial epigenomic and transcriptomic alterations, illuminating a previously underappreciated role of epigenomic and transcriptomic instability in skin pathogenesis. Integration of the multi-omics data revealed that UVR-induced transcriptional dysregulation of a subset of genes was attributable to either genetic mutations or global redistribution of H3K27ac. H3K27ac redistribution further led to the formation of distinctive super enhancers in UV-irradiated cells. Our analysis also identified several new UV target genes, including CYP24A1, GJA5, SLAMF7 and ETV1, which were frequently dysregulated in human squamous cell carcinomas, highlighting their potential as new molecular targets for prevention or treatment of UVR-induced skin cancers. Taken together, our concurrent multi-omics analyses provide new mechanistic insights into the complex molecular networks underlying UV photobiological effects, which have important implications in understanding its impact on skin homeostasis and pathogenesis. PMID:28211524

  19. Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS Online

    PubMed Central

    Forsberg, Erica M; Huan, Tao; Rinehart, Duane; Benton, H Paul; Warth, Benedikt; Hilmers, Brian; Siuzdak, Gary

    2018-01-01

    Systems biology is the study of complex living organisms, and as such, analysis on a systems-wide scale involves the collection of information-dense data sets that are representative of an entire phenotype. To uncover dynamic biological mechanisms, bioinformatics tools have become essential to facilitating data interpretation in large-scale analyses. Global metabolomics is one such method for performing systems biology, as metabolites represent the downstream functional products of ongoing biological processes. We have developed XCMS Online, a platform that enables online metabolomics data processing and interpretation. A systems biology workflow recently implemented within XCMS Online enables rapid metabolic pathway mapping using raw metabolomics data for investigating dysregulated metabolic processes. In addition, this platform supports integration of multi-omic (such as genomic and proteomic) data to garner further systems-wide mechanistic insight. Here, we provide an in-depth procedure showing how to effectively navigate and use the systems biology workflow within XCMS Online without a priori knowledge of the platform, including uploading liquid chromatography (LCLC)–mass spectrometry (MS) data from metabolite-extracted biological samples, defining the job parameters to identify features, correcting for retention time deviations, conducting statistical analysis of features between sample classes and performing predictive metabolic pathway analysis. Additional multi-omics data can be uploaded and overlaid with previously identified pathways to enhance systems-wide analysis of the observed dysregulations. We also describe unique visualization tools to assist in elucidation of statistically significant dysregulated metabolic pathways. Parameter input takes 5–10 min, depending on user experience; data processing typically takes 1–3 h, and data analysis takes ~30 min. PMID:29494574

  20. An Integrated Omics Analysis: Impact of Microgravity on Host Response to Lipopolysaccharide in vitro

    DTIC Science & Technology

    2014-08-07

    on host response to lipopolysaccharide in vitro 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) Nabarun Chakraborty...49. Mateescu B, Batista L, Cardon M, Gruosso T, de Feraudy Y, Mariani O, Nicolas A, Meyniel JP, Cottu P, Sastre Garau X, Mechta Grigoriou F: miR 141

  1. Integrative analysis of multi-omics data for identifying multi-markers for diagnosing pancreatic cancer

    PubMed Central

    2015-01-01

    Background microRNA (miRNA) expression plays an influential role in cancer classification and malignancy, and miRNAs are feasible as alternative diagnostic markers for pancreatic cancer, a highly aggressive neoplasm with silent early symptoms, high metastatic potential, and resistance to conventional therapies. Methods In this study, we evaluated the benefits of multi-omics data analysis by integrating miRNA and mRNA expression data in pancreatic cancer. Using support vector machine (SVM) modelling and leave-one-out cross validation (LOOCV), we evaluated the diagnostic performance of single- or multi-markers based on miRNA and mRNA expression profiles from 104 PDAC tissues and 17 benign pancreatic tissues. For selecting even more reliable and robust markers, we performed validation by independent datasets from the Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) data depositories. For validation, miRNA activity was estimated by miRNA-target gene interaction and mRNA expression datasets in pancreatic cancer. Results Using a comprehensive identification approach, we successfully identified 705 multi-markers having powerful diagnostic performance for PDAC. In addition, these marker candidates annotated with cancer pathways using gene ontology analysis. Conclusions Our prediction models have strong potential for the diagnosis of pancreatic cancer. PMID:26328610

  2. An integrated bioinformatics approach to improve two-color microarray quality-control: impact on biological conclusions.

    PubMed

    van Haaften, Rachel I M; Luceri, Cristina; van Erk, Arie; Evelo, Chris T A

    2009-06-01

    Omics technology used for large-scale measurements of gene expression is rapidly evolving. This work pointed out the need of an extensive bioinformatics analyses for array quality assessment before and after gene expression clustering and pathway analysis. A study focused on the effect of red wine polyphenols on rat colon mucosa was used to test the impact of quality control and normalisation steps on the biological conclusions. The integration of data visualization, pathway analysis and clustering revealed an artifact problem that was solved with an adapted normalisation. We propose a possible point to point standard analysis procedure, based on a combination of clustering and data visualization for the analysis of microarray data.

  3. The Encyclopedia of Systems Biology and OMICS (first presentation) and The ISA Infrastructure for Multi-omics Data (second presentation) (GSC8 Meeting)

    ScienceCinema

    Kolker, Eugene; Sansone, Susanna

    2018-01-15

    The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Eugene Kolker from Seattle Children's Hospital briefly discusses "The Encyclopedia of Systems Biology and OMICS," followed by Susanna Sansone from the EBI on "The ISA Infrastructure for multi-omics data" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA. on Sept. 11, 2009.

  4. Differential Expression and Functional Analysis of High-Throughput -Omics Data Using Open Source Tools.

    PubMed

    Kebschull, Moritz; Fittler, Melanie Julia; Demmer, Ryan T; Papapanou, Panos N

    2017-01-01

    Today, -omics analyses, including the systematic cataloging of messenger RNA and microRNA sequences or DNA methylation patterns in a cell population, organ, or tissue sample, allow for an unbiased, comprehensive genome-level analysis of complex diseases, offering a large advantage over earlier "candidate" gene or pathway analyses. A primary goal in the analysis of these high-throughput assays is the detection of those features among several thousand that differ between different groups of samples. In the context of oral biology, our group has successfully utilized -omics technology to identify key molecules and pathways in different diagnostic entities of periodontal disease.A major issue when inferring biological information from high-throughput -omics studies is the fact that the sheer volume of high-dimensional data generated by contemporary technology is not appropriately analyzed using common statistical methods employed in the biomedical sciences.In this chapter, we outline a robust and well-accepted bioinformatics workflow for the initial analysis of -omics data generated using microarrays or next-generation sequencing technology using open-source tools. Starting with quality control measures and necessary preprocessing steps for data originating from different -omics technologies, we next outline a differential expression analysis pipeline that can be used for data from both microarray and sequencing experiments, and offers the possibility to account for random or fixed effects. Finally, we present an overview of the possibilities for a functional analysis of the obtained data.

  5. Omics for aquatic ecotoxicology: Control of extraneous variability to enhance the analysis of environmental effects

    EPA Science Inventory

    There are multiple sources of biological and technical variation in a typical ecotoxicology study that may not be revealed by traditional endpoints but that become apparent in an omics dataset. As researchers increasingly apply omics technologies to environmental studies, it will...

  6. Studying Cellular Signal Transduction with OMIC Technologies.

    PubMed

    Landry, Benjamin D; Clarke, David C; Lee, Michael J

    2015-10-23

    In the gulf between genotype and phenotype exists proteins and, in particular, protein signal transduction systems. These systems use a relatively limited parts list to respond to a much longer list of extracellular, environmental, and/or mechanical cues with rapidity and specificity. Most signaling networks function in a highly non-linear and often contextual manner. Furthermore, these processes occur dynamically across space and time. Because of these complexities, systems and "OMIC" approaches are essential for the study of signal transduction. One challenge in using OMIC-scale approaches to study signaling is that the "signal" can take different forms in different situations. Signals are encoded in diverse ways such as protein-protein interactions, enzyme activities, localizations, or post-translational modifications to proteins. Furthermore, in some cases, signals may be encoded only in the dynamics, duration, or rates of change of these features. Accordingly, systems-level analyses of signaling may need to integrate multiple experimental and/or computational approaches. As the field has progressed, the non-triviality of integrating experimental and computational analyses has become apparent. Successful use of OMIC methods to study signaling will require the "right" experiments and the "right" modeling approaches, and it is critical to consider both in the design phase of the project. In this review, we discuss common OMIC and modeling approaches for studying signaling, emphasizing the philosophical and practical considerations for effectively merging these two types of approaches to maximize the probability of obtaining reliable and novel insights into signaling biology. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Mechanistic analysis of multi-omics datasets to generate kinetic parameters for constraint-based metabolic models.

    PubMed

    Cotten, Cameron; Reed, Jennifer L

    2013-01-30

    Constraint-based modeling uses mass balances, flux capacity, and reaction directionality constraints to predict fluxes through metabolism. Although transcriptional regulation and thermodynamic constraints have been integrated into constraint-based modeling, kinetic rate laws have not been extensively used. In this study, an in vivo kinetic parameter estimation problem was formulated and solved using multi-omic data sets for Escherichia coli. To narrow the confidence intervals for kinetic parameters, a series of kinetic model simplifications were made, resulting in fewer kinetic parameters than the full kinetic model. These new parameter values are able to account for flux and concentration data from 20 different experimental conditions used in our training dataset. Concentration estimates from the simplified kinetic model were within one standard deviation for 92.7% of the 790 experimental measurements in the training set. Gibbs free energy changes of reaction were calculated to identify reactions that were often operating close to or far from equilibrium. In addition, enzymes whose activities were positively or negatively influenced by metabolite concentrations were also identified. The kinetic model was then used to calculate the maximum and minimum possible flux values for individual reactions from independent metabolite and enzyme concentration data that were not used to estimate parameter values. Incorporating these kinetically-derived flux limits into the constraint-based metabolic model improved predictions for uptake and secretion rates and intracellular fluxes in constraint-based models of central metabolism. This study has produced a method for in vivo kinetic parameter estimation and identified strategies and outcomes of kinetic model simplification. We also have illustrated how kinetic constraints can be used to improve constraint-based model predictions for intracellular fluxes and biomass yield and identify potential metabolic limitations through the integrated analysis of multi-omics datasets.

  8. Listeriomics: an Interactive Web Platform for Systems Biology of Listeria

    PubMed Central

    Koutero, Mikael; Tchitchek, Nicolas; Cerutti, Franck; Lechat, Pierre; Maillet, Nicolas; Hoede, Claire; Chiapello, Hélène; Gaspin, Christine

    2017-01-01

    ABSTRACT As for many model organisms, the amount of Listeria omics data produced has recently increased exponentially. There are now >80 published complete Listeria genomes, around 350 different transcriptomic data sets, and 25 proteomic data sets available. The analysis of these data sets through a systems biology approach and the generation of tools for biologists to browse these various data are a challenge for bioinformaticians. We have developed a web-based platform, named Listeriomics, that integrates different tools for omics data analyses, i.e., (i) an interactive genome viewer to display gene expression arrays, tiling arrays, and sequencing data sets along with proteomics and genomics data sets; (ii) an expression and protein atlas that connects every gene, small RNA, antisense RNA, or protein with the most relevant omics data; (iii) a specific tool for exploring protein conservation through the Listeria phylogenomic tree; and (iv) a coexpression network tool for the discovery of potential new regulations. Our platform integrates all the complete Listeria species genomes, transcriptomes, and proteomes published to date. This website allows navigation among all these data sets with enriched metadata in a user-friendly format and can be used as a central database for systems biology analysis. IMPORTANCE In the last decades, Listeria has become a key model organism for the study of host-pathogen interactions, noncoding RNA regulation, and bacterial adaptation to stress. To study these mechanisms, several genomics, transcriptomics, and proteomics data sets have been produced. We have developed Listeriomics, an interactive web platform to browse and correlate these heterogeneous sources of information. Our website will allow listeriologists and microbiologists to decipher key regulation mechanism by using a systems biology approach. PMID:28317029

  9. Mechanistic analysis of multi-omics datasets to generate kinetic parameters for constraint-based metabolic models

    PubMed Central

    2013-01-01

    Background Constraint-based modeling uses mass balances, flux capacity, and reaction directionality constraints to predict fluxes through metabolism. Although transcriptional regulation and thermodynamic constraints have been integrated into constraint-based modeling, kinetic rate laws have not been extensively used. Results In this study, an in vivo kinetic parameter estimation problem was formulated and solved using multi-omic data sets for Escherichia coli. To narrow the confidence intervals for kinetic parameters, a series of kinetic model simplifications were made, resulting in fewer kinetic parameters than the full kinetic model. These new parameter values are able to account for flux and concentration data from 20 different experimental conditions used in our training dataset. Concentration estimates from the simplified kinetic model were within one standard deviation for 92.7% of the 790 experimental measurements in the training set. Gibbs free energy changes of reaction were calculated to identify reactions that were often operating close to or far from equilibrium. In addition, enzymes whose activities were positively or negatively influenced by metabolite concentrations were also identified. The kinetic model was then used to calculate the maximum and minimum possible flux values for individual reactions from independent metabolite and enzyme concentration data that were not used to estimate parameter values. Incorporating these kinetically-derived flux limits into the constraint-based metabolic model improved predictions for uptake and secretion rates and intracellular fluxes in constraint-based models of central metabolism. Conclusions This study has produced a method for in vivo kinetic parameter estimation and identified strategies and outcomes of kinetic model simplification. We also have illustrated how kinetic constraints can be used to improve constraint-based model predictions for intracellular fluxes and biomass yield and identify potential metabolic limitations through the integrated analysis of multi-omics datasets. PMID:23360254

  10. Uncovering Hidden Layers of Cell Cycle Regulation through Integrative Multi-omic Analysis

    PubMed Central

    Aviner, Ranen; Shenoy, Anjana; Elroy-Stein, Orna; Geiger, Tamar

    2015-01-01

    Studying the complex relationship between transcription, translation and protein degradation is essential to our understanding of biological processes in health and disease. The limited correlations observed between mRNA and protein abundance suggest pervasive regulation of post-transcriptional steps and support the importance of profiling mRNA levels in parallel to protein synthesis and degradation rates. In this work, we applied an integrative multi-omic approach to study gene expression along the mammalian cell cycle through side-by-side analysis of mRNA, translation and protein levels. Our analysis sheds new light on the significant contribution of both protein synthesis and degradation to the variance in protein expression. Furthermore, we find that translation regulation plays an important role at S-phase, while progression through mitosis is predominantly controlled by changes in either mRNA levels or protein stability. Specific molecular functions are found to be co-regulated and share similar patterns of mRNA, translation and protein expression along the cell cycle. Notably, these include genes and entire pathways not previously implicated in cell cycle progression, demonstrating the potential of this approach to identify novel regulatory mechanisms beyond those revealed by traditional expression profiling. Through this three-level analysis, we characterize different mechanisms of gene expression, discover new cycling gene products and highlight the importance and utility of combining datasets generated using different techniques that monitor distinct steps of gene expression. PMID:26439921

  11. Visualising associations between paired ‘omics’ data sets

    PubMed Central

    2012-01-01

    Background Each omics platform is now able to generate a large amount of data. Genomics, proteomics, metabolomics, interactomics are compiled at an ever increasing pace and now form a core part of the fundamental systems biology framework. Recently, several integrative approaches have been proposed to extract meaningful information. However, these approaches lack of visualisation outputs to fully unravel the complex associations between different biological entities. Results The multivariate statistical approaches ‘regularized Canonical Correlation Analysis’ and ‘sparse Partial Least Squares regression’ were recently developed to integrate two types of highly dimensional ‘omics’ data and to select relevant information. Using the results of these methods, we propose to revisit few graphical outputs to better understand the relationships between two ‘omics’ data and to better visualise the correlation structure between the different biological entities. These graphical outputs include Correlation Circle plots, Relevance Networks and Clustered Image Maps. We demonstrate the usefulness of such graphical outputs on several biological data sets and further assess their biological relevance using gene ontology analysis. Conclusions Such graphical outputs are undoubtedly useful to aid the interpretation of these promising integrative analysis tools and will certainly help in addressing fundamental biological questions and understanding systems as a whole. Availability The graphical tools described in this paper are implemented in the freely available R package mixOmics and in its associated web application. PMID:23148523

  12. Multi-omics analysis provides insight to the Ignicoccus hospitalis-Nanoarchaeum equitans association.

    PubMed

    Rawle, Rachel A; Hamerly, Timothy; Tripet, Brian P; Giannone, Richard J; Wurch, Louie; Hettich, Robert L; Podar, Mircea; Copié, Valerie; Bothner, Brian

    2017-09-01

    Studies of interspecies interactions are inherently difficult due to the complex mechanisms which enable these relationships. A model system for studying interspecies interactions is the marine hyperthermophiles Ignicoccus hospitalis and Nanoarchaeum equitans. Recent independently-conducted 'omics' analyses have generated insights into the molecular factors modulating this association. However, significant questions remain about the nature of the interactions between these archaea. We jointly analyzed multiple levels of omics datasets obtained from published, independent transcriptomics, proteomics, and metabolomics analyses. DAVID identified functionally-related groups enriched when I. hospitalis is grown alone or in co-culture with N. equitans. Enriched molecular pathways were subsequently visualized using interaction maps generated using STRING. Key findings of our multi-level omics analysis indicated that I. hospitalis provides precursors to N. equitans for energy metabolism. Analysis indicated an overall reduction in diversity of metabolic precursors in the I. hospitalis-N. equitans co-culture, which has been connected to the differential use of ribosomal subunits and was previously unnoticed. We also identified differences in precursors linked to amino acid metabolism, NADH metabolism, and carbon fixation, providing new insights into the metabolic adaptions of I. hospitalis enabling the growth of N. equitans. This multi-omics analysis builds upon previously identified cellular patterns while offering new insights into mechanisms that enable the I. hospitalis-N. equitans association. Our study applies statistical and visualization techniques to a mixed-source omics dataset to yield a more global insight into a complex system, that was not readily discernable from separate omics studies. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Computational biology for cardiovascular biomarker discovery.

    PubMed

    Azuaje, Francisco; Devaux, Yvan; Wagner, Daniel

    2009-07-01

    Computational biology is essential in the process of translating biological knowledge into clinical practice, as well as in the understanding of biological phenomena based on the resources and technologies originating from the clinical environment. One such key contribution of computational biology is the discovery of biomarkers for predicting clinical outcomes using 'omic' information. This process involves the predictive modelling and integration of different types of data and knowledge for screening, diagnostic or prognostic purposes. Moreover, this requires the design and combination of different methodologies based on statistical analysis and machine learning. This article introduces key computational approaches and applications to biomarker discovery based on different types of 'omic' data. Although we emphasize applications in cardiovascular research, the computational requirements and advances discussed here are also relevant to other domains. We will start by introducing some of the contributions of computational biology to translational research, followed by an overview of methods and technologies used for the identification of biomarkers with predictive or classification value. The main types of 'omic' approaches to biomarker discovery will be presented with specific examples from cardiovascular research. This will include a review of computational methodologies for single-source and integrative data applications. Major computational methods for model evaluation will be described together with recommendations for reporting models and results. We will present recent advances in cardiovascular biomarker discovery based on the combination of gene expression and functional network analyses. The review will conclude with a discussion of key challenges for computational biology, including perspectives from the biosciences and clinical areas.

  14. Research from the NASA Twins Study and Omics in Support of Mars Missions

    NASA Technical Reports Server (NTRS)

    Kundrot, C.; Shelhamer, M.; Scott, G.

    2015-01-01

    The NASA Twins Study, NASA's first foray into integrated omic studies in humans, illustrates how an integrated omics approach can be brought to bear on the challenges to human health and performance on a Mars mission. The NASA Twins Study involves US Astronaut Scott Kelly and his identical twin brother, Mark Kelly, a retired US Astronaut. No other opportunity to study a twin pair for a prolonged period with one subject in space and one on the ground is available for the foreseeable future. A team of 10 principal investigators are conducting the Twins Study, examining a very broad range of biological functions including the genome, epigenome, transcriptome, proteome, metabolome, gut microbiome, immunological response to vaccinations, indicators of atherosclerosis, physiological fluid shifts, and cognition. A novel aspect of the study is the integrated study of molecular, physiological, cognitive, and microbiological properties. Major sample and data collection from both subjects for this study began approximately six months before Scott Kelly's one year mission on the ISS, continue while Scott Kelly is in flight and will conclude approximately six months after his return to Earth. Mark Kelly will remain on Earth during this study, in a lifestyle unconstrained by this study, thereby providing a measure of normal variation in the properties being studied. An overview of initial results and the future plans will be described as well as the technological and ethical issues raised for spaceflight studies involving omics.

  15. The MPLEx Protocol for Multi-omic Analyses of Soil Samples

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nicora, Carrie D.; Burnum-Johnson, Kristin E.; Nakayasu, Ernesto S.

    Mass spectrometry (MS)-based integrated metaproteomic, metabolomic and lipidomic (multi-omic) studies are transforming our ability to understand and characterize microbial communities in environmental and biological systems. These measurements are even enabling enhanced analyses of complex soil microbial communities, which are the most complex microbial systems known to date. Multi-omic analyses, however, do have sample preparation challenges since separate extractions are typically needed for each omic study, thereby greatly amplifying the preparation time and amount of sample required. To address this limitation, a 3-in-1 method for simultaneous metabolite, protein, and lipid extraction (MPLEx) from the exact same soil sample was created bymore » adapting a solvent-based approach. This MPLEx protocol has proven to be simple yet robust for many sample types and even when utilized for limited quantities of complex soil samples. The MPLEx method also greatly enabled the rapid multi-omic measurements needed to gain a better understanding of the members of each microbial community, while evaluating the changes taking place upon biological and environmental perturbations.« less

  16. The Encyclopedia of Systems Biology and OMICS (first presentation) and The ISA Infrastructure for Multi-omics Data (second presentation) (GSC8 Meeting)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kolker, Eugene; Sansone, Susanna

    2011-09-11

    The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Eugene Kolker from Seattle Children's Hospital briefly discusses "The Encyclopedia of Systems Biology and OMICS," followed by Susanna Sansone from the EBI on "Themore » ISA Infrastructure for multi-omics data" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA. on Sept. 11, 2009.« less

  17. Ready to put metadata on the post-2015 development agenda? Linking data publications to responsible innovation and science diplomacy.

    PubMed

    Özdemir, Vural; Kolker, Eugene; Hotez, Peter J; Mohin, Sophie; Prainsack, Barbara; Wynne, Brian; Vayena, Effy; Coşkun, Yavuz; Dereli, Türkay; Huzair, Farah; Borda-Rodriguez, Alexander; Bragazzi, Nicola Luigi; Faris, Jack; Ramesar, Raj; Wonkam, Ambroise; Dandara, Collet; Nair, Bipin; Llerena, Adrián; Kılıç, Koray; Jain, Rekha; Reddy, Panga Jaipal; Gollapalli, Kishore; Srivastava, Sanjeeva; Kickbusch, Ilona

    2014-01-01

    Metadata refer to descriptions about data or as some put it, "data about data." Metadata capture what happens on the backstage of science, on the trajectory from study conception, design, funding, implementation, and analysis to reporting. Definitions of metadata vary, but they can include the context information surrounding the practice of science, or data generated as one uses a technology, including transactional information about the user. As the pursuit of knowledge broadens in the 21(st) century from traditional "science of whats" (data) to include "science of hows" (metadata), we analyze the ways in which metadata serve as a catalyst for responsible and open innovation, and by extension, science diplomacy. In 2015, the United Nations Millennium Development Goals (MDGs) will formally come to an end. Therefore, we propose that metadata, as an ingredient of responsible innovation, can help achieve the Sustainable Development Goals (SDGs) on the post-2015 agenda. Such responsible innovation, as a collective learning process, has become a key component, for example, of the European Union's 80 billion Euro Horizon 2020 R&D Program from 2014-2020. Looking ahead, OMICS: A Journal of Integrative Biology, is launching an initiative for a multi-omics metadata checklist that is flexible yet comprehensive, and will enable more complete utilization of single and multi-omics data sets through data harmonization and greater visibility and accessibility. The generation of metadata that shed light on how omics research is carried out, by whom and under what circumstances, will create an "intervention space" for integration of science with its socio-technical context. This will go a long way to addressing responsible innovation for a fairer and more transparent society. If we believe in science, then such reflexive qualities and commitments attained by availability of omics metadata are preconditions for a robust and socially attuned science, which can then remain broadly respected, independent, and responsibly innovative. "In Sierra Leone, we have not too much electricity. The lights will come on once in a week, and the rest of the month, dark[ness]. So I made my own battery to power light in people's houses." Kelvin Doe (Global Minimum, 2012) MIT Visiting Young Innovator Cambridge, USA, and Sierra Leone "An important function of the (Global) R&D Observatory will be to provide support and training to build capacity in the collection and analysis of R&D flows, and how to link them to the product pipeline." World Health Organization (2013) Draft Working Paper on a Global Health R&D Observatory.

  18. Enlightening discriminative network functional modules behind Principal Component Analysis separation in differential-omic science studies

    PubMed Central

    Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio

    2017-01-01

    Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094

  19. Educating future nursing scientists: Recommendations for integrating omics content in PhD programs.

    PubMed

    Conley, Yvette P; Heitkemper, Margaret; McCarthy, Donna; Anderson, Cindy M; Corwin, Elizabeth J; Daack-Hirsch, Sandra; Dorsey, Susan G; Gregory, Katherine E; Groer, Maureen W; Henly, Susan J; Landers, Timothy; Lyon, Debra E; Taylor, Jacquelyn Y; Voss, Joachim

    2015-01-01

    Preparing the next generation of nursing scientists to conduct high-impact, competitive, sustainable, innovative, and interdisciplinary programs of research requires that the curricula for PhD programs keep pace with emerging areas of knowledge and health care/biomedical science. A field of inquiry that holds great potential to influence our understanding of the underlying biology and mechanisms of health and disease is omics. For the purpose of this article, omics refers to genomics, transcriptomics, proteomics, epigenomics, exposomics, microbiomics, and metabolomics. Traditionally, most PhD programs in schools of nursing do not incorporate this content into their core curricula. As part of the Council for the Advancement of Nursing Science's Idea Festival for Nursing Science Education, a work group charged with addressing omics preparation for the next generation of nursing scientists was convened. The purpose of this article is to describe key findings and recommendations from the work group that unanimously and enthusiastically support the incorporation of omics content into the curricula of PhD programs in nursing. The work group also calls to action faculty in schools of nursing to develop strategies to enable students needing immersion in omics science and methods to execute their research goals. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Computational Tools for Parsimony Phylogenetic Analysis of Omics Data

    PubMed Central

    Salazar, Jose; Amri, Hakima; Noursi, David

    2015-01-01

    Abstract High-throughput assays from genomics, proteomics, metabolomics, and next generation sequencing produce massive omics datasets that are challenging to analyze in biological or clinical contexts. Thus far, there is no publicly available program for converting quantitative omics data into input formats to be used in off-the-shelf robust phylogenetic programs. To the best of our knowledge, this is the first report on creation of two Windows-based programs, OmicsTract and SynpExtractor, to address this gap. We note, as a way of introduction and development of these programs, that one particularly useful bioinformatics inferential modeling is the phylogenetic cladogram. Cladograms are multidimensional tools that show the relatedness between subgroups of healthy and diseased individuals and the latter's shared aberrations; they also reveal some characteristics of a disease that would not otherwise be apparent by other analytical methods. The OmicsTract and SynpExtractor were written for the respective tasks of (1) accommodating advanced phylogenetic parsimony analysis (through standard programs of MIX [from PHYLIP] and TNT), and (2) extracting shared aberrations at the cladogram nodes. OmicsTract converts comma-delimited data tables through assigning each data point into a binary value (“0” for normal states and “1” for abnormal states) then outputs the converted data tables into the proper input file formats for MIX or with embedded commands for TNT. SynapExtractor uses outfiles from MIX and TNT to extract the shared aberrations of each node of the cladogram, matching them with identifying labels from the dataset and exporting them into a comma-delimited file. Labels may be gene identifiers in gene-expression datasets or m/z values in mass spectrometry datasets. By automating these steps, OmicsTract and SynpExtractor offer a veritable opportunity for rapid and standardized phylogenetic analyses of omics data; their model can also be extended to next generation sequencing (NGS) data. We make OmicsTract and SynpExtractor publicly and freely available for non-commercial use in order to strengthen and build capacity for the phylogenetic paradigm of omics analysis. PMID:26230532

  1. METABOLOMICS IN MEDICAL SCIENCES--TRENDS, CHALLENGES AND PERSPECTIVES.

    PubMed

    Klupczyńska, Agnieszka; Dereziński, Paweł; Kokot, Zenon J

    2015-01-01

    Metabolomics is the latest of the "omic" technologies that involves comprehensive analysis of small molecule metabolites of an organism or a specific biological sample. Metabolomics provides an insight into the cell status and describes an actual health condition of organisms. Analysis of metabolome offers a unique opportunity to study the influence of genetic variation, disease, applied treatment or diet on endogenous metabolic state of organisms. There are many areas that might benefit from metabolomic research. In the article some applications of this novel "omic" technology in the field of medical sciences are presented. One of the most popular aims of metabolomic studies is biomarker discovery. Despite using the state-of-art analytical techniques along with advanced bioinformatic tools, metabolomic experiments encounter numerous difficulties and pitfalls. Challenges that researchers in the field of analysis of metabolome have to face include i.a., technical limitations, bioinformatic challenges and integration with other "omic" sciences. One of the grand challenges for studies in the field of metabolomics is to tackle the problem of data analysis, which is probably the most time consuming stage of metabolomic workflow and requires close collaboration between analysts, clinicians and experts in chemometric analysis. Implementation of metabolomics into clinical practice will be dependent on establishment of standardized protocols in analytical performance and data analysis and development of fit-for-purpose biomarker method validation. Metabolomics allows to achieve a sophisticated level of information about biological systems and opens up new perspectives in many fields of medicine, especially in oncology. Apart from its extensive cognitive significance, metabolomics manifests also a practical importance as it may lead to design of new non-invasive, sensitive and specific diagnostic techniques and development of new therapies.

  2. A Multi-layered Quantitative In Vivo Expression Atlas of the Podocyte Unravels Kidney Disease Candidate Genes.

    PubMed

    Rinschen, Markus M; Gödel, Markus; Grahammer, Florian; Zschiedrich, Stefan; Helmstädter, Martin; Kretz, Oliver; Zarei, Mostafa; Braun, Daniela A; Dittrich, Sebastian; Pahmeyer, Caroline; Schroder, Patricia; Teetzen, Carolin; Gee, HeonYung; Daouk, Ghaleb; Pohl, Martin; Kuhn, Elisa; Schermer, Bernhard; Küttner, Victoria; Boerries, Melanie; Busch, Hauke; Schiffer, Mario; Bergmann, Carsten; Krüger, Marcus; Hildebrandt, Friedhelm; Dengjel, Joern; Benzing, Thomas; Huber, Tobias B

    2018-05-22

    Damage to and loss of glomerular podocytes has been identified as the culprit lesion in progressive kidney diseases. Here, we combine mass spectrometry-based proteomics with mRNA sequencing, bioinformatics, and hypothesis-driven studies to provide a comprehensive and quantitative map of mammalian podocytes that identifies unanticipated signaling pathways. Comparison of the in vivo datasets with proteomics data from podocyte cell cultures showed a limited value of available cell culture models. Moreover, in vivo stable isotope labeling by amino acids uncovered surprisingly rapid synthesis of mitochondrial proteins under steady-state conditions that was perturbed under autophagy-deficient, disease-susceptible conditions. Integration of acquired omics dimensions suggested FARP1 as a candidate essential for podocyte function, which could be substantiated by genetic analysis in humans and knockdown experiments in zebrafish. This work exemplifies how the integration of multi-omics datasets can identify a framework of cell-type-specific features relevant for organ health and disease. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Improved bacteriophage genome data is necessary for integrating viral and bacterial ecology.

    PubMed

    Bibby, Kyle

    2014-02-01

    The recent rise in "omics"-enabled approaches has lead to improved understanding in many areas of microbial ecology. However, despite the importance that viruses play in a broad microbial ecology context, viral ecology remains largely not integrated into high-throughput microbial ecology studies. A fundamental hindrance to the integration of viral ecology into omics-enabled microbial ecology studies is the lack of suitable reference bacteriophage genomes in reference databases-currently, only 0.001% of bacteriophage diversity is represented in genome sequence databases. This commentary serves to highlight this issue and to promote bacteriophage genome sequencing as a valuable scientific undertaking to both better understand bacteriophage diversity and move towards a more holistic view of microbial ecology.

  4. Abiotic Stress Responses and Microbe-Mediated Mitigation in Plants: The Omics Strategies

    PubMed Central

    Meena, Kamlesh K.; Sorty, Ajay M.; Bitla, Utkarsh M.; Choudhary, Khushboo; Gupta, Priyanka; Pareek, Ashwani; Singh, Dhananjaya P.; Prabha, Ratna; Sahu, Pramod K.; Gupta, Vijai K.; Singh, Harikesh B.; Krishanani, Kishor K.; Minhas, Paramjit S.

    2017-01-01

    Abiotic stresses are the foremost limiting factors for agricultural productivity. Crop plants need to cope up adverse external pressure created by environmental and edaphic conditions with their intrinsic biological mechanisms, failing which their growth, development, and productivity suffer. Microorganisms, the most natural inhabitants of diverse environments exhibit enormous metabolic capabilities to mitigate abiotic stresses. Since microbial interactions with plants are an integral part of the living ecosystem, they are believed to be the natural partners that modulate local and systemic mechanisms in plants to offer defense under adverse external conditions. Plant-microbe interactions comprise complex mechanisms within the plant cellular system. Biochemical, molecular and physiological studies are paving the way in understanding the complex but integrated cellular processes. Under the continuous pressure of increasing climatic alterations, it now becomes more imperative to define and interpret plant-microbe relationships in terms of protection against abiotic stresses. At the same time, it also becomes essential to generate deeper insights into the stress-mitigating mechanisms in crop plants for their translation in higher productivity. Multi-omics approaches comprising genomics, transcriptomics, proteomics, metabolomics and phenomics integrate studies on the interaction of plants with microbes and their external environment and generate multi-layered information that can answer what is happening in real-time within the cells. Integration, analysis and decipherization of the big-data can lead to a massive outcome that has significant chance for implementation in the fields. This review summarizes abiotic stresses responses in plants in-terms of biochemical and molecular mechanisms followed by the microbe-mediated stress mitigation phenomenon. We describe the role of multi-omics approaches in generating multi-pronged information to provide a better understanding of plant–microbe interactions that modulate cellular mechanisms in plants under extreme external conditions and help to optimize abiotic stresses. Vigilant amalgamation of these high-throughput approaches supports a higher level of knowledge generation about root-level mechanisms involved in the alleviation of abiotic stresses in organisms. PMID:28232845

  5. Abiotic Stress Responses and Microbe-Mediated Mitigation in Plants: The Omics Strategies.

    PubMed

    Meena, Kamlesh K; Sorty, Ajay M; Bitla, Utkarsh M; Choudhary, Khushboo; Gupta, Priyanka; Pareek, Ashwani; Singh, Dhananjaya P; Prabha, Ratna; Sahu, Pramod K; Gupta, Vijai K; Singh, Harikesh B; Krishanani, Kishor K; Minhas, Paramjit S

    2017-01-01

    Abiotic stresses are the foremost limiting factors for agricultural productivity. Crop plants need to cope up adverse external pressure created by environmental and edaphic conditions with their intrinsic biological mechanisms, failing which their growth, development, and productivity suffer. Microorganisms, the most natural inhabitants of diverse environments exhibit enormous metabolic capabilities to mitigate abiotic stresses. Since microbial interactions with plants are an integral part of the living ecosystem, they are believed to be the natural partners that modulate local and systemic mechanisms in plants to offer defense under adverse external conditions. Plant-microbe interactions comprise complex mechanisms within the plant cellular system. Biochemical, molecular and physiological studies are paving the way in understanding the complex but integrated cellular processes. Under the continuous pressure of increasing climatic alterations, it now becomes more imperative to define and interpret plant-microbe relationships in terms of protection against abiotic stresses. At the same time, it also becomes essential to generate deeper insights into the stress-mitigating mechanisms in crop plants for their translation in higher productivity. Multi-omics approaches comprising genomics, transcriptomics, proteomics, metabolomics and phenomics integrate studies on the interaction of plants with microbes and their external environment and generate multi-layered information that can answer what is happening in real-time within the cells. Integration, analysis and decipherization of the big-data can lead to a massive outcome that has significant chance for implementation in the fields. This review summarizes abiotic stresses responses in plants in-terms of biochemical and molecular mechanisms followed by the microbe-mediated stress mitigation phenomenon. We describe the role of multi-omics approaches in generating multi-pronged information to provide a better understanding of plant-microbe interactions that modulate cellular mechanisms in plants under extreme external conditions and help to optimize abiotic stresses. Vigilant amalgamation of these high-throughput approaches supports a higher level of knowledge generation about root-level mechanisms involved in the alleviation of abiotic stresses in organisms.

  6. GeneLab

    NASA Technical Reports Server (NTRS)

    Berrios, Daniel C.; Thompson, Terri G.

    2015-01-01

    NASA GeneLab is expected to capture and distribute omics data and experimental and process conditions most relevant to research community in their statistical and theoretical analysis of NASAs omics data.

  7. TransOmic analysis of forebrain sections in Sp2 conditional knockout embryonic mice using IR-MALDESI imaging of lipids and LC-MS/MS label-free proteomics

    PubMed Central

    Loziuk, Philip; Meier, Florian; Johnson, Caroline

    2016-01-01

    Quantitative methods for detection of biological molecules are needed more than ever before in the emerging age of “omics” and “big data.” Here, we provide an integrated approach for systematic analysis of the “lipidome” in tissue. To test our approach in a biological context, we utilized brain tissue selectively deficient for the transcription factor Specificity Protein 2 (Sp2). Conditional deletion of Sp2 in the mouse cerebral cortex results in developmental deficiencies including disruption of lipid metabolism. Silver (Ag) cationization was implemented for infrared matrix-assisted laser desorption electrospray ionization (IR-MALDESI) to enhance the ion abundances for olefinic lipids, as these have been linked to regulation by Sp2. Combining Ag-doped and conventional IR-MALDESI imaging, this approach was extended to IR-MALDESI imaging of embryonic mouse brains. Further, our imaging technique was combined with bottom-up shotgun proteomic LC-MS/MS analysis and western blot for comparing Sp2 conditional knockout (Sp2-cKO) and wild-type (WT) cortices of tissue sections. This provided an integrated omics dataset which revealed many specific changes to fundamental cellular processes and biosynthetic pathways. In particular, step-specific altered abundances of nucleotides, lipids, and associated proteins were observed in the cerebral cortices of Sp2-cKO embryos. PMID:26942738

  8. Integrated multi-omics analyses reveal the biochemical mechanisms and phylogenetic relevance of anaerobic androgen biodegradation in the environment

    PubMed Central

    Yang, Fu-Chun; Chen, Yi-Lung; Tang, Sen-Lin; Yu, Chang-Ping; Wang, Po-Hsiang; Ismail, Wael; Wang, Chia-Hsiang; Ding, Jiun-Yan; Yang, Cheng-Yu; Yang, Chia-Ying; Chiang, Yin-Ru

    2016-01-01

    Steroid hormones, such as androgens, are common surface-water contaminants. However, literature on the ecophysiological relevance of steroid-degrading organisms in the environment, particularly in anoxic ecosystems, is extremely limited. We previously reported that Steroidobacter denitrificans anaerobically degrades androgens through the 2,3-seco pathway. In this study, the genome of Sdo. denitrificans was completely sequenced. Transcriptomic data revealed gene clusters that were distinctly expressed during anaerobic growth on testosterone. We isolated and characterized the bifunctional 1-testosterone hydratase/dehydrogenase, which is essential for anaerobic degradation of steroid A-ring. Because of apparent substrate preference of this molybdoenzyme, corresponding genes, along with the signature metabolites of the 2,3-seco pathway, were used as biomarkers to investigate androgen biodegradation in the largest sewage treatment plant in Taipei, Taiwan. Androgen metabolite analysis indicated that denitrifying bacteria in anoxic sewage use the 2,3-seco pathway to degrade androgens. Metagenomic analysis and PCR-based functional assays showed androgen degradation in anoxic sewage by Thauera spp. through the action of 1-testosterone hydratase/dehydrogenase. Our integrative ‘omics' approach can be used for culture-independent investigations of the microbial degradation of structurally complex compounds where isotope-labeled substrates are not easily available. PMID:26872041

  9. Creation of a digital slide and tissue microarray resource from a multi-institutional predictive toxicology study in the rat: an initial report from the PredTox group.

    PubMed

    Mulrane, Laoighse; Rexhepaj, Elton; Smart, Valerie; Callanan, John J; Orhan, Diclehan; Eldem, Türkan; Mally, Angela; Schroeder, Susanne; Meyer, Kirstin; Wendt, Maria; O'Shea, Donal; Gallagher, William M

    2008-08-01

    The widespread use of digital slides has only recently come to the fore with the development of high-throughput scanners and high performance viewing software. This development, along with the optimisation of compression standards and image transfer techniques, has allowed the technology to be used in wide reaching applications including integration of images into hospital information systems and histopathological training, as well as the development of automated image analysis algorithms for prediction of histological aberrations and quantification of immunohistochemical stains. Here, the use of this technology in the creation of a comprehensive library of images of preclinical toxicological relevance is demonstrated. The images, acquired using the Aperio ScanScope CS and XT slide acquisition systems, form part of the ongoing EU FP6 Integrated Project, Innovative Medicines for Europe (InnoMed). In more detail, PredTox (abbreviation for Predictive Toxicology) is a subproject of InnoMed and comprises a consortium of 15 industrial (13 large pharma, 1 technology provider and 1 SME) and three academic partners. The primary aim of this consortium is to assess the value of combining data generated from 'omics technologies (proteomics, transcriptomics, metabolomics) with the results from more conventional toxicology methods, to facilitate further informed decision making in preclinical safety evaluation. A library of 1709 scanned images was created of full-face sections of liver and kidney tissue specimens from male Wistar rats treated with 16 proprietary and reference compounds of known toxicity; additional biological materials from these treated animals were separately used to create 'omics data, that will ultimately be used to populate an integrated toxicological database. In respect to assessment of the digital slides, a web-enabled digital slide management system, Digital SlideServer (DSS), was employed to enable integration of the digital slide content into the 'omics database and to facilitate remote viewing by pathologists connected with the project. DSS also facilitated manual annotation of digital slides by the pathologists, specifically in relation to marking particular lesions of interest. Tissue microarrays (TMAs) were constructed from the specimens for the purpose of creating a repository of tissue from animals used in the study with a view to later-stage biomarker assessment. As the PredTox consortium itself aims to identify new biomarkers of toxicity, these TMAs will be a valuable means of validation. In summary, a large repository of histological images was created enabling the subsequent pathological analysis of samples through remote viewing and, along with the utilisation of TMA technology, will allow the validation of biomarkers identified by the PredTox consortium. The population of the PredTox database with these digitised images represents the creation of the first toxicological database integrating 'omics and preclinical data with histological images.

  10. ICM: a web server for integrated clustering of multi-dimensional biomedical data.

    PubMed

    He, Song; He, Haochen; Xu, Wenjian; Huang, Xin; Jiang, Shuai; Li, Fei; He, Fuchu; Bo, Xiaochen

    2016-07-08

    Large-scale efforts for parallel acquisition of multi-omics profiling continue to generate extensive amounts of multi-dimensional biomedical data. Thus, integrated clustering of multiple types of omics data is essential for developing individual-based treatments and precision medicine. However, while rapid progress has been made, methods for integrated clustering are lacking an intuitive web interface that facilitates the biomedical researchers without sufficient programming skills. Here, we present a web tool, named Integrated Clustering of Multi-dimensional biomedical data (ICM), that provides an interface from which to fuse, cluster and visualize multi-dimensional biomedical data and knowledge. With ICM, users can explore the heterogeneity of a disease or a biological process by identifying subgroups of patients. The results obtained can then be interactively modified by using an intuitive user interface. Researchers can also exchange the results from ICM with collaborators via a web link containing a Project ID number that will directly pull up the analysis results being shared. ICM also support incremental clustering that allows users to add new sample data into the data of a previous study to obtain a clustering result. Currently, the ICM web server is available with no login requirement and at no cost at http://biotech.bmi.ac.cn/icm/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Progress in oral personalized medicine: contribution of 'omics'.

    PubMed

    Glurich, Ingrid; Acharya, Amit; Brilliant, Murray H; Shukla, Sanjay K

    2015-01-01

    Precision medicine (PM), representing clinically applicable personalized medicine, proactively integrates and interprets multidimensional personal health data, including clinical, 'omics', and environmental profiles, into clinical practice. Realization of PM remains in progress. The focus of this review is to provide a descriptive narrative overview of: 1) the current status of oral personalized medicine; and 2) recent advances in genomics and related 'omic' and emerging research domains contributing to advancing oral-systemic PM, with special emphasis on current understanding of oral microbiomes. A scan of peer-reviewed literature describing oral PM or 'omic'-based research conducted on humans/data published in English within the last 5 years in journals indexed in the PubMed database was conducted using mesh search terms. An evidence-based approach was used to report on recent advances with potential to advance PM in the context of historical critical and systematic reviews to delineate current state-of-the-art technologies. Special focus was placed on oral microbiome research associated with health and disease states, emerging research domains, and technological advances, which are positioning realization of PM. This review summarizes: 1) evolving conceptualization of personalized medicine; 2) emerging insight into roles of oral infectious and inflammatory processes as contributors to both oral and systemic diseases; 3) community shifts in microbiota that may contribute to disease; 4) evidence pointing to new uncharacterized potential oral pathogens; 5) advances in technological approaches to 'omics' research that will accelerate PM; 6) emerging research domains that expand insights into host-microbe interaction including inter-kingdom communication, systems and network analysis, and salivaomics; and 7) advances in informatics and big data analysis capabilities to facilitate interpretation of host and microbiome-associated datasets. Furthermore, progress in clinically applicable screening assays and biomarker definition to inform clinical care are briefly explored. Advancement of oral PM currently remains in research and discovery phases. Although substantive progress has been made in advancing the understanding of the role of microbiome dynamics in health and disease and is being leveraged to advance early efforts at clinical translation, further research is required to discern interpretable constituency patterns in the complex interactions of these microbial communities in health and disease. Advances in biotechnology and bioinformatics facilitating novel approaches to rapid analysis and interpretation of large datasets are providing new insights into oral health and disease, potentiating clinical application and advancing realization of PM within the next decade.

  12. Mechanism of cisplatin proximal tubule toxicity revealed by integrating transcriptomics, proteomics, metabolomics and biokinetics.

    PubMed

    Wilmes, Anja; Bielow, Chris; Ranninger, Christina; Bellwon, Patricia; Aschauer, Lydia; Limonciel, Alice; Chassaigne, Hubert; Kristl, Theresa; Aiche, Stephan; Huber, Christian G; Guillou, Claude; Hewitt, Philipp; Leonard, Martin O; Dekant, Wolfgang; Bois, Frederic; Jennings, Paul

    2015-12-25

    Cisplatin is one of the most widely used chemotherapeutic agents for the treatment of solid tumours. The major dose-limiting factor is nephrotoxicity, in particular in the proximal tubule. Here, we use an integrated omics approach, including transcriptomics, proteomics and metabolomics coupled to biokinetics to identify cell stress response pathways induced by cisplatin. The human renal proximal tubular cell line RPTEC/TERT1 was treated with sub-cytotoxic concentrations of cisplatin (0.5 and 2 μM) in a daily repeat dose treating regime for up to 14 days. Biokinetic analysis showed that cisplatin was taken up from the basolateral compartment, transported to the apical compartment, and accumulated in cells over time. This is in line with basolateral uptake of cisplatin via organic cation transporter 2 and bioactivation via gamma-glutamyl transpeptidase located on the apical side of proximal tubular cells. Cisplatin affected several pathways including, p53 signalling, Nrf2 mediated oxidative stress response, mitochondrial processes, mTOR and AMPK signalling. In addition, we identified novel pathways changed by cisplatin, including eIF2 signalling, actin nucleation via the ARP/WASP complex and regulation of cell polarization. In conclusion, using an integrated omic approach together with biokinetics we have identified both novel and established mechanisms of cisplatin toxicity. Copyright © 2014 Elsevier Ltd. All rights reserved.

  13. Sparking Thinking: Studying Modern Precision Medicine Will Accelerate the Progression of Traditional Chinese Medicine Patterns.

    PubMed

    Liu, Bao-Cheng; Ji, Guang

    2017-07-01

    Incorporating "-omics" studies with environmental interactions could help elucidate the biological mechanisms responsible for Traditional Chinese Medicine (TCM) patterns. Based on the authors' own experiences, this review outlines a model of an ideal combination of "-omics" biomarkers, environmental factors, and TCM pattern classifications; provides a narrative review of the relevant genetic and TCM studies; and lists several successful integrative examples. Two integration tools are briefly introduced. The first is the integration of modern devices into objective diagnostic methods of TCM patterning, which would improve current clinical decision-making and practice. The second is the use of biobanks and data platforms, which could broadly support biological and medical research. Such efforts will transform current medical management and accelerate the progression of precision medicine.

  14. Big Data Transforms Discovery-Utilization Therapeutics Continuum

    PubMed Central

    Waldman, SA; Terzic, A

    2015-01-01

    Enabling omic technologies adopt a holistic view to produce unprecedented insights into the molecular underpinnings of health and disease, in part, by generating massive high-dimensional biological data. Leveraging these systems-level insights as an engine driving the healthcare evolution is maximized through integration with medical, demographic, and environmental datasets from individuals to populations. Big data analytics has accordingly emerged to add value to the technical aspects of storage, transfer, and analysis required for merging vast arrays of omic-, clinical- and eco-datasets. In turn, this new field at the interface of biology, medicine, and information science is systematically transforming modern therapeutics across discovery, development, regulation, and utilization. “…a man's discourse was like to a rich Persian carpet, the beautiful figures and patterns of which can be shown only by spreading and extending it out; when it is contracted and folded up, they are obscured and lost” Themistocles quoted by Plutarch AD 46 – AD 120 PMID:26888297

  15. Omics Research on the International Space Station

    NASA Technical Reports Server (NTRS)

    Love, John

    2015-01-01

    The International Space Station (ISS) is an orbiting laboratory whose goals include advancing science and technology research. Completion of ISS assembly ushered a new era focused on utilization, encompassing multiple disciplines such as Biology and Biotechnology, Physical Sciences, Technology Development and Demonstration, Human Research, Earth and Space Sciences, and Educational Activities. The research complement planned for upcoming ISS Expeditions 45&46 includes several investigations in the new field of omics, which aims to collectively characterize sets of biomolecules (e.g., genomic, epigenomic, transcriptomic, proteomic, and metabolomic products) that translate into organismic structure and function. For example, Multi-Omics is a JAXA investigation that analyzes human microbial metabolic cross-talk in the space ecosystem by evaluating data from immune dysregulation biomarkers, metabolic profiles, and microbiota composition. The NASA OsteoOmics investigation studies gravitational regulation of osteoblast genomics and metabolism. Tissue Regeneration uses pan-omics approaches with cells cultured in bioreactors to characterize factors involved in mammalian bone tissue regeneration in microgravity. Rodent Research-3 includes an experiment that implements pan-omics to evaluate therapeutically significant molecular circuits, markers, and biomaterials associated with microgravity wound healing and tissue regeneration in bone defective rodents. The JAXA Mouse Epigenetics investigation examines molecular alterations in organ specific gene expression patterns and epigenetic modifications, and analyzes murine germ cell development during long term spaceflight. Lastly, Twins Study ("Differential effects of homozygous twin astronauts associated with differences in exposure to spaceflight factors"), NASA's first foray into human omics research, applies integrated analyses to assess biomolecular responses to physical, physiological, and environmental stressors associated with spaceflight.

  16. Mastitomics, the integrated omics of bovine milk in an experimental model of Streptococcus uberis mastitis: 2. Label-free relative quantitative proteomics.

    PubMed

    Mudaliar, Manikhandan; Tassi, Riccardo; Thomas, Funmilola C; McNeilly, Tom N; Weidt, Stefan K; McLaughlin, Mark; Wilson, David; Burchmore, Richard; Herzyk, Pawel; Eckersall, P David; Zadoks, Ruth N

    2016-08-16

    Mastitis, inflammation of the mammary gland, is the most common and costly disease of dairy cattle in the western world. It is primarily caused by bacteria, with Streptococcus uberis as one of the most prevalent causative agents. To characterize the proteome during Streptococcus uberis mastitis, an experimentally induced model of intramammary infection was used. Milk whey samples obtained from 6 cows at 6 time points were processed using label-free relative quantitative proteomics. This proteomic analysis complements clinical, bacteriological and immunological studies as well as peptidomic and metabolomic analysis of the same challenge model. A total of 2552 non-redundant bovine peptides were identified, and from these, 570 bovine proteins were quantified. Hierarchical cluster analysis and principal component analysis showed clear clustering of results by stage of infection, with similarities between pre-infection and resolution stages (0 and 312 h post challenge), early infection stages (36 and 42 h post challenge) and late infection stages (57 and 81 h post challenge). Ingenuity pathway analysis identified upregulation of acute phase protein pathways over the course of infection, with dominance of different acute phase proteins at different time points based on differential expression analysis. Antimicrobial peptides, notably cathelicidins and peptidoglycan recognition protein, were upregulated at all time points post challenge and peaked at 57 h, which coincided with 10 000-fold decrease in average bacterial counts. The integration of clinical, bacteriological, immunological and quantitative proteomics and other-omic data provides a more detailed systems level view of the host response to mastitis than has been achieved previously.

  17. The PerkinElmer Omics Laboratory (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Smith, Todd

    2017-12-22

    Todd Smith of the PerkinElmer Omics Laboratory gives a talk about his lab and its work at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  18. The PerkinElmer Omics Laboratory (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, Todd

    2012-06-01

    Todd Smith of the PerkinElmer Omics Laboratory gives a talk about his lab and its work at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  19. The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies

    DOE PAGES

    White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.; ...

    2017-05-06

    Over the past century, the significance of the rhizosphere has been increasingly recognized by the scientific community. Furthermore, this complex biological system is comprised of vast interconnected networks of microbial organisms that interact directly with their plant hosts, including archaea, bacteria, fungi, picoeukaryotes, and viruses. The rhizosphere provides a nutritional base to the terrestrial biosphere, and is integral to plant growth, crop production, and ecosystem health. There is little mechanistic understanding of the rhizosphere, but, and that constitutes a critical knowledge gap. It inhibits our ability to predict and control the terrestrial ecosystem to achieve desirable outcomes, such as bioenergymore » production, crop yield maximization, and soil-based carbon sequestration. Multi-omics have the potential to significantly advance our knowledge of rhizospheric science. Our review covers multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and the challenges to be addressed during this century of rhizospheric science.« less

  20. The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.

    Over the past century, the significance of the rhizosphere has been increasingly recognized by the scientific community. Furthermore, this complex biological system is comprised of vast interconnected networks of microbial organisms that interact directly with their plant hosts, including archaea, bacteria, fungi, picoeukaryotes, and viruses. The rhizosphere provides a nutritional base to the terrestrial biosphere, and is integral to plant growth, crop production, and ecosystem health. There is little mechanistic understanding of the rhizosphere, but, and that constitutes a critical knowledge gap. It inhibits our ability to predict and control the terrestrial ecosystem to achieve desirable outcomes, such as bioenergymore » production, crop yield maximization, and soil-based carbon sequestration. Multi-omics have the potential to significantly advance our knowledge of rhizospheric science. Our review covers multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and the challenges to be addressed during this century of rhizospheric science.« less

  1. The state of rhizospheric science in the era of multi-omics: A practical guide to omics technologies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    White, Richard Allen; Rivas-Ubach, Albert; Borkum, Mark I.

    Over the past century, the significance of the rhizosphere as a complex, biological system, comprised of vast, interconnected networks of microbial organisms that interact directly with their plant hosts (e.g., archæa, bacteria, fungi, eukaryotes, and viruses) has been increasingly recognized by the scientific community. Providing a nutritional base to the terrestrial biosphere, the rhizosphere is integral to plant growth, crop production and ecosystem health. Lack of mechanistic understanding of the rhizosphere constitutes a critical knowledge gap, inhibiting our ability to predict and control the terrestrial ecosystem in order to achieve desirable outcomes (e.g., bioenergy production, crop yield maximization, and soilbasedmore » carbon sequestration). Application of multi-omics has the potential to significantly advance our knowledge of rhizospheric science. This review covers: cutting- and bleeding-edge, multi-omic techniques and technologies; methods and protocols for specific rhizospheric science questions; and, challenges to be addressed during this century of rhizospheric science.« less

  2. Linking Ecological, Environmental and Biogeochemical Data with Multi'omics Analysis

    NASA Astrophysics Data System (ADS)

    Hasler-Sheetal, H.; Castorani, M. C.; Fragner, L.; Zeng, Y.; Holmer, M.; Glud, R. N.; Weckwerth, W.; Canfield, D. E.

    2016-02-01

    The integrated analysis of multi'omics and environmental data provides a holistic understanding of biological processes and has been proven to be challenging. Here we present our research concept for conducting multi-omics experiments and linking them to environmental data. Hypoxia, reduced light availability and species interaction - all amplified by global warming - cause a global decline of seagrasses. Metabolic mechanisms for coping with these global threats are largely unknown and multi'omics approaches can be an important approach for generating this insight. We applied GC, LC-qTOF-MS and bioinformatics to investigate the effects of environmental pressure on metabolites present in seagrasses. In a first experiment we assessed the metabolomics response of the seagrass Zostera marina towards anoxia and showed that photosynthetically derived oxygen could satisfy the oxygen demand in the leaves. But accumulation of fermentation products in the roots showed that the rhizosphere was under anoxic stress. In contrast nocturnal anoxia caused a biphasic shift in the metabolome of roots and leaves. This nocturnal reprogramming of the metabolome under anoxia indicates a mitigation mechanism to avoid the toxic effects. A pathway enrichment analysis proposes the alanine shunt, the GABA shunt and the 2-oxoglutarate shunt as such mitigation mechanisms that alleviate pyruvate levels and lead to carbon and nitrogen storage during anoxia. In a second experiment, varying light exposure and species interaction of Z. marina with the blue mussel Mytilus edulis - a co-occurring species in seagrass systems - resulted in treatment specific metabolic fingerprints in seagrass. Light modified the metabolic fingerprint expressed in Z. marina to the presence of mussels, indicating varying physiological responses to mussels in normal and low light regimes. Multivariate data-analysis indicated light exposure as main driver (45%) and mussel presence as minor driver (13%) for the metabolic responses. Traditional plant performance parameters exhibited light dependent variation but in contrast to the metabolome none of these parameters were dependent on the presence of M. edulis. This demonstrates the applicability of metabolomics to reveal hidden effects of environmental pressure on seagrasses.

  3. Reconstruction of genome-scale human metabolic models using omics data.

    PubMed

    Ryu, Jae Yong; Kim, Hyun Uk; Lee, Sang Yup

    2015-08-01

    The impact of genome-scale human metabolic models on human systems biology and medical sciences is becoming greater, thanks to increasing volumes of model building platforms and publicly available omics data. The genome-scale human metabolic models started with Recon 1 in 2007, and have since been used to describe metabolic phenotypes of healthy and diseased human tissues and cells, and to predict therapeutic targets. Here we review recent trends in genome-scale human metabolic modeling, including various generic and tissue/cell type-specific human metabolic models developed to date, and methods, databases and platforms used to construct them. For generic human metabolic models, we pay attention to Recon 2 and HMR 2.0 with emphasis on data sources used to construct them. Draft and high-quality tissue/cell type-specific human metabolic models have been generated using these generic human metabolic models. Integration of tissue/cell type-specific omics data with the generic human metabolic models is the key step, and we discuss omics data and their integration methods to achieve this task. The initial version of the tissue/cell type-specific human metabolic models can further be computationally refined through gap filling, reaction directionality assignment and the subcellular localization of metabolic reactions. We review relevant tools for this model refinement procedure as well. Finally, we suggest the direction of further studies on reconstructing an improved human metabolic model.

  4. Multi-omics analysis provides insight to the Ignicoccus hospitalis - Nanoarchaeum equitans association

    DOE PAGES

    Rawle, Rachel A.; Hamerly, Timothy; Tripet, Brian P.; ...

    2017-06-04

    Studies of interspecies interactions are inherently difficult due to the complex mechanisms which enable these relationships. A model system for studying interspecies interactions is the marine hyperthermophiles Ignicoccus hospitalis and Nanoarchaeum equitans. Recent independently-conducted ‘omics’ analyses have generated insights into the molecular factors modulating this association. However, significant questions remain about the nature of the interactions between these archaea. We jointly analyzed multiple levels of omics datasets obtained from published, independent transcriptomics, proteomics, and metabolomics analyses. DAVID identified functionally-related groups enriched when I. hospitalis is grown alone or in co-culture with N. equitans. Enriched molecular pathways were subsequently visualized usingmore » interaction maps generated using STRING. Key findings of our multi-level omics analysis indicated that I. hospitalis provides precursors to N. equitans for energy metabolism. Analysis indicated an overall reduction in diversity of metabolic precursors in the I. hospitalis–N. equitans co-culture, which has been connected to the differential use of ribosomal subunits and was previously unnoticed. We also identified differences in precursors linked to amino acid metabolism, NADH metabolism, and carbon fixation, providing new insights into the metabolic adaptions of I. hospitalis enabling the growth of N. equitans. In conclusion, this multi-omics analysis builds upon previously identified cellular patterns while offering new insights into mechanisms that enable the I. hospitalis–N. equitans association. This study applies statistical and visualization techniques to a mixed-source omics dataset to yield a more global insight into a complex system, that was not readily discernable from separate omics studies.« less

  5. Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.

    PubMed

    Voillet, Valentin; Besse, Philippe; Liaubet, Laurence; San Cristobal, Magali; González, Ignacio

    2016-10-03

    In omics data integration studies, it is common, for a variety of reasons, for some individuals to not be present in all data tables. Missing row values are challenging to deal with because most statistical methods cannot be directly applied to incomplete datasets. To overcome this issue, we propose a multiple imputation (MI) approach in a multivariate framework. In this study, we focus on multiple factor analysis (MFA) as a tool to compare and integrate multiple layers of information. MI involves filling the missing rows with plausible values, resulting in M completed datasets. MFA is then applied to each completed dataset to produce M different configurations (the matrices of coordinates of individuals). Finally, the M configurations are combined to yield a single consensus solution. We assessed the performance of our method, named MI-MFA, on two real omics datasets. Incomplete artificial datasets with different patterns of missingness were created from these data. The MI-MFA results were compared with two other approaches i.e., regularized iterative MFA (RI-MFA) and mean variable imputation (MVI-MFA). For each configuration resulting from these three strategies, the suitability of the solution was determined against the true MFA configuration obtained from the original data and a comprehensive graphical comparison showing how the MI-, RI- or MVI-MFA configurations diverge from the true configuration was produced. Two approaches i.e., confidence ellipses and convex hulls, to visualize and assess the uncertainty due to missing values were also described. We showed how the areas of ellipses and convex hulls increased with the number of missing individuals. A free and easy-to-use code was proposed to implement the MI-MFA method in the R statistical environment. We believe that MI-MFA provides a useful and attractive method for estimating the coordinates of individuals on the first MFA components despite missing rows. MI-MFA configurations were close to the true configuration even when many individuals were missing in several data tables. This method takes into account the uncertainty of MI-MFA configurations induced by the missing rows, thereby allowing the reliability of the results to be evaluated.

  6. Genelab: Scientific Partnerships and an Open-Access Database to Maximize Usage of Omics Data from Space Biology Experiments

    NASA Technical Reports Server (NTRS)

    Reinsch, S. S.; Galazka, J..; Berrios, D. C; Chakravarty, K.; Fogle, H.; Lai, S.; Bokyo, V.; Timucin, L. R.; Tran, P.; Skidmore, M.

    2016-01-01

    NASA's mission includes expanding our understanding of biological systems to improve life on Earth and to enable long-duration human exploration of space. The GeneLab Data System (GLDS) is NASA's premier open-access omics data platform for biological experiments. GLDS houses standards-compliant, high-throughput sequencing and other omics data from spaceflight-relevant experiments. The GeneLab project at NASA-Ames Research Center is developing the database, and also partnering with spaceflight projects through sharing or augmentation of experiment samples to expand omics analyses on precious spaceflight samples. The partnerships ensure that the maximum amount of data is garnered from spaceflight experiments and made publically available as rapidly as possible via the GLDS. GLDS Version 1.0, went online in April 2015. Software updates and new data releases occur at least quarterly. As of October 2016, the GLDS contains 80 datasets and has search and download capabilities. Version 2.0 is slated for release in September of 2017 and will have expanded, integrated search capabilities leveraging other public omics databases (NCBI GEO, PRIDE, MG-RAST). Future versions in this multi-phase project will provide a collaborative platform for omics data analysis. Data from experiments that explore the biological effects of the spaceflight environment on a wide variety of model organisms are housed in the GLDS including data from rodents, invertebrates, plants and microbes. Human datasets are currently limited to those with anonymized data (e.g., from cultured cell lines). GeneLab ensures prompt release and open access to high-throughput genomics, transcriptomics, proteomics, and metabolomics data from spaceflight and ground-based simulations of microgravity, radiation or other space environment factors. The data are meticulously curated to assure that accurate experimental and sample processing metadata are included with each data set. GLDS download volumes indicate strong interest of the scientific community in these data. To date GeneLab has partnered with multiple experiments including two plant (Arabidopsis thaliana) experiments, two mice experiments, and several microbe experiments. GeneLab optimized protocols in the rodent partnerships for maximum yield of RNA, DNA and protein from tissues harvested and preserved during the SpaceX-4 mission, as well as from tissues from mice that were frozen intact during spaceflight and later dissected on the ground. Analysis of GeneLab data will contribute fundamental knowledge of how the space environment affects biological systems, and as well as yield terrestrial benefits resulting from mitigation strategies to prevent effects observed during exposure to space environments.

  7. [Clinical value evaluation of Chinese herbal formula in context of multi-omics network].

    PubMed

    Li, Bing; Han, Fei; Wang, Zhong; Wang, Yong-Yan

    2017-03-01

    Clinical value evaluation is the key issue to solve the problems such as high repetition rate, fuzzy clinical positioning, broad indications and unclear clinical values in Chinese herbal formula(Chinese patent medicine). By analyzing the challenges and opportunities of Chinese herbal formula in clinical value evaluation, this paper introduced a strategy of multi-omic network analysis. Through comparative analysis of three stroke treatment formulas, we suggested their different characteristic advantages for variant symptoms or phenotypes of stroke, which may provide reference for rational clinical choice. Such multi-omic network analysis strategy may open a unique angle of view for clinical evaluation and comparison of Chinese herbal formula. Copyright© by the Chinese Pharmaceutical Association.

  8. IPF-LASSO: Integrative L 1-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data

    PubMed Central

    Jiang, Xiaoyu; Fuchs, Mathias

    2017-01-01

    As modern biotechnologies advance, it has become increasingly frequent that different modalities of high-dimensional molecular data (termed “omics” data in this paper), such as gene expression, methylation, and copy number, are collected from the same patient cohort to predict the clinical outcome. While prediction based on omics data has been widely studied in the last fifteen years, little has been done in the statistical literature on the integration of multiple omics modalities to select a subset of variables for prediction, which is a critical task in personalized medicine. In this paper, we propose a simple penalized regression method to address this problem by assigning different penalty factors to different data modalities for feature selection and prediction. The penalty factors can be chosen in a fully data-driven fashion by cross-validation or by taking practical considerations into account. In simulation studies, we compare the prediction performance of our approach, called IPF-LASSO (Integrative LASSO with Penalty Factors) and implemented in the R package ipflasso, with the standard LASSO and sparse group LASSO. The use of IPF-LASSO is also illustrated through applications to two real-life cancer datasets. All data and codes are available on the companion website to ensure reproducibility. PMID:28546826

  9. The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases.

    PubMed

    Moulos, Panagiotis; Klein, Julie; Jupp, Simon; Stevens, Robert; Bascands, Jean-Loup; Schanstra, Joost P

    2013-07-24

    Constant technological advances have allowed scientists in biology to migrate from conventional single-omics to multi-omics experimental approaches, challenging bioinformatics to bridge this multi-tiered information. Ongoing research in renal biology is no exception. The results of large-scale and/or high throughput experiments, presenting a wealth of information on kidney disease are scattered across the web. To tackle this problem, we recently presented the KUPKB, a multi-omics data repository for renal diseases. In this article, we describe KUPNetViz, a biological graph exploration tool allowing the exploration of KUPKB data through the visualization of biomolecule interactions. KUPNetViz enables the integration of multi-layered experimental data over different species, renal locations and renal diseases to protein-protein interaction networks and allows association with biological functions, biochemical pathways and other functional elements such as miRNAs. KUPNetViz focuses on the simplicity of its usage and the clarity of resulting networks by reducing and/or automating advanced functionalities present in other biological network visualization packages. In addition, it allows the extrapolation of biomolecule interactions across different species, leading to the formulations of new plausible hypotheses, adequate experiment design and to the suggestion of novel biological mechanisms. We demonstrate the value of KUPNetViz by two usage examples: the integration of calreticulin as a key player in a larger interaction network in renal graft rejection and the novel observation of the strong association of interleukin-6 with polycystic kidney disease. The KUPNetViz is an interactive and flexible biological network visualization and exploration tool. It provides renal biologists with biological network snapshots of the complex integrated data of the KUPKB allowing the formulation of new hypotheses in a user friendly manner.

  10. The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases

    PubMed Central

    2013-01-01

    Background Constant technological advances have allowed scientists in biology to migrate from conventional single-omics to multi-omics experimental approaches, challenging bioinformatics to bridge this multi-tiered information. Ongoing research in renal biology is no exception. The results of large-scale and/or high throughput experiments, presenting a wealth of information on kidney disease are scattered across the web. To tackle this problem, we recently presented the KUPKB, a multi-omics data repository for renal diseases. Results In this article, we describe KUPNetViz, a biological graph exploration tool allowing the exploration of KUPKB data through the visualization of biomolecule interactions. KUPNetViz enables the integration of multi-layered experimental data over different species, renal locations and renal diseases to protein-protein interaction networks and allows association with biological functions, biochemical pathways and other functional elements such as miRNAs. KUPNetViz focuses on the simplicity of its usage and the clarity of resulting networks by reducing and/or automating advanced functionalities present in other biological network visualization packages. In addition, it allows the extrapolation of biomolecule interactions across different species, leading to the formulations of new plausible hypotheses, adequate experiment design and to the suggestion of novel biological mechanisms. We demonstrate the value of KUPNetViz by two usage examples: the integration of calreticulin as a key player in a larger interaction network in renal graft rejection and the novel observation of the strong association of interleukin-6 with polycystic kidney disease. Conclusions The KUPNetViz is an interactive and flexible biological network visualization and exploration tool. It provides renal biologists with biological network snapshots of the complex integrated data of the KUPKB allowing the formulation of new hypotheses in a user friendly manner. PMID:23883183

  11. Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery

    PubMed Central

    Huo, Zhiguang; Tseng, George

    2017-01-01

    Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K-means (is-K means) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare is-K means with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency. PMID:28959370

  12. Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

    PubMed

    Huo, Zhiguang; Tseng, George

    2017-06-01

    Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K -means (is- K means) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare is- K means with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency.

  13. The -omics Era- Toward a Systems-Level Understanding of Streptomyces

    PubMed Central

    Zhou, Zhan; Gu, Jianying; Du, Yi-Ling; Li, Yong-Quan; Wang, Yufeng

    2011-01-01

    Streptomyces is a group of soil bacteria of medicinal, economic, ecological, and industrial importance. It is renowned for its complex biology in gene regulation, antibiotic production, morphological differentiation, and stress response. In this review, we provide an overview of the recent advances in Streptomyces biology inspired by -omics based high throughput technologies. In this post-genomic era, vast amounts of data have been integrated to provide significant new insights into the fundamental mechanisms of system control and regulation dynamics of Streptomyces. PMID:22379394

  14. Automatic Tool for Local Assembly Structures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Whole community shotgun sequencing of total DNA (i.e. metagenomics) and total RNA (i.e. metatranscriptomics) has provided a wealth of information in the microbial community structure, predicted functions, metabolic networks, and is even able to reconstruct complete genomes directly. Here we present ATLAS (Automatic Tool for Local Assembly Structures) a comprehensive pipeline for assembly, annotation, genomic binning of metagenomic and metatranscriptomic data with an integrated framework for Multi-Omics. This will provide an open source tool for the Multi-Omic community at large.

  15. UniVIO: A Multiple Omics Database with Hormonome and Transcriptome Data from Rice

    PubMed Central

    Sakurai, Tetsuya; Sakakibara, Hitoshi

    2013-01-01

    Plant hormones play important roles as signaling molecules in the regulation of growth and development by controlling the expression of downstream genes. Since the hormone signaling system represents a complex network involving functional cross-talk through the mutual regulation of signaling and metabolism, a comprehensive and integrative analysis of plant hormone concentrations and gene expression is important for a deeper understanding of hormone actions. We have developed a database named Uniformed Viewer for Integrated Omics (UniVIO: http://univio.psc.riken.jp/), which displays hormone-metabolome (hormonome) and transcriptome data in a single formatted (uniformed) heat map. At the present time, hormonome and transcriptome data obtained from 14 organ parts of rice plants at the reproductive stage and seedling shoots of three gibberellin signaling mutants are included in the database. The hormone concentration and gene expression data can be searched by substance name, probe ID, gene locus ID or gene description. A correlation search function has been implemented to enable users to obtain information of correlated substance accumulation and gene expression. In the correlation search, calculation method, range of correlation coefficient and plant samples can be selected freely. PMID:23314752

  16. RepExplore: addressing technical replicate variance in proteomics and metabolomics data analysis.

    PubMed

    Glaab, Enrico; Schneider, Reinhard

    2015-07-01

    High-throughput omics datasets often contain technical replicates included to account for technical sources of noise in the measurement process. Although summarizing these replicate measurements by using robust averages may help to reduce the influence of noise on downstream data analysis, the information on the variance across the replicate measurements is lost in the averaging process and therefore typically disregarded in subsequent statistical analyses.We introduce RepExplore, a web-service dedicated to exploit the information captured in the technical replicate variance to provide more reliable and informative differential expression and abundance statistics for omics datasets. The software builds on previously published statistical methods, which have been applied successfully to biomedical omics data but are difficult to use without prior experience in programming or scripting. RepExplore facilitates the analysis by providing a fully automated data processing and interactive ranking tables, whisker plot, heat map and principal component analysis visualizations to interpret omics data and derived statistics. Freely available at http://www.repexplore.tk enrico.glaab@uni.lu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  17. Biogeochemical Typing of Paddy Field by a Data-Driven Approach Revealing Sub-Systems within a Complex Environment - A Pipeline to Filtrate, Organize and Frame Massive Dataset from Multi-Omics Analyses

    PubMed Central

    Ogawa, Diogo M. O.; Moriya, Shigeharu; Tsuboi, Yuuri; Date, Yasuhiro; Prieto-da-Silva, Álvaro R. B.; Rádis-Baptista, Gandhi; Yamane, Tetsuo; Kikuchi, Jun

    2014-01-01

    We propose the technique of biogeochemical typing (BGC typing) as a novel methodology to set forth the sub-systems of organismal communities associated to the correlated chemical profiles working within a larger complex environment. Given the intricate characteristic of both organismal and chemical consortia inherent to the nature, many environmental studies employ the holistic approach of multi-omics analyses undermining as much information as possible. Due to the massive amount of data produced applying multi-omics analyses, the results are hard to visualize and to process. The BGC typing analysis is a pipeline built using integrative statistical analysis that can treat such huge datasets filtering, organizing and framing the information based on the strength of the various mutual trends of the organismal and chemical fluctuations occurring simultaneously in the environment. To test our technique of BGC typing, we choose a rich environment abounding in chemical nutrients and organismal diversity: the surficial freshwater from Japanese paddy fields and surrounding waters. To identify the community consortia profile we employed metagenomics as high throughput sequencing (HTS) for the fragments amplified from Archaea rRNA, universal 16S rRNA and 18S rRNA; to assess the elemental content we employed ionomics by inductively coupled plasma optical emission spectroscopy (ICP-OES); and for the organic chemical profile, metabolomics employing both Fourier transformed infrared (FT-IR) spectroscopy and proton nuclear magnetic resonance (1H-NMR) all these analyses comprised our multi-omics dataset. The similar trends between the community consortia against the chemical profiles were connected through correlation. The result was then filtered, organized and framed according to correlation strengths and peculiarities. The output gave us four BGC types displaying uniqueness in community and chemical distribution, diversity and richness. We conclude therefore that the BGC typing is a successful technique for elucidating the sub-systems of organismal communities with associated chemical profiles in complex ecosystems. PMID:25330259

  18. Biogeochemical typing of paddy field by a data-driven approach revealing sub-systems within a complex environment--a pipeline to filtrate, organize and frame massive dataset from multi-omics analyses.

    PubMed

    Ogawa, Diogo M O; Moriya, Shigeharu; Tsuboi, Yuuri; Date, Yasuhiro; Prieto-da-Silva, Álvaro R B; Rádis-Baptista, Gandhi; Yamane, Tetsuo; Kikuchi, Jun

    2014-01-01

    We propose the technique of biogeochemical typing (BGC typing) as a novel methodology to set forth the sub-systems of organismal communities associated to the correlated chemical profiles working within a larger complex environment. Given the intricate characteristic of both organismal and chemical consortia inherent to the nature, many environmental studies employ the holistic approach of multi-omics analyses undermining as much information as possible. Due to the massive amount of data produced applying multi-omics analyses, the results are hard to visualize and to process. The BGC typing analysis is a pipeline built using integrative statistical analysis that can treat such huge datasets filtering, organizing and framing the information based on the strength of the various mutual trends of the organismal and chemical fluctuations occurring simultaneously in the environment. To test our technique of BGC typing, we choose a rich environment abounding in chemical nutrients and organismal diversity: the surficial freshwater from Japanese paddy fields and surrounding waters. To identify the community consortia profile we employed metagenomics as high throughput sequencing (HTS) for the fragments amplified from Archaea rRNA, universal 16S rRNA and 18S rRNA; to assess the elemental content we employed ionomics by inductively coupled plasma optical emission spectroscopy (ICP-OES); and for the organic chemical profile, metabolomics employing both Fourier transformed infrared (FT-IR) spectroscopy and proton nuclear magnetic resonance (1H-NMR) all these analyses comprised our multi-omics dataset. The similar trends between the community consortia against the chemical profiles were connected through correlation. The result was then filtered, organized and framed according to correlation strengths and peculiarities. The output gave us four BGC types displaying uniqueness in community and chemical distribution, diversity and richness. We conclude therefore that the BGC typing is a successful technique for elucidating the sub-systems of organismal communities with associated chemical profiles in complex ecosystems.

  19. Traditional Chinese medicine ZHENG and Omics convergence: a systems approach to post-genomics medicine in a global world.

    PubMed

    Wang, Peng; Chen, Zhen

    2013-09-01

    Traditional Chinese medicine (TCM) is a comprehensive system of medical practice that has been used to diagnose, treat, and prevent illnesses for more than 3000 years. ZHENG (also known as "syndrome") differentiation remains the essence of TCM. In China, TCM shares equal status, and integrated with Western medicine in the healthcare system to treat many types of diseases. Yet, compared to biomolecular science and Western medicine, the ZHENG/TCM approach to diagnostics might appear unobjective, but offers at the same time long-standing clinical and phenotypic-rich insights. With the current globalization of life sciences and the arrival of "Big Data" research and development, these two silos of medical lore are rapidly coalescing. The applications of multi-omics strategies to TCM have begun to provide novel insights into the essence and molecular basis of TCM ZHENG. We searched the Chinese electronic databases and PubMed for published articles related to "Omics" and "TCM ZHENG" and observed a dramatic increase in studies over the past few years. In this article, we provide a timely synthesis of the lessons learned, and the emerging applications of omics science in TCM ZHENG research. We suggest that the global health scholarship and the field of "developing world Omics" can usefully draw from TCM, and vice versa.

  20. Mergeomics: a web server for identifying pathological pathways, networks, and key regulators via multidimensional data integration.

    PubMed

    Arneson, Douglas; Bhattacharya, Anindya; Shu, Le; Mäkinen, Ville-Petteri; Yang, Xia

    2016-09-09

    Human diseases are commonly the result of multidimensional changes at molecular, cellular, and systemic levels. Recent advances in genomic technologies have enabled an outpour of omics datasets that capture these changes. However, separate analyses of these various data only provide fragmented understanding and do not capture the holistic view of disease mechanisms. To meet the urgent needs for tools that effectively integrate multiple types of omics data to derive biological insights, we have developed Mergeomics, a computational pipeline that integrates multidimensional disease association data with functional genomics and molecular networks to retrieve biological pathways, gene networks, and central regulators critical for disease development. To make the Mergeomics pipeline available to a wider research community, we have implemented an online, user-friendly web server ( http://mergeomics. idre.ucla.edu/ ). The web server features a modular implementation of the Mergeomics pipeline with detailed tutorials. Additionally, it provides curated genomic resources including tissue-specific expression quantitative trait loci, ENCODE functional annotations, biological pathways, and molecular networks, and offers interactive visualization of analytical results. Multiple computational tools including Marker Dependency Filtering (MDF), Marker Set Enrichment Analysis (MSEA), Meta-MSEA, and Weighted Key Driver Analysis (wKDA) can be used separately or in flexible combinations. User-defined summary-level genomic association datasets (e.g., genetic, transcriptomic, epigenomic) related to a particular disease or phenotype can be uploaded and computed real-time to yield biologically interpretable results, which can be viewed online and downloaded for later use. Our Mergeomics web server offers researchers flexible and user-friendly tools to facilitate integration of multidimensional data into holistic views of disease mechanisms in the form of tissue-specific key regulators, biological pathways, and gene networks.

  1. Incorporating zebrafish omics into chemical biology and toxicology.

    PubMed

    Sukardi, Hendrian; Ung, Choong Yong; Gong, Zhiyuan; Lam, Siew Hong

    2010-03-01

    In this communication, we describe the general aspects of omics approaches for analyses of transcriptome, proteome, and metabolome, and how they can be strategically incorporated into chemical screening and perturbation studies using the zebrafish system. Pharmacological efficacy and selectivity of chemicals can be evaluated based on chemical-induced phenotypic effects; however, phenotypic observation has limitations in identifying mechanistic action of chemicals. We suggest adapting gene-expression-based high-throughput screening as a complementary strategy to zebrafish-phenotype-based screening for mechanistic insights about the mode of action and toxicity of a chemical, large-scale predictive applications and comparative analysis of chemical-induced omics signatures, which are useful to identify conserved biological responses, signaling pathways, and biomarkers. The potential mechanistic, predictive, and comparative applications of omics approaches can be implemented in the zebrafish system. Examples of these using the omics approaches in zebrafish, including data of ours and others, are presented and discussed. Omics also facilitates the translatability of zebrafish studies across species through comparison of conserved chemical-induced responses. This review is intended to update interested readers with the current omics approaches that have been applied in chemical studies on zebrafish and their potential in enhancing discovery in chemical biology.

  2. Repression of Septin9 and Septin2 suppresses tumor growth of human glioblastoma cells.

    PubMed

    Xu, Dongchao; Liu, Ajuan; Wang, Xuan; Chen, Yidan; Shen, Yunyun; Tan, Zhou; Qiu, Mengsheng

    2018-05-01

    Glioblastoma (GBM) is the most common primary malignancy of the central nervous system (CNS) with <10% 5-year survival rate. The growth and invasion of GBM cells into normal brain make the resection and treatment difficult. A better understanding of the biology of GBM cells is crucial to the targeted therapies for the disease. In this study, we identified Septin9 (SEPT9) and Septin2 (SEPT2) as GBM-related genes through integrated multi-omics analysis across independent transcriptomic and proteomic studies. Further studies revealed that expression of SEPT9 and SEPT2 was elevated in glioma tissues and cell lines (A172, U87-MG). Knockdown of SEPT9 and SEPT2 in A172/U87-MG was able to inhibit GBM cell proliferation and arrest cell cycle progression in the S phase in a synergistic mechanism. Moreover, suppression of SEPT9 and SEPT2 decreased the GBM cell invasive capability and significantly impaired the growth of glioma xenografts in nude mice. Furthermore, the decrease in GBM cell growth caused by SEPT9 and SEPT2 RNAi appears to involve two parallel signaling pathway including the p53/p21 axis and MEK/ERK activation. Together, our integration of multi-omics analysis has revealed previously unrecognized synergistic role of SEPT9 and SEPT2 in GBM, and provided novel insights into the targeted therapy of GBM.

  3. Integrated multi-omics analysis supports role of lysophosphatidylcholine and related glycerophospholipids in the Lotus japonicus-Glomus intraradices mycorrhizal symbiosis.

    PubMed

    Vijayakumar, Vinod; Liebisch, Gerhard; Buer, Benjamin; Xue, Li; Gerlach, Nina; Blau, Samira; Schmitz, Jessica; Bucher, Marcel

    2016-02-01

    Interaction of plant roots with arbuscular mycorrhizal fungi (AMF) is a complex trait resulting in cooperative interactions among the two symbionts including bidirectional exchange of resources. To study arbuscular mycorrhizal symbiosis (AMS) trait variation in the model plant Lotus japonicus, we performed an integrated multi-omics analysis with a focus on plant and fungal phospholipid (PL) metabolism and biological significance of lysophosphatidylcholine (LPC). Our results support the role of LPC as a bioactive compound eliciting cellular and molecular response mechanisms in Lotus. Evidence is provided for large interspecific chemical diversity of LPC species among mycorrhizae with related AMF species. Lipid, gene expression and elemental profiling emphasize the Lotus-Glomus intraradices interaction as distinct from other arbuscular mycorrhizal (AM) interactions. In G. intraradices, genes involved in fatty acid (FA) elongation and biosynthesis of unsaturated FAs were enhanced, while in Lotus, FA synthesis genes were up-regulated during AMS. Furthermore, FAS protein localization to mitochondria suggests FA biosynthesis and elongation may also occur in AMF. Our results suggest the existence of interspecific partitioning of PL resources for generation of LPC and novel candidate bioactive PLs in the Lotus-G. intraradices symbiosis. Moreover, the data advocate research with phylogenetically diverse Glomeromycota species for a broader understanding of the molecular underpinnings of AMS. © 2015 John Wiley & Sons Ltd.

  4. ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding.

    PubMed

    Guhlin, Joseph; Silverstein, Kevin A T; Zhou, Peng; Tiffin, Peter; Young, Nevin D

    2017-08-10

    Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.

  5. Metabolic Model-Based Integration of Microbiome Taxonomic and Metabolomic Profiles Elucidates Mechanistic Links between Ecological and Metabolic Variation.

    PubMed

    Noecker, Cecilia; Eng, Alexander; Srinivasan, Sujatha; Theriot, Casey M; Young, Vincent B; Jansson, Janet K; Fredricks, David N; Borenstein, Elhanan

    2016-01-01

    Multiple molecular assays now enable high-throughput profiling of the ecology, metabolic capacity, and activity of the human microbiome. However, to date, analyses of such multi-omic data typically focus on statistical associations, often ignoring extensive prior knowledge of the mechanisms linking these various facets of the microbiome. Here, we introduce a comprehensive framework to systematically link variation in metabolomic data with community composition by utilizing taxonomic, genomic, and metabolic information. Specifically, we integrate available and inferred genomic data, metabolic network modeling, and a method for predicting community-wide metabolite turnover to estimate the biosynthetic and degradation potential of a given community. Our framework then compares variation in predicted metabolic potential with variation in measured metabolites' abundances to evaluate whether community composition can explain observed shifts in the community metabolome, and to identify key taxa and genes contributing to the shifts. Focusing on two independent vaginal microbiome data sets, each pairing 16S community profiling with large-scale metabolomics, we demonstrate that our framework successfully recapitulates observed variation in 37% of metabolites. Well-predicted metabolite variation tends to result from disease-associated metabolism. We further identify several disease-enriched species that contribute significantly to these predictions. Interestingly, our analysis also detects metabolites for which the predicted variation negatively correlates with the measured variation, suggesting environmental control points of community metabolism. Applying this framework to gut microbiome data sets reveals similar trends, including prediction of bile acid metabolite shifts. This framework is an important first step toward a system-level multi-omic integration and an improved mechanistic understanding of the microbiome activity and dynamics in health and disease. Studies characterizing both the taxonomic composition and metabolic profile of various microbial communities are becoming increasingly common, yet new computational methods are needed to integrate and interpret these data in terms of known biological mechanisms. Here, we introduce an analytical framework to link species composition and metabolite measurements, using a simple model to predict the effects of community ecology on metabolite concentrations and evaluating whether these predictions agree with measured metabolomic profiles. We find that a surprisingly large proportion of metabolite variation in the vaginal microbiome can be predicted based on species composition (including dramatic shifts associated with disease), identify putative mechanisms underlying these predictions, and evaluate the roles of individual bacterial species and genes. Analysis of gut microbiome data using this framework recovers similar community metabolic trends. This framework lays the foundation for model-based multi-omic integrative studies, ultimately improving our understanding of microbial community metabolism.

  6. Metabolic Model-Based Integration of Microbiome Taxonomic and Metabolomic Profiles Elucidates Mechanistic Links between Ecological and Metabolic Variation

    PubMed Central

    Noecker, Cecilia; Eng, Alexander; Srinivasan, Sujatha; Theriot, Casey M.; Young, Vincent B.; Jansson, Janet K.; Fredricks, David N.

    2016-01-01

    ABSTRACT Multiple molecular assays now enable high-throughput profiling of the ecology, metabolic capacity, and activity of the human microbiome. However, to date, analyses of such multi-omic data typically focus on statistical associations, often ignoring extensive prior knowledge of the mechanisms linking these various facets of the microbiome. Here, we introduce a comprehensive framework to systematically link variation in metabolomic data with community composition by utilizing taxonomic, genomic, and metabolic information. Specifically, we integrate available and inferred genomic data, metabolic network modeling, and a method for predicting community-wide metabolite turnover to estimate the biosynthetic and degradation potential of a given community. Our framework then compares variation in predicted metabolic potential with variation in measured metabolites’ abundances to evaluate whether community composition can explain observed shifts in the community metabolome, and to identify key taxa and genes contributing to the shifts. Focusing on two independent vaginal microbiome data sets, each pairing 16S community profiling with large-scale metabolomics, we demonstrate that our framework successfully recapitulates observed variation in 37% of metabolites. Well-predicted metabolite variation tends to result from disease-associated metabolism. We further identify several disease-enriched species that contribute significantly to these predictions. Interestingly, our analysis also detects metabolites for which the predicted variation negatively correlates with the measured variation, suggesting environmental control points of community metabolism. Applying this framework to gut microbiome data sets reveals similar trends, including prediction of bile acid metabolite shifts. This framework is an important first step toward a system-level multi-omic integration and an improved mechanistic understanding of the microbiome activity and dynamics in health and disease. IMPORTANCE Studies characterizing both the taxonomic composition and metabolic profile of various microbial communities are becoming increasingly common, yet new computational methods are needed to integrate and interpret these data in terms of known biological mechanisms. Here, we introduce an analytical framework to link species composition and metabolite measurements, using a simple model to predict the effects of community ecology on metabolite concentrations and evaluating whether these predictions agree with measured metabolomic profiles. We find that a surprisingly large proportion of metabolite variation in the vaginal microbiome can be predicted based on species composition (including dramatic shifts associated with disease), identify putative mechanisms underlying these predictions, and evaluate the roles of individual bacterial species and genes. Analysis of gut microbiome data using this framework recovers similar community metabolic trends. This framework lays the foundation for model-based multi-omic integrative studies, ultimately improving our understanding of microbial community metabolism. PMID:27239563

  7. Mastitomics, the integrated omics of bovine milk in an experimental model of Streptococcus uberis mastitis: 2. Label-free relative quantitative proteomics† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c6mb00290k Click here for additional data file.

    PubMed Central

    Mudaliar, Manikhandan; Tassi, Riccardo; Thomas, Funmilola C.; McNeilly, Tom N.; Weidt, Stefan K.; McLaughlin, Mark; Wilson, David; Burchmore, Richard; Herzyk, Pawel; Eckersall, P. David

    2016-01-01

    Mastitis, inflammation of the mammary gland, is the most common and costly disease of dairy cattle in the western world. It is primarily caused by bacteria, with Streptococcus uberis as one of the most prevalent causative agents. To characterize the proteome during Streptococcus uberis mastitis, an experimentally induced model of intramammary infection was used. Milk whey samples obtained from 6 cows at 6 time points were processed using label-free relative quantitative proteomics. This proteomic analysis complements clinical, bacteriological and immunological studies as well as peptidomic and metabolomic analysis of the same challenge model. A total of 2552 non-redundant bovine peptides were identified, and from these, 570 bovine proteins were quantified. Hierarchical cluster analysis and principal component analysis showed clear clustering of results by stage of infection, with similarities between pre-infection and resolution stages (0 and 312 h post challenge), early infection stages (36 and 42 h post challenge) and late infection stages (57 and 81 h post challenge). Ingenuity pathway analysis identified upregulation of acute phase protein pathways over the course of infection, with dominance of different acute phase proteins at different time points based on differential expression analysis. Antimicrobial peptides, notably cathelicidins and peptidoglycan recognition protein, were upregulated at all time points post challenge and peaked at 57 h, which coincided with 10 000-fold decrease in average bacterial counts. The integration of clinical, bacteriological, immunological and quantitative proteomics and other-omic data provides a more detailed systems level view of the host response to mastitis than has been achieved previously. PMID:27412694

  8. Integrated multi-omic analyses in Biomphalaria-Schistosoma dialogue reveal the immunobiological significance of FREP-SmPoMuc interaction.

    PubMed

    Portet, Anaïs; Pinaud, Silvain; Tetreau, Guillaume; Galinier, Richard; Cosseau, Céline; Duval, David; Grunau, Christoph; Mitta, Guillaume; Gourbal, Benjamin

    2017-10-01

    The fresh water snail Biomphalaria glabrata is one of the vectors of the trematode pathogen Schistosoma mansoni, which is one of the agents responsible of human schistosomiasis. In this host-parasite interaction, co-evolutionary dynamic results into an infectivity mosaic known as compatibility polymorphism. Integrative approaches including large scale molecular approaches have been conducted in recent years to improve our understanding of the mechanisms underlying compatibility. This review presents the combination of integrated Multi-Omic approaches leading to the discovery of two repertoires of polymorphic and/or diversified interacting molecules: the parasite antigens S. mansoni polymorphic mucins (SmPoMucs) and the B. glabrata immune receptors fibrinogen-related proteins (FREPs). We argue that their interactions may be major components for defining the compatible/incompatible status of a specific snail/schistosome combination. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Integrated Approaches to Drug Discovery for Oxidative Stress-Related Retinal Diseases.

    PubMed

    Nishimura, Yuhei; Hara, Hideaki

    2016-01-01

    Excessive oxidative stress induces dysregulation of functional networks in the retina, resulting in retinal diseases such as glaucoma, age-related macular degeneration, and diabetic retinopathy. Although various therapies have been developed to reduce oxidative stress in retinal diseases, most have failed to show efficacy in clinical trials. This may be due to oversimplification of target selection for such a complex network as oxidative stress. Recent advances in high-throughput technologies have facilitated the collection of multilevel omics data, which has driven growth in public databases and in the development of bioinformatics tools. Integration of the knowledge gained from omics databases can be used to generate disease-related biological networks and to identify potential therapeutic targets within the networks. Here, we provide an overview of integrative approaches in the drug discovery process and provide simple examples of how the approaches can be exploited to identify oxidative stress-related targets for retinal diseases.

  10. Integrated Approaches to Drug Discovery for Oxidative Stress-Related Retinal Diseases

    PubMed Central

    Hara, Hideaki

    2016-01-01

    Excessive oxidative stress induces dysregulation of functional networks in the retina, resulting in retinal diseases such as glaucoma, age-related macular degeneration, and diabetic retinopathy. Although various therapies have been developed to reduce oxidative stress in retinal diseases, most have failed to show efficacy in clinical trials. This may be due to oversimplification of target selection for such a complex network as oxidative stress. Recent advances in high-throughput technologies have facilitated the collection of multilevel omics data, which has driven growth in public databases and in the development of bioinformatics tools. Integration of the knowledge gained from omics databases can be used to generate disease-related biological networks and to identify potential therapeutic targets within the networks. Here, we provide an overview of integrative approaches in the drug discovery process and provide simple examples of how the approaches can be exploited to identify oxidative stress-related targets for retinal diseases. PMID:28053689

  11. Integrative Approaches to Understanding the Pathogenic Role of Genetic Variation in Rheumatic Diseases.

    PubMed

    Laufer, Vincent A; Chen, Jake Y; Langefeld, Carl D; Bridges, S Louis

    2017-08-01

    The use of high-throughput omics may help to understand the contribution of genetic variants to the pathogenesis of rheumatic diseases. We discuss the concept of missing heritability: that genetic variants do not explain the heritability of rheumatoid arthritis and related rheumatologic conditions. In addition to an overview of how integrative data analysis can lead to novel insights into mechanisms of rheumatic diseases, we describe statistical approaches to prioritizing genetic variants for future functional analyses. We illustrate how analyses of large datasets provide hope for improved approaches to the diagnosis, treatment, and prevention of rheumatic diseases. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Application of omics data in regulatory toxicology: report of an international BfR expert workshop.

    PubMed

    Marx-Stoelting, P; Braeuning, A; Buhrke, T; Lampen, A; Niemann, L; Oelgeschlaeger, M; Rieke, S; Schmidt, F; Heise, T; Pfeil, R; Solecki, R

    2015-11-01

    Advances in omics techniques and molecular toxicology are necessary to provide new perspectives for regulatory toxicology. By the application of modern molecular techniques, more mechanistic information should be gained to support standard toxicity studies and to contribute to a reduction and refinement of animal experiments required for certain regulatory purposes. The relevance and applicability of data obtained by omics methods to regulatory purposes such as grouping of chemicals, mode of action analysis or classification and labelling needs further improvement, defined validation and cautious expert judgment. Based on the results of an international expert workshop organized 2014 by the Federal Institute for Risk Assessment in Berlin, this paper is aimed to provide a critical overview of the regulatory relevance and reliability of omics methods, basic requirements on data quality and validation, as well as regulatory criteria to decide which effects observed by omics methods should be considered adverse or non-adverse. As a way forward, it was concluded that the inclusion of omics data can facilitate a more flexible approach for regulatory risk assessment and may help to reduce or refine animal testing.

  13. CPTAC Develops LinkedOmics – Public Web Portal to Analyze Multi-Omics Data Within and Across Cancer Types | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    Multi-omics analysis has grown in popularity among biomedical researchers given the comprehensive characterization of thousands of molecular attributes in addition to clinical attributes. Several data portals have been devised to make these datasets directly available to the cancer research community. However, none of the existing data portals allow systematic exploration and interpretation of the complex relationships between the vast amount of clinical and molecular attributes. CPTAC investigator Dr.

  14. Microarray platform for omics analysis

    NASA Astrophysics Data System (ADS)

    Mecklenburg, Michael; Xie, Bin

    2001-09-01

    Microarray technology has revolutionized genetic analysis. However, limitations in genome analysis has lead to renewed interest in establishing 'omic' strategies. As we enter the post-genomic era, new microarray technologies are needed to address these new classes of 'omic' targets, such as proteins, as well as lipids and carbohydrates. We have developed a microarray platform that combines self- assembling monolayers with the biotin-streptavidin system to provide a robust, versatile immobilization scheme. A hydrophobic film is patterned on the surface creating an array of tension wells that eliminates evaporation effects thereby reducing the shear stress to which biomolecules are exposed to during immobilization. The streptavidin linker layer makes it possible to adapt and/or develop microarray based assays using virtually any class of biomolecules including: carbohydrates, peptides, antibodies, receptors, as well as them ore traditional DNA based arrays. Our microarray technology is designed to furnish seamless compatibility across the various 'omic' platforms by providing a common blueprint for fabricating and analyzing arrays. The prototype microarray uses a microscope slide footprint patterned with 2 by 96 flat wells. Data on the microarray platform will be presented.

  15. Comprehensive two-dimensional gas chromatography and food sensory properties: potential and challenges.

    PubMed

    Cordero, Chiara; Kiefl, Johannes; Schieberle, Peter; Reichenbach, Stephen E; Bicchi, Carlo

    2015-01-01

    Modern omics disciplines dealing with food flavor focus the analytical efforts on the elucidation of sensory-active compounds, including all possible stimuli of multimodal perception (aroma, taste, texture, etc.) by means of a comprehensive, integrated treatment of sample constituents, such as physicochemical properties, concentration in the matrix, and sensory properties (odor/taste quality, perception threshold). Such analyses require detailed profiling of known bioactive components as well as advanced fingerprinting techniques to catalog sample constituents comprehensively, quantitatively, and comparably across samples. Multidimensional analytical platforms support comprehensive investigations required for flavor analysis by combining information on analytes' identities, physicochemical behaviors (volatility, polarity, partition coefficient, and solubility), concentration, and odor quality. Unlike other omics, flavor metabolomics and sensomics include the final output of the biological phenomenon (i.e., sensory perceptions) as an additional analytical dimension, which is specifically and exclusively triggered by the chemicals analyzed. However, advanced omics platforms, which are multidimensional by definition, pose challenging issues not only in terms of coupling with detection systems and sample preparation, but also in terms of data elaboration and processing. The large number of variables collected during each analytical run provides a high level of information, but requires appropriate strategies to exploit fully this potential. This review focuses on advances in comprehensive two-dimensional gas chromatography and analytical platforms combining two-dimensional gas chromatography with olfactometry, chemometrics, and quantitative assays for food sensory analysis to assess the quality of a given product. We review instrumental advances and couplings, automation in sample preparation, data elaboration, and a selection of applications.

  16. PANDA-view: An easy-to-use tool for statistical analysis and visualization of quantitative proteomics data.

    PubMed

    Chang, Cheng; Xu, Kaikun; Guo, Chaoping; Wang, Jinxia; Yan, Qi; Zhang, Jian; He, Fuchu; Zhu, Yunping

    2018-05-22

    Compared with the numerous software tools developed for identification and quantification of -omics data, there remains a lack of suitable tools for both downstream analysis and data visualization. To help researchers better understand the biological meanings in their -omics data, we present an easy-to-use tool, named PANDA-view, for both statistical analysis and visualization of quantitative proteomics data and other -omics data. PANDA-view contains various kinds of analysis methods such as normalization, missing value imputation, statistical tests, clustering and principal component analysis, as well as the most commonly-used data visualization methods including an interactive volcano plot. Additionally, it provides user-friendly interfaces for protein-peptide-spectrum representation of the quantitative proteomics data. PANDA-view is freely available at https://sourceforge.net/projects/panda-view/. 1987ccpacer@163.com and zhuyunping@gmail.com. Supplementary data are available at Bioinformatics online.

  17. [Alternatives to animal testing].

    PubMed

    Fabre, Isabelle

    2009-11-01

    The use of alternative methods to animal testing are an integral part of the 3Rs concept (refine, reduce, replace) defined by Russel & Burch in 1959. These approaches include in silico methods (databases and computer models), in vitro physicochemical analysis, biological methods using bacteria or isolated cells, reconstructed enzyme systems, and reconstructed tissues. Emerging "omic" methods used in integrated approaches further help to reduce animal use, while stem cells offer promising approaches to toxicologic and pathophysiologic studies, along with organotypic cultures and bio-artificial organs. Only a few alternative methods can so far be used in stand-alone tests as substitutes for animal testing. The best way to use these methods is to integrate them in tiered testing strategies (ITS), in which animals are only used as a last resort.

  18. Novel promoters and coding first exons in DLG2 linked to developmental disorders and intellectual disability.

    PubMed

    Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume

    2017-07-19

    Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.

  19. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.

    Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less

  20. Time to "go large" on biofilm research: advantages of an omics approach.

    PubMed

    Azevedo, Nuno F; Lopes, Susana P; Keevil, Charles W; Pereira, Maria O; Vieira, Maria J

    2009-04-01

    In nature, the biofilm mode of life is of great importance in the cell cycle for many microorganisms. Perhaps because of biofilm complexity and variability, the characterization of a given microbial system, in terms of biofilm formation potential, structure and associated physiological activity, in a large-scale, standardized and systematic manner has been hindered by the absence of high-throughput methods. This outlook is now starting to change as new methods involving the utilization of microtiter-plates and automated spectrophotometry and microscopy systems are being developed to perform large-scale testing of microbial biofilms. Here, we evaluate if the time is ripe to start an integrated omics approach, i.e., the generation and interrogation of large datasets, to biofilms--"biofomics". This omics approach would bring much needed insight into how biofilm formation ability is affected by a number of environmental, physiological and mutational factors and how these factors interplay between themselves in a standardized manner. This could then lead to the creation of a database where biofilm signatures are identified and interrogated. Nevertheless, and before embarking on such an enterprise, the selection of a versatile, robust, high-throughput biofilm growing device and of appropriate methods for biofilm analysis will have to be performed. Whether such device and analytical methods are already available, particularly for complex heterotrophic biofilms is, however, very debatable.

  1. Overview of 'Omics Technologies for Military Occupational Health Surveillance and Medicine.

    PubMed

    Bradburne, Christopher; Graham, David; Kingston, H M; Brenner, Ruth; Pamuku, Matt; Carruth, Lucy

    2015-10-01

    Systems biology ('omics) technologies are emerging as tools for the comprehensive analysis and monitoring of human health. In order for these tools to be used in military medicine, clinical sampling and biobanking will need to be optimized to be compatible with downstream processing and analysis for each class of molecule measured. This article provides an overview of 'omics technologies, including instrumentation, tools, and methods, and their potential application for warfighter exposure monitoring. We discuss the current state and the potential utility of personalized data from a variety of 'omics sources including genomics, epigenomics, transcriptomics, metabolomics, proteomics, lipidomics, and efforts to combine their use. Issues in the "sample-to-answer" workflow, including collection and biobanking are discussed, as well as national efforts for standardization and clinical interpretation. Establishment of these emerging capabilities, along with accurate xenobiotic monitoring, for the Department of Defense could provide new and effective tools for environmental health monitoring at all duty stations, including deployed locations. Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.

  2. Cross-Platform Toxicogenomics for the Prediction of Non-Genotoxic Hepatocarcinogenesis in Rat

    PubMed Central

    Metzger, Ute; Templin, Markus F.; Plummer, Simon; Ellinger-Ziegelbauer, Heidrun; Zell, Andreas

    2014-01-01

    In the area of omics profiling in toxicology, i.e. toxicogenomics, characteristic molecular profiles have previously been incorporated into prediction models for early assessment of a carcinogenic potential and mechanism-based classification of compounds. Traditionally, the biomarker signatures used for model construction were derived from individual high-throughput techniques, such as microarrays designed for monitoring global mRNA expression. In this study, we built predictive models by integrating omics data across complementary microarray platforms and introduced new concepts for modeling of pathway alterations and molecular interactions between multiple biological layers. We trained and evaluated diverse machine learning-based models, differing in the incorporated features and learning algorithms on a cross-omics dataset encompassing mRNA, miRNA, and protein expression profiles obtained from rat liver samples treated with a heterogeneous set of substances. Most of these compounds could be unambiguously classified as genotoxic carcinogens, non-genotoxic carcinogens, or non-hepatocarcinogens based on evidence from published studies. Since mixed characteristics were reported for the compounds Cyproterone acetate, Thioacetamide, and Wy-14643, we reclassified these compounds as either genotoxic or non-genotoxic carcinogens based on their molecular profiles. Evaluating our toxicogenomics models in a repeated external cross-validation procedure, we demonstrated that the prediction accuracy of our models could be increased by joining the biomarker signatures across multiple biological layers and by adding complex features derived from cross-platform integration of the omics data. Furthermore, we found that adding these features resulted in a better separation of the compound classes and a more confident reclassification of the three undefined compounds as non-genotoxic carcinogens. PMID:24830643

  3. Reverse engineering biomolecular systems using -omic data: challenges, progress and opportunities.

    PubMed

    Quo, Chang F; Kaddi, Chanchala; Phan, John H; Zollanvari, Amin; Xu, Mingqing; Wang, May D; Alterovitz, Gil

    2012-07-01

    Recent advances in high-throughput biotechnologies have led to the rapid growing research interest in reverse engineering of biomolecular systems (REBMS). 'Data-driven' approaches, i.e. data mining, can be used to extract patterns from large volumes of biochemical data at molecular-level resolution while 'design-driven' approaches, i.e. systems modeling, can be used to simulate emergent system properties. Consequently, both data- and design-driven approaches applied to -omic data may lead to novel insights in reverse engineering biological systems that could not be expected before using low-throughput platforms. However, there exist several challenges in this fast growing field of reverse engineering biomolecular systems: (i) to integrate heterogeneous biochemical data for data mining, (ii) to combine top-down and bottom-up approaches for systems modeling and (iii) to validate system models experimentally. In addition to reviewing progress made by the community and opportunities encountered in addressing these challenges, we explore the emerging field of synthetic biology, which is an exciting approach to validate and analyze theoretical system models directly through experimental synthesis, i.e. analysis-by-synthesis. The ultimate goal is to address the present and future challenges in reverse engineering biomolecular systems (REBMS) using integrated workflow of data mining, systems modeling and synthetic biology.

  4. Digital transplantation pathology: combining whole slide imaging, multiplex staining and automated image analysis.

    PubMed

    Isse, K; Lesniak, A; Grama, K; Roysam, B; Minervini, M I; Demetris, A J

    2012-01-01

    Conventional histopathology is the gold standard for allograft monitoring, but its value proposition is increasingly questioned. "-Omics" analysis of tissues, peripheral blood and fluids and targeted serologic studies provide mechanistic insights into allograft injury not currently provided by conventional histology. Microscopic biopsy analysis, however, provides valuable and unique information: (a) spatial-temporal relationships; (b) rare events/cells; (c) complex structural context; and (d) integration into a "systems" model. Nevertheless, except for immunostaining, no transformative advancements have "modernized" routine microscopy in over 100 years. Pathologists now team with hardware and software engineers to exploit remarkable developments in digital imaging, nanoparticle multiplex staining, and computational image analysis software to bridge the traditional histology-global "-omic" analyses gap. Included are side-by-side comparisons, objective biopsy finding quantification, multiplexing, automated image analysis, and electronic data and resource sharing. Current utilization for teaching, quality assurance, conferencing, consultations, research and clinical trials is evolving toward implementation for low-volume, high-complexity clinical services like transplantation pathology. Cost, complexities of implementation, fluid/evolving standards, and unsettled medical/legal and regulatory issues remain as challenges. Regardless, challenges will be overcome and these technologies will enable transplant pathologists to increase information extraction from tissue specimens and contribute to cross-platform biomarker discovery for improved outcomes. ©Copyright 2011 The American Society of Transplantation and the American Society of Transplant Surgeons.

  5. The Perseus computational platform for comprehensive analysis of (prote)omics data.

    PubMed

    Tyanova, Stefka; Temu, Tikira; Sinitcyn, Pavel; Carlson, Arthur; Hein, Marco Y; Geiger, Tamar; Mann, Matthias; Cox, Jürgen

    2016-09-01

    A main bottleneck in proteomics is the downstream biological analysis of highly multivariate quantitative protein abundance data generated using mass-spectrometry-based analysis. We developed the Perseus software platform (http://www.perseus-framework.org) to support biological and biomedical researchers in interpreting protein quantification, interaction and post-translational modification data. Perseus contains a comprehensive portfolio of statistical tools for high-dimensional omics data analysis covering normalization, pattern recognition, time-series analysis, cross-omics comparisons and multiple-hypothesis testing. A machine learning module supports the classification and validation of patient groups for diagnosis and prognosis, and it also detects predictive protein signatures. Central to Perseus is a user-friendly, interactive workflow environment that provides complete documentation of computational methods used in a publication. All activities in Perseus are realized as plugins, and users can extend the software by programming their own, which can be shared through a plugin store. We anticipate that Perseus's arsenal of algorithms and its intuitive usability will empower interdisciplinary analysis of complex large data sets.

  6. Challenges in Biomarker Discovery: Combining Expert Insights with Statistical Analysis of Complex Omics Data

    PubMed Central

    McDermott, Jason E.; Wang, Jing; Mitchell, Hugh; Webb-Robertson, Bobbie-Jo; Hafen, Ryan; Ramey, John; Rodland, Karin D.

    2012-01-01

    Introduction The advent of high throughput technologies capable of comprehensive analysis of genes, transcripts, proteins and other significant biological molecules has provided an unprecedented opportunity for the identification of molecular markers of disease processes. However, it has simultaneously complicated the problem of extracting meaningful molecular signatures of biological processes from these complex datasets. The process of biomarker discovery and characterization provides opportunities for more sophisticated approaches to integrating purely statistical and expert knowledge-based approaches. Areas covered In this review we will present examples of current practices for biomarker discovery from complex omic datasets and the challenges that have been encountered in deriving valid and useful signatures of disease. We will then present a high-level review of data-driven (statistical) and knowledge-based methods applied to biomarker discovery, highlighting some current efforts to combine the two distinct approaches. Expert opinion Effective, reproducible and objective tools for combining data-driven and knowledge-based approaches to identify predictive signatures of disease are key to future success in the biomarker field. We will describe our recommendations for possible approaches to this problem including metrics for the evaluation of biomarkers. PMID:23335946

  7. Challenges in Biomarker Discovery: Combining Expert Insights with Statistical Analysis of Complex Omics Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McDermott, Jason E.; Wang, Jing; Mitchell, Hugh D.

    2013-01-01

    The advent of high throughput technologies capable of comprehensive analysis of genes, transcripts, proteins and other significant biological molecules has provided an unprecedented opportunity for the identification of molecular markers of disease processes. However, it has simultaneously complicated the problem of extracting meaningful signatures of biological processes from these complex datasets. The process of biomarker discovery and characterization provides opportunities both for purely statistical and expert knowledge-based approaches and would benefit from improved integration of the two. Areas covered In this review we will present examples of current practices for biomarker discovery from complex omic datasets and the challenges thatmore » have been encountered. We will then present a high-level review of data-driven (statistical) and knowledge-based methods applied to biomarker discovery, highlighting some current efforts to combine the two distinct approaches. Expert opinion Effective, reproducible and objective tools for combining data-driven and knowledge-based approaches to biomarker discovery and characterization are key to future success in the biomarker field. We will describe our recommendations of possible approaches to this problem including metrics for the evaluation of biomarkers.« less

  8. Systems analysis of multiple regulator perturbations allows discovery of virulence factors in Salmonella

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoon, Hyunjin; Ansong, Charles; McDermott, Jason E.

    Background: Systemic bacterial infections are highly regulated and complex processes that are orchestrated by numerous virulence factors. Genes that are coordinately controlled by the set of regulators required for systemic infection are potentially required for pathogenicity. Results: In this study we present a systems biology approach in which sample-matched multi-omic measurements of fourteen virulence-essential regulator mutants were coupled with computational network analysis to efficiently identify Salmonella virulence factors. Immunoblot experiments verified network-predicted virulence factors and a subset was determined to be secreted into the host cytoplasm, suggesting that they are virulence factors directly interacting with host cellular components. Two ofmore » these, SrfN and PagK2, were required for full mouse virulence and were shown to be translocated independent of either of the type III secretion systems in Salmonella or the type III injectisome-related flagellar mechanism. Conclusions: Integrating multi-omic datasets from Salmonella mutants lacking virulence regulators not only identified novel virulence factors but also defined a new class of translocated effectors involved in pathogenesis. The success of this strategy at discovery of known and novel virulence factors suggests that the approach may have applicability for other bacterial pathogens.« less

  9. Understanding and Designing the Strategies for the Microbe-Mediated Remediation of Environmental Contaminants Using Omics Approaches.

    PubMed

    Malla, Muneer A; Dubey, Anamika; Yadav, Shweta; Kumar, Ashwani; Hashem, Abeer; Abd Allah, Elsayed Fathi

    2018-01-01

    Rapid industrialization and population explosion has resulted in the generation and dumping of various contaminants into the environment. These harmful compounds deteriorate the human health as well as the surrounding environments. Current research aims to harness and enhance the natural ability of different microbes to metabolize these toxic compounds. Microbial-mediated bioremediation offers great potential to reinstate the contaminated environments in an ecologically acceptable approach. However, the lack of the knowledge regarding the factors controlling and regulating the growth, metabolism, and dynamics of diverse microbial communities in the contaminated environments often limits its execution. In recent years the importance of advanced tools such as genomics, proteomics, transcriptomics, metabolomics, and fluxomics has increased to design the strategies to treat these contaminants in ecofriendly manner. Previously researchers has largely focused on the environmental remediation using single omics-approach, however the present review specifically addresses the integrative role of the multi-omics approaches in microbial-mediated bioremediation. Additionally, we discussed how the multi-omics approaches help to comprehend and explore the structural and functional aspects of the microbial consortia in response to the different environmental pollutants and presented some success stories by using these approaches.

  10. Omics of the marine medaka (Oryzias melastigma) and its relevance to marine environmental research.

    PubMed

    Kim, Bo-Mi; Kim, Jaebum; Choi, Ik-Young; Raisuddin, Sheikh; Au, Doris W T; Leung, Kenneth M Y; Wu, Rudolf S S; Rhee, Jae-Sung; Lee, Jae-Seong

    2016-02-01

    In recent years, the marine medaka (Oryzias melastigma), also known as the Indian medaka or brackish medaka, has been recognized as a model fish species for ecotoxicology and environmental research in the Asian region. O. melastigma has several promising features for research, which include a short generation period (3-4 months), daily spawning, small size (3-4 cm), transparent embryos, sexual dimorphism, and ease of mass culture in the laboratory. There have been extensive transcriptome and genome studies on the marine medaka in the past decade. Such omics data can be useful in understanding the signal transduction pathways of small teleosts in response to environmental stressors. An omics-integrated approach in the study of the marine medaka is important for strengthening its role as a small fish model for marine environmental studies. In this review, we present current omics information about the marine medaka and discuss its potential applications in the study of various molecular pathways that can be targets of marine environmental stressors, such as chemical pollutants. We believe that this review will encourage the use of this small fish as a model species in marine environmental research. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. The Twins Study: NASA's First Foray into 21st Century Omics Research

    NASA Technical Reports Server (NTRS)

    Kundrot, C. E.; Shelhamer, M.; Scott, G. B. I.

    2015-01-01

    The full array of 21st century omics-based research methods should be intelligently employed to reduce the health and performance risks that astronauts will be exposed to during exploration missions beyond low Earth Orbit. In March of 2015, US Astronaut Scott Kelly will launch to the International Space Station for a one year mission while his twin brother, Mark Kelly, a retired US Astronaut, remains on the ground. This situation presents an extremely rare flight opportunity to perform an integrated omics-based demonstration pilot study involving identical twin astronauts. A group of 10 principal investigators has been competitively selected, funded, and teamed together to form the Twins Study. A very broad range of biological function are being examined including the genome, epigenome, transcriptome, proteome, metabolome, gut microbiome, immunological response to vaccinations, indicators of atherosclerosis, physiological fluid shifts, and cognition. The plans for the Twins Study and an overview of initial results will be described as well as the technological and ethical issues raised for such spaceflight studies. An anticipated outcome of the Twins Study is that it will place NASA on a trajectory of using omics-based information to develop precision countermeasures for individual astronauts.

  12. Understanding and Designing the Strategies for the Microbe-Mediated Remediation of Environmental Contaminants Using Omics Approaches

    PubMed Central

    Malla, Muneer A.; Dubey, Anamika; Yadav, Shweta; Kumar, Ashwani; Hashem, Abeer; Abd_Allah, Elsayed Fathi

    2018-01-01

    Rapid industrialization and population explosion has resulted in the generation and dumping of various contaminants into the environment. These harmful compounds deteriorate the human health as well as the surrounding environments. Current research aims to harness and enhance the natural ability of different microbes to metabolize these toxic compounds. Microbial-mediated bioremediation offers great potential to reinstate the contaminated environments in an ecologically acceptable approach. However, the lack of the knowledge regarding the factors controlling and regulating the growth, metabolism, and dynamics of diverse microbial communities in the contaminated environments often limits its execution. In recent years the importance of advanced tools such as genomics, proteomics, transcriptomics, metabolomics, and fluxomics has increased to design the strategies to treat these contaminants in ecofriendly manner. Previously researchers has largely focused on the environmental remediation using single omics-approach, however the present review specifically addresses the integrative role of the multi-omics approaches in microbial-mediated bioremediation. Additionally, we discussed how the multi-omics approaches help to comprehend and explore the structural and functional aspects of the microbial consortia in response to the different environmental pollutants and presented some success stories by using these approaches. PMID:29915565

  13. The emergence of molecular profiling and omics techniques in seagrass biology; furthering our understanding of seagrasses.

    PubMed

    Davey, Peter A; Pernice, Mathieu; Sablok, Gaurav; Larkum, Anthony; Lee, Huey Tyng; Golicz, Agnieszka; Edwards, David; Dolferus, Rudy; Ralph, Peter

    2016-09-01

    Seagrass meadows are disappearing at alarming rates as a result of increasing coastal development and climate change. The emergence of omics and molecular profiling techniques in seagrass research is timely, providing a new opportunity to address such global issues. Whilst these applications have transformed terrestrial plant research, they have only emerged in seagrass research within the past decade; In this time frame we have observed a significant increase in the number of publications in this nascent field, and as of this year the first genome of a seagrass species has been sequenced. In this review, we focus on the development of omics and molecular profiling and the utilization of molecular markers in the field of seagrass biology. We highlight the advances, merits and pitfalls associated with such technology, and importantly we identify and address the knowledge gaps, which to this day prevent us from understanding seagrasses in a holistic manner. By utilizing the powers of omics and molecular profiling technologies in integrated strategies, we will gain a better understanding of how these unique plants function at the molecular level and how they respond to on-going disturbance and climate change events.

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Noecker, Cecilia; Eng, Alexander; Srinivasan, Sujatha

    ABSTRACT Multiple molecular assays now enable high-throughput profiling of the ecology, metabolic capacity, and activity of the human microbiome. However, to date, analyses of such multi-omic data typically focus on statistical associations, often ignoring extensive prior knowledge of the mechanisms linking these various facets of the microbiome. Here, we introduce a comprehensive framework to systematically link variation in metabolomic data with community composition by utilizing taxonomic, genomic, and metabolic information. Specifically, we integrate available and inferred genomic data, metabolic network modeling, and a method for predicting community-wide metabolite turnover to estimate the biosynthetic and degradation potential of a given community.more » Our framework then compares variation in predicted metabolic potential with variation in measured metabolites’ abundances to evaluate whether community composition can explain observed shifts in the community metabolome, and to identify key taxa and genes contributing to the shifts. Focusing on two independent vaginal microbiome data sets, each pairing 16S community profiling with large-scale metabolomics, we demonstrate that our framework successfully recapitulates observed variation in 37% of metabolites. Well-predicted metabolite variation tends to result from disease-associated metabolism. We further identify several disease-enriched species that contribute significantly to these predictions. Interestingly, our analysis also detects metabolites for which the predicted variation negatively correlates with the measured variation, suggesting environmental control points of community metabolism. Applying this framework to gut microbiome data sets reveals similar trends, including prediction of bile acid metabolite shifts. This framework is an important first step toward a system-level multi-omic integration and an improved mechanistic understanding of the microbiome activity and dynamics in health and disease. IMPORTANCEStudies characterizing both the taxonomic composition and metabolic profile of various microbial communities are becoming increasingly common, yet new computational methods are needed to integrate and interpret these data in terms of known biological mechanisms. Here, we introduce an analytical framework to link species composition and metabolite measurements, using a simple model to predict the effects of community ecology on metabolite concentrations and evaluating whether these predictions agree with measured metabolomic profiles. We find that a surprisingly large proportion of metabolite variation in the vaginal microbiome can be predicted based on species composition (including dramatic shifts associated with disease), identify putative mechanisms underlying these predictions, and evaluate the roles of individual bacterial species and genes. Analysis of gut microbiome data using this framework recovers similar community metabolic trends. This framework lays the foundation for model-based multi-omic integrative studies, ultimately improving our understanding of microbial community metabolism.« less

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rawle, Rachel A.; Hamerly, Timothy; Tripet, Brian P.

    Studies of interspecies interactions are inherently difficult due to the complex mechanisms which enable these relationships. A model system for studying interspecies interactions is the marine hyperthermophiles Ignicoccus hospitalis and Nanoarchaeum equitans. Recent independently-conducted ‘omics’ analyses have generated insights into the molecular factors modulating this association. However, significant questions remain about the nature of the interactions between these archaea. We jointly analyzed multiple levels of omics datasets obtained from published, independent transcriptomics, proteomics, and metabolomics analyses. DAVID identified functionally-related groups enriched when I. hospitalis is grown alone or in co-culture with N. equitans. Enriched molecular pathways were subsequently visualized usingmore » interaction maps generated using STRING. Key findings of our multi-level omics analysis indicated that I. hospitalis provides precursors to N. equitans for energy metabolism. Analysis indicated an overall reduction in diversity of metabolic precursors in the I. hospitalis–N. equitans co-culture, which has been connected to the differential use of ribosomal subunits and was previously unnoticed. We also identified differences in precursors linked to amino acid metabolism, NADH metabolism, and carbon fixation, providing new insights into the metabolic adaptions of I. hospitalis enabling the growth of N. equitans. In conclusion, this multi-omics analysis builds upon previously identified cellular patterns while offering new insights into mechanisms that enable the I. hospitalis–N. equitans association. This study applies statistical and visualization techniques to a mixed-source omics dataset to yield a more global insight into a complex system, that was not readily discernable from separate omics studies.« less

  16. Evaluation and integration of functional annotation pipelines for newly sequenced organisms: the potato genome as a test case.

    PubMed

    Amar, David; Frades, Itziar; Danek, Agnieszka; Goldberg, Tatyana; Sharma, Sanjeev K; Hedley, Pete E; Proux-Wera, Estelle; Andreasson, Erik; Shamir, Ron; Tzfadia, Oren; Alexandersson, Erik

    2014-12-05

    For most organisms, even if their genome sequence is available, little functional information about individual genes or proteins exists. Several annotation pipelines have been developed for functional analysis based on sequence, 'omics', and literature data. However, researchers encounter little guidance on how well they perform. Here, we used the recently sequenced potato genome as a case study. The potato genome was selected since its genome is newly sequenced and it is a non-model plant even if there is relatively ample information on individual potato genes, and multiple gene expression profiles are available. We show that the automatic gene annotations of potato have low accuracy when compared to a "gold standard" based on experimentally validated potato genes. Furthermore, we evaluate six state-of-the-art annotation pipelines and show that their predictions are markedly dissimilar (Jaccard similarity coefficient of 0.27 between pipelines on average). To overcome this discrepancy, we introduce a simple GO structure-based algorithm that reconciles the predictions of the different pipelines. We show that the integrated annotation covers more genes, increases by over 50% the number of highly co-expressed GO processes, and obtains much higher agreement with the gold standard. We find that different annotation pipelines produce different results, and show how to integrate them into a unified annotation that is of higher quality than each single pipeline. We offer an improved functional annotation of both PGSC and ITAG potato gene models, as well as tools that can be applied to additional pipelines and improve annotation in other organisms. This will greatly aid future functional analysis of '-omics' datasets from potato and other organisms with newly sequenced genomes. The new potato annotations are available with this paper.

  17. Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data.

    PubMed

    Nguyen, Tin; Diaz, Diana; Tagett, Rebecca; Draghici, Sorin

    2016-07-12

    MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer, their impact on distinct biological pathways and phenotypes is largely unknown. Current integration approaches require sample-matched miRNA/mRNA datasets, resulting in limited applicability in practice. Since these approaches cannot integrate heterogeneous information available across independent experiments, they neither account for bias inherent in individual studies, nor do they benefit from increased sample size. Here we present a novel framework able to integrate miRNA and mRNA data (vertical data integration) available in independent studies (horizontal meta-analysis) allowing for a comprehensive analysis of the given phenotypes. To demonstrate the utility of our method, we conducted a meta-analysis of pancreatic and colorectal cancer, using 1,471 samples from 15 mRNA and 14 miRNA expression datasets. Our two-dimensional data integration approach greatly increases the power of statistical analysis and correctly identifies pathways known to be implicated in the phenotypes. The proposed framework is sufficiently general to integrate other types of data obtained from high-throughput assays.

  18. A System Biology Perspective on Environment-Host-Microbe Interactions.

    PubMed

    Chen, Lianmin; Garmaeva, Sanzhima; Zherankova, Alexandra; Fu, Jingyuan; Wijmenga, Cisca

    2018-04-16

    A vast, complex and dynamic consortium of microorganisms known as the gut microbiome colonizes the human gut. Over the past few decades we have developed an increased awareness of its important role in human health. In this review we discuss the role of the gut microbiome in complex diseases and the possible causal scenarios behind its interactions with the host genome and environmental factors. We then propose a new analysis framework that combines a systems biology approach, cross-kingdom integration of multiple levels of omics data, and innovative in vitro models to yield an integrated picture of human host-microbe interactions. This new framework will lay the foundation for the development of the next phase in personalized medicine.

  19. Integrated analysis of miRNA and mRNA expression data identifies multiple miRNAs regulatory networks for the tumorigenesis of colorectal cancer.

    PubMed

    Xu, Peng; Wang, Junhua; Sun, Bo; Xiao, Zhongdang

    2018-06-15

    Investigating the potential biological function of differential changed genes through integrating multiple omics data including miRNA and mRNA expression profiles, is always hot topic. However, how to evaluate the repression effect on target genes integrating miRNA and mRNA expression profiles are not fully solved. In this study, we provide an analyzing method by integrating both miRNAs and mRNAs expression data simultaneously. Difference analysis was adopted based on the repression score, then significantly repressed mRNAs were screened out by DEGseq. Pathway analysis for the significantly repressed mRNAs shows that multiple pathways such as MAPK signaling pathway, TGF-beta signaling pathway and so on, may correlated to the colorectal cancer(CRC). Focusing on the MAPK signaling pathway, a miRNA-mRNA network that centering the cell fate genes was constructed. Finally, the miRNA-mRNAs that potentially important in the CRC carcinogenesis were screened out and scored by impact index. Copyright © 2018 Elsevier B.V. All rights reserved.

  20. Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling.

    PubMed

    Vijayakumar, Supreeta; Conway, Max; Lió, Pietro; Angione, Claudio

    2017-05-30

    Metabolic modelling has entered a mature phase with dozens of methods and software implementations available to the practitioner and the theoretician. It is not easy for a modeller to be able to see the wood (or the forest) for the trees. Driven by this analogy, we here present a 'forest' of principal methods used for constraint-based modelling in systems biology. This provides a tree-based view of methods available to prospective modellers, also available in interactive version at http://modellingmetabolism.net, where it will be kept updated with new methods after the publication of the present manuscript. Our updated classification of existing methods and tools highlights the most promising in the different branches, with the aim to develop a vision of how existing methods could hybridize and become more complex. We then provide the first hands-on tutorial for multi-objective optimization of metabolic models in R. We finally discuss the implementation of multi-view machine learning approaches in poly-omic integration. Throughout this work, we demonstrate the optimization of trade-offs between multiple metabolic objectives, with a focus on omic data integration through machine learning. We anticipate that the combination of a survey, a perspective on multi-view machine learning and a step-by-step R tutorial should be of interest for both the beginner and the advanced user. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  1. Developing a 'personalome' for precision medicine: emerging methods that compute interpretable effect sizes from single-subject transcriptomes.

    PubMed

    Vitali, Francesca; Li, Qike; Schissler, A Grant; Berghout, Joanne; Kenost, Colleen; Lussier, Yves A

    2017-12-18

    The development of computational methods capable of analyzing -omics data at the individual level is critical for the success of precision medicine. Although unprecedented opportunities now exist to gather data on an individual's -omics profile ('personalome'), interpreting and extracting meaningful information from single-subject -omics remain underdeveloped, particularly for quantitative non-sequence measurements, including complete transcriptome or proteome expression and metabolite abundance. Conventional bioinformatics approaches have largely been designed for making population-level inferences about 'average' disease processes; thus, they may not adequately capture and describe individual variability. Novel approaches intended to exploit a variety of -omics data are required for identifying individualized signals for meaningful interpretation. In this review-intended for biomedical researchers, computational biologists and bioinformaticians-we survey emerging computational and translational informatics methods capable of constructing a single subject's 'personalome' for predicting clinical outcomes or therapeutic responses, with an emphasis on methods that provide interpretable readouts. (i) the single-subject analytics of the transcriptome shows the greatest development to date and, (ii) the methods were all validated in simulations, cross-validations or independent retrospective data sets. This survey uncovers a growing field that offers numerous opportunities for the development of novel validation methods and opens the door for future studies focusing on the interpretation of comprehensive 'personalomes' through the integration of multiple -omics, providing valuable insights into individual patient outcomes and treatments. © The Author 2017. Published by Oxford University Press.

  2. Next generation sequencing and its applications in HPVassociated cancers

    PubMed Central

    Tuna, Musaffe; Amos, Christopher I.

    2017-01-01

    Approximately 18% of all human cancers have a viral etiology, and human papillomavirus (HPV) has been identified as one of the most prevalent viruses that plays causative role in nearly all cervical cancers and, in addition, in subset of head and neck, anal, penile and vulvar cancers. The recent introduction of next generation sequencing (NGS) and other omics approaches have resulted in comprehensive knowledge on the pathogenesis of HPV-driven tumors. Specifically, these approaches have provided detailed information on genomic HPV integration sites, disrupted genes and pathways, and common and distinct genetic and epigenetic alterations in different human HPV-associated cancers. This review focuses on HPV integration sites, its concomitantly disrupted genes and pathways and its functional consequences in both cervical and head and neck cancers. Integration of NGS data with other omics and clinical data is crucial to better understand the pathophysiology of each individual malignancy and, based on this, to select targets and to design effective personalized treatment options. PMID:27784002

  3. Next generation sequencing and its applications in HPV-associated cancers.

    PubMed

    Tuna, Musaffe; Amos, Christopher I

    2017-01-31

    Approximately 18% of all human cancers have a viral etiology, and human papillomavirus (HPV) has been identified as one of the most prevalent viruses that plays causative role in nearly all cervical cancers and, in addition, in subset of head and neck, anal, penile and vulvar cancers. The recent introduction of next generation sequencing (NGS) and other 'omics' approaches have resulted in comprehensive knowledge on the pathogenesis of HPV-driven tumors. Specifically, these approaches have provided detailed information on genomic HPV integration sites, disrupted genes and pathways, and common and distinct genetic and epigenetic alterations in different human HPV-associated cancers. This review focuses on HPV integration sites, its concomitantly disrupted genes and pathways and its functional consequences in both cervical and head and neck cancers. Integration of NGS data with other 'omics' and clinical data is crucial to better understand the pathophysiology of each individual malignancy and, based on this, to select targets and to design effective personalized treatment options.

  4. Integrating data from biological experiments into metabolic networks with the DBE information system.

    PubMed

    Borisjuk, Ljudmilla; Hajirezaei, Mohammad-Reza; Klukas, Christian; Rolletschek, Hardy; Schreiber, Falk

    2005-01-01

    Modern 'omics'-technologies result in huge amounts of data about life processes. For analysis and data mining purposes this data has to be considered in the context of the underlying biological networks. This work presents an approach for integrating data from biological experiments into metabolic networks by mapping the data onto network elements and visualising the data enriched networks automatically. This methodology is implemented in DBE, an information system that supports the analysis and visualisation of experimental data in the context of metabolic networks. It consists of five parts: (1) the DBE-Database for consistent data storage, (2) the Excel-Importer application for the data import, (3) the DBE-Website as the interface for the system, (4) the DBE-Pictures application for the up- and download of binary (e. g. image) files, and (5) DBE-Gravisto, a network analysis and graph visualisation system. The usability of this approach is demonstrated in two examples.

  5. Veterinary Medicine and Multi-Omics Research for Future Nutrition Targets: Metabolomics and Transcriptomics of the Common Degenerative Mitral Valve Disease in Dogs.

    PubMed

    Li, Qinghong; Freeman, Lisa M; Rush, John E; Huggins, Gordon S; Kennedy, Adam D; Labuda, Jeffrey A; Laflamme, Dorothy P; Hannah, Steven S

    2015-08-01

    Canine degenerative mitral valve disease (DMVD) is the most common form of heart disease in dogs. The objective of this study was to identify cellular and metabolic pathways that play a role in DMVD by performing metabolomics and transcriptomics analyses on serum and tissue (mitral valve and left ventricle) samples previously collected from dogs with DMVD or healthy hearts. Gas or liquid chromatography followed by mass spectrophotometry were used to identify metabolites in serum. Transcriptomics analysis of tissue samples was completed using RNA-seq, and selected targets were confirmed by RT-qPCR. Random Forest analysis was used to classify the metabolites that best predicted the presence of DMVD. Results identified 41 known and 13 unknown serum metabolites that were significantly different between healthy and DMVD dogs, representing alterations in fat and glucose energy metabolism, oxidative stress, and other pathways. The three metabolites with the greatest single effect in the Random Forest analysis were γ-glutamylmethionine, oxidized glutathione, and asymmetric dimethylarginine. Transcriptomics analysis identified 812 differentially expressed transcripts in left ventricle samples and 263 in mitral valve samples, representing changes in energy metabolism, antioxidant function, nitric oxide signaling, and extracellular matrix homeostasis pathways. Many of the identified alterations may benefit from nutritional or medical management. Our study provides evidence of the growing importance of integrative approaches in multi-omics research in veterinary and nutritional sciences.

  6. Principles of proteome allocation are revealed using proteomic data and genome-scale models

    PubMed Central

    Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; Ebrahim, Ali; Saunders, Michael A.; Palsson, Bernhard O.

    2016-01-01

    Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thus represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. This flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models. PMID:27857205

  7. Information Commons for Rice (IC4R)

    PubMed Central

    2016-01-01

    Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice. PMID:26519466

  8. Principles of proteome allocation are revealed using proteomic data and genome-scale models

    DOE PAGES

    Yang, Laurence; Yurkovich, James T.; Lloyd, Colton J.; ...

    2016-11-18

    Integrating omics data to refine or make context-specific models is an active field of constraint-based modeling. Proteomics now cover over 95% of the Escherichia coli proteome by mass. Genome-scale models of Metabolism and macromolecular Expression (ME) compute proteome allocation linked to metabolism and fitness. Using proteomics data, we formulated allocation constraints for key proteome sectors in the ME model. The resulting calibrated model effectively computed the “generalist” (wild-type) E. coli proteome and phenotype across diverse growth environments. Across 15 growth conditions, prediction errors for growth rate and metabolic fluxes were 69% and 14% lower, respectively. The sector-constrained ME model thusmore » represents a generalist ME model reflecting both growth rate maximization and “hedging” against uncertain environments and stresses, as indicated by significant enrichment of these sectors for the general stress response sigma factor σS. Finally, the sector constraints represent a general formalism for integrating omics data from any experimental condition into constraint-based ME models. The constraints can be fine-grained (individual proteins) or coarse-grained (functionally-related protein groups) as demonstrated here. Furthermore, this flexible formalism provides an accessible approach for narrowing the gap between the complexity captured by omics data and governing principles of proteome allocation described by systems-level models.« less

  9. Big Data Analytics in Medicine and Healthcare.

    PubMed

    Ristevski, Blagoj; Chen, Ming

    2018-05-10

    This paper surveys big data with highlighting the big data analytics in medicine and healthcare. Big data characteristics: value, volume, velocity, variety, veracity and variability are described. Big data analytics in medicine and healthcare covers integration and analysis of large amount of complex heterogeneous data such as various - omics data (genomics, epigenomics, transcriptomics, proteomics, metabolomics, interactomics, pharmacogenomics, diseasomics), biomedical data and electronic health records data. We underline the challenging issues about big data privacy and security. Regarding big data characteristics, some directions of using suitable and promising open-source distributed data processing software platform are given.

  10. High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software.

    PubMed

    Fabregat-Traver, Diego; Sharapov, Sodbo Zh; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo

    2014-01-01

    To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the 'omics' context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL.

  11. GeneLab: A Systems Biology Platform for Spaceflight Omics Data

    NASA Technical Reports Server (NTRS)

    Reinsch, Sigrid S.; Lai, San-Huei; Chen, Rick; Thompson, Terri; Berrios, Daniel; Fogle, Homer; Marcu, Oana; Timucin, Linda; Chakravarty, Kaushik; Coughlan, Joseph

    2015-01-01

    NASA's mission includes expanding our understanding of biological systems to improve life on Earth and to enable long-duration human exploration of space. Resources to support large numbers of spaceflight investigations are limited. NASA's GeneLab project is maximizing the science output from these experiments by: (1) developing a unique public bioinformatics database that includes space bioscience relevant "omics" data (genomics, transcriptomics, proteomics, and metabolomics) and experimental metadata; (2) partnering with NASA-funded flight experiments through bio-sample sharing or sample augmentation to expedite omics data input to the GeneLab database; and (3) developing community-driven reference flight experiments. The first database, GeneLab Data System Version 1.0, went online in April 2015. V1.0 contains numerous flight datasets and has search and download capabilities. Version 2.0 will be released in 2016 and will link to analytic tools. In 2015 Genelab partnered with two Biological Research in Canisters experiments (BBRIC-19 and BRIC-20) which examine responses of Arabidopsis thaliana to spaceflight. GeneLab also partnered with Rodent Research-1 (RR1), the maiden flight to test the newly developed rodent habitat. GeneLab developed protocols for maxiumum yield of RNA, DNA and protein from precious RR-1 tissues harvested and preserved during the SpaceX-4 mission, as well as from tissues from mice that were frozen intact during spaceflight and later dissected. GeneLab is establishing partnerships with at least three planned flights for 2016. Organism-specific nationwide Science Definition Teams (SDTs) will define future GeneLab dedicated missions and ensure the broader scientific impact of the GeneLab missions. GeneLab ensures prompt release and open access to all high-throughput omics data from spaceflight and ground-based simulations of microgravity and radiation. Overall, GeneLab will facilitate the generation and query of parallel multi-omics data, and deep curation of metadata for integrative analysis, allowing researchers to uncover cellular networks as observed in systems biology platforms. Consequently, the scientific community will have access to a more complete picture of functional and regulatory networks responsive to the spaceflight environment.. Analysis of GeneLab data will contribute fundamental knowledge of how the space environment affects biological systems, and enable emerging terrestrial benefits resulting from mitigation strategies to prevent effects observed during exposure to space. As a result, open access to the data will foster new hypothesis-driven research for future spaceflight studies spanning basic science to translational science.

  12. Morphomics: An integral part of systems biology of the human placenta.

    PubMed

    Mayhew, T M

    2015-04-01

    The placenta is a transient organ the functioning of which has health consequences far beyond the embryo/fetus. Understanding the biology of any system (organ, organism, single cell, etc) requires a comprehensive and inclusive approach which embraces all the biomedical disciplines and 'omic' technologies and then integrates information obtained from all of them. Among the latest 'omics' is morphomics. The terms morphome and morphomics have been applied incoherently in biology and biomedicine but, recently, they have been given clear and widescale definitions. Morphomics is placed in the context of other 'omics' and its pertinent technologies and tools for sampling and quantitation are reviewed. Emphasis is accorded to the importance of random sampling principles in systems biology and the value of combining 3D quantification with alternative imaging techniques to advance knowledge and understanding of the human placental morphome. By analogy to other 'omes', the morphome is the totality of morphological features within a system and morphomics is the systematic study of those structures. Information about structure is required at multiple levels of resolution in order to understand better the processes by which a given system alters with time, experimental treatment or environmental insult. Therefore, morphomics research includes all imaging techniques at all levels of achievable resolution from gross anatomy and medical imaging, via optical and electron microscopy, to molecular characterisation. Quantification is an important element of all 'omics' studies and, because biological systems exist and operate in 3-dimensional (3D) space, precise descriptions of form, content and spatial relationships require the quantification of structure in 3D. These considerations are relevant to future study contributions to the Human Placenta Project. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Gut Microbiota Dysbiosis as Risk and Premorbid Factors of IBD and IBS Along the Childhood-Adulthood Transition.

    PubMed

    Putignani, Lorenza; Del Chierico, Federica; Vernocchi, Pamela; Cicala, Michele; Cucchiara, Salvatore; Dallapiccola, Bruno

    2016-02-01

    Gastrointestinal disorders, although clinically heterogeneous, share pathogenic mechanisms, including genetic susceptibility, impaired gut barrier function, altered microbiota, and environmental triggers (infections, social and behavioral factors, epigenetic control, and diet). Gut microbiota has been studied for inflammatory bowel disease (IBD) and irritable bowel syndrome (IBS) in either children or adults, while modifiable gut microbiota features, acting as risk and premorbid factors along the childhood-adulthood transition, have not been thoroughly investigated so far. Indeed, the relationship between variations of the entire host/microbiota/environmental scenario and clinical phenotypes is still not fully understood. In this respect, tracking gut dysbiosis grading may help deciphering host phenotype-genotype associations and microbiota shifts in an integrated top-down omics-based approach within large-scale pediatric and adult case-control cohorts. Large-scale gut microbiota signatures and host inflammation patterns may be integrated with dietary habits, under genetic and epigenetic constraints, providing gut dysbiosis profiles acting as risk predictors of IBD or IBS in preclinical cases. Tracking dysbiosis supports new personalized/stratified IBD and IBS prevention programmes, generating Decision Support System tools. They include (1) high risk or flare-up recurrence -omics-based dysbiosis profiles; (2) microbial and molecular biomarkers of health and disease; (3) -omics-based pipelines for laboratory medicine diagnostics; (4) health apps for self-management of score-based dietary profiles, which can be shared with clinicians for nutritional habit and lifestyle amendment; (5) -omics profiling data warehousing and public repositories for IBD and IBS profile consultation. Dysbiosis-related indexes can represent novel laboratory and clinical medicine tools preventing or postponing the disease, finally interfering with its natural history.

  14. Integration of multi-omics techniques and physiological phenotyping within a holistic phenomics approach to study senescence in model and crop plants.

    PubMed

    Großkinsky, Dominik K; Syaifullah, Syahnada Jaya; Roitsch, Thomas

    2018-02-12

    The study of senescence in plants is complicated by diverse levels of temporal and spatial dynamics as well as the impact of external biotic and abiotic factors and crop plant management. Whereas the molecular mechanisms involved in developmentally regulated leaf senescence are very well understood, in particular in the annual model plant species Arabidopsis, senescence of other organs such as the flower, fruit, and root is much less studied as well as senescence in perennials such as trees. This review addresses the need for the integration of multi-omics techniques and physiological phenotyping into holistic phenomics approaches to dissect the complex phenomenon of senescence. That became feasible through major advances in the establishment of various, complementary 'omics' technologies. Such an interdisciplinary approach will also need to consider knowledge from the animal field, in particular in relation to novel regulators such as small, non-coding RNAs, epigenetic control and telomere length. Such a characterization of phenotypes via the acquisition of high-dimensional datasets within a systems biology approach will allow us to systematically characterize the various programmes governing senescence beyond leaf senescence in Arabidopsis and to elucidate the underlying molecular processes. Such a multi-omics approach is expected to also spur the application of results from model plants to agriculture and their verification for sustainable and environmentally friendly improvement of crop plant stress resilience and productivity and contribute to improvements based on postharvest physiology for the food industry and the benefit of its customers. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  15. A Syst-OMICS Approach to Ensuring Food Safety and Reducing the Economic Burden of Salmonellosis.

    PubMed

    Emond-Rheault, Jean-Guillaume; Jeukens, Julie; Freschi, Luca; Kukavica-Ibrulj, Irena; Boyle, Brian; Dupont, Marie-Josée; Colavecchio, Anna; Barrere, Virginie; Cadieux, Brigitte; Arya, Gitanjali; Bekal, Sadjia; Berry, Chrystal; Burnett, Elton; Cavestri, Camille; Chapin, Travis K; Crouse, Alanna; Daigle, France; Danyluk, Michelle D; Delaquis, Pascal; Dewar, Ken; Doualla-Bell, Florence; Fliss, Ismail; Fong, Karen; Fournier, Eric; Franz, Eelco; Garduno, Rafael; Gill, Alexander; Gruenheid, Samantha; Harris, Linda; Huang, Carol B; Huang, Hongsheng; Johnson, Roger; Joly, Yann; Kerhoas, Maud; Kong, Nguyet; Lapointe, Gisèle; Larivière, Line; Loignon, Stéphanie; Malo, Danielle; Moineau, Sylvain; Mottawea, Walid; Mukhopadhyay, Kakali; Nadon, Céline; Nash, John; Ngueng Feze, Ida; Ogunremi, Dele; Perets, Ann; Pilar, Ana V; Reimer, Aleisha R; Robertson, James; Rohde, John; Sanderson, Kenneth E; Song, Lingqiao; Stephan, Roger; Tamber, Sandeep; Thomassin, Paul; Tremblay, Denise; Usongo, Valentine; Vincent, Caroline; Wang, Siyun; Weadge, Joel T; Wiedmann, Martin; Wijnands, Lucas; Wilson, Emily D; Wittum, Thomas; Yoshida, Catherine; Youfsi, Khadija; Zhu, Lei; Weimer, Bart C; Goodridge, Lawrence; Levesque, Roger C

    2017-01-01

    The Salmonella Syst-OMICS consortium is sequencing 4,500 Salmonella genomes and building an analysis pipeline for the study of Salmonella genome evolution, antibiotic resistance and virulence genes. Metadata, including phenotypic as well as genomic data, for isolates of the collection are provided through the Salmonella Foodborne Syst-OMICS database (SalFoS), at https://salfos.ibis.ulaval.ca/. Here, we present our strategy and the analysis of the first 3,377 genomes. Our data will be used to draw potential links between strains found in fresh produce, humans, animals and the environment. The ultimate goals are to understand how Salmonella evolves over time, improve the accuracy of diagnostic methods, develop control methods in the field, and identify prognostic markers for evidence-based decisions in epidemiology and surveillance.

  16. Text Mining in Cancer Gene and Pathway Prioritization

    PubMed Central

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes. PMID:25392685

  17. Text mining in cancer gene and pathway prioritization.

    PubMed

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes.

  18. BiofOmics: a Web platform for the systematic and standardized collection of high-throughput biofilm data.

    PubMed

    Lourenço, Anália; Ferreira, Andreia; Veiga, Nuno; Machado, Idalina; Pereira, Maria Olivia; Azevedo, Nuno F

    2012-01-01

    Consortia of microorganisms, commonly known as biofilms, are attracting much attention from the scientific community due to their impact in human activity. As biofilm research grows to be a data-intensive discipline, the need for suitable bioinformatics approaches becomes compelling to manage and validate individual experiments, and also execute inter-laboratory large-scale comparisons. However, biofilm data is widespread across ad hoc, non-standardized individual files and, thus, data interchange among researchers, or any attempt of cross-laboratory experimentation or analysis, is hardly possible or even attempted. This paper presents BiofOmics, the first publicly accessible Web platform specialized in the management and analysis of data derived from biofilm high-throughput studies. The aim is to promote data interchange across laboratories, implementing collaborative experiments, and enable the development of bioinformatics tools in support of the processing and analysis of the increasing volumes of experimental biofilm data that are being generated. BiofOmics' data deposition facility enforces data structuring and standardization, supported by controlled vocabulary. Researchers are responsible for the description of the experiments, their results and conclusions. BiofOmics' curators interact with submitters only to enforce data structuring and the use of controlled vocabulary. Then, BiofOmics' search facility makes publicly available the profile and data associated with a submitted study so that any researcher can profit from these standardization efforts to compare similar studies, generate new hypotheses to be tested or even extend the conditions experimented in the study. BiofOmics' novelty lies in its support to standardized data deposition, the availability of computerizable data files and the free-of-charge dissemination of biofilm studies across the community. Hopefully, this will open promising research possibilities, namely the comparison of results between different laboratories, the reproducibility of methods within and between laboratories, and the development of guidelines and standardized protocols for biofilm formation operating procedures and analytical methods.

  19. Integrated omics analyses of retrograde signaling mutant delineate interrelated stress-response strata.

    PubMed

    Bjornson, Marta; Balcke, Gerd Ulrich; Xiao, Yanmei; de Souza, Amancio; Wang, Jin-Zheng; Zhabinskaya, Dina; Tagkopoulos, Ilias; Tissier, Alain; Dehesh, Katayoon

    2017-07-01

    To maintain homeostasis in the face of intrinsic and extrinsic insults, cells have evolved elaborate quality control networks to resolve damage at multiple levels. Interorganellar communication is a key requirement for this maintenance, however the underlying mechanisms of this communication have remained an enigma. Here we integrate the outcome of transcriptomic, proteomic, and metabolomics analyses of genotypes including ceh1, a mutant with constitutively elevated levels of both the stress-specific plastidial retrograde signaling metabolite methyl-erythritol cyclodiphosphate (MEcPP) and the defense hormone salicylic acid (SA), as well as the high MEcPP but SA deficient genotype ceh1/eds16, along with corresponding controls. Integration of multi-omic analyses enabled us to delineate the function of MEcPP from SA, and expose the compartmentalized role of this retrograde signaling metabolite in induction of distinct but interdependent signaling cascades instrumental in adaptive responses. Specifically, here we identify strata of MEcPP-sensitive stress-response cascades, among which we focus on selected pathways including organelle-specific regulation of jasmonate biosynthesis; simultaneous induction of synthesis and breakdown of SA; and MEcPP-mediated alteration of cellular redox status in particular glutathione redox balance. Collectively, these integrated multi-omic analyses provided a vehicle to gain an in-depth knowledge of genome-metabolism interactions, and to further probe the extent of these interactions and delineate their functional contributions. Through this approach we were able to pinpoint stress-mediated transcriptional and metabolic signatures and identify the downstream processes modulated by the independent or overlapping functions of MEcPP and SA in adaptive responses. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  20. A Review of Recent Advancement in Integrating Omics Data with Literature Mining towards Biomedical Discoveries

    PubMed Central

    Raja, Kalpana; Patrick, Matthew; Gao, Yilin; Madu, Desmond; Yang, Yuyang

    2017-01-01

    In the past decade, the volume of “omics” data generated by the different high-throughput technologies has expanded exponentially. The managing, storing, and analyzing of this big data have been a great challenge for the researchers, especially when moving towards the goal of generating testable data-driven hypotheses, which has been the promise of the high-throughput experimental techniques. Different bioinformatics approaches have been developed to streamline the downstream analyzes by providing independent information to interpret and provide biological inference. Text mining (also known as literature mining) is one of the commonly used approaches for automated generation of biological knowledge from the huge number of published articles. In this review paper, we discuss the recent advancement in approaches that integrate results from omics data and information generated from text mining approaches to uncover novel biomedical information. PMID:28331849

  1. Biological Networks for Cancer Candidate Biomarkers Discovery

    PubMed Central

    Yan, Wenying; Xue, Wenjin; Chen, Jiajia; Hu, Guang

    2016-01-01

    Due to its extraordinary heterogeneity and complexity, cancer is often proposed as a model case of a systems biology disease or network disease. There is a critical need of effective biomarkers for cancer diagnosis and/or outcome prediction from system level analyses. Methods based on integrating omics data into networks have the potential to revolutionize the identification of cancer biomarkers. Deciphering the biological networks underlying cancer is undoubtedly important for understanding the molecular mechanisms of the disease and identifying effective biomarkers. In this review, the networks constructed for cancer biomarker discovery based on different omics level data are described and illustrated from recent advances in the field. PMID:27625573

  2. Manipulating environmental stresses and stress tolerance of microalgae for enhanced production of lipids and value-added products-A review.

    PubMed

    Chen, Bailing; Wan, Chun; Mehmood, Muhammad Aamer; Chang, Jo-Shu; Bai, Fengwu; Zhao, Xinqing

    2017-11-01

    Microalgae have promising potential to produce lipids and a variety of high-value chemicals. Suitable stress conditions such as nitrogen starvation and high salinity could stimulate synthesis and accumulation of lipids and high-value products by microalgae, therefore, various stress-modification strategies were developed to manipulate and optimize cultivation processes to enhance bioproduction efficiency. On the other hand, advancements in omics-based technologies have boosted the research to globally understand microalgal gene regulation under stress conditions, which enable further improvement of production efficiency via genetic engineering. Moreover, integration of multi-omics data, synthetic biology design, and genetic engineering manipulations exhibits a tremendous potential in the betterment of microalgal biorefinery. This review discusses the process manipulation strategies and omics studies on understanding the regulation of metabolite biosynthesis under various stressful conditions, and proposes genetic engineering of microalgae to improve bioproduction via manipulating stress tolerance. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.

    PubMed

    Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali

    2018-04-01

    Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.

  4. EXPOsOMICS: final policy workshop and stakeholder consultation.

    PubMed

    Turner, Michelle C; Vineis, Paolo; Seleiro, Eduardo; Dijmarescu, Michaela; Balshaw, David; Bertollini, Roberto; Chadeau-Hyam, Marc; Gant, Timothy; Gulliver, John; Jeong, Ayoung; Kyrtopoulos, Soterios; Martuzzi, Marco; Miller, Gary W; Nawrot, Timothy; Nieuwenhuijsen, Mark; Phillips, David H; Probst-Hensch, Nicole; Samet, Jonathan; Vermeulen, Roel; Vlaanderen, Jelle; Vrijheid, Martine; Wild, Christopher; Kogevinas, Manolis

    2018-02-15

    The final meeting of the EXPOsOMICS project "Final Policy Workshop and Stakeholder Consultation" took place 28-29 March 2017 to present the main results of the project and discuss their implications both for future research and for regulatory and policy activities. This paper summarizes presentations and discussions at the meeting related with the main results and advances in exposome research achieved through the EXPOsOMICS project; on other parallel research initiatives on the study of the exposome in Europe and in the United States and their complementarity to EXPOsOMICS; lessons learned from these early studies on the exposome and how they may shape the future of research on environmental exposure assessment; and finally the broader implications of exposome research for risk assessment and policy development on environmental exposures. The main results of EXPOsOMICS in relation to studies of the external exposome and internal exposome in relation to both air pollution and water contaminants were presented as well as new technologies for environmental health research (adductomics) and advances in statistical methods. Although exposome research strengthens the scientific basis for policy development, there is a need in terms of showing added value for public health to: improve communication of research results to non-scientific audiences; target research to the broader landscape of societal challenges; and draw applicable conclusions. Priorities for future work include the development and standardization of methodologies and technologies for assessing the external and internal exposome, improved data sharing and integration, and the demonstration of the added value of exposome science over conventional approaches in answering priority policy questions.

  5. Integrative analysis of multi-omics data reveals distinct impacts of DDB1-CUL4 associated factors in human lung adenocarcinomas.

    PubMed

    Yan, Hong; Bi, Lei; Wang, Yunshan; Zhang, Xia; Hou, Zhibo; Wang, Qian; Snijders, Antoine M; Mao, Jian-Hua

    2017-03-23

    Many DDB1-CUL4 associated factors (DCAFs) have been identified and serve as substrate receptors. Although the oncogenic role of CUL4A has been well established, specific DCAFs involved in cancer development remain largely unknown. Here we infer the potential impact of 19 well-defined DCAFs in human lung adenocarcinomas (LuADCs) using integrative omics analyses, and discover that mRNA levels of DTL, DCAF4, 12 and 13 are consistently elevated whereas VBRBP is reduced in LuADCs compared to normal lung tissues. The transcriptional levels of DCAFs are significantly correlated with their gene copy number variations. SKIP2, DTL, DCAF6, 7, 8, 13 and 17 are frequently gained whereas VPRBP, PHIP, DCAF10, 12 and 15 are frequently lost. We find that only transcriptional level of DTL is robustly, significantly and negatively correlated with overall survival across independent datasets. Moreover, DTL-correlated genes are enriched in cell cycle and DNA repair pathways. We also identified that the levels of 25 proteins were significantly associated with DTL overexpression in LuADCs, which include significant decreases in protein level of the tumor supressor genes such as PDCD4, NKX2-1 and PRKAA1. Our results suggest that different CUL4-DCAF axis plays the distinct roles in LuADC development with possible relevance for therapeutic target development.

  6. Systems Medicine: The Future of Medical Genomics, Healthcare, and Wellness.

    PubMed

    Saqi, Mansoor; Pellet, Johann; Roznovat, Irina; Mazein, Alexander; Ballereau, Stéphane; De Meulder, Bertrand; Auffray, Charles

    2016-01-01

    Recent advances in genomics have led to the rapid and relatively inexpensive collection of patient molecular data including multiple types of omics data. The integration of these data with clinical measurements has the potential to impact on our understanding of the molecular basis of disease and on disease management. Systems medicine is an approach to understanding disease through an integration of large patient datasets. It offers the possibility for personalized strategies for healthcare through the development of a new taxonomy of disease. Advanced computing will be an important component in effectively implementing systems medicine. In this chapter we describe three computational challenges associated with systems medicine: disease subtype discovery using integrated datasets, obtaining a mechanistic understanding of disease, and the development of an informatics platform for the mining, analysis, and visualization of data emerging from translational medicine studies.

  7. Bioinformatics-based tools in drug discovery: the cartography from single gene to integrative biological networks.

    PubMed

    Ramharack, Pritika; Soliman, Mahmoud E S

    2018-06-01

    Originally developed for the analysis of biological sequences, bioinformatics has advanced into one of the most widely recognized domains in the scientific community. Despite this technological evolution, there is still an urgent need for nontoxic and efficient drugs. The onus now falls on the 'omics domain to meet this need by implementing bioinformatics techniques that will allow for the introduction of pioneering approaches in the rational drug design process. Here, we categorize an updated list of informatics tools and explore the capabilities of integrative bioinformatics in disease control. We believe that our review will serve as a comprehensive guide toward bioinformatics-oriented disease and drug discovery research. Copyright © 2018 Elsevier Ltd. All rights reserved.

  8. Tools for the functional interpretation of metabolomic experiments.

    PubMed

    Chagoyen, Monica; Pazos, Florencio

    2013-11-01

    The so-called 'omics' approaches used in modern biology aim at massively characterizing the molecular repertories of living systems at different levels. Metabolomics is one of the last additions to the 'omics' family and it deals with the characterization of the set of metabolites in a given biological system. As metabolomic techniques become more massive and allow characterizing larger sets of metabolites, automatic methods for analyzing these sets in order to obtain meaningful biological information are required. Only recently the first tools specifically designed for this task in metabolomics appeared. They are based on approaches previously used in transcriptomics and other 'omics', such as annotation enrichment analysis. These, together with generic tools for metabolic analysis and visualization not specifically designed for metabolomics will for sure be in the toolbox of the researches doing metabolomic experiments in the near future.

  9. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aurich, Maike K.; Fleming, Ronan M. T.; Thiele, Ines

    Metabolomic data sets provide a direct read-out of cellular phenotypes and are increasingly generated to study biological questions. Previous work, by us and others, revealed the potential of analyzing extracellular metabolomic data in the context of the metabolic model using constraint-based modeling. With the MetaboTools, we make our methods available to the broader scientific community. The MetaboTools consist of a protocol, a toolbox, and tutorials of two use cases. The protocol describes, in a step-wise manner, the workflow of data integration, and computational analysis. The MetaboTools comprise the Matlab code required to complete the workflow described in the protocol. Tutorialsmore » explain the computational steps for integration of two different data sets and demonstrate a comprehensive set of methods for the computational analysis of metabolic models and stratification thereof into different phenotypes. The presented workflow supports integrative analysis of multiple omics data sets. Importantly, all analysis tools can be applied to metabolic models without performing the entire workflow. Taken together, the MetaboTools constitute a comprehensive guide to the intra-model analysis of extracellular metabolomic data from microbial, plant, or human cells. In conclusion, this computational modeling resource offers a broad set of computational analysis tools for a wide biomedical and non-biomedical research community.« less

  10. Network-Based Integration of Disparate Omic Data To Identify "Silent Players" in Cancer

    PubMed Central

    Ruffalo, Matthew

    2015-01-01

    Development of high-throughput monitoring technologies enables interrogation of cancer samples at various levels of cellular activity. Capitalizing on these developments, various public efforts such as The Cancer Genome Atlas (TCGA) generate disparate omic data for large patient cohorts. As demonstrated by recent studies, these heterogeneous data sources provide the opportunity to gain insights into the molecular changes that drive cancer pathogenesis and progression. However, these insights are limited by the vast search space and as a result low statistical power to make new discoveries. In this paper, we propose methods for integrating disparate omic data using molecular interaction networks, with a view to gaining mechanistic insights into the relationship between molecular changes at different levels of cellular activity. Namely, we hypothesize that genes that play a role in cancer development and progression may be implicated by neither frequent mutation nor differential expression, and that network-based integration of mutation and differential expression data can reveal these “silent players”. For this purpose, we utilize network-propagation algorithms to simulate the information flow in the cell at a sample-specific resolution. We then use the propagated mutation and expression signals to identify genes that are not necessarily mutated or differentially expressed genes, but have an essential role in tumor development and patient outcome. We test the proposed method on breast cancer and glioblastoma multiforme data obtained from TCGA. Our results show that the proposed method can identify important proteins that are not readily revealed by molecular data, providing insights beyond what can be gleaned by analyzing different types of molecular data in isolation. PMID:26683094

  11. The Air Force In Silico -- Computational Biology in 2025

    DTIC Science & Technology

    2007-11-01

    and chromosome) these new fields are commonly referred to as “~omics.” Proteomics , transcriptomics, metabolomics , epigenomics, physiomics... Bioinformatics , 2006, http://journal.imbio.de/ http://www-bm.ipk-gatersleben.de/stable/php/ journal /articles/pdf/jib-22.pdf (accessed 30 September...Chirino, G. Tansley and I. Dryden, “The implications for Bioinformatics of integration across physical scales,” Journal of Integrative Bioinformatics

  12. Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

    PubMed Central

    Luo, Yuan; Ahmad, Faraz S.; Shah, Sanjiv J.

    2017-01-01

    Heart failure with preserved ejection fraction (HFpEF) is a heterogeneous clinical syndrome that may benefit from improved subtyping in order to better characterize its pathophysiology and to develop novel targeted therapies. The United States Precision Medicine Initiative comes amid the rapid growth in quantity and modality of clinical data for HFpEF patients ranging from deep phenotypic to trans-omic data. Tensor factorization, a form of machine learning, allows for the integration of multiple data modalities to derive clinically relevant HFpEF subtypes that may have significant differences in underlying pathophysiology and differential response to therapies. Tensor factorization also allows for better interpretability by supporting dimensionality reduction and identifying latent groups of data for meaningful summarization of both features and disease outcomes. In this narrative review, we analyze the modest literature on the application of tensor factorization to related biomedical fields including genotyping and phenotyping. Based on the cited work including work of our own, we suggest multiple tensor factorization formulations capable of integrating the deep phenotypic and trans-omic modalities of data for HFpEF, or accounting for interactions between genetic variants at different -omic hierarchies. We encourage extensive experimental studies to tackle challenges in applying tensor factorization for precision medicine in HFpEF, including effectively incorporating existing medical knowledge, properly accounting for uncertainty, and efficiently enforcing sparsity for better interpretability. PMID:28116551

  13. Decoding genes with coexpression networks and metabolomics - 'majority report by precogs'.

    PubMed

    Saito, Kazuki; Hirai, Masami Y; Yonekura-Sakakibara, Keiko

    2008-01-01

    Following the sequencing of whole genomes of model plants, high-throughput decoding of gene function is a major challenge in modern plant biology. In view of remarkable technical advances in transcriptomics and metabolomics, integrated analysis of these 'omics' by data-mining informatics is an excellent tool for prediction and identification of gene function, particularly for genes involved in complicated metabolic pathways. The availability of Arabidopsis public transcriptome datasets containing data of >1000 microarrays reinforces the potential for prediction of gene function by transcriptome coexpression analysis. Here, we review the strategy of combining transcriptome and metabolome as a powerful technology for studying the functional genomics of model plants and also crop and medicinal plants.

  14. Risk Assessment and Communication Tools for Genotype Associations with Multifactorial Phenotypes: The Concept of “Edge Effect” and Cultivating an Ethical Bridge between Omics Innovations and Society

    PubMed Central

    Suarez-Kurtz, Guilherme; Stenne, Raphaëlle; Somogyi, Andrew A.; Someya, Toshiyuki; Kayaalp, S. Oğuz; Kolker, Eugene

    2009-01-01

    Abstract Applications of omics technologies in the postgenomics era swiftly expanded from rare monogenic disorders to multifactorial common complex diseases, pharmacogenomics, and personalized medicine. Already, there are signposts indicative of further omics technology investment in nutritional sciences (nutrigenomics), environmental health/ecology (ecogenomics), and agriculture (agrigenomics). Genotype–phenotype association studies are a centerpiece of translational research in omics science. Yet scientific and ethical standards and ways to assess and communicate risk information obtained from association studies have been neglected to date. This is a significant gap because association studies decisively influence which genetic loci become genetic tests in the clinic or products in the genetic test marketplace. A growing challenge concerns the interpretation of large overlap typically observed in distribution of quantitative traits in a genetic association study with a polygenic/multifactorial phenotype. To remedy the shortage of risk assessment and communication tools for association studies, this paper presents the concept of edge effect. That is, the shift in population edges of a multifactorial quantitative phenotype is a more sensitive measure (than population averages) to gauge the population level impact and by extension, policy significance of an omics marker. Empirical application of the edge effect concept is illustrated using an original analysis of warfarin pharmacogenomics and the VKORC1 genetic variation in a Brazilian population sample. These edge effect analyses are examined in relation to regulatory guidance development for association studies. We explain that omics science transcends the conventional laboratory bench space and includes a highly heterogeneous cast of stakeholders in society who have a plurality of interests that are often in conflict. Hence, communication of risk information in diagnostic medicine also demands attention to processes involved in production of knowledge and human values embedded in scientific practice, for example, why, how, by whom, and to what ends association studies are conducted, and standards are developed (or not). To ensure sustainability of omics innovations and forecast their trajectory, we need interventions to bridge the gap between omics laboratory and society. Appreciation of scholarship in history of omics science is one remedy to responsibly learn from the past to ensure a sustainable future in omics fields, both emerging (nutrigenomics, ecogenomics), and those that are more established (pharmacogenomics). Another measure to build public trust and sustainability of omics fields could be legislative initiatives to create a multidisciplinary oversight body, at arm's length from conflict of interests, to carry out independent, impartial, and transparent innovation analyses and prospective technology assessment. PMID:19290811

  15. Risk assessment and communication tools for genotype associations with multifactorial phenotypes: the concept of "edge effect" and cultivating an ethical bridge between omics innovations and society.

    PubMed

    Ozdemir, Vural; Suarez-Kurtz, Guilherme; Stenne, Raphaëlle; Somogyi, Andrew A; Someya, Toshiyuki; Kayaalp, S Oğuz; Kolker, Eugene

    2009-02-01

    Applications of omics technologies in the postgenomics era swiftly expanded from rare monogenic disorders to multifactorial common complex diseases, pharmacogenomics, and personalized medicine. Already, there are signposts indicative of further omics technology investment in nutritional sciences (nutrigenomics), environmental health/ecology (ecogenomics), and agriculture (agrigenomics). Genotype-phenotype association studies are a centerpiece of translational research in omics science. Yet scientific and ethical standards and ways to assess and communicate risk information obtained from association studies have been neglected to date. This is a significant gap because association studies decisively influence which genetic loci become genetic tests in the clinic or products in the genetic test marketplace. A growing challenge concerns the interpretation of large overlap typically observed in distribution of quantitative traits in a genetic association study with a polygenic/multifactorial phenotype. To remedy the shortage of risk assessment and communication tools for association studies, this paper presents the concept of edge effect. That is, the shift in population edges of a multifactorial quantitative phenotype is a more sensitive measure (than population averages) to gauge the population level impact and by extension, policy significance of an omics marker. Empirical application of the edge effect concept is illustrated using an original analysis of warfarin pharmacogenomics and the VKORC1 genetic variation in a Brazilian population sample. These edge effect analyses are examined in relation to regulatory guidance development for association studies. We explain that omics science transcends the conventional laboratory bench space and includes a highly heterogeneous cast of stakeholders in society who have a plurality of interests that are often in conflict. Hence, communication of risk information in diagnostic medicine also demands attention to processes involved in production of knowledge and human values embedded in scientific practice, for example, why, how, by whom, and to what ends association studies are conducted, and standards are developed (or not). To ensure sustainability of omics innovations and forecast their trajectory, we need interventions to bridge the gap between omics laboratory and society. Appreciation of scholarship in history of omics science is one remedy to responsibly learn from the past to ensure a sustainable future in omics fields, both emerging (nutrigenomics, ecogenomics), and those that are more established (pharmacogenomics). Another measure to build public trust and sustainability of omics fields could be legislative initiatives to create a multidisciplinary oversight body, at arm's length from conflict of interests, to carry out independent, impartial, and transparent innovation analyses and prospective technology assessment.

  16. PyPanda: a Python package for gene regulatory network reconstruction

    PubMed Central

    van IJzendoorn, David G.P.; Glass, Kimberly; Quackenbush, John; Kuijjer, Marieke L.

    2016-01-01

    Summary: PANDA (Passing Attributes between Networks for Data Assimilation) is a gene regulatory network inference method that uses message-passing to integrate multiple sources of ‘omics data. PANDA was originally coded in C ++. In this application note we describe PyPanda, the Python version of PANDA. PyPanda runs considerably faster than the C ++ version and includes additional features for network analysis. Availability and implementation: The open source PyPanda Python package is freely available at http://github.com/davidvi/pypanda. Contact: mkuijjer@jimmy.harvard.edu or d.g.p.van_ijzendoorn@lumc.nl PMID:27402905

  17. PyPanda: a Python package for gene regulatory network reconstruction.

    PubMed

    van IJzendoorn, David G P; Glass, Kimberly; Quackenbush, John; Kuijjer, Marieke L

    2016-11-01

    PANDA (Passing Attributes between Networks for Data Assimilation) is a gene regulatory network inference method that uses message-passing to integrate multiple sources of 'omics data. PANDA was originally coded in C ++. In this application note we describe PyPanda, the Python version of PANDA. PyPanda runs considerably faster than the C ++ version and includes additional features for network analysis. The open source PyPanda Python package is freely available at http://github.com/davidvi/pypanda CONTACT: mkuijjer@jimmy.harvard.edu or d.g.p.van_ijzendoorn@lumc.nl. © The Author 2016. Published by Oxford University Press.

  18. Beyond DNA Sequencing in Space: Current and Future Omics Capabilities of the Biomolecule Sequencer Payload

    NASA Technical Reports Server (NTRS)

    Wallace, Sarah

    2017-01-01

    Why do we need a DNA sequencer to support the human exploration of space? (A) Operational environmental monitoring; (1) Identification of contaminating microbes, (2) Infectious disease diagnosis, (3) Reduce down mass (sample return for environmental monitoring, crew health, etc.). (B) Research; (1) Human, (2) Animal, (3) Microbes/Cell lines, (4) Plant. (C) Med Ops; (1) Response to countermeasures, (2) Radiation, (3) Real-time analysis can influence medical intervention. (C) Support astrobiology science investigations; (1) Technology superiorly suited to in situ nucleic acid-based life detection, (2) Functional testing for integration into robotics for extraplanetary exploration mission.

  19. Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

    PubMed

    Moon, Myungjin; Nakai, Kenta

    2018-04-01

    Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.

  20. Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors

    PubMed Central

    2014-01-01

    In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions. PMID:25071867

  1. Omics databases on kidney disease: where they can be found and how to benefit from them.

    PubMed

    Papadopoulos, Theofilos; Krochmal, Magdalena; Cisek, Katryna; Fernandes, Marco; Husi, Holger; Stevens, Robert; Bascands, Jean-Loup; Schanstra, Joost P; Klein, Julie

    2016-06-01

    In the recent decades, the evolution of omics technologies has led to advances in all biological fields, creating a demand for effective storage, management and exchange of rapidly generated data and research discoveries. To address this need, the development of databases of experimental outputs has become a common part of scientific practice in order to serve as knowledge sources and data-sharing platforms, providing information about genes, transcripts, proteins or metabolites. In this review, we present omics databases available currently, with a special focus on their application in kidney research and possibly in clinical practice. Databases are divided into two categories: general databases with a broad information scope and kidney-specific databases distinctively concentrated on kidney pathologies. In research, databases can be used as a rich source of information about pathophysiological mechanisms and molecular targets. In the future, databases will support clinicians with their decisions, providing better and faster diagnoses and setting the direction towards more preventive, personalized medicine. We also provide a test case demonstrating the potential of biological databases in comparing multi-omics datasets and generating new hypotheses to answer a critical and common diagnostic problem in nephrology practice. In the future, employment of databases combined with data integration and data mining should provide powerful insights into unlocking the mysteries of kidney disease, leading to a potential impact on pharmacological intervention and therapeutic disease management.

  2. Integrative Genomics and Computational Systems Medicine

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McDermott, Jason E.; Huang, Yufei; Zhang, Bing

    The exponential growth in generation of large amounts of genomic data from biological samples has driven the emerging field of systems medicine. This field is promising because it improves our understanding of disease processes at the systems level. However, the field is still in its young stage. There exists a great need for novel computational methods and approaches to effectively utilize and integrate various omics data.

  3. Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival.

    PubMed

    Suo, Chen; Hrydziuszko, Olga; Lee, Donghwan; Pramana, Setia; Saputra, Dhany; Joshi, Himanshu; Calza, Stefano; Pawitan, Yudi

    2015-08-15

    Genome and transcriptome analyses can be used to explore cancers comprehensively, and it is increasingly common to have multiple omics data measured from each individual. Furthermore, there are rich functional data such as predicted impact of mutations on protein coding and gene/protein networks. However, integration of the complex information across the different omics and functional data is still challenging. Clinical validation, particularly based on patient outcomes such as survival, is important for assessing the relevance of the integrated information and for comparing different procedures. An analysis pipeline is built for integrating genomic and transcriptomic alterations from whole-exome and RNA sequence data and functional data from protein function prediction and gene interaction networks. The method accumulates evidence for the functional implications of mutated potential driver genes found within and across patients. A driver-gene score (DGscore) is developed to capture the cumulative effect of such genes. To contribute to the score, a gene has to be frequently mutated, with high or moderate mutational impact at protein level, exhibiting an extreme expression and functionally linked to many differentially expressed neighbors in the functional gene network. The pipeline is applied to 60 matched tumor and normal samples of the same patient from The Cancer Genome Atlas breast-cancer project. In clinical validation, patients with high DGscores have worse survival than those with low scores (P = 0.001). Furthermore, the DGscore outperforms the established expression-based signatures MammaPrint and PAM50 in predicting patient survival. In conclusion, integration of mutation, expression and functional data allows identification of clinically relevant potential driver genes in cancer. The documented pipeline including annotated sample scripts can be found in http://fafner.meb.ki.se/biostatwiki/driver-genes/. yudi.pawitan@ki.se Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Validation of a novel biomarker panel for the detection of ovarian cancer

    PubMed Central

    Leung, Felix; Bernardini, Marcus Q.; Brown, Marshall D.; Zheng, Yingye; Molina, Rafael; Bast, Robert C.; Davis, Gerard; Serra, Stefano; Diamandis, Eleftherios P.; Kulasingam, Vathany

    2016-01-01

    Background Ovarian cancer (OvCa) is the most lethal gynecological malignancy. Our integrated -omics approach to OvCa biomarker discovery has identified kallikrein 6 (KLK6) and folate-receptor 1 (FOLR1) as promising candidates but these markers require further validation. Methods KLK6, FOLR1 CA125 and HE4 were investigated in three independent serum cohorts with a total of 20 healthy controls, 150 benign controls and 216 OvCa patients. The serum biomarker levels were determined by ELISA or automated immunoassay. Results All biomarkers demonstrated elevations in the sera of OvCa patients compared to controls (p<0.01). Overall, CA125 and HE4 displayed the strongest ability (AUC 0.80 and 0.82, respectively) to identify OvCa patients and the addition of HE4 to CA125 improved the sensitivity from 36% to 67% at a set specificity of 95%. As well, the combination of HE4 and FOLR1 was a strong predictor of OvCa diagnosis, displaying comparable sensitivity (65%) to the best performing CA125-based models (67%) at a set specificity of 95%. Conclusions The markers identified through our integrated –omics approach performed similarly to the clinically-approved markers CA125 and HE4. Furthermore, HE4 represents a powerful diagnostic marker for OvCa and should be used more routinely in a clinical setting. Impact The implications of our study are two-fold: (1) we have demonstrated the strengths of HE4 alone and in combination with CA125, lending credence to increasing its usage in the clinic; and (2) we have demonstrated the clinical utility of our integrated –omics approach to identifying novel serum markers with comparable performance to clinical markers. PMID:27448593

  5. MetaboTools: A comprehensive toolbox for analysis of genome-scale metabolic models

    DOE PAGES

    Aurich, Maike K.; Fleming, Ronan M. T.; Thiele, Ines

    2016-08-03

    Metabolomic data sets provide a direct read-out of cellular phenotypes and are increasingly generated to study biological questions. Previous work, by us and others, revealed the potential of analyzing extracellular metabolomic data in the context of the metabolic model using constraint-based modeling. With the MetaboTools, we make our methods available to the broader scientific community. The MetaboTools consist of a protocol, a toolbox, and tutorials of two use cases. The protocol describes, in a step-wise manner, the workflow of data integration, and computational analysis. The MetaboTools comprise the Matlab code required to complete the workflow described in the protocol. Tutorialsmore » explain the computational steps for integration of two different data sets and demonstrate a comprehensive set of methods for the computational analysis of metabolic models and stratification thereof into different phenotypes. The presented workflow supports integrative analysis of multiple omics data sets. Importantly, all analysis tools can be applied to metabolic models without performing the entire workflow. Taken together, the MetaboTools constitute a comprehensive guide to the intra-model analysis of extracellular metabolomic data from microbial, plant, or human cells. In conclusion, this computational modeling resource offers a broad set of computational analysis tools for a wide biomedical and non-biomedical research community.« less

  6. Spotlight on environmental omics and toxicology: a long way in a short time.

    PubMed

    Martyniuk, Christopher J; Simmons, Denina B

    2016-09-01

    The applications for high throughput omics technologies in environmental science have increased dramatically in recent years. Transcriptomics, proteomics, and metabolomics have been used to study how chemicals in our environment affect both aquatic and terrestrial organisms, and the characterization of molecular initiating events is a significant goal in toxicology to better predict adverse responses to toxicants. This special journal edition demonstrates the scope of the science that leverages omics-based methods in both laboratory and wild populations within the context of environmental toxicology, ranging from fish to mammals. It is important to recognize that the environment comprises one axis of the One Health concept - the idea that human health is unequivocally intertwined to our environment and to the organisms that inhabit that environment. We have much to learn from a comparative approach, and studies that integrate the transcriptome, proteome, and the metabolome are expected to offer the most detailed mechanism-based adverse outcome pathways that are applicable for use in both environmental monitoring and risk assessment. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Improving the discoverability, accessibility, and citability of omics datasets: a case report.

    PubMed

    Darlington, Yolanda F; Naumov, Alexey; McOwiti, Apollo; Kankanamge, Wasula H; Becnel, Lauren B; McKenna, Neil J

    2017-03-01

    Although omics datasets represent valuable assets for hypothesis generation, model testing, and data validation, the infrastructure supporting their reuse lacks organization and consistency. Using nuclear receptor signaling transcriptomic datasets as proof of principle, we developed a model to improve the discoverability, accessibility, and citability of published omics datasets. Primary datasets were retrieved from archives, processed to extract data points, then subjected to metadata enrichment and gap filling. The resulting secondary datasets were exposed on responsive web pages to support mining of gene lists, discovery of related datasets, and single-click citation integration with popular reference managers. Automated processes were established to embed digital object identifier-driven links to the secondary datasets in associated journal articles, small molecule and gene-centric databases, and a dataset search engine. Our model creates multiple points of access to reprocessed and reannotated derivative datasets across the digital biomedical research ecosystem, promoting their visibility and usability across disparate research communities. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  8. Computational Omics Pre-Awardees | Office of Cancer Clinical Proteomics Research

    Cancer.gov

    The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) is pleased to announce the pre-awardees of the Computational Omics solicitation. Working with NVIDIA Foundation's Compute the Cure initiative and Leidos Biomedical Research Inc., the NCI, through this solicitation, seeks to leverage computational efforts to provide tools for the mining and interpretation of large-scale publicly available ‘omics’ datasets.

  9. Systems Biology Approaches for Host–Fungal Interactions: An Expanding Multi-Omics Frontier

    PubMed Central

    Culibrk, Luka; Croft, Carys A.

    2016-01-01

    Abstract Opportunistic fungal infections are an increasing threat for global health, and for immunocompromised patients in particular. These infections are characterized by interaction between fungal pathogen and host cells. The exact mechanisms and the attendant variability in host and fungal pathogen interaction remain to be fully elucidated. The field of systems biology aims to characterize a biological system, and utilize this knowledge to predict the system's response to stimuli such as fungal exposures. A multi-omics approach, for example, combining data from genomics, proteomics, metabolomics, would allow a more comprehensive and pan-optic “two systems” biology of both the host and the fungal pathogen. In this review and literature analysis, we present highly specialized and nascent methods for analysis of multiple -omes of biological systems, in addition to emerging single-molecule visualization techniques that may assist in determining biological relevance of multi-omics data. We provide an overview of computational methods for modeling of gene regulatory networks, including some that have been applied towards the study of an interacting host and pathogen. In sum, comprehensive characterizations of host–fungal pathogen systems are now possible, and utilization of these cutting-edge multi-omics strategies may yield advances in better understanding of both host biology and fungal pathogens at a systems scale. PMID:26885725

  10. Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Papoutsakis, Elefterios

    This is the final project report for project "Experimental Systems-Biology Approaches for Clostridia-Based Bioenergy Production" for the funding period of 9/1/12 to 2/28/2015 (three years with a 6-month no-cost extension) OVERVIEW AND PROJECT GOALS The bottleneck of achieving higher rates and titers of toxic metabolites (such as solvents and carboxylic acids that can used as biofuels or biofuel precursors) can be overcome by engineering the stress response system. Thus, understanding and modeling the response of cells to toxic metabolites is a problem of great fundamental and practical significance. In this project, our goal is to dissect at the molecular systemsmore » level and build models (conceptual and quantitative) for the stress response of C. acetobutylicum (Cac) to its two toxic metabolites: butanol (BuOH) and butyrate (BA). Transcriptional (RNAseq and microarray based), proteomic and fluxomic data and their analysis are key requirements for this goal. Transcriptional data from mid-exponential cultures of Cac under 4 different levels of BuOH and BA stress was obtained using both microarrays (Papoutsakis group) and deep sequencing (RNAseq; Meyers and Papoutsakis groups). These two sets of data do not only serve to validate each other, but are also used for identification of stress-induced changes in transcript levels, small regulatory RNAs, & in transcriptional start sites. Quantitative proteomic data (Lee group), collected using the iTRAQ technology, are essential for understanding of protein levels and turnover under stress and the various protein-protein interactions that orchestrate the stress response. Metabolic flux changes (Antoniewicz group) of core pathways, which provide important information on the re-allocation of energy and carbon resources under metabolite stress, were examined using 13C-labelled chemicals. Omics data are integrated at different levels and scales. At the metabolic-pathway level, omics data are integrated into a 2nd generation genome-scale model (GSM) (Maranas group). Omics data are also integrated using bioinformatics (Wu and Huang group), whereby regulatory details of gene and protein expression, protein-protein interactions and metabolic flux regulation are incorporated. The PI (Papoutsakis) facilitated project integration through monthly meeting and reports, conference calls, and collaborative manuscript preparation. The five groups collaborated extensively and made a large number of presentations in national and international meetings. It has also published several papers, with several more in the preparation stage. Several PhD, MS and postdoctoral students were trained as part of this collaborative and interdisciplinary project.« less

  11. Multilayered Genetic and Omics Dissection of Mitochondrial Activity in a Mouse Reference Population

    PubMed Central

    Wu, Yibo; Williams, Evan G.; Dubuis, Sébastien; Mottis, Adrienne; Jovaisaite, Virginija; Houten, Sander M.; Argmann, Carmen A.; Faridi, Pouya; Wolski, Witold; Kutalik, Zoltán; Zamboni, Nicola; Auwerx, Johan; Aebersold, Ruedi

    2014-01-01

    SUMMARY The manner by which genotype and environment affect complex phenotypes is one of the fundamental questions in biology. In this study, we quantified the transcriptome—a subset of the metabolome—and, using targeted proteomics, quantified a subset of the liver proteome from 40 strains of the BXD mouse genetic reference population on two diverse diets. We discovered dozens of transcript, protein, and metabolite QTLs, several of which linked to metabolic phenotypes. Most prominently, Dhtkd1 was identified as a primary regulator of 2-aminoadipate, explaining variance in fasted glucose and diabetes status in both mice and humans. These integrated molecular profiles also allowed further characterization of complex pathways, particularly the mitochondrial unfolded protein response (UPRmt). UPRmt shows strikingly variant responses at the transcript and protein level that are remarkably conserved among C. elegans, mice, and humans. Overall, these examples demonstrate the value of an integrated multilayered omics approach to characterize complex metabolic phenotypes. PMID:25215496

  12. Harnessing Integrative Omics to Facilitate Molecular Imaging of the Human Epidermal Growth Factor Receptor Family for Precision Medicine.

    PubMed

    Pool, Martin; de Boer, H Rudolf; Hooge, Marjolijn N Lub-de; van Vugt, Marcel A T M; de Vries, Elisabeth G E

    2017-01-01

    Cancer is a growing problem worldwide. The cause of death in cancer patients is often due to treatment-resistant metastatic disease. Many molecularly targeted anticancer drugs have been developed against 'oncogenic driver' pathways. However, these treatments are usually only effective in properly selected patients. Resistance to molecularly targeted drugs through selective pressure on acquired mutations or molecular rewiring can hinder their effectiveness. This review summarizes how molecular imaging techniques can potentially facilitate the optimal implementation of targeted agents. Using the human epidermal growth factor receptor (HER) family as a model in (pre)clinical studies, we illustrate how molecular imaging may be employed to characterize whole body target expression as well as monitor drug effectiveness and the emergence of tumor resistance. We further discuss how an integrative omics discovery platform could guide the selection of 'effect sensors' - new molecular imaging targets - which are dynamic markers that indicate treatment effectiveness or resistance.

  13. Childhood obesity: a systems medicine approach.

    PubMed

    Stone, William L; Schetzina, Karen; Stuart, Charles

    2016-06-01

    Childhood obesity and its sequelae are a major public health problem in both the USA and globally. This review will focus on a systems medicine approach to obesity. Systems medicine is an integrative approach utilizing the vast amount of data garnered from "omics" technology and integrating these data with conventional pathophysiology as well as diverse environmental factors such as diet, exercise, community dynamics and the intestinal microbiome. Omics technology includes genomics, epigenomics, metagenomics, metabolomics and proteomics. In addition to unraveling etiology, the goals of a systems medicine approach are to provide actionable and evidenced-based clinical approaches. In the case of childhood obesity, an additional goal is characterizing measureable risk factors/biomarkers for obesity at the earliest possible age and devising age-appropriate optimal intervention strategies. It is also important to establish the age at which interventions could be critical. As discussed below, it is possible that some of the pathophysiological and epigenetic changes resulting from childhood obesity could become more irreversible the longer the obesity remains untreated.

  14. NaviCom: a web application to create interactive molecular network portraits using multi-level omics data.

    PubMed

    Dorel, Mathurin; Viara, Eric; Barillot, Emmanuel; Zinovyev, Andrei; Kuperstein, Inna

    2017-01-01

    Human diseases such as cancer are routinely characterized by high-throughput molecular technologies, and multi-level omics data are accumulated in public databases at increasing rate. Retrieval and visualization of these data in the context of molecular network maps can provide insights into the pattern of regulation of molecular functions reflected by an omics profile. In order to make this task easy, we developed NaviCom, a Python package and web platform for visualization of multi-level omics data on top of biological network maps. NaviCom is bridging the gap between cBioPortal, the most used resource of large-scale cancer omics data and NaviCell, a data visualization web service that contains several molecular network map collections. NaviCom proposes several standardized modes of data display on top of molecular network maps, allowing addressing specific biological questions. We illustrate how users can easily create interactive network-based cancer molecular portraits via NaviCom web interface using the maps of Atlas of Cancer Signalling Network (ACSN) and other maps. Analysis of these molecular portraits can help in formulating a scientific hypothesis on the molecular mechanisms deregulated in the studied disease. NaviCom is available at https://navicom.curie.fr. © The Author(s) 2017. Published by Oxford University Press.

  15. 'Omic' genetic technologies for herbal medicines in psychiatry.

    PubMed

    Sarris, Jerome; Ng, Chee Hong; Schweitzer, Isaac

    2012-04-01

    The field of genetics, which includes the use of 'omic' technologies, is an evolving area of science that has emerging application in phytotherapy. Omic studies include pharmacogenomics, proteomics and metabolomics. Herbal medicines, as monotherapies, or complex formulations such as traditional Chinese herbal prescriptions, may benefit from omic studies, and this new field may be termed 'herbomics'. Applying herbomics in the field of psychiatry may provide answers about which herbal interventions may be effective for individuals, which genetic processes are triggered, and the subsequent neurochemical pathways of activity. The use of proteomic technology can explore the differing epigenetic effects on neurochemical gene expression between individual herbs, isolated constituents and complex formulae. The possibilities of side effects or insufficient response to the herb can also be assessed via pharmacogenomic analysis of polymorphisms of cytochrome P450 liver enzymes or P-glycoprotein. While another novel application of omic technology is for the validation of the concept of synergy in individual herbal extracts and prescriptive formulations. Chronic administration of psychotropic herbal medicines may discover important effects on chromatin remodelling via modification of histone and DNA methylation. This paper focuses on the emerging field of herbomics, and is to our knowledge the first publication to explore this in the area of psychiatry. Copyright © 2011 John Wiley & Sons, Ltd.

  16. Integrative analysis of multi-omics data reveals distinct impacts of DDB1-CUL4 associated factors in human lung adenocarcinomas

    DOE PAGES

    Yan, Hong; Bi, Lei; Wang, Yunshan; ...

    2017-03-23

    Many DDB1-CUL4 associated factors (DCAFs) have been identified and serve as substrate receptors. Although the oncogenic role of CUL4A has been well established, specific DCAFs involved in cancer development remain largely unknown. Here we infer the potential impact of 19 well-defined DCAFs in human lung adenocarcinomas (LuADCs) using integrative omics analyses, and discover that mRNA levels of DTL, DCAF4, 12 and 13 are consistently elevated whereas VBRBP is reduced in LuADCs compared to normal lung tissues. The transcriptional levels of DCAFs are significantly correlated with their gene copy number variations. SKIP2, DTL, DCAF6, 7, 8, 13 and 17 are frequentlymore » gained whereas VPRBP, PHIP, DCAF10, 12 and 15 are frequently lost. We find that only transcriptional level of DTL is robustly, significantly and negatively correlated with overall survival across independent datasets. Moreover, DTL-correlated genes are enriched in cell cycle and DNA repair pathways. We also identified that the levels of 25 proteins were significantly associated with DTL overexpression in LuADCs, which include significant decreases in protein level of the tumor supressor genes such as PDCD4, NKX2-1 and PRKAA1. In conclusion, our results suggest that different CUL4-DCAF axis plays the distinct roles in LuADC development with possible relevance for therapeutic target development.« less

  17. High-throughput bioinformatics with the Cyrille2 pipeline system

    PubMed Central

    Fiers, Mark WEJ; van der Burgt, Ate; Datema, Erwin; de Groot, Joost CW; van Ham, Roeland CHJ

    2008-01-01

    Background Modern omics research involves the application of high-throughput technologies that generate vast volumes of data. These data need to be pre-processed, analyzed and integrated with existing knowledge through the use of diverse sets of software tools, models and databases. The analyses are often interdependent and chained together to form complex workflows or pipelines. Given the volume of the data used and the multitude of computational resources available, specialized pipeline software is required to make high-throughput analysis of large-scale omics datasets feasible. Results We have developed a generic pipeline system called Cyrille2. The system is modular in design and consists of three functionally distinct parts: 1) a web based, graphical user interface (GUI) that enables a pipeline operator to manage the system; 2) the Scheduler, which forms the functional core of the system and which tracks what data enters the system and determines what jobs must be scheduled for execution, and; 3) the Executor, which searches for scheduled jobs and executes these on a compute cluster. Conclusion The Cyrille2 system is an extensible, modular system, implementing the stated requirements. Cyrille2 enables easy creation and execution of high throughput, flexible bioinformatics pipelines. PMID:18269742

  18. Biotic interactions as drivers of algal origin and evolution.

    PubMed

    Brodie, Juliet; Ball, Steven G; Bouget, François-Yves; Chan, Cheong Xin; De Clerck, Olivier; Cock, J Mark; Gachon, Claire; Grossman, Arthur R; Mock, Thomas; Raven, John A; Saha, Mahasweta; Smith, Alison G; Vardi, Assaf; Yoon, Hwan Su; Bhattacharya, Debashish

    2017-11-01

    Contents 670 I. 671 II. 671 III. 676 IV. 678 678 References 678 SUMMARY: Biotic interactions underlie life's diversity and are the lynchpin to understanding its complexity and resilience within an ecological niche. Algal biologists have embraced this paradigm, and studies building on the explosive growth in omics and cell biology methods have facilitated the in-depth analysis of nonmodel organisms and communities from a variety of ecosystems. In turn, these advances have enabled a major revision of our understanding of the origin and evolution of photosynthesis in eukaryotes, bacterial-algal interactions, control of massive algal blooms in the ocean, and the maintenance and degradation of coral reefs. Here, we review some of the most exciting developments in the field of algal biotic interactions and identify challenges for scientists in the coming years. We foresee the development of an algal knowledgebase that integrates ecosystem-wide omics data and the development of molecular tools/resources to perform functional analyses of individuals in isolation and in populations. These assets will allow us to move beyond mechanistic studies of a single species towards understanding the interactions amongst algae and other organisms in both the laboratory and the field. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  19. GeneLab Analysis Working Group Kick-Off Meeting

    NASA Technical Reports Server (NTRS)

    Costes, Sylvain V.

    2018-01-01

    Goals to achieve for GeneLab AWG - GL vision - Review of GeneLab AWG charter Timeline and milestones for 2018 Logistics - Monthly Meeting - Workshop - Internship - ASGSR Introduction of team leads and goals of each group Introduction of all members Q/A Three-tier Client Strategy to Democratize Data Physiological changes, pathway enrichment, differential expression, normalization, processing metadata, reproducibility, Data federation/integration with heterogeneous bioinformatics external databases The GLDS currently serves over 100 omics investigations to the biomedical community via open access. In order to expand the scope of metadata record searches via the GLDS, we designed a metadata warehouse that collects and updates metadata records from external systems housing similar data. To demonstrate the capabilities of federated search and retrieval of these data, we imported metadata records from three open-access data systems into the GLDS metadata warehouse: NCBI's Gene Expression Omnibus (GEO), EBI's PRoteomics IDEntifications (PRIDE) repository, and the Metagenomics Analysis server (MG-RAST). Each of these systems defines metadata for omics data sets differently. One solution to bridge such differences is to employ a common object model (COM) to which each systems' representation of metadata can be mapped. Warehoused metadata records are then transformed at ETL to this single, common representation. Queries generated via the GLDS are then executed against the warehouse, and matching records are shown in the COM representation (Fig. 1). While this approach is relatively straightforward to implement, the volume of the data in the omics domain presents challenges in dealing with latency and currency of records. Furthermore, the lack of a coordinated has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.

  20. Systems Toxicology: Real World Applications and Opportunities.

    PubMed

    Hartung, Thomas; FitzGerald, Rex E; Jennings, Paul; Mirams, Gary R; Peitsch, Manuel C; Rostami-Hodjegan, Amin; Shah, Imran; Wilks, Martin F; Sturla, Shana J

    2017-04-17

    Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized from empirical end points to describing modes of action as adverse outcome pathways and perturbed networks. Toward this aim, Systems Toxicology entails the integration of in vitro and in vivo toxicity data with computational modeling. This evolving approach depends critically on data reliability and relevance, which in turn depends on the quality of experimental models and bioanalysis techniques used to generate toxicological data. Systems Toxicology involves the use of large-scale data streams ("big data"), such as those derived from omics measurements that require computational means for obtaining informative results. Thus, integrative analysis of multiple molecular measurements, particularly acquired by omics strategies, is a key approach in Systems Toxicology. In recent years, there have been significant advances centered on in vitro test systems and bioanalytical strategies, yet a frontier challenge concerns linking observed network perturbations to phenotypes, which will require understanding pathways and networks that give rise to adverse responses. This summary perspective from a 2016 Systems Toxicology meeting, an international conference held in the Alps of Switzerland, describes the limitations and opportunities of selected emerging applications in this rapidly advancing field. Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized, from empirical end points to pathways of toxicity. This requires the integration of in vitro and in vivo data with computational modeling. Test systems and bioanalytical technologies have made significant advances, but ensuring data reliability and relevance is an ongoing concern. The major challenge facing the new pathway approach is determining how to link observed network perturbations to phenotypic toxicity.

  1. Systems Toxicology: Real World Applications and Opportunities

    PubMed Central

    2017-01-01

    Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized from empirical end points to describing modes of action as adverse outcome pathways and perturbed networks. Toward this aim, Systems Toxicology entails the integration of in vitro and in vivo toxicity data with computational modeling. This evolving approach depends critically on data reliability and relevance, which in turn depends on the quality of experimental models and bioanalysis techniques used to generate toxicological data. Systems Toxicology involves the use of large-scale data streams (“big data”), such as those derived from omics measurements that require computational means for obtaining informative results. Thus, integrative analysis of multiple molecular measurements, particularly acquired by omics strategies, is a key approach in Systems Toxicology. In recent years, there have been significant advances centered on in vitro test systems and bioanalytical strategies, yet a frontier challenge concerns linking observed network perturbations to phenotypes, which will require understanding pathways and networks that give rise to adverse responses. This summary perspective from a 2016 Systems Toxicology meeting, an international conference held in the Alps of Switzerland, describes the limitations and opportunities of selected emerging applications in this rapidly advancing field. Systems Toxicology aims to change the basis of how adverse biological effects of xenobiotics are characterized, from empirical end points to pathways of toxicity. This requires the integration of in vitro and in vivo data with computational modeling. Test systems and bioanalytical technologies have made significant advances, but ensuring data reliability and relevance is an ongoing concern. The major challenge facing the new pathway approach is determining how to link observed network perturbations to phenotypic toxicity. PMID:28362102

  2. RCAS: an RNA centric annotation system for transcriptome-wide regions of interest.

    PubMed

    Uyar, Bora; Yusuf, Dilmurat; Wurmus, Ricardo; Rajewsky, Nikolaus; Ohler, Uwe; Akalin, Altuna

    2017-06-02

    In the field of RNA, the technologies for studying the transcriptome have created a tremendous potential for deciphering the puzzles of the RNA biology. Along with the excitement, the unprecedented volume of RNA related omics data is creating great challenges in bioinformatics analyses. Here, we present the RNA Centric Annotation System (RCAS), an R package, which is designed to ease the process of creating gene-centric annotations and analysis for the genomic regions of interest obtained from various RNA-based omics technologies. The design of RCAS is modular, which enables flexible usage and convenient integration with other bioinformatics workflows. RCAS is an R/Bioconductor package but we also created graphical user interfaces including a Galaxy wrapper and a stand-alone web service. The application of RCAS on published datasets shows that RCAS is not only able to reproduce published findings but also helps generate novel knowledge and hypotheses. The meta-gene profiles, gene-centric annotation, motif analysis and gene-set analysis provided by RCAS provide contextual knowledge which is necessary for understanding the functional aspects of different biological events that involve RNAs. In addition, the array of different interfaces and deployment options adds the convenience of use for different levels of users. RCAS is available at http://bioconductor.org/packages/release/bioc/html/RCAS.html and http://rcas.mdc-berlin.de. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Comparative multi-omics systems analysis of Escherichia coli strains B and K-12.

    PubMed

    Yoon, Sung Ho; Han, Mee-Jung; Jeong, Haeyoung; Lee, Choong Hoon; Xia, Xiao-Xia; Lee, Dae-Hee; Shim, Ji Hoon; Lee, Sang Yup; Oh, Tae Kwang; Kim, Jihyun F

    2012-05-25

    Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts.

  4. Comparative multi-omics systems analysis of Escherichia coli strains B and K-12

    PubMed Central

    2012-01-01

    Background Elucidation of a genotype-phenotype relationship is critical to understand an organism at the whole-system level. Here, we demonstrate that comparative analyses of multi-omics data combined with a computational modeling approach provide a framework for elucidating the phenotypic characteristics of organisms whose genomes are sequenced. Results We present a comprehensive analysis of genome-wide measurements incorporating multifaceted holistic data - genome, transcriptome, proteome, and phenome - to determine the differences between Escherichia coli B and K-12 strains. A genome-scale metabolic network of E. coli B was reconstructed and used to identify genetic bases of the phenotypes unique to B compared with K-12 through in silico complementation testing. This systems analysis revealed that E. coli B is well-suited for production of recombinant proteins due to a greater capacity for amino acid biosynthesis, fewer proteases, and lack of flagella. Furthermore, E. coli B has an additional type II secretion system and a different cell wall and outer membrane composition predicted to be more favorable for protein secretion. In contrast, E. coli K-12 showed a higher expression of heat shock genes and was less susceptible to certain stress conditions. Conclusions This integrative systems approach provides a high-resolution system-wide view and insights into why two closely related strains of E. coli, B and K-12, manifest distinct phenotypes. Therefore, systematic understanding of cellular physiology and metabolism of the strains is essential not only to determine culture conditions but also to design recombinant hosts. PMID:22632713

  5. Metabolomics and Integrative Omics for the Development of Thai Traditional Medicine

    PubMed Central

    Khoomrung, Sakda; Wanichthanarak, Kwanjeera; Nookaew, Intawat; Thamsermsang, Onusa; Seubnooch, Patcharamon; Laohapand, Tawee; Akarasereenont, Pravit

    2017-01-01

    In recent years, interest in studies of traditional medicine in Asian and African countries has gradually increased due to its potential to complement modern medicine. In this review, we provide an overview of Thai traditional medicine (TTM) current development, and ongoing research activities of TTM related to metabolomics. This review will also focus on three important elements of systems biology analysis of TTM including analytical techniques, statistical approaches and bioinformatics tools for handling and analyzing untargeted metabolomics data. The main objective of this data analysis is to gain a comprehensive understanding of the system wide effects that TTM has on individuals. Furthermore, potential applications of metabolomics and systems medicine in TTM will also be discussed. PMID:28769804

  6. Validation of standard operating procedures in a multicenter retrospective study to identify -omics biomarkers for chronic low back pain.

    PubMed

    Dagostino, Concetta; De Gregori, Manuela; Gieger, Christian; Manz, Judith; Gudelj, Ivan; Lauc, Gordan; Divizia, Laura; Wang, Wei; Sim, Moira; Pemberton, Iain K; MacDougall, Jane; Williams, Frances; Van Zundert, Jan; Primorac, Dragan; Aulchenko, Yurii; Kapural, Leonardo; Allegri, Massimo

    2017-01-01

    Chronic low back pain (CLBP) is one of the most common medical conditions, ranking as the greatest contributor to global disability and accounting for huge societal costs based on the Global Burden of Disease 2010 study. Large genetic and -omics studies provide a promising avenue for the screening, development and validation of biomarkers useful for personalized diagnosis and treatment (precision medicine). Multicentre studies are needed for such an effort, and a standardized and homogeneous approach is vital for recruitment of large numbers of participants among different centres (clinical and laboratories) to obtain robust and reproducible results. To date, no validated standard operating procedures (SOPs) for genetic/-omics studies in chronic pain have been developed. In this study, we validated an SOP model that will be used in the multicentre (5 centres) retrospective "PainOmics" study, funded by the European Community in the 7th Framework Programme, which aims to develop new biomarkers for CLBP through three different -omics approaches: genomics, glycomics and activomics. The SOPs describe the specific procedures for (1) blood collection, (2) sample processing and storage, (3) shipping details and (4) cross-check testing and validation before assays that all the centres involved in the study have to follow. Multivariate analysis revealed the absolute specificity and homogeneity of the samples collected by the five centres for all genetics, glycomics and activomics analyses. The SOPs used in our multicenter study have been validated. Hence, they could represent an innovative tool for the correct management and collection of reliable samples in other large-omics-based multicenter studies.

  7. Rebooting Bioresilience: A Multi-OMICS Approach to Tackle Global Catastrophic Biological Risks and Next-Generation Biothreats.

    PubMed

    Kambouris, Manousos E; Manoussopoulos, Yiannis; Kantzanou, Maria; Velegraki, Aristea; Gaitanis, Georgios; Arabatzis, Michalis; Patrinos, George P

    2018-01-01

    Global Catastrophic Biological Risks (GCBRs) refer to biological events-natural, deliberate, and accidental-of a global and lasting impact. This challenges the life scientists to raise their game on two hitherto neglected innovation frontiers: a veritable "futures" thinking to "think the unthinkable," and "systems thinking" so as to see both the trees and the forest when it comes to GCBRs. This innovation analysis article outlines the promise of Omics systems science biotechnologies, for example, to deploy rapid fire diagnostics for health security crises at GCBR level, possibly involving neopathogens and/or incurring epidemics (e.g., severe acute respiratory syndrome [SARS] and Ebola) that collectively threaten the lives of global society and interdependent biological ecosystems. Moreover, Omics encourages thinking beyond immediacy and in long-term strategies for biopreparedness and response innovation when the timelines are aggressive and compressed in response to crises such as GCBRs, but also to non-global but surging, multiple threats occurring as successive, overlapping, or distinct events, rather than as distinct entities-a prospect enforcing a reboot in Bioresilience. We define Next-Generation Bioresilience as "a systems approach against natural, accidental and perpetrated GCBRs using Omics technologies, and a shift in mentality, whereby the systems approach is expanded to include multiple plausible futures and expose unchecked assumptions attendant to risks, beyond technological determinism." In sum, it is time to think about the realistic potential of Omics biotechnologies beyond clinical practice and precision medicine so as to harness the opportunities and address the uncertainties associated not only with GCBRs but also with other emerging Omics applications in health and society.

  8. Improving -Omics-Based Research and Precision Health in Minority Populations: Recommendations for Nurse Scientists.

    PubMed

    Taylor, Jacquelyn Y; Barcelona de Mendoza, Veronica

    2018-01-01

    The purpose of this article is to provide an overview of the role of nurse scientists in -omics-based research and to promote discussion around the conduct of -omics-based nursing research in minority communities. Nurses are advocates, educators, practitioners, scientists, and researchers, and are crucial to the design and successful implementation of -omics studies, particularly including minority communities. The contribution of nursing in this area of research is crucial to reducing health disparities. In this article, challenges in the conduct of -omics-based research in minority communities are discussed, and recommendations for improving diversity among nurse scientists, study participants, and utilization of training and continuing education programs in -omics are provided. Many opportunities exist for nurses to increase their knowledge in -omics and to continue to build the ranks of nurse scientists as leaders in -omics-based research. In order to work successfully with communities of color, nurse scientists must advocate for participation in the Precision Medicine Initiative, improve representation of nurse faculty of color, and increase utilization of training programs in -omics and lead such initiatives. All nursing care has the potential to be affected by the era of -omics and precision health. By taking an inclusive approach to diversity in nursing and -omics research, nurses will be well placed to be leaders in reducing health disparities through research, practice, and education. © 2017 Sigma Theta Tau International.

  9. Mega-analysis of Odds Ratio: A Convergent Method for a Deep Understanding of the Genetic Evidence in Schizophrenia.

    PubMed

    Jia, Peilin; Chen, Xiangning; Xie, Wei; Kendler, Kenneth S; Zhao, Zhongming

    2018-06-20

    Numerous high-throughput omics studies have been conducted in schizophrenia, providing an accumulated catalog of susceptible variants and genes. The results from these studies, however, are highly heterogeneous. The variants and genes nominated by different omics studies often have limited overlap with each other. There is thus a pressing need for integrative analysis to unify the different types of data and provide a convergent view of schizophrenia candidate genes (SZgenes). In this study, we collected a comprehensive, multidimensional dataset, including 7819 brain-expressed genes. The data hosted genome-wide association evidence in genetics (eg, genotyping data, copy number variations, de novo mutations), epigenetics, transcriptomics, and literature mining. We developed a method named mega-analysis of odds ratio (MegaOR) to prioritize SZgenes. Application of MegaOR in the multidimensional data resulted in consensus sets of SZgenes (up to 530), each enriched with dense, multidimensional evidence. We proved that these SZgenes had highly tissue-specific expression in brain and nerve and had intensive interactions that were significantly stronger than chance expectation. Furthermore, we found these SZgenes were involved in human brain development by showing strong spatiotemporal expression patterns; these characteristics were replicated in independent brain expression datasets. Finally, we found the SZgenes were enriched in critical functional gene sets involved in neuronal activities, ligand gated ion signaling, and fragile X mental retardation protein targets. In summary, MegaOR analysis reported consensus sets of SZgenes with enriched association evidence to schizophrenia, providing insights into the pathophysiology underlying schizophrenia.

  10. Deciphering the complex: methodological overview of statistical models to derive OMICS-based biomarkers.

    PubMed

    Chadeau-Hyam, Marc; Campanella, Gianluca; Jombart, Thibaut; Bottolo, Leonardo; Portengen, Lutzen; Vineis, Paolo; Liquet, Benoit; Vermeulen, Roel C H

    2013-08-01

    Recent technological advances in molecular biology have given rise to numerous large-scale datasets whose analysis imposes serious methodological challenges mainly relating to the size and complex structure of the data. Considerable experience in analyzing such data has been gained over the past decade, mainly in genetics, from the Genome-Wide Association Study era, and more recently in transcriptomics and metabolomics. Building upon the corresponding literature, we provide here a nontechnical overview of well-established methods used to analyze OMICS data within three main types of regression-based approaches: univariate models including multiple testing correction strategies, dimension reduction techniques, and variable selection models. Our methodological description focuses on methods for which ready-to-use implementations are available. We describe the main underlying assumptions, the main features, and advantages and limitations of each of the models. This descriptive summary constitutes a useful tool for driving methodological choices while analyzing OMICS data, especially in environmental epidemiology, where the emergence of the exposome concept clearly calls for unified methods to analyze marginally and jointly complex exposure and OMICS datasets. Copyright © 2013 Wiley Periodicals, Inc.

  11. Intestinal meta-transcriptome comparison reveals disparate antiviral transcriptional response and its association with Mitochondria in chicken immunity development

    USDA-ARS?s Scientific Manuscript database

    Background: Availability of a large number of data sets in public repositories and the advances in integrating multi-omics methods have greatly advanced our understanding of biological organisms and microbial associates, as well as large subcellular organelles, such as mitochondria. Mitochondrial ...

  12. Analysis of integrated multiple 'omics' datasets reveals the mechanisms of initiation and determination in the formation of tuberous roots in Rehmannia glutinosa.

    PubMed

    Li, Mingjie; Yang, Yanhui; Li, Xinyu; Gu, Li; Wang, Fengji; Feng, Fajie; Tian, Yunhe; Wang, Fengqing; Wang, Xiaoran; Lin, Wenxiong; Chen, Xinjian; Zhang, Zhongyi

    2015-09-01

    All tuberous roots in Rehmannia glutinosa originate from the expansion of fibrous roots (FRs), but not all FRs can successfully transform into tuberous roots. This study identified differentially expressed genes and proteins associated with the expansion of FRs, by comparing the tuberous root at expansion stages (initiated tuberous root, ITRs) and FRs at the seedling stage (initiated FRs, IFRs). The role of miRNAs in the expansion of FRs was also explored using the sRNA transcriptome and degradome to identify miRNAs and their target genes that were differentially expressed between ITRs and FRs at the mature stage (unexpanded FRs, UFRs, which are unable to expand into ITRs). A total of 6032 genes and 450 proteins were differentially expressed between ITRs and IFRs. Integrated analyses of these data revealed several genes and proteins involved in light signalling, hormone response, and signal transduction that might participate in the induction of tuberous root formation. Several genes related to cell division and cell wall metabolism were involved in initiating the expansion of IFRs. Of 135 miRNAs differentially expressed between ITRs and UFRs, there were 27 miRNAs whose targets were specifically identified in the degradome. Analysis of target genes showed that several miRNAs specifically expressed in UFRs were involved in the degradation of key genes required for the formation of tuberous roots. As far as could be ascertained, this is the first time that the miRNAs that control the transition of FRs to tuberous roots in R. glutinosa have been identified. This comprehensive analysis of 'omics' data sheds new light on the mechanisms involved in the regulation of tuberous roots formation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  13. Identifying candidate driver genes by integrative ovarian cancer genomics data

    NASA Astrophysics Data System (ADS)

    Lu, Xinguo; Lu, Jibo

    2017-08-01

    Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.

  14. Unraveling snake venom complexity with 'omics' approaches: challenges and perspectives.

    PubMed

    Zelanis, André; Tashima, Alexandre Keiji

    2014-09-01

    The study of snake venom proteomes (venomics) has been experiencing a burst of reports, however the comprehensive knowledge of the dynamic range of proteins present within a single venom, the set of post-translational modifications (PTMs) as well as the lack of a comprehensive database related to venom proteins are among the main challenges in venomics research. The phenotypic plasticity in snake venom proteomes together with their inherent toxin proteoform diversity, points out to the use of integrative analysis in order to better understand their actual complexity. In this regard, such a systems venomics task should encompass the integration of data from transcriptomic and proteomic studies (specially the venom gland proteome), the identification of biological PTMs, and the estimation of artifactual proteomes and peptidomes generated by sample handling procedures. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. Integrated cellular network of transcription regulations and protein-protein interactions

    PubMed Central

    2010-01-01

    Background With the accumulation of increasing omics data, a key goal of systems biology is to construct networks at different cellular levels to investigate cellular machinery of the cell. However, there is currently no satisfactory method to construct an integrated cellular network that combines the gene regulatory network and the signaling regulatory pathway. Results In this study, we integrated different kinds of omics data and developed a systematic method to construct the integrated cellular network based on coupling dynamic models and statistical assessments. The proposed method was applied to S. cerevisiae stress responses, elucidating the stress response mechanism of the yeast. From the resulting integrated cellular network under hyperosmotic stress, the highly connected hubs which are functionally relevant to the stress response were identified. Beyond hyperosmotic stress, the integrated network under heat shock and oxidative stress were also constructed and the crosstalks of these networks were analyzed, specifying the significance of some transcription factors to serve as the decision-making devices at the center of the bow-tie structure and the crucial role for rapid adaptation scheme to respond to stress. In addition, the predictive power of the proposed method was also demonstrated. Conclusions We successfully construct the integrated cellular network which is validated by literature evidences. The integration of transcription regulations and protein-protein interactions gives more insight into the actual biological network and is more predictive than those without integration. The method is shown to be powerful and flexible and can be used under different conditions and for different species. The coupling dynamic models of the whole integrated cellular network are very useful for theoretical analyses and for further experiments in the fields of network biology and synthetic biology. PMID:20211003

  16. Integrated cellular network of transcription regulations and protein-protein interactions.

    PubMed

    Wang, Yu-Chao; Chen, Bor-Sen

    2010-03-08

    With the accumulation of increasing omics data, a key goal of systems biology is to construct networks at different cellular levels to investigate cellular machinery of the cell. However, there is currently no satisfactory method to construct an integrated cellular network that combines the gene regulatory network and the signaling regulatory pathway. In this study, we integrated different kinds of omics data and developed a systematic method to construct the integrated cellular network based on coupling dynamic models and statistical assessments. The proposed method was applied to S. cerevisiae stress responses, elucidating the stress response mechanism of the yeast. From the resulting integrated cellular network under hyperosmotic stress, the highly connected hubs which are functionally relevant to the stress response were identified. Beyond hyperosmotic stress, the integrated network under heat shock and oxidative stress were also constructed and the crosstalks of these networks were analyzed, specifying the significance of some transcription factors to serve as the decision-making devices at the center of the bow-tie structure and the crucial role for rapid adaptation scheme to respond to stress. In addition, the predictive power of the proposed method was also demonstrated. We successfully construct the integrated cellular network which is validated by literature evidences. The integration of transcription regulations and protein-protein interactions gives more insight into the actual biological network and is more predictive than those without integration. The method is shown to be powerful and flexible and can be used under different conditions and for different species. The coupling dynamic models of the whole integrated cellular network are very useful for theoretical analyses and for further experiments in the fields of network biology and synthetic biology.

  17. Data mining in newt-omics, the repository for omics data from the newt.

    PubMed

    Looso, Mario; Braun, Thomas

    2015-01-01

    Salamanders are an excellent model organism to study regenerative processes due to their unique ability to regenerate lost appendages or organs. Straightforward bioinformatics tools to analyze and take advantage of the growing number of "omics" studies performed in salamanders were lacking so far. To overcome this limitation, we have generated a comprehensive data repository for the red-spotted newt Notophthalmus viridescens, named newt-omics, merging omics style datasets on the transcriptome and proteome level including expression values and annotations. The resource is freely available via a user-friendly Web-based graphical user interface ( http://newt-omics.mpi-bn.mpg.de) that allows access and queries to the database without prior bioinformatical expertise. The repository is updated regularly, incorporating new published datasets from omics technologies.

  18. Insights into the Ecology and Evolution of Polyploid Plants through Network Analysis.

    PubMed

    Gallagher, Joseph P; Grover, Corrinne E; Hu, Guanjing; Wendel, Jonathan F

    2016-06-01

    Polyploidy is a widespread phenomenon throughout eukaryotes, with important ecological and evolutionary consequences. Although genes operate as components of complex pathways and networks, polyploid changes in genes and gene expression have typically been evaluated as either individual genes or as a part of broad-scale analyses. Network analysis has been fruitful in associating genomic and other 'omic'-based changes with phenotype for many systems. In polyploid species, network analysis has the potential not only to facilitate a better understanding of the complex 'omic' underpinnings of phenotypic and ecological traits common to polyploidy, but also to provide novel insight into the interaction among duplicated genes and genomes. This adds perspective to the global patterns of expression (and other 'omic') change that accompany polyploidy and to the patterns of recruitment and/or loss of genes following polyploidization. While network analysis in polyploid species faces challenges common to other analyses of duplicated genomes, present technologies combined with thoughtful experimental design provide a powerful system to explore polyploid evolution. Here, we demonstrate the utility and potential of network analysis to questions pertaining to polyploidy with an example involving evolution of the transgressively superior cotton fibres found in polyploid Gossypium hirsutum. By combining network analysis with prior knowledge, we provide further insights into the role of profilins in fibre domestication and exemplify the potential for network analysis in polyploid species. © 2016 John Wiley & Sons Ltd.

  19. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

    PubMed Central

    Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

    2016-01-01

    Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates. PMID:27633273

  20. Evaluation of hierarchical models for integrative genomic analyses.

    PubMed

    Denis, Marie; Tadesse, Mahlet G

    2016-03-01

    Advances in high-throughput technologies have led to the acquisition of various types of -omic data on the same biological samples. Each data type gives independent and complementary information that can explain the biological mechanisms of interest. While several studies performing independent analyses of each dataset have led to significant results, a better understanding of complex biological mechanisms requires an integrative analysis of different sources of data. Flexible modeling approaches, based on penalized likelihood methods and expectation-maximization (EM) algorithms, are studied and tested under various biological relationship scenarios between the different molecular features and their effects on a clinical outcome. The models are applied to genomic datasets from two cancer types in the Cancer Genome Atlas project: glioblastoma multiforme and ovarian serous cystadenocarcinoma. The integrative models lead to improved model fit and predictive performance. They also provide a better understanding of the biological mechanisms underlying patients' survival. Source code implementing the integrative models is freely available at https://github.com/mgt000/IntegrativeAnalysis along with example datasets and sample R script applying the models to these data. The TCGA datasets used for analysis are publicly available at https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp marie.denis@cirad.fr or mgt26@georgetown.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. BiNA: A Visual Analytics Tool for Biological Network Data

    PubMed Central

    Gerasch, Andreas; Faber, Daniel; Küntzer, Jan; Niermann, Peter; Kohlbacher, Oliver; Lenhof, Hans-Peter; Kaufmann, Michael

    2014-01-01

    Interactive visual analysis of biological high-throughput data in the context of the underlying networks is an essential task in modern biomedicine with applications ranging from metabolic engineering to personalized medicine. The complexity and heterogeneity of data sets require flexible software architectures for data analysis. Concise and easily readable graphical representation of data and interactive navigation of large data sets are essential in this context. We present BiNA - the Biological Network Analyzer - a flexible open-source software for analyzing and visualizing biological networks. Highly configurable visualization styles for regulatory and metabolic network data offer sophisticated drawings and intuitive navigation and exploration techniques using hierarchical graph concepts. The generic projection and analysis framework provides powerful functionalities for visual analyses of high-throughput omics data in the context of networks, in particular for the differential analysis and the analysis of time series data. A direct interface to an underlying data warehouse provides fast access to a wide range of semantically integrated biological network databases. A plugin system allows simple customization and integration of new analysis algorithms or visual representations. BiNA is available under the 3-clause BSD license at http://bina.unipax.info/. PMID:24551056

  2. Innovative Tools and Technology for Analysis of Single Cells and Cell-Cell Interaction.

    PubMed

    Konry, Tania; Sarkar, Saheli; Sabhachandani, Pooja; Cohen, Noa

    2016-07-11

    Heterogeneity in single-cell responses and intercellular interactions results from complex regulation of cell-intrinsic and environmental factors. Single-cell analysis allows not only detection of individual cellular characteristics but also correlation of genetic content with phenotypic traits in the same cell. Technological advances in micro- and nanofabrication have benefited single-cell analysis by allowing precise control of the localized microenvironment, cell manipulation, and sensitive detection capabilities. Additionally, microscale techniques permit rapid, high-throughput, multiparametric screening that has become essential for -omics research. This review highlights innovative applications of microscale platforms in genetic, proteomic, and metabolic detection in single cells; cell sorting strategies; and heterotypic cell-cell interaction. We discuss key design aspects of single-cell localization and isolation in microfluidic systems, dynamic and endpoint analyses, and approaches that integrate highly multiplexed detection of various intracellular species.

  3. Deciphering Biochemical Network: from particles to planes then to spaces

    NASA Astrophysics Data System (ADS)

    Ye, Xinhao; Zhang, Siliang; Engineer Research CenterBiotechnology, National

    2004-03-01

    Today when we are still infatuated with the booming systematic fashion in life science, we, especially as biologist, ironically have fallen down into a sub-systematic maze. That is, although rapid advances in "omics" sciences ceaselessly provided so-called global or large-scale maps to exhibit the corresponding subnet, seldom paid attention to connecting these distinct but close-knit functional modules. Fortunately, a group of physicists recently cast off this natural moat and integrated multi-scale biological network into a simple life's pyramid. However, if extended this pyramid to a 3D structure in view of XYZ axis constructed by the temporal, spatial and organized characteristics respectively, it should be noted that this from-universal-to-particular pyramid is only a transverse section while the achievements in diverse "omics" sciences consist of relative longitudinal ones. On that footing, if analogizing the development of systems biology in last decades as a huge leap from discrete particles (typically in "a paper = a gene" era) to several planes (that is relative to corresponding OMICS science), we might rationally predict a next "space" era is coming soon to untangle and map the multi-tiered biological network really in a whole.

  4. -Omic and Electronic Health Record Big Data Analytics for Precision Medicine.

    PubMed

    Wu, Po-Yen; Cheng, Chih-Wen; Kaddi, Chanchala D; Venugopalan, Janani; Hoffman, Ryan; Wang, May D

    2017-02-01

    Rapid advances of high-throughput technologies and wide adoption of electronic health records (EHRs) have led to fast accumulation of -omic and EHR data. These voluminous complex data contain abundant information for precision medicine, and big data analytics can extract such knowledge to improve the quality of healthcare. In this paper, we present -omic and EHR data characteristics, associated challenges, and data analytics including data preprocessing, mining, and modeling. To demonstrate how big data analytics enables precision medicine, we provide two case studies, including identifying disease biomarkers from multi-omic data and incorporating -omic information into EHR. Big data analytics is able to address -omic and EHR data challenges for paradigm shift toward precision medicine. Big data analytics makes sense of -omic and EHR data to improve healthcare outcome. It has long lasting societal impact.

  5. Next generation microbiological risk assessment-Potential of omics data for hazard characterisation.

    PubMed

    Haddad, Nabila; Johnson, Nick; Kathariou, Sophia; Métris, Aline; Phister, Trevor; Pielaat, Annemarie; Tassou, Chrysoula; Wells-Bennik, Marjon H J; Zwietering, Marcel H

    2018-04-12

    According to the World Health Organization estimates in 2015, 600 million people fall ill every year from contaminated food and 420,000 die. Microbial risk assessment (MRA) was developed as a tool to reduce and prevent risks presented by pathogens and/or their toxins. MRA is organized in four steps to analyse information and assist in both designing appropriate control options and implementation of regulatory decisions and programs. Among the four steps, hazard characterisation is performed to establish the probability and severity of a disease outcome, which is determined as function of the dose of toxin and/or pathogen ingested. This dose-response relationship is subject to both variability and uncertainty. The purpose of this review/opinion article is to discuss how Next Generation Omics can impact hazard characterisation and, more precisely, how it can improve our understanding of variability and limit the uncertainty in the dose-response relation. The expansion of omics tools (e.g. genomics, transcriptomics, proteomics and metabolomics) allows for a better understanding of pathogenicity mechanisms and virulence levels of bacterial strains. Detection and identification of virulence genes, comparative genomics, analyses of mRNA and protein levels and the development of biomarkers can help in building a mechanistic dose-response model to predict disease severity. In this respect, systems biology can help to identify critical system characteristics that confer virulence and explain variability between strains. Despite challenges in the integration of omics into risk assessment, some omics methods have already been used by regulatory agencies for hazard identification. Standardized methods, reproducibility and datasets obtained from realistic conditions remain a challenge, and are needed to improve accuracy of hazard characterisation. When these improvements are realized, they will allow the health authorities and government policy makers to prioritize hazards more accurately and thus refine surveillance programs with the collaboration of all stakeholders of the food chain. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  6. Somatic Embryogenesis in Coffee: The Evolution of Biotechnology and the Integration of Omics Technologies Offer Great Opportunities

    PubMed Central

    Campos, Nádia A.; Panis, Bart; Carpentier, Sebastien C.

    2017-01-01

    One of the most important crops cultivated around the world is coffee. There are two main cultivated species, Coffea arabica and C. canephora. Both species are difficult to improve through conventional breeding, taking at least 20 years to produce a new cultivar. Biotechnological tools such as genetic transformation, micropropagation and somatic embryogenesis (SE) have been extensively studied in order to provide practical results for coffee improvement. While genetic transformation got many attention in the past and is booming with the CRISPR technology, micropropagation and SE are still the major bottle neck and urgently need more attention. The methodologies to induce SE and the further development of the embryos are genotype-dependent, what leads to an almost empirical development of specific protocols for each cultivar or clone. This is a serious limitation and excludes a general comprehensive understanding of the process as a whole. The aim of this review is to provide an overview of which achievements and molecular insights have been gained in (coffee) somatic embryogenesis and encourage researchers to invest further in the in vitro technology and combine it with the latest omics techniques (genomics, transcriptomics, proteomics, metabolomics, and phenomics). We conclude that the evolution of biotechnology and the integration of omics technologies offer great opportunities to (i) optimize the production process of SE and the subsequent conversion into rooted plantlets and (ii) to screen for possible somaclonal variation. However, currently the usage of the latest biotechnology did not pass the stage beyond proof of potential and needs to further improve. PMID:28871271

  7. Metabolomics in transfusion medicine.

    PubMed

    Nemkov, Travis; Hansen, Kirk C; Dumont, Larry J; D'Alessandro, Angelo

    2016-04-01

    Biochemical investigations on the regulatory mechanisms of red blood cell (RBC) and platelet (PLT) metabolism have fostered a century of advances in the field of transfusion medicine. Owing to these advances, storage of RBCs and PLT concentrates has become a lifesaving practice in clinical and military settings. There, however, remains room for improvement, especially with regard to the introduction of novel storage and/or rejuvenation solutions, alternative cell processing strategies (e.g., pathogen inactivation technologies), and quality testing (e.g., evaluation of novel containers with alternative plasticizers). Recent advancements in mass spectrometry-based metabolomics and systems biology, the bioinformatics integration of omics data, promise to speed up the design and testing of innovative storage strategies developed to improve the quality, safety, and effectiveness of blood products. Here we review the currently available metabolomics technologies and briefly describe the routine workflow for transfusion medicine-relevant studies. The goal is to provide transfusion medicine experts with adequate tools to navigate through the otherwise overwhelming amount of metabolomics data burgeoning in the field during the past few years. Descriptive metabolomics data have represented the first step omics researchers have taken into the field of transfusion medicine. However, to up the ante, clinical and omics experts will need to merge their expertise to investigate correlative and mechanistic relationships among metabolic variables and transfusion-relevant variables, such as 24-hour in vivo recovery for transfused RBCs. Integration with systems biology models will potentially allow for in silico prediction of metabolic phenotypes, thus streamlining the design and testing of alternative storage strategies and/or solutions. © 2015 AABB.

  8. Assessment of pleiotropic transcriptome perturbations in Arabidopsis engineered for indirect insect defence.

    PubMed

    Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J

    2014-06-19

    Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.

  9. Omics strategies for revealing Yersinia pestis virulence

    PubMed Central

    Yang, Ruifu; Du, Zongmin; Han, Yanping; Zhou, Lei; Song, Yajun; Zhou, Dongsheng; Cui, Yujun

    2012-01-01

    Omics has remarkably changed the way we investigate and understand life. Omics differs from traditional hypothesis-driven research because it is a discovery-driven approach. Mass datasets produced from omics-based studies require experts from different fields to reveal the salient features behind these data. In this review, we summarize omics-driven studies to reveal the virulence features of Yersinia pestis through genomics, trascriptomics, proteomics, interactomics, etc. These studies serve as foundations for further hypothesis-driven research and help us gain insight into Y. pestis pathogenesis. PMID:23248778

  10. GDCRNATools: an R/Bioconductor package for integrative analysis of lncRNA, miRNA, and mRNA data in GDC.

    PubMed

    Li, Ruidong; Qu, Han; Wang, Shibo; Wei, Julong; Zhang, Le; Ma, Renyuan; Lu, Jianming; Zhu, Jianguo; Zhong, Wei-De; Jia, Zhenyu

    2018-03-02

    The large-scale multidimensional omics data in the Genomic Data Commons (GDC) provides opportunities to investigate the crosstalk among different RNA species and their regulatory mechanisms in cancers. Easy-to-use bioinformatics pipelines are needed to facilitate such studies. We have developed a user-friendly R/Bioconductor package, named GDCRNATools, for downloading, organizing, and analyzing RNA data in GDC with an emphasis on deciphering the lncRNA-mRNA related competing endogenous RNAs (ceRNAs) regulatory network in cancers. Many widely used bioinformatics tools and databases are utilized in our package. Users can easily pack preferred downstream analysis pipelines or integrate their own pipelines into the workflow. Interactive shiny web apps built in GDCRNATools greatly improve visualization of results from the analysis. GDCRNATools is an R/Bioconductor package that is freely available at Bioconductor (http://bioconductor.org/packages/devel/bioc/html/GDCRNATools.html). Detailed instructions, manual and example code are also available in Github (https://github.com/Jialab-UCR/GDCRNATools). arthur.jia@ucr.edu or zhongwd2009@live.cn or doctorzhujianguo@163.com.

  11. The contribution of genetics and environment to obesity.

    PubMed

    Albuquerque, David; Nóbrega, Clévio; Manco, Licínio; Padez, Cristina

    2017-09-01

    Obesity is a global health problem mainly attributed to lifestyle changes such as diet, low physical activity or socioeconomics factors. However, several evidences consistently showed that genetics contributes significantly to the weight-gain susceptibility. A systematic literature search of most relevant original, review and meta-analysis, restricted to English was conducted in PubMed, Web of Science and Google scholar up to May 2017 concerning the contribution of genetics and environmental factors to obesity. Several evidences suggest that obesogenic environments contribute to the development of an obese phenotype. However, not every individual from the same population, despite sharing the same obesogenic environment, develop obesity. After more than 10 years of investigation on the genetics of obesity, the variants found associated with obesity represent only 3% of the estimated BMI-heritability, which is around 47-80%. Moreover, genetic factors per se were unable to explain the rapid spread of obesity prevalence. The integration of multi-omics data enables scientists having a better picture and to elucidate unknown pathways contributing to obesity. New studies based on case-control or gene candidate approach will be important to identify new variants associated with obesity susceptibility and consequently unveiling its genetic architecture. This will lead to an improvement of our understanding about underlying mechanisms involved in development and origin of the actual obesity epidemic. The integration of several omics will also provide insights about the interplay between genes and environments contributing to the obese phenotype. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com

  12. RAMONA: a Web application for gene set analysis on multilevel omics data.

    PubMed

    Sass, Steffen; Buettner, Florian; Mueller, Nikola S; Theis, Fabian J

    2015-01-01

    Decreasing costs of modern high-throughput experiments allow for the simultaneous analysis of altered gene activity on various molecular levels. However, these multi-omics approaches lead to a large amount of data, which is hard to interpret for a non-bioinformatician. Here, we present the remotely accessible multilevel ontology analysis (RAMONA). It offers an easy-to-use interface for the simultaneous gene set analysis of combined omics datasets and is an extension of the previously introduced MONA approach. RAMONA is based on a Bayesian enrichment method for the inference of overrepresented biological processes among given gene sets. Overrepresentation is quantified by interpretable term probabilities. It is able to handle data from various molecular levels, while in parallel coping with redundancies arising from gene set overlaps and related multiple testing problems. The comprehensive output of RAMONA is easy to interpret and thus allows for functional insight into the affected biological processes. With RAMONA, we provide an efficient implementation of the Bayesian inference problem such that ontologies consisting of thousands of terms can be processed in the order of seconds. RAMONA is implemented as ASP.NET Web application and publicly available at http://icb.helmholtz-muenchen.de/ramona. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  13. Computational challenges in modeling gene regulatory events.

    PubMed

    Pataskar, Abhijeet; Tiwari, Vijay K

    2016-10-19

    Cellular transcriptional programs driven by genetic and epigenetic mechanisms could be better understood by integrating "omics" data and subsequently modeling the gene-regulatory events. Toward this end, computational biology should keep pace with evolving experimental procedures and data availability. This article gives an exemplified account of the current computational challenges in molecular biology.

  14. MicroRNA regulation of bovine monocyte inflammatory and metabolic networks in an in vivo infection model

    USDA-ARS?s Scientific Manuscript database

    Bovine mastitis is an inflammation-driven disease of the bovine mammary gland that costs the global dairy industry several billion dollars per annum. Because disease susceptibility is a multi-factorial complex phenotype, a multi-omic integrative biology approach is required to dissect the multilayer...

  15. Multidimensional approaches for studying plant defence against insects: from ecology to omics and synthetic biology.

    PubMed

    Barah, Pankaj; Bones, Atle M

    2015-02-01

    The biggest challenge for modern biology is to integrate multidisciplinary approaches towards understanding the organizational and functional complexity of biological systems at different hierarchies, starting from the subcellular molecular mechanisms (microscopic) to the functional interactions of ecological communities (macroscopic). The plant-insect interaction is a good model for this purpose with the availability of an enormous amount of information at the molecular and the ecosystem levels. Changing global climatic conditions are abruptly resetting plant-insect interactions. Integration of discretely located heterogeneous information from the ecosystem to genes and pathways will be an advantage to understand the complexity of plant-insect interactions. This review will present the recent developments in omics-based high-throughput experimental approaches, with particular emphasis on studying plant defence responses against insect attack. The review highlights the importance of using integrative systems approaches to study plant-insect interactions from the macroscopic to the microscopic level. We analyse the current efforts in generating, integrating and modelling multiomics data to understand plant-insect interaction at a systems level. As a future prospect, we highlight the growing interest in utilizing the synthetic biology platform for engineering insect-resistant plants. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  16. Crop improvement using life cycle datasets acquired under field conditions.

    PubMed

    Mochida, Keiichi; Saisho, Daisuke; Hirayama, Takashi

    2015-01-01

    Crops are exposed to various environmental stresses in the field throughout their life cycle. Modern plant science has provided remarkable insights into the molecular networks of plant stress responses in laboratory conditions, but the responses of different crops to environmental stresses in the field need to be elucidated. Recent advances in omics analytical techniques and information technology have enabled us to integrate data from a spectrum of physiological metrics of field crops. The interdisciplinary efforts of plant science and data science enable us to explore factors that affect crop productivity and identify stress tolerance-related genes and alleles. Here, we describe recent advances in technologies that are key components for data driven crop design, such as population genomics, chronological omics analyses, and computer-aided molecular network prediction. Integration of the outcomes from these technologies will accelerate our understanding of crop phenology under practical field situations and identify key characteristics to represent crop stress status. These elements would help us to genetically engineer "designed crops" to prevent yield shortfalls because of environmental fluctuations due to future climate change.

  17. Protein Interactome of Muscle Invasive Bladder Cancer

    PubMed Central

    Bhat, Akshay; Heinzel, Andreas; Mayer, Bernd; Perco, Paul; Mühlberger, Irmgard; Husi, Holger; Merseburger, Axel S.; Zoidakis, Jerome; Vlahou, Antonia; Schanstra, Joost P.; Mischak, Harald; Jankowski, Vera

    2015-01-01

    Muscle invasive bladder carcinoma is a complex, multifactorial disease caused by disruptions and alterations of several molecular pathways that result in heterogeneous phenotypes and variable disease outcome. Combining this disparate knowledge may offer insights for deciphering relevant molecular processes regarding targeted therapeutic approaches guided by molecular signatures allowing improved phenotype profiling. The aim of the study is to characterize muscle invasive bladder carcinoma on a molecular level by incorporating scientific literature screening and signatures from omics profiling. Public domain omics signatures together with molecular features associated with muscle invasive bladder cancer were derived from literature mining to provide 286 unique protein-coding genes. These were integrated in a protein-interaction network to obtain a molecular functional map of the phenotype. This feature map educated on three novel disease-associated pathways with plausible involvement in bladder cancer, namely Regulation of actin cytoskeleton, Neurotrophin signalling pathway and Endocytosis. Systematic integration approaches allow to study the molecular context of individual features reported as associated with a clinical phenotype and could potentially help to improve the molecular mechanistic description of the disorder. PMID:25569276

  18. -Omic and Electronic Health Records Big Data Analytics for Precision Medicine

    PubMed Central

    Wu, Po-Yen; Cheng, Chih-Wen; Kaddi, Chanchala D.; Venugopalan, Janani; Hoffman, Ryan; Wang, May D.

    2017-01-01

    Objective Rapid advances of high-throughput technologies and wide adoption of electronic health records (EHRs) have led to fast accumulation of -omic and EHR data. These voluminous complex data contain abundant information for precision medicine, and big data analytics can extract such knowledge to improve the quality of health care. Methods In this article, we present -omic and EHR data characteristics, associated challenges, and data analytics including data pre-processing, mining, and modeling. Results To demonstrate how big data analytics enables precision medicine, we provide two case studies, including identifying disease biomarkers from multi-omic data and incorporating -omic information into EHR. Conclusion Big data analytics is able to address –omic and EHR data challenges for paradigm shift towards precision medicine. Significance Big data analytics makes sense of –omic and EHR data to improve healthcare outcome. It has long lasting societal impact. PMID:27740470

  19. Biomolecular Analysis Capability for Cellular and Omics Research on the International Space Station

    NASA Technical Reports Server (NTRS)

    Guinart-Ramirez, Y.; Cooley, V. M.; Love, J. E.

    2016-01-01

    International Space Station (ISS) assembly complete ushered a new era focused on utilization of this state-of-the-art orbiting laboratory to advance science and technology research in a wide array of disciplines, with benefits to Earth and space exploration. ISS enabling capability for research in cellular and molecular biology includes equipment for in situ, on-orbit analysis of biomolecules. Applications of this growing capability range from biomedicine and biotechnology to the emerging field of Omics. For example, Biomolecule Sequencer is a space-based miniature DNA sequencer that provides nucleotide sequence data for entire samples, which may be used for purposes such as microorganism identification and astrobiology. It complements the use of WetLab-2 SmartCycler"TradeMark", which extracts RNA and provides real-time quantitative gene expression data analysis from biospecimens sampled or cultured onboard the ISS, for downlink to ground investigators, with applications ranging from clinical tissue evaluation to multigenerational assessment of organismal alterations. And the Genes in Space-1 investigation, aimed at examining epigenetic changes, employs polymerase chain reaction to detect immune system alterations. In addition, an increasing assortment of tools to visualize the subcellular distribution of tagged macromolecules is becoming available onboard the ISS. For instance, the NASA LMM (Light Microscopy Module) is a flexible light microscopy imaging facility that enables imaging of physical and biological microscopic phenomena in microgravity. Another light microscopy system modified for use in space to image life sciences payloads is initially used by the Heart Cells investigation ("Effects of Microgravity on Stem Cell-Derived Cardiomyocytes for Human Cardiovascular Disease Modeling and Drug Discovery"). Also, the JAXA Microscope system can perform remotely controllable light, phase-contrast, and fluorescent observations. And upcoming confocal microscopy capability will allow for optical sectioning of biological tissues to determine microanatomical localization of biomarkers. Furthermore, NASA's geneLAB effort addresses integration of genomic, epigenomic, transcriptomic, proteomic and metabolomic datasets, by applying an innovative open source science platform for multi-investigator high throughput utilization of the ISS. In sum, the expanding ISS capability for analysis of biomolecules is enabling innovative research in a broad spectrum of areas such as cellular and molecular biology, biotechnology, tissue engineering, biomedicine, and Omics, providing manifold benefits for humanity.

  20. ICan: an integrated co-alteration network to identify ovarian cancer-related genes.

    PubMed

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.

  1. ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes

    PubMed Central

    Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan

    2015-01-01

    Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614

  2. Integrated metatranscriptomics and metaproteomics for the characterization of bacterial microbiota in unfed Ixodes ricinus.

    PubMed

    Hernández-Jarguín, Angélica; Díaz-Sánchez, Sandra; Villar, Margarita; de la Fuente, José

    2018-05-05

    An innovative metaomics approach integrating metatranscriptomics and metaproteomics was used to characterize bacterial communities in the microbiota of the Lyme borreliosis spirochete vector, Ixodes ricinus (Acari: Ixodidae). Whole internal tissues and salivary glands from unfed larvae and female ticks, respectively were used. Reused I. ricinus RNA-sequencing data for metranscriptomics analysis together with metaproteomics provided a better characterization of tick bacterial microbiota by increasing bacteria identification and support for identified bacteria with putative functional implications. The results showed the presence of symbiotic, commensal, soil, environmental, and pathogenic bacteria in the I. ricinus microbiota, including previously unrecognized commensal and soil microorganisms. The results of the metaomics approach may have implications in the characterization of putative mechanisms by which pathogen infection manipulates tick microbiota to facilitate infection. Metaomics approaches integrating different omics datasets would provide a better description of tick microbiota compositions, and insights into tick interactions with microbiota, pathogens and hosts. Copyright © 2018 Elsevier GmbH. All rights reserved.

  3. ECOMICS: A Web-Based Toolkit for Investigating the Biomolecular Web in Ecosystems Using a Trans-omics Approach

    PubMed Central

    Morioka, Yusuke; Everroad, R. Craig; Shino, Amiu; Matsushima, Akihiro; Haruna, Hideaki; Moriya, Shigeharu; Toyoda, Tetsuro; Kikuchi, Jun

    2012-01-01

    Ecosystems can be conceptually thought of as interconnected environmental and metabolic systems, in which small molecules to macro-molecules interact through diverse networks. State-of-the-art technologies in post-genomic science offer ways to inspect and analyze this biomolecular web using omics-based approaches. Exploring useful genes and enzymes, as well as biomass resources responsible for anabolism and catabolism within ecosystems will contribute to a better understanding of environmental functions and their application to biotechnology. Here we present ECOMICS, a suite of web-based tools for ECosystem trans-OMICS investigation that target metagenomic, metatranscriptomic, and meta-metabolomic systems, including biomacromolecular mixtures derived from biomass. ECOMICS is made of four integrated webtools. E-class allows for the sequence-based taxonomic classification of eukaryotic and prokaryotic ribosomal data and the functional classification of selected enzymes. FT2B allows for the digital processing of NMR spectra for downstream metabolic or chemical phenotyping. Bm-Char allows for statistical assignment of specific compounds found in lignocellulose-based biomass, and HetMap is a data matrix generator and correlation calculator that can be applied to trans-omics datasets as analyzed by these and other web tools. This web suite is unique in that it allows for the monitoring of biomass metabolism in a particular environment, i.e., from macromolecular complexes (FT2DB and Bm-Char) to microbial composition and degradation (E-class), and makes possible the understanding of relationships between molecular and microbial elements (HetMap). This website is available to the public domain at: https://database.riken.jp/ecomics/. PMID:22319563

  4. MOPED enables discoveries through consistently processed proteomics data

    PubMed Central

    Higdon, Roger; Stewart, Elizabeth; Stanberry, Larissa; Haynes, Winston; Choiniere, John; Montague, Elizabeth; Anderson, Nathaniel; Yandl, Gregory; Janko, Imre; Broomall, William; Fishilevich, Simon; Lancet, Doron; Kolker, Natali; Kolker, Eugene

    2014-01-01

    The Model Organism Protein Expression Database (MOPED, http://moped.proteinspire.org), is an expanding proteomics resource to enable biological and biomedical discoveries. MOPED aggregates simple, standardized and consistently processed summaries of protein expression and metadata from proteomics (mass spectrometry) experiments from human and model organisms (mouse, worm and yeast). The latest version of MOPED adds new estimates of protein abundance and concentration, as well as relative (differential) expression data. MOPED provides a new updated query interface that allows users to explore information by organism, tissue, localization, condition, experiment, or keyword. MOPED supports the Human Proteome Project’s efforts to generate chromosome and diseases specific proteomes by providing links from proteins to chromosome and disease information, as well as many complementary resources. MOPED supports a new omics metadata checklist in order to harmonize data integration, analysis and use. MOPED’s development is driven by the user community, which spans 90 countries guiding future development that will transform MOPED into a multi-omics resource. MOPED encourages users to submit data in a simple format. They can use the metadata a checklist generate a data publication for this submission. As a result, MOPED will provide even greater insights into complex biological processes and systems and enable deeper and more comprehensive biological and biomedical discoveries. PMID:24350770

  5. DiseaseConnect: a comprehensive web server for mechanism-based disease–disease connections

    PubMed Central

    Liu, Chun-Chi; Tseng, Yu-Ting; Li, Wenyuan; Wu, Chia-Yu; Mayzus, Ilya; Rzhetsky, Andrey; Sun, Fengzhu; Waterman, Michael; Chen, Jeremy J. W.; Chaudhary, Preet M.; Loscalzo, Joseph; Crandall, Edward; Zhou, Xianghong Jasmine

    2014-01-01

    The DiseaseConnect (http://disease-connect.org) is a web server for analysis and visualization of a comprehensive knowledge on mechanism-based disease connectivity. The traditional disease classification system groups diseases with similar clinical symptoms and phenotypic traits. Thus, diseases with entirely different pathologies could be grouped together, leading to a similar treatment design. Such problems could be avoided if diseases were classified based on their molecular mechanisms. Connecting diseases with similar pathological mechanisms could inspire novel strategies on the effective repositioning of existing drugs and therapies. Although there have been several studies attempting to generate disease connectivity networks, they have not yet utilized the enormous and rapidly growing public repositories of disease-related omics data and literature, two primary resources capable of providing insights into disease connections at an unprecedented level of detail. Our DiseaseConnect, the first public web server, integrates comprehensive omics and literature data, including a large amount of gene expression data, Genome-Wide Association Studies catalog, and text-mined knowledge, to discover disease–disease connectivity via common molecular mechanisms. Moreover, the clinical comorbidity data and a comprehensive compilation of known drug–disease relationships are additionally utilized for advancing the understanding of the disease landscape and for facilitating the mechanism-based development of new drug treatments. PMID:24895436

  6. CombiROC: an interactive web tool for selecting accurate marker combinations of omics data.

    PubMed

    Mazzara, Saveria; Rossi, Riccardo L; Grifantini, Renata; Donizetti, Simone; Abrignani, Sergio; Bombaci, Mauro

    2017-03-30

    Diagnostic accuracy can be improved considerably by combining multiple markers, whose performance in identifying diseased subjects is usually assessed via receiver operating characteristic (ROC) curves. The selection of multimarker signatures is a complicated process that requires integration of data signatures with sophisticated statistical methods. We developed a user-friendly tool, called CombiROC, to help researchers accurately determine optimal markers combinations from diverse omics methods. With CombiROC data from different domains, such as proteomics and transcriptomics, can be analyzed using sensitivity/specificity filters: the number of candidate marker panels rising from combinatorial analysis is easily optimized bypassing limitations imposed by the nature of different experimental approaches. Leaving to the user full control on initial selection stringency, CombiROC computes sensitivity and specificity for all markers combinations, performances of best combinations and ROC curves for automatic comparisons, all visualized in a graphic interface. CombiROC was designed without hard-coded thresholds, allowing a custom fit to each specific data: this dramatically reduces the computational burden and lowers the false negative rates given by fixed thresholds. The application was validated with published data, confirming the marker combination already originally described or even finding new ones. CombiROC is a novel tool for the scientific community freely available at http://CombiROC.eu.

  7. Sparse generalized linear model with L0 approximation for feature selection and prediction with big omics data.

    PubMed

    Liu, Zhenqiu; Sun, Fengzhu; McGovern, Dermot P

    2017-01-01

    Feature selection and prediction are the most important tasks for big data mining. The common strategies for feature selection in big data mining are L 1 , SCAD and MC+. However, none of the existing algorithms optimizes L 0 , which penalizes the number of nonzero features directly. In this paper, we develop a novel sparse generalized linear model (GLM) with L 0 approximation for feature selection and prediction with big omics data. The proposed approach approximate the L 0 optimization directly. Even though the original L 0 problem is non-convex, the problem is approximated by sequential convex optimizations with the proposed algorithm. The proposed method is easy to implement with only several lines of code. Novel adaptive ridge algorithms ( L 0 ADRIDGE) for L 0 penalized GLM with ultra high dimensional big data are developed. The proposed approach outperforms the other cutting edge regularization methods including SCAD and MC+ in simulations. When it is applied to integrated analysis of mRNA, microRNA, and methylation data from TCGA ovarian cancer, multilevel gene signatures associated with suboptimal debulking are identified simultaneously. The biological significance and potential clinical importance of those genes are further explored. The developed Software L 0 ADRIDGE in MATLAB is available at https://github.com/liuzqx/L0adridge.

  8. Omics-based approaches reveal phospholipids remodeling of Rhizopus oryzae responding to furfural stress for fumaric acid-production from xylose.

    PubMed

    Pan, Xinrong; Liu, Huanhuan; Liu, Jiao; Wang, Cheng; Wen, Jianping

    2016-12-01

    In order to relieve the toxicity of furfural on Rhizopus oryzae fermentation, the molecular mechanism of R. oryzae responding to furfural stress for fumaric acid-production was investigated by omics-based approaches. In metabolomics analysis, 29 metabolites including amino acid, sugars, polyols and fatty acids showed significant changes for maintaining the basic cell metabolism at the cost of lowering fumaric acid production. To further uncover the survival mechanism, lipidomics was carried out, revealing that phosphatidylcholine, phosphatidylglycerol, phosphatidylinositol and polyunsaturated acyl chains might be closely correlated with R. oryzae's adapting to furfural stress. Based on the above omics analysis, lecithin, inositol and soybean oil were exogenously supplemented separately with an optimized concentration in the presence of furfural, which increased fumaric acid titer from 5.78g/L to 10.03g/L, 10.05g/L and 12.13g/L (increased by 73.5%, 73.8% and 110%, respectively). These findings provide a methodological guidance for hemicellulose-fumaric acid development. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. A Systematic Approach to Time-series Metabolite Profiling and RNA-seq Analysis of Chinese Hamster Ovary Cell Culture.

    PubMed

    Hsu, Han-Hsiu; Araki, Michihiro; Mochizuki, Masao; Hori, Yoshimi; Murata, Masahiro; Kahar, Prihardi; Yoshida, Takanobu; Hasunuma, Tomohisa; Kondo, Akihiko

    2017-03-02

    Chinese hamster ovary (CHO) cells are the primary host used for biopharmaceutical protein production. The engineering of CHO cells to produce higher amounts of biopharmaceuticals has been highly dependent on empirical approaches, but recent high-throughput "omics" methods are changing the situation in a rational manner. Omics data analyses using gene expression or metabolite profiling make it possible to identify key genes and metabolites in antibody production. Systematic omics approaches using different types of time-series data are expected to further enhance understanding of cellular behaviours and molecular networks for rational design of CHO cells. This study developed a systematic method for obtaining and analysing time-dependent intracellular and extracellular metabolite profiles, RNA-seq data (enzymatic mRNA levels) and cell counts from CHO cell cultures to capture an overall view of the CHO central metabolic pathway (CMP). We then calculated correlation coefficients among all the profiles and visualised the whole CMP by heatmap analysis and metabolic pathway mapping, to classify genes and metabolites together. This approach provides an efficient platform to identify key genes and metabolites in CHO cell culture.

  10. High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software

    PubMed Central

    Fabregat-Traver, Diego; Sharapov, Sodbo Zh.; Hayward, Caroline; Rudan, Igor; Campbell, Harry; Aulchenko, Yurii; Bientinesi, Paolo

    2014-01-01

    To raise the power of genome-wide association studies (GWAS) and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the ’omics’ context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL. PMID:25717363

  11. What can an ecophysiological approach tell us about the physiological responses of marine invertebrates to hypoxia?

    PubMed

    Spicer, John I

    2014-01-01

    Hypoxia (low O2) is a common and natural feature of many marine environments. However, human-induced hypoxia has been on the rise over the past half century and is now recognised as a major problem in the world's seas and oceans. Whilst we have information on how marine invertebrates respond physiologically to hypoxia in the laboratory, we still lack understanding of how they respond to such stress in the wild (now and in the future). Consequently, here the question 'what can an ecophysiological approach tell us about physiological responses of marine invertebrates to hypoxia' is addressed. How marine invertebrates work in the wild when challenged with hypoxia is explored using four case studies centred on different hypoxic environments. The recent integration of the various -omics into ecophysiology is discussed, and a number of advantages of, and challenges to, successful integration are suggested. The case studies and -omic/physiology integration data are used to inform the concluding part of the review, where it is suggested that physiological responses to hypoxia in the wild are not always the same as those predicted from laboratory experiments. This is due to behaviour in the wild modifying responses, and therefore more than one type of 'experimental' approach is essential to reliably determine the actual response. It is also suggested that assuming it is known what a measured response is 'for' can be misleading and that taking parodies of ecophysiology seriously may impede research progress. This review finishes with the suggestion that an -omics approach is, and is becoming, a powerful method of understanding the response of marine invertebrates to environmental hypoxia and may be an ideal way of studying hypoxic responses in the wild. Despite centring on physiological responses to hypoxia, the review hopefully serves as a contribution to the discussion of what (animal) ecophysiology looks like (or should look like) in the 21st century.

  12. An integrated "omics" approach to the characterization of maize (Zea mays L.) mutants deficient in the expression of two genes encoding cytosolic glutamine synthetase.

    PubMed

    Amiour, Nardjis; Imbaud, Sandrine; Clément, Gilles; Agier, Nicolas; Zivy, Michel; Valot, Benoît; Balliau, Thierry; Quilleré, Isabelle; Tercé-Laforgue, Thérèse; Dargel-Graffin, Céline; Hirel, Bertrand

    2014-11-20

    To identify the key elements controlling grain production in maize, it is essential to have an integrated view of the responses to alterations in the main steps of nitrogen assimilation by modification of gene expression. Two maize mutant lines (gln1.3 and gln1.4), deficient in two genes encoding cytosolic glutamine synthetase, a key enzyme involved in nitrogen assimilation, were previously characterized by a reduction of kernel size in the gln1.4 mutant and by a reduction of kernel number in the gln1.3 mutant. In this work, the differences in leaf gene transcripts, proteins and metabolite accumulation in gln1.3 and gln1.4 mutants were studied at two key stages of plant development, in order to identify putative candidate genes, proteins and metabolic pathways contributing on one hand to the control of plant development and on the other to grain production. The most interesting finding in this study is that a number of key plant processes were altered in the gln1.3 and gln1.4 mutants, including a number of major biological processes such as carbon metabolism and transport, cell wall metabolism, and several metabolic pathways and stress responsive and regulatory elements. We also found that the two mutants share common or specific characteristics across at least two or even three of the "omics" considered at the vegetative stage of plant development, or during the grain filling period. This is the first comprehensive molecular and physiological characterization of two cytosolic glutamine synthetase maize mutants using a combined transcriptomic, proteomic and metabolomic approach. We find that the integration of the three "omics" procedures is not straight forward, since developmental and mutant-specific levels of regulation seem to occur from gene expression to metabolite accumulation. However, their potential use is discussed with a view to improving our understanding of nitrogen assimilation and partitioning and its impact on grain production.

  13. Perspective for Aquaponic Systems: "Omic" Technologies for Microbial Community Analysis.

    PubMed

    Munguia-Fragozo, Perla; Alatorre-Jacome, Oscar; Rico-Garcia, Enrique; Torres-Pacheco, Irineo; Cruz-Hernandez, Andres; Ocampo-Velazquez, Rosalia V; Garcia-Trejo, Juan F; Guevara-Gonzalez, Ramon G

    2015-01-01

    Aquaponics is the combined production of aquaculture and hydroponics, connected by a water recirculation system. In this productive system, the microbial community is responsible for carrying out the nutrient dynamics between the components. The nutrimental transformations mainly consist in the transformation of chemical species from toxic compounds into available nutrients. In this particular field, the microbial research, the "Omic" technologies will allow a broader scope of studies about a current microbial profile inside aquaponics community, even in those species that currently are unculturable. This approach can also be useful to understand complex interactions of living components in the system. Until now, the analog studies were made to set up the microbial characterization on recirculation aquaculture systems (RAS). However, microbial community composition of aquaponics is still unknown. "Omic" technologies like metagenomic can help to reveal taxonomic diversity. The perspectives are also to begin the first attempts to sketch the functional diversity inside aquaponic systems and its ecological relationships. The knowledge of the emergent properties inside the microbial community, as well as the understanding of the biosynthesis pathways, can derive in future biotechnological applications. Thus, the aim of this review is to show potential applications of current "Omic" tools to characterize the microbial community in aquaponic systems.

  14. Methods to isolate a large amount of generative cells, sperm cells and vegetative nuclei from tomato pollen for "omics" analysis.

    PubMed

    Lu, Yunlong; Wei, Liqin; Wang, Tai

    2015-01-01

    The development of sperm cells (SCs) from microspores involves a set of finely regulated molecular and cellular events and the coordination of these events. The mechanisms underlying these events and their interconnections remain a major challenge. Systems analysis of genome-wide molecular networks and functional modules with high-throughput "omics" approaches is crucial for understanding the mechanisms; however, this study is hindered because of the difficulty in isolating a large amount of cells of different types, especially generative cells (GCs), from the pollen. Here, we optimized the conditions of tomato pollen germination and pollen tube growth to allow for long-term growth of pollen tubes in vitro with SCs generated in the tube. Using this culture system, we developed methods for isolating GCs, SCs and vegetative cell nuclei (VN) from just-germinated tomato pollen grains and growing pollen tubes and their purification by Percoll density gradient centrifugation. The purity and viability of isolated GCs and SCs were confirmed by microscopy examination and fluorescein diacetate staining, respectively, and the integrity of VN was confirmed by propidium iodide staining. We could obtain about 1.5 million GCs and 2.0 million SCs each from 180 mg initiated pollen grains, and 10 million VN from 270 mg initiated pollen grains germinated in vitro in each experiment. These methods provide the necessary preconditions for systematic biology studies of SC development and differentiation in higher plants.

  15. Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data

    PubMed Central

    Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia

    2015-01-01

    The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168

  16. MDAS: an integrated system for metabonomic data analysis.

    PubMed

    Liu, Juan; Li, Bo; Xiong, Jiang-Hui

    2009-03-01

    Metabonomics, the latest 'omics' research field, shows great promise as a tool in biomarker discovery, drug efficacy and toxicity analysis, disease diagnosis and prognosis. One of the major challenges now facing researchers is how to process this data to yield useful information about a biological system, e.g., the mechanism of diseases. Traditional methods employed in metabonomic data analysis use multivariate analysis methods developed independently in chemometrics research. Additionally, with the development of machine learning approaches, some methods such as SVMs also show promise for use in metabonomic data analysis. Aside from the application of general multivariate analysis and machine learning methods to this problem, there is also a need for an integrated tool customized for metabonomic data analysis which can be easily used by biologists to reveal interesting patterns in metabonomic data.In this paper, we present a novel software tool MDAS (Metabonomic Data Analysis System) for metabonomic data analysis which integrates traditional chemometrics methods and newly introduced machine learning approaches. MDAS contains a suite of functional models for metabonomic data analysis and optimizes the flow of data analysis. Several file formats can be accepted as input. The input data can be optionally preprocessed and can then be processed with operations such as feature analysis and dimensionality reduction. The data with reduced dimensionalities can be used for training or testing through machine learning models. The system supplies proper visualization for data preprocessing, feature analysis, and classification which can be a powerful function for users to extract knowledge from the data. MDAS is an integrated platform for metabonomic data analysis, which transforms a complex analysis procedure into a more formalized and simplified one. The software package can be obtained from the authors.

  17. Galaxy-M: a Galaxy workflow for processing and analyzing direct infusion and liquid chromatography mass spectrometry-based metabolomics data.

    PubMed

    Davidson, Robert L; Weber, Ralf J M; Liu, Haoyu; Sharma-Oates, Archana; Viant, Mark R

    2016-01-01

    Metabolomics is increasingly recognized as an invaluable tool in the biological, medical and environmental sciences yet lags behind the methodological maturity of other omics fields. To achieve its full potential, including the integration of multiple omics modalities, the accessibility, standardization and reproducibility of computational metabolomics tools must be improved significantly. Here we present our end-to-end mass spectrometry metabolomics workflow in the widely used platform, Galaxy. Named Galaxy-M, our workflow has been developed for both direct infusion mass spectrometry (DIMS) and liquid chromatography mass spectrometry (LC-MS) metabolomics. The range of tools presented spans from processing of raw data, e.g. peak picking and alignment, through data cleansing, e.g. missing value imputation, to preparation for statistical analysis, e.g. normalization and scaling, and principal components analysis (PCA) with associated statistical evaluation. We demonstrate the ease of using these Galaxy workflows via the analysis of DIMS and LC-MS datasets, and provide PCA scores and associated statistics to help other users to ensure that they can accurately repeat the processing and analysis of these two datasets. Galaxy and data are all provided pre-installed in a virtual machine (VM) that can be downloaded from the GigaDB repository. Additionally, source code, executables and installation instructions are available from GitHub. The Galaxy platform has enabled us to produce an easily accessible and reproducible computational metabolomics workflow. More tools could be added by the community to expand its functionality. We recommend that Galaxy-M workflow files are included within the supplementary information of publications, enabling metabolomics studies to achieve greater reproducibility.

  18. A machine learning heuristic to identify biologically relevant and minimal biomarker panels from omics data

    PubMed Central

    2015-01-01

    Background Investigations into novel biomarkers using omics techniques generate large amounts of data. Due to their size and numbers of attributes, these data are suitable for analysis with machine learning methods. A key component of typical machine learning pipelines for omics data is feature selection, which is used to reduce the raw high-dimensional data into a tractable number of features. Feature selection needs to balance the objective of using as few features as possible, while maintaining high predictive power. This balance is crucial when the goal of data analysis is the identification of highly accurate but small panels of biomarkers with potential clinical utility. In this paper we propose a heuristic for the selection of very small feature subsets, via an iterative feature elimination process that is guided by rule-based machine learning, called RGIFE (Rule-guided Iterative Feature Elimination). We use this heuristic to identify putative biomarkers of osteoarthritis (OA), articular cartilage degradation and synovial inflammation, using both proteomic and transcriptomic datasets. Results and discussion Our RGIFE heuristic increased the classification accuracies achieved for all datasets when no feature selection is used, and performed well in a comparison with other feature selection methods. Using this method the datasets were reduced to a smaller number of genes or proteins, including those known to be relevant to OA, cartilage degradation and joint inflammation. The results have shown the RGIFE feature reduction method to be suitable for analysing both proteomic and transcriptomics data. Methods that generate large ‘omics’ datasets are increasingly being used in the area of rheumatology. Conclusions Feature reduction methods are advantageous for the analysis of omics data in the field of rheumatology, as the applications of such techniques are likely to result in improvements in diagnosis, treatment and drug discovery. PMID:25923811

  19. GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

    PubMed

    Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

    2018-03-24

    Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.

  20. Permafrost Meta-Omics and Climate Change

    NASA Astrophysics Data System (ADS)

    Mackelprang, Rachel; Saleska, Scott R.; Jacobsen, Carsten Suhr; Jansson, Janet K.; Taş, Neslihan

    2016-06-01

    Permanently frozen soil, or permafrost, covers a large portion of the Earth's terrestrial surface and represents a unique environment for cold-adapted microorganisms. As permafrost thaws, previously protected organic matter becomes available for microbial degradation. Microbes that decompose soil carbon produce carbon dioxide and other greenhouse gases, contributing substantially to climate change. Next-generation sequencing and other -omics technologies offer opportunities to discover the mechanisms by which microbial communities regulate the loss of carbon and the emission of greenhouse gases from thawing permafrost regions. Analysis of nucleic acids and proteins taken directly from permafrost-associated soils has provided new insights into microbial communities and their functions in Arctic environments that are increasingly impacted by climate change. In this article we review current information from various molecular -omics studies on permafrost microbial ecology and explore the relevance of these insights to our current understanding of the dynamics of permafrost loss due to climate change.

  1. Omics on bioleaching: current and future impacts.

    PubMed

    Martinez, Patricio; Vera, Mario; Bobadilla-Fazzini, Roberto A

    2015-10-01

    Bioleaching corresponds to the microbial-catalyzed process of conversion of insoluble metals into soluble forms. As an applied biotechnology globally used, it represents an extremely interesting field of research where omics techniques can be applied in terms of knowledge development, but moreover in terms of process design, control, and optimization. In this mini-review, the current state of genomics, proteomics, and metabolomics of bioleaching and the major impacts of these analytical methods at industrial scale are highlighted. In summary, genomics has been essential in the determination of the biodiversity of leaching processes and for development of conceptual and functional metabolic models. Proteomic impacts are mostly related to microbe-mineral interaction analysis, including copper resistance and biofilm formation. Early steps of metabolomics in the field of bioleaching have shown a significant potential for the use of metabolites as industrial biomarkers. Development directions are given in order to enhance the future impacts of the omics in biohydrometallurgy.

  2. Growing trend of CE at the omics level: the frontier of systems biology--an update.

    PubMed

    Ban, Eunmi; Park, Soo Hyun; Kang, Min-Jung; Lee, Hyun-Jung; Song, Eun Joo; Yoo, Young Sook

    2012-01-01

    Omics is the study of proteins, peptides, genes, and metabolites in living organisms. Systems biology aims to understand the system through the study of the relationship between elements such as genes and proteins in biological system. Recently, systems biology emerged as the result of the advanced development of high-throughput analysis technologies such as DNA sequencers, DNA arrays, and mass spectrometry for omics studies. Among a number of analytical tools and technologies, CE and CE coupled to MS are promising and relatively rapidly developing tools with the potential to provide qualitative and quantitative analyses of biological molecules. With an emphasis on CE for systems biology, this review summarizes the method developments and applications of CE for the genomic, transcriptomic, proteomic, and metabolomic studies focusing on the drug discovery and disease diagnosis and therapies since 2009. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.

    PubMed

    Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi

    2018-01-01

    We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.

  4. PAGER 2.0: an update to the pathway, annotated-list and gene-signature electronic repository for Human Network Biology

    PubMed Central

    Yue, Zongliang; Zheng, Qi; Neylon, Michael T; Yoo, Minjae; Shin, Jimin; Zhao, Zhiying; Tan, Aik Choon

    2018-01-01

    Abstract Integrative Gene-set, Network and Pathway Analysis (GNPA) is a powerful data analysis approach developed to help interpret high-throughput omics data. In PAGER 1.0, we demonstrated that researchers can gain unbiased and reproducible biological insights with the introduction of PAGs (Pathways, Annotated-lists and Gene-signatures) as the basic data representation elements. In PAGER 2.0, we improve the utility of integrative GNPA by significantly expanding the coverage of PAGs and PAG-to-PAG relationships in the database, defining a new metric to quantify PAG data qualities, and developing new software features to simplify online integrative GNPA. Specifically, we included 84 282 PAGs spanning 24 different data sources that cover human diseases, published gene-expression signatures, drug–gene, miRNA–gene interactions, pathways and tissue-specific gene expressions. We introduced a new normalized Cohesion Coefficient (nCoCo) score to assess the biological relevance of genes inside a PAG, and RP-score to rank genes and assign gene-specific weights inside a PAG. The companion web interface contains numerous features to help users query and navigate the database content. The database content can be freely downloaded and is compatible with third-party Gene Set Enrichment Analysis tools. We expect PAGER 2.0 to become a major resource in integrative GNPA. PAGER 2.0 is available at http://discovery.informatics.uab.edu/PAGER/. PMID:29126216

  5. imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel.

    PubMed

    Grapov, Dmitry; Newman, John W

    2012-09-01

    Interactive modules for Data Exploration and Visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data through a user-friendly interface. Individual modules enables interactive and dynamic analyses of large data by interfacing R's multivariate statistics and highly customizable visualizations with the spreadsheet environment, aiding robust inferences and generating information-rich data visualizations. This tool provides access to multiple comparisons with false discovery correction, hierarchical clustering, principal and independent component analyses, partial least squares regression and discriminant analysis, through an intuitive interface for creating high-quality two- and a three-dimensional visualizations including scatter plot matrices, distribution plots, dendrograms, heat maps, biplots, trellis biplots and correlation networks. Freely available for download at http://sourceforge.net/projects/imdev/. Implemented in R and VBA and supported by Microsoft Excel (2003, 2007 and 2010).

  6. How next-generation sequencing and multiscale data analysis will transform infectious disease management.

    PubMed

    Pak, Theodore R; Kasarskis, Andrew

    2015-12-01

    Recent reviews have examined the extent to which routine next-generation sequencing (NGS) on clinical specimens will improve the capabilities of clinical microbiology laboratories in the short term, but do not explore integrating NGS with clinical data from electronic medical records (EMRs), immune profiling data, and other rich datasets to create multiscale predictive models. This review introduces a range of "omics" and patient data sources relevant to managing infections and proposes 3 potentially disruptive applications for these data in the clinical workflow. The combined threats of healthcare-associated infections and multidrug-resistant organisms may be addressed by multiscale analysis of NGS and EMR data that is ideally updated and refined over time within each healthcare organization. Such data and analysis should form the cornerstone of future learning health systems for infectious disease. © The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America.

  7. A comparison of graph- and kernel-based -omics data integration algorithms for classifying complex traits.

    PubMed

    Yan, Kang K; Zhao, Hongyu; Pang, Herbert

    2017-12-06

    High-throughput sequencing data are widely collected and analyzed in the study of complex diseases in quest of improving human health. Well-studied algorithms mostly deal with single data source, and cannot fully utilize the potential of these multi-omics data sources. In order to provide a holistic understanding of human health and diseases, it is necessary to integrate multiple data sources. Several algorithms have been proposed so far, however, a comprehensive comparison of data integration algorithms for classification of binary traits is currently lacking. In this paper, we focus on two common classes of integration algorithms, graph-based that depict relationships with subjects denoted by nodes and relationships denoted by edges, and kernel-based that can generate a classifier in feature space. Our paper provides a comprehensive comparison of their performance in terms of various measurements of classification accuracy and computation time. Seven different integration algorithms, including graph-based semi-supervised learning, graph sharpening integration, composite association network, Bayesian network, semi-definite programming-support vector machine (SDP-SVM), relevance vector machine (RVM) and Ada-boost relevance vector machine are compared and evaluated with hypertension and two cancer data sets in our study. In general, kernel-based algorithms create more complex models and require longer computation time, but they tend to perform better than graph-based algorithms. The performance of graph-based algorithms has the advantage of being faster computationally. The empirical results demonstrate that composite association network, relevance vector machine, and Ada-boost RVM are the better performers. We provide recommendations on how to choose an appropriate algorithm for integrating data from multiple sources.

  8. Watson for Genomics: Moving Personalized Medicine Forward.

    PubMed

    Rhrissorrakrai, Kahn; Koyama, Takahiko; Parida, Laxmi

    2016-08-01

    The confluence of genomic technologies and cognitive computing has brought us to the doorstep of widespread usage of personalized medicine. Cognitive systems, such as Watson for Genomics (WG), integrate massive amounts of new omic data with the current body of knowledge to assist physicians in analyzing and acting on patient's genomic profiles. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. COBRApy: COnstraints-Based Reconstruction and Analysis for Python.

    PubMed

    Ebrahim, Ali; Lerman, Joshua A; Palsson, Bernhard O; Hyduke, Daniel R

    2013-08-08

    COnstraint-Based Reconstruction and Analysis (COBRA) methods are widely used for genome-scale modeling of metabolic networks in both prokaryotes and eukaryotes. Due to the successes with metabolism, there is an increasing effort to apply COBRA methods to reconstruct and analyze integrated models of cellular processes. The COBRA Toolbox for MATLAB is a leading software package for genome-scale analysis of metabolism; however, it was not designed to elegantly capture the complexity inherent in integrated biological networks and lacks an integration framework for the multiomics data used in systems biology. The openCOBRA Project is a community effort to promote constraints-based research through the distribution of freely available software. Here, we describe COBRA for Python (COBRApy), a Python package that provides support for basic COBRA methods. COBRApy is designed in an object-oriented fashion that facilitates the representation of the complex biological processes of metabolism and gene expression. COBRApy does not require MATLAB to function; however, it includes an interface to the COBRA Toolbox for MATLAB to facilitate use of legacy codes. For improved performance, COBRApy includes parallel processing support for computationally intensive processes. COBRApy is an object-oriented framework designed to meet the computational challenges associated with the next generation of stoichiometric constraint-based models and high-density omics data sets. http://opencobra.sourceforge.net/

  10. A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data.

    PubMed

    Kang, Tianyu; Ding, Wei; Zhang, Luoyan; Ziemek, Daniel; Zarringhalam, Kourosh

    2017-12-19

    Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.

  11. A mutli-omic systems approach to elucidating Yersinia virulence mechanisms

    PubMed Central

    Ansong, Charles; Schrimpe-Rutledge, Alexandra C.; Mitchell, Hugh; Chauhan, Sadhana; Jones, Marcus B.; Kim, Young-Mo; McAteer, Kathleen; Deatherage Kaiser, Brooke L.; Dubois, Jennifer L.; Brewer, Heather M.; Frank, Bryan C.; McDermott, Jason E.; Metz, Thomas O.; Peterson, Scott N.; Smith, Richard D.; Motin, Vladimir L.; Adkins, Joshua N.

    2012-01-01

    The underlying mechanisms that lead to dramatic differences between closely related pathogens are not always readily apparent. For example, the genomes of Yersinia pestis (YP) the causative agent of plague with a high mortality rate and Yersinia pseudotuberculosis (YPT) an enteric pathogen with a modest mortality rate are highly similar with some species specific differences; however the molecular causes of their distinct clinical outcomes remain poorly understood. In this study, a temporal multi-omic analysis of YP and YPT at physiologically relevant temperatures was performed to gain insights into how an acute and highly lethal bacterial pathogen, YP, differs from its less virulent progenitor, YPT. This analysis revealed higher gene and protein expression levels of conserved major virulence factors in YP relative to YPT, including the Yop virulon and the pH6 antigen. This suggests that adaptation in the regulatory architecture, in addition to the presence of unique genetic material, may contribute to the increased pathogenenicity of YP relative to YPT. Additionally, global transcriptome and proteome responses of YP and YPT revealed conserved post-transcriptional control of metabolism and the translational machinery including the modulation of glutamate levels in Yersiniae. Finally, the omics data was coupled with a computational network analysis, allowing an efficient prediction of novel Yersinia virulence factors based on gene and protein expression patterns. PMID:23147219

  12. The Omics Dashboard for interactive exploration of gene-expression data.

    PubMed

    Paley, Suzanne; Parker, Karen; Spaulding, Aaron; Tomb, Jean-Francois; O'Maille, Paul; Karp, Peter D

    2017-12-01

    The Omics Dashboard is a software tool for interactive exploration and analysis of gene-expression datasets. The Omics Dashboard is organized as a hierarchy of cellular systems. At the highest level of the hierarchy the Dashboard contains graphical panels depicting systems such as biosynthesis, energy metabolism, regulation and central dogma. Each of those panels contains a series of X-Y plots depicting expression levels of subsystems of that panel, e.g. subsystems within the central dogma panel include transcription, translation and protein maturation and folding. The Dashboard presents a visual read-out of the expression status of cellular systems to facilitate a rapid top-down user survey of how all cellular systems are responding to a given stimulus, and to enable the user to quickly view the responses of genes within specific systems of interest. Although the Dashboard is complementary to traditional statistical methods for analysis of gene-expression data, we show how it can detect changes in gene expression that statistical techniques may overlook. We present the capabilities of the Dashboard using two case studies: the analysis of lipid production for the marine alga Thalassiosira pseudonana, and an investigation of a shift from anaerobic to aerobic growth for the bacterium Escherichia coli. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. The Omics Dashboard for interactive exploration of gene-expression data

    PubMed Central

    Paley, Suzanne; Parker, Karen; Spaulding, Aaron; Tomb, Jean-Francois; O’Maille, Paul

    2017-01-01

    Abstract The Omics Dashboard is a software tool for interactive exploration and analysis of gene-expression datasets. The Omics Dashboard is organized as a hierarchy of cellular systems. At the highest level of the hierarchy the Dashboard contains graphical panels depicting systems such as biosynthesis, energy metabolism, regulation and central dogma. Each of those panels contains a series of X–Y plots depicting expression levels of subsystems of that panel, e.g. subsystems within the central dogma panel include transcription, translation and protein maturation and folding. The Dashboard presents a visual read-out of the expression status of cellular systems to facilitate a rapid top-down user survey of how all cellular systems are responding to a given stimulus, and to enable the user to quickly view the responses of genes within specific systems of interest. Although the Dashboard is complementary to traditional statistical methods for analysis of gene-expression data, we show how it can detect changes in gene expression that statistical techniques may overlook. We present the capabilities of the Dashboard using two case studies: the analysis of lipid production for the marine alga Thalassiosira pseudonana, and an investigation of a shift from anaerobic to aerobic growth for the bacterium Escherichia coli. PMID:29040755

  14. Cytoplasmic male sterility (CMS) in hybrid breeding in field crops.

    PubMed

    Bohra, Abhishek; Jha, Uday C; Adhimoolam, Premkumar; Bisht, Deepak; Singh, Narendra P

    2016-05-01

    A comprehensive understanding of CMS/Rf system enabled by modern omics tools and technologies considerably improves our ability to harness hybrid technology for enhancing the productivity of field crops. Harnessing hybrid vigor or heterosis is a promising approach to tackle the current challenge of sustaining enhanced yield gains of field crops. In the context, cytoplasmic male sterility (CMS) owing to its heritable nature to manifest non-functional male gametophyte remains a cost-effective system to promote efficient hybrid seed production. The phenomenon of CMS stems from a complex interplay between maternally-inherited (mitochondrion) and bi-parental (nucleus) genomic elements. In recent years, attempts aimed to comprehend the sterility-inducing factors (orfs) and corresponding fertility determinants (Rf) in plants have greatly increased our access to candidate genomic segments and the cloned genes. To this end, novel insights obtained by applying state-of-the-art omics platforms have substantially enriched our understanding of cytoplasmic-nuclear communication. Concomitantly, molecular tools including DNA markers have been implicated in crop hybrid breeding in order to greatly expedite the progress. Here, we review the status of diverse sterility-inducing cytoplasms and associated Rf factors reported across different field crops along with exploring opportunities for integrating modern omics tools with CMS-based hybrid breeding.

  15. Inflammaging and human longevity in the omics era.

    PubMed

    Monti, Daniela; Ostan, Rita; Borelli, Vincenzo; Castellani, Gastone; Franceschi, Claudio

    2017-07-01

    Inflammaging is a recent theory of aging originally proposed in 2000 where data and conceptualizations regarding the aging of the immune system (immunosenescence) and the evolution of immune responses from invertebrates to mammals converged. This theory has received an increasing number of citations and experimental confirmations. Here we present an updated version of inflammaging focused on omics data - particularly on glycomics - collected on centenarians, semi-supercentenarians and their offspring. Accordingly, we arrived to the following conclusions: i) inflammaging has a structure where specific combinations of pro- and anti-inflammatory mediators are involved; ii) inflammaging is systemic and more complex than we previously thought, as many organs, tissues and cell types participate in producing pro- and anti-inflammatory stimuli defined "molecular garbage"; iii) inflammaging is dynamic, can be propagated locally to neighboring cells and systemically from organ to organ by circulating products and microvesicles, and amplified by chronic age-related diseases constituting a "local fire", which in turn produces additional inflammatory stimuli and molecular garbage; iv) an integrated Systems Medicine approach is urgently needed to let emerge a robust and highly informative set/combination of omics markers able to better grasp the complex molecular core of inflammaging in elderly and centenarians. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  16. A statistical framework for applying RNA profiling to chemical hazard detection.

    PubMed

    Kostich, Mitchell S

    2017-12-01

    Use of 'omics technologies in environmental science is expanding. However, application is mostly restricted to characterizing molecular steps leading from toxicant interaction with molecular receptors to apical endpoints in laboratory species. Use in environmental decision-making is limited, due to difficulty in elucidating mechanisms in sufficient detail to make quantitative outcome predictions in any single species or in extending predictions to aquatic communities. Here we introduce a mechanism-agnostic statistical approach, supplementing mechanistic investigation by allowing probabilistic outcome prediction even when understanding of molecular pathways is limited, and facilitating extrapolation from results in laboratory test species to predictions about aquatic communities. We use concepts familiar to environmental managers, supplemented with techniques employed for clinical interpretation of 'omics-based biomedical tests. We describe the framework in step-wise fashion, beginning with single test replicates of a single RNA variant, then extending to multi-gene RNA profiling, collections of test replicates, and integration of complementary data. In order to simplify the presentation, we focus on using RNA profiling for distinguishing presence versus absence of chemical hazards, but the principles discussed can be extended to other types of 'omics measurements, multi-class problems, and regression. We include a supplemental file demonstrating many of the concepts using the open source R statistical package. Published by Elsevier Ltd.

  17. ISOL@: an Italian SOLAnaceae genomics resource.

    PubMed

    Chiusano, Maria Luisa; D'Agostino, Nunzio; Traini, Alessandra; Licciardello, Concetta; Raimondo, Enrico; Aversano, Mario; Frusciante, Luigi; Monti, Luigi

    2008-03-26

    Present-day '-omics' technologies produce overwhelming amounts of data which include genome sequences, information on gene expression (transcripts and proteins) and on cell metabolic status. These data represent multiple aspects of a biological system and need to be investigated as a whole to shed light on the mechanisms which underpin the system functionality. The gathering and convergence of data generated by high-throughput technologies, the effective integration of different data-sources and the analysis of the information content based on comparative approaches are key methods for meaningful biological interpretations. In the frame of the International Solanaceae Genome Project, we propose here ISOLA, an Italian SOLAnaceae genomics resource. ISOLA (available at http://biosrv.cab.unina.it/isola) represents a trial platform and it is conceived as a multi-level computational environment.ISOLA currently consists of two main levels: the genome and the expression level. The cornerstone of the genome level is represented by the Solanum lycopersicum genome draft sequences generated by the International Tomato Genome Sequencing Consortium. Instead, the basic element of the expression level is the transcriptome information from different Solanaceae species, mainly in the form of species-specific comprehensive collections of Expressed Sequence Tags (ESTs). The cross-talk between the genome and the expression levels is based on data source sharing and on tools that enhance data quality, that extract information content from the levels' under parts and produce value-added biological knowledge. ISOLA is the result of a bioinformatics effort that addresses the challenges of the post-genomics era. It is designed to exploit '-omics' data based on effective integration to acquire biological knowledge and to approach a systems biology view. Beyond providing experimental biologists with a preliminary annotation of the tomato genome, this effort aims to produce a trial computational environment where different aspects and details are maintained as they are relevant for the analysis of the organization, the functionality and the evolution of the Solanaceae family.

  18. Framework for the quality assurance of 'omics technologies considering GLP requirements.

    PubMed

    Kauffmann, Hans-Martin; Kamp, Hennicke; Fuchs, Regine; Chorley, Brian N; Deferme, Lize; Ebbels, Timothy; Hackermüller, Jörg; Perdichizzi, Stefania; Poole, Alan; Sauer, Ursula G; Tollefsen, Knut E; Tralau, Tewes; Yauk, Carole; van Ravenzwaay, Ben

    2017-12-01

    'Omics technologies are gaining importance to support regulatory toxicity studies. Prerequisites for performing 'omics studies considering GLP principles were discussed at the European Centre for Ecotoxicology and Toxicology of Chemicals (ECETOC) Workshop Applying 'omics technologies in Chemical Risk Assessment. A GLP environment comprises a standard operating procedure system, proper pre-planning and documentation, and inspections of independent quality assurance staff. To prevent uncontrolled data changes, the raw data obtained in the respective 'omics data recording systems have to be specifically defined. Further requirements include transparent and reproducible data processing steps, and safe data storage and archiving procedures. The software for data recording and processing should be validated, and data changes should be traceable or disabled. GLP-compliant quality assurance of 'omics technologies appears feasible for many GLP requirements. However, challenges include (i) defining, storing, and archiving the raw data; (ii) transparent descriptions of data processing steps; (iii) software validation; and (iv) ensuring complete reproducibility of final results with respect to raw data. Nevertheless, 'omics studies can be supported by quality measures (e.g., GLP principles) to ensure quality control, reproducibility and traceability of experiments. This enables regulators to use 'omics data in a fit-for-purpose context, which enhances their applicability for risk assessment. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. SECIMTools: a suite of metabolomics data analysis tools.

    PubMed

    Kirpich, Alexander S; Ibarra, Miguel; Moskalenko, Oleksandr; Fear, Justin M; Gerken, Joseph; Mi, Xinlei; Ashrafi, Ali; Morse, Alison M; McIntyre, Lauren M

    2018-04-20

    Metabolomics has the promise to transform the area of personalized medicine with the rapid development of high throughput technology for untargeted analysis of metabolites. Open access, easy to use, analytic tools that are broadly accessible to the biological community need to be developed. While technology used in metabolomics varies, most metabolomics studies have a set of features identified. Galaxy is an open access platform that enables scientists at all levels to interact with big data. Galaxy promotes reproducibility by saving histories and enabling the sharing workflows among scientists. SECIMTools (SouthEast Center for Integrated Metabolomics) is a set of Python applications that are available both as standalone tools and wrapped for use in Galaxy. The suite includes a comprehensive set of quality control metrics (retention time window evaluation and various peak evaluation tools), visualization techniques (hierarchical cluster heatmap, principal component analysis, modular modularity clustering), basic statistical analysis methods (partial least squares - discriminant analysis, analysis of variance, t-test, Kruskal-Wallis non-parametric test), advanced classification methods (random forest, support vector machines), and advanced variable selection tools (least absolute shrinkage and selection operator LASSO and Elastic Net). SECIMTools leverages the Galaxy platform and enables integrated workflows for metabolomics data analysis made from building blocks designed for easy use and interpretability. Standard data formats and a set of utilities allow arbitrary linkages between tools to encourage novel workflow designs. The Galaxy framework enables future data integration for metabolomics studies with other omics data.

  20. A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification.

    PubMed

    Xie, Xin-Ping; Xie, Yu-Feng; Wang, Hong-Qiang

    2017-08-23

    Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity.

  1. Omics studies of citrus, grape and rosaceae fruit trees

    PubMed Central

    Shiratake, Katsuhiro; Suzuki, Mami

    2016-01-01

    Recent advance of bioinformatics and analytical apparatuses such as next generation DNA sequencer (NGS) and mass spectrometer (MS) has brought a big wave of comprehensive study to biology. Comprehensive study targeting all genes, transcripts (RNAs), proteins, metabolites, hormones, ions or phenotypes is called genomics, transcriptomics, proteomics, metabolomics, hormonomics, ionomics or phenomics, respectively. These omics are powerful approaches to identify key genes for important traits, to clarify events of physiological mechanisms and to reveal unknown metabolic pathways in crops. Recently, the use of omics approach has increased dramatically in fruit tree research. Although the most reported omics studies on fruit trees are transcriptomics, proteomics and metabolomics, and a few is reported on hormonomics and ionomics. In this article, we reviewed recent omics studies of major fruit trees, i.e. citrus, grapevine and rosaceae fruit trees. The effectiveness and prospects of omics in fruit tree research will as well be highlighted. PMID:27069397

  2. Omics studies of citrus, grape and rosaceae fruit trees.

    PubMed

    Shiratake, Katsuhiro; Suzuki, Mami

    2016-01-01

    Recent advance of bioinformatics and analytical apparatuses such as next generation DNA sequencer (NGS) and mass spectrometer (MS) has brought a big wave of comprehensive study to biology. Comprehensive study targeting all genes, transcripts (RNAs), proteins, metabolites, hormones, ions or phenotypes is called genomics, transcriptomics, proteomics, metabolomics, hormonomics, ionomics or phenomics, respectively. These omics are powerful approaches to identify key genes for important traits, to clarify events of physiological mechanisms and to reveal unknown metabolic pathways in crops. Recently, the use of omics approach has increased dramatically in fruit tree research. Although the most reported omics studies on fruit trees are transcriptomics, proteomics and metabolomics, and a few is reported on hormonomics and ionomics. In this article, we reviewed recent omics studies of major fruit trees, i.e. citrus, grapevine and rosaceae fruit trees. The effectiveness and prospects of omics in fruit tree research will as well be highlighted.

  3. Evaluation of diversity among common beans (Phaseolus vulgaris L.) from two centers of domestication using 'omics' technologies

    PubMed Central

    2010-01-01

    Background Genetic diversity among wild accessions and cultivars of common bean (Phaseolus vulgaris L.) has been characterized using plant morphology, seed protein allozymes, random amplified polymorphic DNA, restriction fragment length polymorphisms, DNA sequence analysis, chloroplast DNA, and microsatellite markers. Yet, little is known about whether these traits, which distinguish among genetically distinct types of common bean, can be evaluated using omics technologies. Results Three 'omics' approaches: transcriptomics, proteomics, and metabolomics were used to qualitatively evaluate the diversity of common bean from two Centers of Domestication (COD). All three approaches were able to classify common bean according to their COD using unsupervised analyses; these findings are consistent with the hypothesis that differences exist in gene transcription, protein expression, and synthesis and metabolism of small molecules among common bean cultivars representative of different COD. Metabolomic analyses of multiple cultivars within two common bean gene pools revealed cultivar differences in small molecules that were of sufficient magnitude to allow identification of unique cultivar fingerprints. Conclusions Given the high-throughput and low cost of each of these 'omics' platforms, significant opportunities exist for their use in the rapid identification of traits of agronomic and nutritional importance as well as to characterize genetic diversity. PMID:21126341

  4. Global metabolite profiling analysis of lipotoxicity in HER2/neu-positive breast cancer cells.

    PubMed

    Baumann, Jan; Kokabee, Mostafa; Wong, Jason; Balasubramaniyam, Rakshika; Sun, Yan; Conklin, Douglas S

    2018-06-05

    Recent work has shown that HER2/neu-positive breast cancer cells rely on a unique Warburg-like metabolism for survival and aggressive behavior. These cells are dependent on fatty acid (FA) synthesis, show markedly increased levels of stored fats and disruption of the synthetic process results in apoptosis. In this study, we used global metabolite profiling and a multi-omics network analysis approach to model the metabolic changes in this physiology under palmitate-supplemented growth conditions to gain insights into the molecular mechanism and its relevance to disease prevention and treatment. Computational analyses were used to define pathway enrichment based on the dataset of significantly altered metabolites and to integrate metabolomics and transcriptomics data in a multi-omics network analysis. Network-predicted changes and functional relationships were tested with cell assays in vitro . Palmitate-supplemented growth conditions induce distinct metabolic alterations. Growth of HER2-normal MCF7 cells is unaffected under these conditions whereas HER2/neu-positive cells display unchanged neutral lipid content, AMPK activation, inhibition of fatty acid synthesis and significantly altered glutamine, glucose and serine/glycine metabolism. The predominant upregulated lipid species is the novel bioactive lipid N-palmitoylglycine, which is non-toxic to these cells. Limiting the availability of glutamine significantly ameliorates the lipotoxic effects of palmitate, reduces CHOP and XBP1(s) induction and restores the expression levels of HER2 and HER3. The study shows that HER2/neu-positive breast cancer cells change their metabolic phenotype in the presence of palmitate. Palmitate induces AMPK activation and inhibition of fatty acid synthesis that feeds back into glycolysis as well as anaplerotic glutamine metabolism.

  5. Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology

    PubMed Central

    Afendi, Farit M.; Ono, Naoaki; Nakamura, Yukiko; Nakamura, Kensuke; Darusman, Latifah K.; Kibinge, Nelson; Morita, Aki Hirai; Tanaka, Ken; Horai, Hisayuki; Altaf-Ul-Amin, Md.; Kanaya, Shigehiko

    2013-01-01

    Molecular biological data has rapidly increased with the recent progress of the Omics fields, e.g., genomics, transcriptomics, proteomics and metabolomics that necessitates the development of databases and methods for efficient storage, retrieval, integration and analysis of massive data. The present study reviews the usage of KNApSAcK Family DB in metabolomics and related area, discusses several statistical methods for handling multivariate data and shows their application on Indonesian blended herbal medicines (Jamu) as a case study. Exploration using Biplot reveals many plants are rarely utilized while some plants are highly utilized toward specific efficacy. Furthermore, the ingredients of Jamu formulas are modeled using Partial Least Squares Discriminant Analysis (PLS-DA) in order to predict their efficacy. The plants used in each Jamu medicine served as the predictors, whereas the efficacy of each Jamu provided the responses. This model produces 71.6% correct classification in predicting efficacy. Permutation test then is used to determine plants that serve as main ingredients in Jamu formula by evaluating the significance of the PLS-DA coefficients. Next, in order to explain the role of plants that serve as main ingredients in Jamu medicines, information of pharmacological activity of the plants is added to the predictor block. Then N-PLS-DA model, multiway version of PLS-DA, is utilized to handle the three-dimensional array of the predictor block. The resulting N-PLS-DA model reveals that the effects of some pharmacological activities are specific for certain efficacy and the other activities are diverse toward many efficacies. Mathematical modeling introduced in the present study can be utilized in global analysis of big data targeting to reveal the underlying biology. PMID:24688691

  6. A multi-objective constraint-based approach for modeling genome-scale microbial ecosystems.

    PubMed

    Budinich, Marko; Bourdon, Jérémie; Larhlimi, Abdelhalim; Eveillard, Damien

    2017-01-01

    Interplay within microbial communities impacts ecosystems on several scales, and elucidation of the consequent effects is a difficult task in ecology. In particular, the integration of genome-scale data within quantitative models of microbial ecosystems remains elusive. This study advocates the use of constraint-based modeling to build predictive models from recent high-resolution -omics datasets. Following recent studies that have demonstrated the accuracy of constraint-based models (CBMs) for simulating single-strain metabolic networks, we sought to study microbial ecosystems as a combination of single-strain metabolic networks that exchange nutrients. This study presents two multi-objective extensions of CBMs for modeling communities: multi-objective flux balance analysis (MO-FBA) and multi-objective flux variability analysis (MO-FVA). Both methods were applied to a hot spring mat model ecosystem. As a result, multiple trade-offs between nutrients and growth rates, as well as thermodynamically favorable relative abundances at community level, were emphasized. We expect this approach to be used for integrating genomic information in microbial ecosystems. Following models will provide insights about behaviors (including diversity) that take place at the ecosystem scale.

  7. Benchmark Dose Modeling Estimates of the Concentrations of Inorganic Arsenic That Induce Changes to the Neonatal Transcriptome, Proteome, and Epigenome in a Pregnancy Cohort.

    PubMed

    Rager, Julia E; Auerbach, Scott S; Chappell, Grace A; Martin, Elizabeth; Thompson, Chad M; Fry, Rebecca C

    2017-10-16

    Prenatal inorganic arsenic (iAs) exposure influences the expression of critical genes and proteins associated with adverse outcomes in newborns, in part through epigenetic mediators. The doses at which these genomic and epigenomic changes occur have yet to be evaluated in the context of dose-response modeling. The goal of the present study was to estimate iAs doses that correspond to changes in transcriptomic, proteomic, epigenomic, and integrated multi-omic signatures in human cord blood through benchmark dose (BMD) modeling. Genome-wide DNA methylation, microRNA expression, mRNA expression, and protein expression levels in cord blood were modeled against total urinary arsenic (U-tAs) levels from pregnant women exposed to varying levels of iAs. Dose-response relationships were modeled in BMDExpress, and BMDs representing 10% response levels were estimated. Overall, DNA methylation changes were estimated to occur at lower exposure concentrations in comparison to other molecular endpoints. Multi-omic module eigengenes were derived through weighted gene co-expression network analysis, representing co-modulated signatures across transcriptomic, proteomic, and epigenomic profiles. One module eigengene was associated with decreased gestational age occurring alongside increased iAs exposure. Genes/proteins within this module eigengene showed enrichment for organismal development, including potassium voltage-gated channel subfamily Q member 1 (KCNQ1), an imprinted gene showing differential methylation and expression in response to iAs. Modeling of this prioritized multi-omic module eigengene resulted in a BMD(BMDL) of 58(45) μg/L U-tAs, which was estimated to correspond to drinking water arsenic concentrations of 51(40) μg/L. Results are in line with epidemiological evidence supporting effects of prenatal iAs occurring at levels <100 μg As/L urine. Together, findings present a variety of BMD measures to estimate doses at which prenatal iAs exposure influences neonatal outcome-relevant transcriptomic, proteomic, and epigenomic profiles.

  8. Personalized Medicine: What's in it for Rare Diseases?

    PubMed

    Schee Genannt Halfmann, Sebastian; Mählmann, Laura; Leyens, Lada; Reumann, Matthias; Brand, Angela

    2017-01-01

    Personalised Medicine has become a reality over the last years. The emergence of 'omics' and big data has started revolutionizing healthcare. New 'omics' technologies lead to a better molecular characterization of diseases and a new understanding of the complexity of diseases. The approach of PM is already successfully applied in different healthcare areas such as oncology, cardiology, nutrition and for rare diseases. However, health systems across the EU are often still promoting the 'one-size fits all' approach, even if it is known that patients do greatly vary in their molecular characteristics and response to drugs and other interventions. To make use of the full potentials of PM in the next years ahead several challenges need to be addressed such as the integration of big data, patient empowerment, translation of basic to clinical research, bringing the innovation to the market and shaping sustainable healthcare systems.

  9. Advantages and Pitfalls of Mass Spectrometry Based Metabolome Profiling in Systems Biology.

    PubMed

    Aretz, Ina; Meierhofer, David

    2016-04-27

    Mass spectrometry-based metabolome profiling became the method of choice in systems biology approaches and aims to enhance biological understanding of complex biological systems. Genomics, transcriptomics, and proteomics are well established technologies and are commonly used by many scientists. In comparison, metabolomics is an emerging field and has not reached such high-throughput, routine and coverage than other omics technologies. Nevertheless, substantial improvements were achieved during the last years. Integrated data derived from multi-omics approaches will provide a deeper understanding of entire biological systems. Metabolome profiling is mainly hampered by its diversity, variation of metabolite concentration by several orders of magnitude and biological data interpretation. Thus, multiple approaches are required to cover most of the metabolites. No software tool is capable of comprehensively translating all the data into a biologically meaningful context yet. In this review, we discuss the advantages of metabolome profiling and main obstacles limiting progress in systems biology.

  10. Advantages and Pitfalls of Mass Spectrometry Based Metabolome Profiling in Systems Biology

    PubMed Central

    Aretz, Ina; Meierhofer, David

    2016-01-01

    Mass spectrometry-based metabolome profiling became the method of choice in systems biology approaches and aims to enhance biological understanding of complex biological systems. Genomics, transcriptomics, and proteomics are well established technologies and are commonly used by many scientists. In comparison, metabolomics is an emerging field and has not reached such high-throughput, routine and coverage than other omics technologies. Nevertheless, substantial improvements were achieved during the last years. Integrated data derived from multi-omics approaches will provide a deeper understanding of entire biological systems. Metabolome profiling is mainly hampered by its diversity, variation of metabolite concentration by several orders of magnitude and biological data interpretation. Thus, multiple approaches are required to cover most of the metabolites. No software tool is capable of comprehensively translating all the data into a biologically meaningful context yet. In this review, we discuss the advantages of metabolome profiling and main obstacles limiting progress in systems biology. PMID:27128910

  11. Nutrigenomics, the Microbiome, and Gene-Environment Interactions: New Directions in Cardiovascular Disease Research, Prevention, and Treatment: A Scientific Statement From the American Heart Association.

    PubMed

    Ferguson, Jane F; Allayee, Hooman; Gerszten, Robert E; Ideraabdullah, Folami; Kris-Etherton, Penny M; Ordovás, José M; Rimm, Eric B; Wang, Thomas J; Bennett, Brian J

    2016-06-01

    Cardiometabolic diseases are the leading cause of death worldwide and are strongly linked to both genetic and nutritional factors. The field of nutrigenomics encompasses multiple approaches aimed at understanding the effects of diet on health or disease development, including nutrigenetic studies investigating the relationship between genetic variants and diet in modulating cardiometabolic risk, as well as the effects of dietary components on multiple "omic" measures, including transcriptomics, metabolomics, proteomics, lipidomics, epigenetic modifications, and the microbiome. Here, we describe the current state of the field of nutrigenomics with respect to cardiometabolic disease research and outline a direction for the integration of multiple omics techniques in future nutrigenomic studies aimed at understanding mechanisms and developing new therapeutic options for cardiometabolic disease treatment and prevention. © 2016 American Heart Association, Inc.

  12. Next-Generation Pathology.

    PubMed

    Caie, Peter D; Harrison, David J

    2016-01-01

    The field of pathology is rapidly transforming from a semiquantitative and empirical science toward a big data discipline. Large data sets from across multiple omics fields may now be extracted from a patient's tissue sample. Tissue is, however, complex, heterogeneous, and prone to artifact. A reductionist view of tissue and disease progression, which does not take this complexity into account, may lead to single biomarkers failing in clinical trials. The integration of standardized multi-omics big data and the retention of valuable information on spatial heterogeneity are imperative to model complex disease mechanisms. Mathematical modeling through systems pathology approaches is the ideal medium to distill the significant information from these large, multi-parametric, and hierarchical data sets. Systems pathology may also predict the dynamical response of disease progression or response to therapy regimens from a static tissue sample. Next-generation pathology will incorporate big data with systems medicine in order to personalize clinical practice for both prognostic and predictive patient care.

  13. Predicting clinical outcome of neuroblastoma patients using an integrative network-based approach.

    PubMed

    Tranchevent, Léon-Charles; Nazarov, Petr V; Kaoma, Tony; Schmartz, Georges P; Muller, Arnaud; Kim, Sang-Yoon; Rajapakse, Jagath C; Azuaje, Francisco

    2018-06-07

    One of the main current challenges in computational biology is to make sense of the huge amounts of multidimensional experimental data that are being produced. For instance, large cohorts of patients are often screened using different high-throughput technologies, effectively producing multiple patient-specific molecular profiles for hundreds or thousands of patients. We propose and implement a network-based method that integrates such patient omics data into Patient Similarity Networks. Topological features derived from these networks were then used to predict relevant clinical features. As part of the 2017 CAMDA challenge, we have successfully applied this strategy to a neuroblastoma dataset, consisting of genomic and transcriptomic data. In particular, we observe that models built on our network-based approach perform at least as well as state of the art models. We furthermore explore the effectiveness of various topological features and observe, for instance, that redundant centrality metrics can be combined to build more powerful models. We demonstrate that the networks inferred from omics data contain clinically relevant information and that patient clinical outcomes can be predicted using only network topological data. This article was reviewed by Yang-Yu Liu, Tomislav Smuc and Isabel Nepomuceno.

  14. Integrated -Omics: A Powerful Approach to Understanding the Heterogeneous Lignification of Fibre Crops

    PubMed Central

    Gea, Guerriero; Kjell, Sergeant; Jean-François, Hausman

    2013-01-01

    Lignin and cellulose represent the two main components of plant secondary walls and the most abundant polymers on Earth. Quantitatively one of the principal products of the phenylpropanoid pathway, lignin confers high mechanical strength and hydrophobicity to plant walls, thus enabling erect growth and high-pressure water transport in the vessels. Lignin is characterized by a high natural heterogeneity in its composition and abundance in plant secondary cell walls, even in the different tissues of the same plant. A typical example is the stem of fibre crops, which shows a lignified core enveloped by a cellulosic, lignin-poor cortex. Despite the great value of fibre crops for humanity, however, still little is known on the mechanisms controlling their cell wall biogenesis, and particularly, what regulates their spatially-defined lignification pattern. Given the chemical complexity and the heterogeneous composition of fibre crops’ secondary walls, only the use of multidisciplinary approaches can convey an integrated picture and provide exhaustive information covering different levels of biological complexity. The present review highlights the importance of combining high throughput -omics approaches to get a complete understanding of the factors regulating the lignification heterogeneity typical of fibre crops. PMID:23708098

  15. Omics methods for probing the mode of action of natural and synthetic phytotoxins.

    PubMed

    Duke, Stephen O; Bajsa, Joanna; Pan, Zhiqiang

    2013-02-01

    For a little over a decade, omics methods (transcriptomics, proteomics, metabolomics, and physionomics) have been used to discover and probe the mode of action of both synthetic and natural phytotoxins. For mode of action discovery, the strategy for each of these approaches is to generate an omics profile for phytotoxins with known molecular targets and to compare this library of responses to the responses of compounds with unknown modes of action. Using more than one omics approach enhances the probability of success. Generally, compounds with the same mode of action generate similar responses with a particular omics method. Stress and detoxification responses to phytotoxins can be much clearer than effects directly related to the target site. Clues to new modes of action must be validated with in vitro enzyme effects or genetic approaches. Thus far, the only new phytotoxin target site discovered with omics approaches (metabolomics and physionomics) is that of cinmethylin and structurally related 5-benzyloxymethyl-1,2-isoxazolines. These omics approaches pointed to tyrosine amino-transferase as the target, which was verified by enzyme assays and genetic methods. In addition to being a useful tool of mode of action discovery, omics methods provide detailed information on genetic and biochemical impacts of phytotoxins. Such information can be useful in understanding the full impact of natural phytotoxins in both agricultural and natural ecosystems.

  16. Modifying Yeast Tolerance to Inhibitory Conditions of Ethanol Production Processes

    PubMed Central

    Caspeta, Luis; Castillo, Tania; Nielsen, Jens

    2015-01-01

    Saccharomyces cerevisiae strains having a broad range of substrate utilization, rapid substrate consumption, and conversion to ethanol, as well as good tolerance to inhibitory conditions are ideal for cost-competitive ethanol production from lignocellulose. A major drawback to directly design S. cerevisiae tolerance to inhibitory conditions of lignocellulosic ethanol production processes is the lack of knowledge about basic aspects of its cellular signaling network in response to stress. Here, we highlight the inhibitory conditions found in ethanol production processes, the targeted cellular functions, the key contributions of integrated -omics analysis to reveal cellular stress responses according to these inhibitors, and current status on design-based engineering of tolerant and efficient S. cerevisiae strains for ethanol production from lignocellulose. PMID:26618154

  17. Pathway Analysis and Omics Data Visualization Using Pathway Genome Databases: FragariaCyc, a Case Study.

    PubMed

    Naithani, Sushma; Jaiswal, Pankaj

    2017-01-01

    The species-specific plant Pathway Genome Databases (PGDBs) based on the BioCyc platform provide a conceptual model of the cellular metabolic network of an organism. Such frameworks allow analysis of the genome-scale expression data to understand changes in the overall metabolisms of an organism (or organs, tissues, and cells) in response to various extrinsic (e.g. developmental and differentiation) and/or extrinsic signals (e.g. pathogens and abiotic stresses) from the surrounding environment. Using FragariaCyc, a pathway database for the diploid strawberry Fragaria vesca, we show (1) the basic navigation across a PGDB; (2) a case study of pathway comparison across plant species; and (3) an example of RNA-Seq data analysis using Omics Viewer tool. The protocols described here generally apply to other Pathway Tools-based PGDBs.

  18. aGEM: an integrative system for analyzing spatial-temporal gene-expression information

    PubMed Central

    Jiménez-Lozano, Natalia; Segura, Joan; Macías, José Ramón; Vega, Juanjo; Carazo, José María

    2009-01-01

    Motivation: The work presented here describes the ‘anatomical Gene-Expression Mapping (aGEM)’ Platform, a development conceived to integrate phenotypic information with the spatial and temporal distributions of genes expressed in the mouse. The aGEM Platform has been built by extending the Distributed Annotation System (DAS) protocol, which was originally designed to share genome annotations over the WWW. DAS is a client-server system in which a single client integrates information from multiple distributed servers. Results: The aGEM Platform provides information to answer three main questions. (i) Which genes are expressed in a given mouse anatomical component? (ii) In which mouse anatomical structures are a given gene or set of genes expressed? And (iii) is there any correlation among these findings? Currently, this Platform includes several well-known mouse resources (EMAGE, GXD and GENSAT), hosting gene-expression data mostly obtained from in situ techniques together with a broad set of image-derived annotations. Availability: The Platform is optimized for Firefox 3.0 and it is accessed through a friendly and intuitive display: http://agem.cnb.csic.es Contact: natalia@cnb.csic.es Supplementary information: Supplementary data are available at http://bioweb.cnb.csic.es/VisualOmics/aGEM/home.html and http://bioweb.cnb.csic.es/VisualOmics/index_VO.html and Bioinformatics online. PMID:19592395

  19. Environmental OMICS: Current Status and Future Directions.

    EPA Science Inventory

    Objectives: Applications of OMICS to high throughput studies of changes of genes, RNAs, proteins and metabolites, and their associated functions in cells or organisms exposed to environmental chemicals has led to the emergence of a very active research field: environmental OMICS....

  20. Epigenetics Research on the International Space Station

    NASA Technical Reports Server (NTRS)

    Love, John; Cooley, Vic

    2016-01-01

    The International Space Station (ISS) is a state-of-the orbiting laboratory focused on advancing science and technology research. Experiments being conducted on the ISS include investigations in the emerging field of Epigenetics. Epigenetics refers to stably heritable changes in gene expression or cellular phenotype (the transcriptional potential of a cell) resulting from changes in a chromosome without alterations to the underlying DNA nucleotide sequence (the genetic code), which are caused by external or environmental factors, such as spaceflight microgravity. Molecular mechanisms associated with epigenetic alterations regulating gene expression patterns include covalent chemical modifications of DNA (e.g., methylation) or histone proteins (e.g., acetylation, phorphorylation, or ubiquitination). For example, Epigenetics ("Epigenetics in Spaceflown C. elegans") is a recent JAXA investigation examining whether adaptations to microgravity transmit from one cell generation to another without changing the basic DNA of the organism. Mouse Epigenetics ("Transcriptome Analysis and Germ-Cell Development Analysis of Mice in Space") investigates molecular alterations in organ-specific gene expression patterns and epigenetic modifications, and analyzes murine germ cell development during long term spaceflight, as well as assessing changes in offspring DNA. NASA's first foray into human Omics research, the Twins Study ("Differential effects of homozygous twin astronauts associated with differences in exposure to spaceflight factors"), includes investigations evaluating differential epigenetic effects via comprehensive whole genome analysis, the landscape of DNA and RNA methylation, and biomolecular changes by means of longitudinal integrated multi-omics research. And the inaugural Genes in Space student challenge experiment (Genes in Space-1) is aimed at understanding how epigenetics plays a role in immune system dysregulation by assaying DNA methylation in immune cells directly in space using miniPCR technology. In addition, NASA's geneLAB campaign covers the epigenome as part of the "expressome", by employing an innovative open source science platform for multi-investigator high throughput utilization of the ISS. Earth benefits of Epigenetics research onboard the ISS range from contributions to the fundamental understanding of epigenetic phenomena with applications in countermeasure development for biomedical conditions, to the generation of integrated strategies for personalized medicine based on unique physiological responses.

  1. Strategies for the profiling, characterisation and detailed structural analysis of N-linked oligosaccharides.

    PubMed

    Tharmalingam, Tharmala; Adamczyk, Barbara; Doherty, Margaret A; Royle, Louise; Rudd, Pauline M

    2013-02-01

    Many post-translational modifications, including glycosylation, are pivotal for the structural integrity, location and functional activity of glycoproteins. Sub-populations of proteins that are relocated or functionally changed by such modifications can change resting proteins into active ones, mediating specific effector functions, as in the case of monoclonal antibodies. To ensure safe and efficacious drugs it is essential to employ appropriate robust, quantitative analytical strategies that can (i) perform detailed glycan structural analysis, (ii) characterise specific subsets of glycans to assess known critical features of therapeutic activities (iii) rapidly profile glycan pools for at-line monitoring or high level batch to batch screening. Here we focus on these aspects of glycan analysis, showing how state-of-the-art technologies are required at all stages during the production of recombinant glycotherapeutics. These data can provide insights into processing pathways and suggest markers for intervention at critical control points in bioprocessing and also critical decision points in disease and drug monitoring in patients. Importantly, these tools are now enabling the first glycome/genome studies in large populations, allowing the integration of glycomics into other 'omics platforms in a systems biology context.

  2. Computational approaches to standard-compliant biofilm data for reliable analysis and integration.

    PubMed

    Sousa, Ana Margarida; Ferreira, Andreia; Azevedo, Nuno F; Pereira, Maria Olivia; Lourenço, Anália

    2012-12-01

    The study of microorganism consortia, also known as biofilms, is associated to a number of applications in biotechnology, ecotechnology and clinical domains. Nowadays, biofilm studies are heterogeneous and data-intensive, encompassing different levels of analysis. Computational modelling of biofilm studies has become thus a requirement to make sense of these vast and ever-expanding biofilm data volumes. The rationale of the present work is a machine-readable format for representing biofilm studies and supporting biofilm data interchange and data integration. This format is supported by the Biofilm Science Ontology (BSO), the first ontology on biofilms information. The ontology is decomposed into a number of areas of interest, namely: the Experimental Procedure Ontology (EPO) which describes biofilm experimental procedures; the Colony Morphology Ontology (CMO) which characterises morphologically microorganism colonies; and other modules concerning biofilm phenotype, antimicrobial susceptibility and virulence traits. The overall objective behind BSO is to develop semantic resources to capture, represent and share data on biofilms and related experiments in a regularized fashion manner. Furthermore, the present work also introduces a framework in assistance of biofilm data interchange and analysis - BiofOmics (http://biofomics.org) - and a public repository on colony morphology signatures - MorphoCol (http://stardust.deb.uminho.pt/morphocol).

  3. Computational approaches to standard-compliant biofilm data for reliable analysis and integration.

    PubMed

    Sousa, Ana Margarida; Ferreira, Andreia; Azevedo, Nuno F; Pereira, Maria Olivia; Lourenço, Anália

    2012-07-24

    The study of microorganism consortia, also known as biofilms, is associated to a number of applications in biotechnology, ecotechnology and clinical domains. Nowadays, biofilm studies are heterogeneous and data-intensive, encompassing different levels of analysis. Computational modelling of biofilm studies has become thus a requirement to make sense of these vast and ever-expanding biofilm data volumes. The rationale of the present work is a machine-readable format for representing biofilm studies and supporting biofilm data interchange and data integration. This format is supported by the Biofilm Science Ontology (BSO), the first ontology on biofilms information. The ontology is decomposed into a number of areas of interest, namely: the Experimental Procedure Ontology (EPO) which describes biofilm experimental procedures; the Colony Morphology Ontology (CMO) which characterises morphologically microorganism colonies; and other modules concerning biofilm phenotype, antimicrobial susceptibility and virulence traits. The overall objective behind BSO is to develop semantic resources to capture, represent and share data on biofilms and related experiments in a regularized fashion manner. Furthermore, the present work also introduces a framework in assistance of biofilm data interchange and analysis - BiofOmics (http://biofomics.org) - and a public repository on colony morphology signatures - MorphoCol (http://stardust.deb.uminho.pt/morphocol).

  4. Newt-omics: a comprehensive repository for omics data from the newt Notophthalmus viridescens

    PubMed Central

    Bruckskotten, Marc; Looso, Mario; Reinhardt, Richard; Braun, Thomas; Borchardt, Thilo

    2012-01-01

    Notophthalmus viridescens, a member of the salamander family is an excellent model organism to study regenerative processes due to its unique ability to replace lost appendages and to repair internal organs. Molecular insights into regenerative events have been severely hampered by the lack of genomic, transcriptomic and proteomic data, as well as an appropriate database to store such novel information. Here, we describe ‘Newt-omics’ (http://newt-omics.mpi-bn.mpg.de), a database, which enables researchers to locate, retrieve and store data sets dedicated to the molecular characterization of newts. Newt-omics is a transcript-centred database, based on an Expressed Sequence Tag (EST) data set from the newt, covering ∼50 000 Sanger sequenced transcripts and a set of high-density microarray data, generated from regenerating hearts. Newt-omics also contains a large set of peptides identified by mass spectrometry, which was used to validate 13 810 ESTs as true protein coding. Newt-omics is open to implement additional high-throughput data sets without changing the database structure. Via a user-friendly interface Newt-omics allows access to a huge set of molecular data without the need for prior bioinformatical expertise. PMID:22039101

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schaumberg, Andrew

    The Omics Tools package provides several small trivial tools for work in genomics. This single portable package, the “omics.jar” file, is a toolbox that works in any Java-based environment, including PCs, Macs, and supercomputers. The number of tools is expected to grow. One tool (called cmsearch.hadoop or cmsearch.local), calls the external cmsearch program to predict non-coding RNA in a genome. The cmsearch program is part of the third-party Infernal package. Omics Tools does not contain Infernal. Infernal may be installed separately. The cmsearch.hadoop subtool requires Apache Hadoop and runs on a supercomputer, though cmsearch.local does not and runs on amore » server. Omics Tools does not contain Hadoop. Hadoop mat be installed separartely The other tools (cmgbk, cmgff, fastats, pal, randgrp, randgrpr, randsub) do not interface with third-party tools. Omics Tools is written in Java and Scala programming languages. Invoking the “help” command shows currently available tools, as shown below: schaumbe@gpint06:~/proj/omics$ java -jar omics.jar help Known commands are: cmgbk : compare cmsearch and GenBank Infernal hits cmgff : compare hits among two GFF (version 3) files cmsearch.hadoop : find Infernal hits in a genome, on your supercomputer cmsearch.local : find Infernal hits in a genome, on your workstation fastats : FASTA stats, e.g. # bases, GC content pal : stem-loop motif detection by palindromic sequence search (code stub) randgrp : random subsample without replacement, of groups randgrpr : random subsample with replacement, of groups (fast) randsub : random subsample without replacement, of file lines For more help regarding a particular command, use: java -jar omics.jar command help Usage: java -jar omics.jar command args« less

  6. An R package for the integrated analysis of metabolomics and spectral data.

    PubMed

    Costa, Christopher; Maraschin, Marcelo; Rocha, Miguel

    2016-06-01

    Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as nuclear magnetic resonance, gas or liquid chromatography, mass spectrometry, infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  7. Multi-omics Frontiers in Algal Research: Techniques and Progress to Explore Biofuels in the Postgenomics World.

    PubMed

    Rai, Vineeta; Karthikaichamy, Anbarasu; Das, Debasish; Noronha, Santosh; Wangikar, Pramod P; Srivastava, Sanjeeva

    2016-07-01

    Current momentum of microalgal research rests extensively in tapping the potential of multi-omics methodologies in regard to sustainable biofuels. Microalgal biomass is fermented to bioethanol; while lipids, particularly triacylglycerides (TAGs), are transesterified to biodiesels. Biodiesel has emerged as an ideal biofuel candidate; hence, its commercialization and use are increasingly being emphasized. Abiotic stresses exaggerate TAG accumulation, but the precise mechanisms are yet to be known. More recently, comprehensive multi-omics studies in microalgae have emerged from the biofuel perspective. Genomics and transcriptomics of microalgae have provided crucial leads and basic understanding toward lipid biosynthesis. Proteomics and metabolomics are now complementing "algal omics" and offer precise functional insights into the attendant static and dynamic physiological contexts. Indeed, the field has progressed from shotgun to targeted approaches. Notably, targeted proteomics studies in microalga are not yet reported. Several multi-omics tools and technologies that may be used to dig deeper into the microalgal physiology are examined and highlighted in this review. The article therefore aims to both introduce various available high-throughput biotechnologies and applications of "omics" in microalgae, and enlists a compendium of the emerging cutting edge literature. We suggest that a strategic and thoughtful combination of data streams from different omics platforms can provide a system-wide overview. The algal omics warrants closer attention in the future, with a view to technical, economic, and societal impacts that are anticipated in the current postgenomics era.

  8. Pharmacometabolomics Informs Quantitative Radiomics for Glioblastoma Diagnostic Innovation.

    PubMed

    Katsila, Theodora; Matsoukas, Minos-Timotheos; Patrinos, George P; Kardamakis, Dimitrios

    2017-08-01

    Applications of omics systems biology technologies have enormous promise for radiology and diagnostics in surgical fields. In this context, the emerging fields of radiomics (a systems scale approach to radiology using a host of technologies, including omics) and pharmacometabolomics (use of metabolomics for patient and disease stratification and guiding precision medicine) offer much synergy for diagnostic innovation in surgery, particularly in neurosurgery. This synthesis of omics fields and applications is timely because diagnostic accuracy in central nervous system tumors still challenges decision-making. Considering the vast heterogeneity in brain tumors, disease phenotypes, and interindividual variability in surgical and chemotherapy outcomes, we believe that diagnostic accuracy can be markedly improved by quantitative radiomics coupled to pharmacometabolomics and related health information technologies while optimizing economic costs of traditional diagnostics. In this expert review, we present an innovation analysis on a systems-level multi-omics approach toward diagnostic accuracy in central nervous system tumors. For this, we suggest that glioblastomas serve as a useful application paradigm. We performed a literature search on PubMed for articles published in English between 2006 and 2016. We used the search terms "radiomics," "glioblastoma," "biomarkers," "pharmacogenomics," "pharmacometabolomics," "pharmacometabonomics/pharmacometabolomics," "collaborative informatics," and "precision medicine." A list of the top 4 insights we derived from this literature analysis is presented in this study. For example, we found that (i) tumor grading needs to be better refined, (ii) diagnostic precision should be improved, (iii) standardization in radiomics is lacking, and (iv) quantitative radiomics needs to prove clinical implementation. We conclude with an interdisciplinary call to the metabolomics, pharmacy/pharmacology, radiology, and surgery communities that pharmacometabolomics coupled to information technologies (chemoinformatics tools, databases, collaborative systems) can inform quantitative radiomics, thus translating Big Data and information growth to knowledge growth, rational drug development and diagnostics innovation for glioblastomas, and possibly in other brain tumors.

  9. geneLAB: Expanding the Impact of NASA's Biological Research in Space

    NASA Technical Reports Server (NTRS)

    Rayl, Nicole; Smith, Jeffrey D.

    2014-01-01

    The geneLAB project is designed to leverage the value of large 'omics' datasets from molecular biology projects conducted on the ISS by making these datasets available, citable, discoverable, interpretable, reusable, and reproducible. geneLAB will create a collaboration space with an integrated set of tools for depositing, accessing, analyzing, and modeling these diverse datasets from spaceflight and related terrestrial studies.

  10. methylPipe and compEpiTools: a suite of R packages for the integrative analysis of epigenomics data.

    PubMed

    Kishore, Kamal; de Pretis, Stefano; Lister, Ryan; Morelli, Marco J; Bianchi, Valerio; Amati, Bruno; Ecker, Joseph R; Pelizzola, Mattia

    2015-09-29

    Numerous methods are available to profile several epigenetic marks, providing data with different genome coverage and resolution. Large epigenomic datasets are then generated, and often combined with other high-throughput data, including RNA-seq, ChIP-seq for transcription factors (TFs) binding and DNase-seq experiments. Despite the numerous computational tools covering specific steps in the analysis of large-scale epigenomics data, comprehensive software solutions for their integrative analysis are still missing. Multiple tools must be identified and combined to jointly analyze histone marks, TFs binding and other -omics data together with DNA methylation data, complicating the analysis of these data and their integration with publicly available datasets. To overcome the burden of integrating various data types with multiple tools, we developed two companion R/Bioconductor packages. The former, methylPipe, is tailored to the analysis of high- or low-resolution DNA methylomes in several species, accommodating (hydroxy-)methyl-cytosines in both CpG and non-CpG sequence context. The analysis of multiple whole-genome bisulfite sequencing experiments is supported, while maintaining the ability of integrating targeted genomic data. The latter, compEpiTools, seamlessly incorporates the results obtained with methylPipe and supports their integration with other epigenomics data. It provides a number of methods to score these data in regions of interest, leading to the identification of enhancers, lncRNAs, and RNAPII stalling/elongation dynamics. Moreover, it allows a fast and comprehensive annotation of the resulting genomic regions, and the association of the corresponding genes with non-redundant GeneOntology terms. Finally, the package includes a flexible method based on heatmaps for the integration of various data types, combining annotation tracks with continuous or categorical data tracks. methylPipe and compEpiTools provide a comprehensive Bioconductor-compliant solution for the integrative analysis of heterogeneous epigenomics data. These packages are instrumental in providing biologists with minimal R skills a complete toolkit facilitating the analysis of their own data, or in accelerating the analyses performed by more experienced bioinformaticians.

  11. Integrated Omic Analysis of a Guinea Pig Model of Heart Failure and Sudden Cardiac Death.

    PubMed

    Foster, D Brian; Liu, Ting; Kammers, Kai; O'Meally, Robert; Yang, Ni; Papanicolaou, Kyriakos N; Talbot, C Conover; Cole, Robert N; O'Rourke, Brian

    2016-09-02

    Here, we examine key regulatory pathways underlying the transition from compensated hypertrophy (HYP) to decompensated heart failure (HF) and sudden cardiac death (SCD) in a guinea pig pressure-overload model by integrated multiome analysis. Relative protein abundances from sham-operated HYP and HF hearts were assessed by iTRAQ LC-MS/MS. Metabolites were quantified by LC-MS/MS or GC-MS. Transcriptome profiles were obtained using mRNA microarrays. The guinea pig HF proteome exhibited classic biosignatures of cardiac HYP, left ventricular dysfunction, fibrosis, inflammation, and extravasation. Fatty acid metabolism, mitochondrial transcription/translation factors, antioxidant enzymes, and other mitochondrial procsses, were downregulated in HF but not HYP. Proteins upregulated in HF implicate extracellular matrix remodeling, cytoskeletal remodeling, and acute phase inflammation markers. Among metabolites, acylcarnitines were downregulated in HYP and fatty acids accumulated in HF. The correlation of transcript and protein changes in HF was weak (R(2) = 0.23), suggesting post-transcriptional gene regulation in HF. Proteome/metabolome integration indicated metabolic bottlenecks in fatty acyl-CoA processing by carnitine palmitoyl transferase (CPT1B) as well as TCA cycle inhibition. On the basis of these findings, we present a model of cardiac decompensation involving impaired nuclear integration of Ca(2+) and cyclic nucleotide signals that are coupled to mitochondrial metabolic and antioxidant defects through the CREB/PGC1α transcriptional axis.

  12. Omics methods for probing the mode of action of natural phytotoxins

    USDA-ARS?s Scientific Manuscript database

    For a little over a decade, omics methods (transcriptomics, proteomics, metabolomics, and physionomics) have been used to discover and probe the mode of action of both synthetic and natural phytotoxins. For mode of action discovery, the strategy for each of these approaches is to generate an omics...

  13. OMICS DATA IN THE QUALITATIVE AND QUANTITATIVE CHARACTERIZATION OF THE MODE OF ACTION IN SUPPORT OF IRIS ASSESSMENTS

    EPA Science Inventory

    Knowledge and information generated using new tools/methods collectively called "Omics" technologies could have a profound effect on qualitative and quantitative characterizations of human health risk assessments.

    The suffix "Omics" is a descriptor used for a series of e...

  14. Multi-platform ’Omics Analysis of Human Ebola Virus Disease Pathogenesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eisfeld, Amie J.; Halfmann, Peter J.; Wendler, Jason P.

    The pathogenesis of human Ebola virus disease (EVD) is complex. EVD is characterized by high levels of virus replication and dissemination, dysregulated immune responses, extensive virus- and host-mediated tissue damage, and disordered coagulation. To clarify how host responses contribute to EVD pathophysiology, we performed multi-platform ’omics analysis of peripheral blood mononuclear cells and plasma from EVD patients. Our results indicate that EVD molecular signatures overlap with those of sepsis, imply that pancreatic enzymes contribute to tissue damage in fatal EVD, and suggest that Ebola virus infection may induce aberrant neutrophils whose activity could explain hallmarks of fatal EVD. Moreover, integratedmore » biomarker prediction identified putative biomarkers from different data platforms that differentiated survivors and fatalities early after infection. This work reveals insight into EVD pathogenesis, suggests an effective approach for biomarker identification, and provides an important community resource for further analysis of human EVD severity.« less

  15. Metabolome Integrated Analysis of High-Temperature Response in Pinus radiata.

    PubMed

    Escandón, Mónica; Meijón, Mónica; Valledor, Luis; Pascual, Jesús; Pinto, Gloria; Cañal, María Jesús

    2018-01-01

    The integrative omics approach is crucial to identify the molecular mechanisms underlying high-temperature response in non-model species. Based on future scenarios of heat increase, Pinus radiata plants were exposed to a temperature of 40°C for a period of 5 days, including recovered plants (30 days after last exposure to 40°C) in the analysis. The analysis of the metabolome using complementary mass spectrometry techniques (GC-MS and LC-Orbitrap-MS) allowed the reliable quantification of 2,287 metabolites. The analysis of identified metabolites and highlighter metabolic pathways across heat time exposure reveal the dynamism of the metabolome in relation to high-temperature response in P. radiata , identifying the existence of a turning point (on day 3) at which P. radiata plants changed from an initial stress response program (shorter-term response) to an acclimation one (longer-term response). Furthermore, the integration of metabolome and physiological measurements, which cover from the photosynthetic state to hormonal profile, suggests a complex metabolic pathway interaction network related to heat-stress response. Cytokinins (CKs), fatty acid metabolism and flavonoid and terpenoid biosynthesis were revealed as the most important pathways involved in heat-stress response in P. radiata , with zeatin riboside (ZR) and isopentenyl adenosine (iPA) as the key hormones coordinating these multiple and complex interactions. On the other hand, the integrative approach allowed elucidation of crucial metabolic mechanisms involved in heat response in P. radiata , as well as the identification of thermotolerance metabolic biomarkers (L-phenylalanine, hexadecanoic acid, and dihydromyricetin), crucial metabolites which can reschedule the metabolic strategy to adapt to high temperature.

  16. Metabolome Integrated Analysis of High-Temperature Response in Pinus radiata

    PubMed Central

    Escandón, Mónica; Meijón, Mónica; Valledor, Luis; Pascual, Jesús; Pinto, Gloria; Cañal, María Jesús

    2018-01-01

    The integrative omics approach is crucial to identify the molecular mechanisms underlying high-temperature response in non-model species. Based on future scenarios of heat increase, Pinus radiata plants were exposed to a temperature of 40°C for a period of 5 days, including recovered plants (30 days after last exposure to 40°C) in the analysis. The analysis of the metabolome using complementary mass spectrometry techniques (GC-MS and LC-Orbitrap-MS) allowed the reliable quantification of 2,287 metabolites. The analysis of identified metabolites and highlighter metabolic pathways across heat time exposure reveal the dynamism of the metabolome in relation to high-temperature response in P. radiata, identifying the existence of a turning point (on day 3) at which P. radiata plants changed from an initial stress response program (shorter-term response) to an acclimation one (longer-term response). Furthermore, the integration of metabolome and physiological measurements, which cover from the photosynthetic state to hormonal profile, suggests a complex metabolic pathway interaction network related to heat-stress response. Cytokinins (CKs), fatty acid metabolism and flavonoid and terpenoid biosynthesis were revealed as the most important pathways involved in heat-stress response in P. radiata, with zeatin riboside (ZR) and isopentenyl adenosine (iPA) as the key hormones coordinating these multiple and complex interactions. On the other hand, the integrative approach allowed elucidation of crucial metabolic mechanisms involved in heat response in P. radiata, as well as the identification of thermotolerance metabolic biomarkers (L-phenylalanine, hexadecanoic acid, and dihydromyricetin), crucial metabolites which can reschedule the metabolic strategy to adapt to high temperature. PMID:29719546

  17. Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives

    PubMed Central

    Merelli, Ivan; Pérez-Sánchez, Horacio; Gesing, Sandra; D'Agostino, Daniele

    2014-01-01

    The explosion of the data both in the biomedical research and in the healthcare systems demands urgent solutions. In particular, the research in omics sciences is moving from a hypothesis-driven to a data-driven approach. Healthcare is additionally always asking for a tighter integration with biomedical data in order to promote personalized medicine and to provide better treatments. Efficient analysis and interpretation of Big Data opens new avenues to explore molecular biology, new questions to ask about physiological and pathological states, and new ways to answer these open issues. Such analyses lead to better understanding of diseases and development of better and personalized diagnostics and therapeutics. However, such progresses are directly related to the availability of new solutions to deal with this huge amount of information. New paradigms are needed to store and access data, for its annotation and integration and finally for inferring knowledge and making it available to researchers. Bioinformatics can be viewed as the “glue” for all these processes. A clear awareness of present high performance computing (HPC) solutions in bioinformatics, Big Data analysis paradigms for computational biology, and the issues that are still open in the biomedical and healthcare fields represent the starting point to win this challenge. PMID:25254202

  18. Evaluating biological variation in non-transgenic crops: executive summary from the ILSI Health and Environmental Sciences Institute workshop, November 16-17, 2009, Paris, France.

    PubMed

    Doerrer, Nancy; Ladics, Gregory; McClain, Scott; Herouet-Guicheney, Corinne; Poulsen, Lars K; Privalle, Laura; Stagg, Nicola

    2010-12-01

    The International Life Sciences Institute Health and Environmental Sciences Institute Protein Allergenicity Technical Committee hosted an international workshop November 16-17, 2009, in Paris, France, with over 60 participants from academia, government, and industry to review and discuss the potential utility of "-omics" technologies for assessing the variability in plant gene, protein, and metabolite expression. The goal of the workshop was to illustrate how a plant's constituent makeup and phenotypic processes can be surveyed analytically. Presentations on the "-omics" techniques (i.e., genomics, proteomics, and metabolomics) highlighted the workshop, and summaries of these presentations are published separately in this supplemental issue. This paper summarizes key messages, as well as the consensus points reached, in a roundtable discussion on eight specific questions posed during the final session of the workshop. The workshop established some common, though not unique, challenges for all "-omics" techniques, and include (a) standardization of separation/extraction and analytical techniques; (b) difficulty in associating environmental impacts (e.g., planting, soil texture, location, climate, stress) with potential alterations in plants at genomic, proteomic, and metabolomic levels; (c) many independent analytical measurements, but few replicates/subjects--poorly defined accuracy and precision; and (d) bias--a lack of hypothesis-driven science. Information on natural plant variation is critical in establishing the utility of new technologies due to the variability in specific analytes that may result from genetic differences (crop genotype), different crop management practices (conventional high input, low input, organic), interaction between genotype and environment, and the use of different breeding methods. For example, variations of several classes of proteins were reported among different soybean, rice, or wheat varieties or varieties grown at different locations. Data on the variability of allergenic proteins are important in defining the risk of potential allergenicity. Once established as a standardized assay, survey approaches such as the "-omics" techniques can be considered in a hypothesis-driven analysis of plants, such as determining unintended effects in genetically modified (GM) crops. However, the analysis should include both the GM and control varieties that have the same breeding history and exposure to the same environmental conditions. Importantly, the biological relevance and safety significance of changes in "-omic" data are still unknown. Furthermore, the current compositional assessment for evaluating the substantial equivalence of GM crops is robust, comprehensive, and a good tool for food safety assessments. The overall consensus of the workshop participants was that many "-omics" techniques are extremely useful in the discovery and research phases of biotechnology, and are valuable for hypothesis generation. However, there are many methodological shortcomings identified with "-omics" approaches, a paucity of reference materials, and a lack of focused strategy for their use that currently make them not conducive for the safety assessment of GM crops. Copyright © 2010 Elsevier Inc. All rights reserved.

  19. Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data.

    PubMed

    Tomescu, Oana A; Mattanovich, Diethard; Thallinger, Gerhard G

    2014-01-01

    Technological improvements have shifted the focus from data generation to data analysis. The availability of large amounts of data from transcriptomics, protemics and metabolomics experiments raise new questions concerning suitable integrative analysis methods. We compare three integrative analysis techniques (co-inertia analysis, generalized singular value decomposition and integrative biclustering) by applying them to gene and protein abundance data from the six life cycle stages of Plasmodium falciparum. Co-inertia analysis is an analysis method used to visualize and explore gene and protein data. The generalized singular value decomposition has shown its potential in the analysis of two transcriptome data sets. Integrative Biclustering applies biclustering to gene and protein data. Using CIA, we visualize the six life cycle stages of Plasmodium falciparum, as well as GO terms in a 2D plane and interpret the spatial configuration. With GSVD, we decompose the transcriptomic and proteomic data sets into matrices with biologically meaningful interpretations and explore the processes captured by the data sets. IBC identifies groups of genes, proteins, GO Terms and life cycle stages of Plasmodium falciparum. We show method-specific results as well as a network view of the life cycle stages based on the results common to all three methods. Additionally, by combining the results of the three methods, we create a three-fold validated network of life cycle stage specific GO terms: Sporozoites are associated with transcription and transport; merozoites with entry into host cell as well as biosynthetic and metabolic processes; rings with oxidation-reduction processes; trophozoites with glycolysis and energy production; schizonts with antigenic variation and immune response; gametocyctes with DNA packaging and mitochondrial transport. Furthermore, the network connectivity underlines the separation of the intraerythrocytic cycle from the gametocyte and sporozoite stages. Using integrative analysis techniques, we can integrate knowledge from different levels and obtain a wider view of the system under study. The overlap between method-specific and common results is considerable, even if the basic mathematical assumptions are very different. The three-fold validated network of life cycle stage characteristics of Plasmodium falciparum could identify a large amount of the known associations from literature in only one study.

  20. Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data

    PubMed Central

    2014-01-01

    Background Technological improvements have shifted the focus from data generation to data analysis. The availability of large amounts of data from transcriptomics, protemics and metabolomics experiments raise new questions concerning suitable integrative analysis methods. We compare three integrative analysis techniques (co-inertia analysis, generalized singular value decomposition and integrative biclustering) by applying them to gene and protein abundance data from the six life cycle stages of Plasmodium falciparum. Co-inertia analysis is an analysis method used to visualize and explore gene and protein data. The generalized singular value decomposition has shown its potential in the analysis of two transcriptome data sets. Integrative Biclustering applies biclustering to gene and protein data. Results Using CIA, we visualize the six life cycle stages of Plasmodium falciparum, as well as GO terms in a 2D plane and interpret the spatial configuration. With GSVD, we decompose the transcriptomic and proteomic data sets into matrices with biologically meaningful interpretations and explore the processes captured by the data sets. IBC identifies groups of genes, proteins, GO Terms and life cycle stages of Plasmodium falciparum. We show method-specific results as well as a network view of the life cycle stages based on the results common to all three methods. Additionally, by combining the results of the three methods, we create a three-fold validated network of life cycle stage specific GO terms: Sporozoites are associated with transcription and transport; merozoites with entry into host cell as well as biosynthetic and metabolic processes; rings with oxidation-reduction processes; trophozoites with glycolysis and energy production; schizonts with antigenic variation and immune response; gametocyctes with DNA packaging and mitochondrial transport. Furthermore, the network connectivity underlines the separation of the intraerythrocytic cycle from the gametocyte and sporozoite stages. Conclusion Using integrative analysis techniques, we can integrate knowledge from different levels and obtain a wider view of the system under study. The overlap between method-specific and common results is considerable, even if the basic mathematical assumptions are very different. The three-fold validated network of life cycle stage characteristics of Plasmodium falciparum could identify a large amount of the known associations from literature in only one study. PMID:25033389

  1. Biomedical data integration in computational drug design and bioinformatics.

    PubMed

    Seoane, Jose A; Aguiar-Pulido, Vanessa; Munteanu, Cristian R; Rivero, Daniel; Rabunal, Juan R; Dorado, Julian; Pazos, Alejandro

    2013-03-01

    In recent years, in the post genomic era, more and more data is being generated by biological high throughput technologies, such as proteomics and transcriptomics. This omics data can be very useful, but the real challenge is to analyze all this data, as a whole, after integrating it. Biomedical data integration enables making queries to different, heterogeneous and distributed biomedical data sources. Data integration solutions can be very useful not only in the context of drug design, but also in biomedical information retrieval, clinical diagnosis, system biology, etc. In this review, we analyze the most common approaches to biomedical data integration, such as federated databases, data warehousing, multi-agent systems and semantic technology, as well as the solutions developed using these approaches in the past few years.

  2. Application of meta-omics techniques to understand greenhouse gas emissions originating from ruminal metabolism.

    PubMed

    Wallace, Robert J; Snelling, Timothy J; McCartney, Christine A; Tapio, Ilma; Strozzi, Francesco

    2017-01-16

    Methane emissions from ruminal fermentation contribute significantly to total anthropological greenhouse gas (GHG) emissions. New meta-omics technologies are beginning to revolutionise our understanding of the rumen microbial community structure, metabolic potential and metabolic activity. Here we explore these developments in relation to GHG emissions. Microbial rumen community analyses based on small subunit ribosomal RNA sequence analysis are not yet predictive of methane emissions from individual animals or treatments. Few metagenomics studies have been directly related to GHG emissions. In these studies, the main genes that differed in abundance between high and low methane emitters included archaeal genes involved in methanogenesis, with others that were not apparently related to methane metabolism. Unlike the taxonomic analysis up to now, the gene sets from metagenomes may have predictive value. Furthermore, metagenomic analysis predicts metabolic function better than only a taxonomic description, because different taxa share genes with the same function. Metatranscriptomics, the study of mRNA transcript abundance, should help to understand the dynamic of microbial activity rather than the gene abundance; to date, only one study has related the expression levels of methanogenic genes to methane emissions, where gene abundance failed to do so. Metaproteomics describes the proteins present in the ecosystem, and is therefore arguably a better indication of microbial metabolism. Both two-dimensional polyacrylamide gel electrophoresis and shotgun peptide sequencing methods have been used for ruminal analysis. In our unpublished studies, both methods showed an abundance of archaeal methanogenic enzymes, but neither was able to discriminate high and low emitters. Metabolomics can take several forms that appear to have predictive value for methane emissions; ruminal metabolites, milk fatty acid profiles, faecal long-chain alcohols and urinary metabolites have all shown promising results. Rumen microbial amino acid metabolism lies at the root of excessive nitrogen emissions from ruminants, yet only indirect inferences for nitrogen emissions can be drawn from meta-omics studies published so far. Annotation of meta-omics data depends on databases that are generally weak in rumen microbial entries. The Hungate 1000 project and Global Rumen Census initiatives are therefore essential to improve the interpretation of sequence/metabolic information.

  3. imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel

    PubMed Central

    Grapov, Dmitry; Newman, John W.

    2012-01-01

    Summary: Interactive modules for Data Exploration and Visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data through a user-friendly interface. Individual modules enables interactive and dynamic analyses of large data by interfacing R's multivariate statistics and highly customizable visualizations with the spreadsheet environment, aiding robust inferences and generating information-rich data visualizations. This tool provides access to multiple comparisons with false discovery correction, hierarchical clustering, principal and independent component analyses, partial least squares regression and discriminant analysis, through an intuitive interface for creating high-quality two- and a three-dimensional visualizations including scatter plot matrices, distribution plots, dendrograms, heat maps, biplots, trellis biplots and correlation networks. Availability and implementation: Freely available for download at http://sourceforge.net/projects/imdev/. Implemented in R and VBA and supported by Microsoft Excel (2003, 2007 and 2010). Contact: John.Newman@ars.usda.gov Supplementary Information: Installation instructions, tutorials and users manual are available at http://sourceforge.net/projects/imdev/. PMID:22815358

  4. Systems biology of yeast: enabling technology for development of cell factories for production of advanced biofuels.

    PubMed

    de Jong, Bouke; Siewers, Verena; Nielsen, Jens

    2012-08-01

    Transportation fuels will gradually shift from oil based fuels towards alternative fuel resources like biofuels. Current bioethanol and biodiesel can, however, not cover the increasing demand for biofuels and there is therefore a need for advanced biofuels with superior fuel properties. Novel cell factories will provide a production platform for advanced biofuels. However, deep cellular understanding is required for improvement of current biofuel cell factories. Fast screening and analysis (-omics) methods and metabolome-wide mathematical models are promising techniques. An integrated systems approach of these techniques drives diversity and quantity of several new biofuel compounds. This review will cover the recent technological developments that support improvement of the advanced biofuels 1-butanol, biodiesels and jetfuels. Copyright © 2011 Elsevier Ltd. All rights reserved.

  5. Novel insights, challenges and practical implications of DOHaD-omics research.

    PubMed

    Hodyl, Nicolette A; Muhlhausler, Beverly

    2016-02-15

    Research investigating the developmental origins of health and disease (DOHaD) has never had the technology to investigate physiology in such a data-rich capacity and at such a microlevel as it does now. A symposium at the inaugural meeting of the DOHaD Society of Australia and New Zealand outlined the advantages and challenges of using "-omics" technologies in DOHaD research. DOHaD studies with -omics approaches to generate large, rich datasets were discussed. We discuss implications for policy and practice and make recommendations to facilitate successful translation of results of future DOHaD-omics studies.

  6. Metabolomics and transcriptomics profiles reveal the dysregulation of the tricarboxylic acid cycle and related mechanisms in prostate cancer.

    PubMed

    Shao, Yaping; Ye, Guozhu; Ren, Shancheng; Piao, Hai-Long; Zhao, Xinjie; Lu, Xin; Wang, Fubo; Ma, Wang; Li, Jia; Yin, Peiyuan; Xia, Tian; Xu, Chuanliang; Yu, Jane J; Sun, Yinghao; Xu, Guowang

    2018-07-15

    Genetic alterations drive metabolic reprograming to meet increased biosynthetic precursor and energy demands for cancer cell proliferation and survival in unfavorable environments. A systematic study of gene-metabolite regulatory networks and metabolic dysregulation should reveal the molecular mechanisms underlying prostate cancer (PCa) pathogenesis. Herein, we performed gas chromatography-mass spectrometry (GC-MS)-based metabolomics and RNA-seq analyses in prostate tumors and matched adjacent normal tissues (ANTs) to elucidate the molecular alterations and potential underlying regulatory mechanisms in PCa. Significant accumulation of metabolic intermediates and enrichment of genes in the tricarboxylic acid (TCA) cycle were observed in tumor tissues, indicating TCA cycle hyperactivation in PCa tissues. In addition, the levels of fumarate and malate were highly correlated with the Gleason score, tumor stage and expression of genes encoding related enzymes and were significantly related to the expression of genes involved in branched chain amino acid degradation. Using an integrated omics approach, we further revealed the potential anaplerotic routes from pyruvate, glutamine catabolism and branched chain amino acid (BCAA) degradation contributing to replenishing metabolites for TCA cycle. Integrated omics techniques enable the performance of network-based analyses to gain a comprehensive and in-depth understanding of PCa pathophysiology and may facilitate the development of new and effective therapeutic strategies. © 2018 UICC.

  7. Integrating 'omic' data and biogeochemical modeling: the key to understanding the microbial regulation of matter cycling in soil

    NASA Astrophysics Data System (ADS)

    Pagel, Holger; Kandeler, Ellen; Seifert, Jana; Camarinha-Silva, Amélia; Kügler, Philipp; Rennert, Thilo; Poll, Christian; Streck, Thilo

    2016-04-01

    Matter cycling in soils and associated soil functions are intrinsically controlled by microbial dynamics. It is therefore crucial to consider functional traits of microorganisms in biogeochemical models. Tremendous advances in 'omic' methods provide a plethora of data on physiology, metabolic capabilities and ecological life strategies of microorganisms in soil. Combined with isotopic techniques, biochemical pathways and transformations can be identified and quantified. Such data have been, however, rarely used to improve the mechanistic representation of microbial dynamics in soil organic matter models. It is the goal of the Young Investigator Group SoilReg to address this challenge. Our general approach is to tightly integrate experiments and biochemical modeling. NextGen sequencing will be applied to identify key functional groups. Active microbial groups will be quantified by measurements of functional genes and by stable isotope probing methods of DNA and proteins. Based on this information a biogeochemical model that couples a mechanistic representation of microbial dynamics with physicochemical processes will be set up and calibrated. Sensitivity and stability analyses of the model as well as scenario simulations will reveal the importance of intrinsic and extrinsic controls of organic matter turnover. We will demonstrate our concept and present first results of two case studies on pesticide degradation and methane oxidation.

  8. Is laboratory medicine ready for the era of personalized medicine? A survey addressed to laboratory directors of hospitals/academic schools of medicine in Europe.

    PubMed

    Malentacchi, Francesca; Mancini, Irene; Brandslund, Ivan; Vermeersch, Pieter; Schwab, Matthias; Marc, Janja; van Schaik, Ron H N; Siest, Gerard; Theodorsson, Elvar; Pazzagli, Mario; Di Resta, Chiara

    2015-06-01

    Developments in "-omics" are creating a paradigm shift in laboratory medicine leading to personalized medicine. This allows the increase in diagnostics and therapeutics focused on individuals rather than populations. In order to investigate whether laboratory medicine is ready to play a key role in the integration of personalized medicine in routine health care and set the state-of-the-art knowledge about personalized medicine and laboratory medicine in Europe, a questionnaire was constructed under the auspices of the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) and the European Society of Pharmacogenomics and Personalised Therapy (ESPT). The answers of the participating laboratory medicine professionals indicate that they are aware that personalized medicine can represent a new and promising health model, and that laboratory medicine should play a key role in supporting the implementation of personalized medicine in the clinical setting. Participants think that the current organization of laboratory medicine needs additional/relevant implementations such as (i) new technological facilities in -omics; (ii) additional training for the current personnel focused on the new methodologies; (iii) incorporation in the laboratory of new competencies in data interpretation and counseling; and (iv) cooperation and collaboration among professionals of different disciplines to integrate information according to a personalized medicine approach.

  9. Clinical omics analysis of colorectal cancer incorporating copy number aberrations and gene expression data.

    PubMed

    Yoshida, Tsuyoshi; Kobayashi, Takumi; Itoda, Masaya; Muto, Taika; Miyaguchi, Ken; Mogushi, Kaoru; Shoji, Satoshi; Shimokawa, Kazuro; Iida, Satoru; Uetake, Hiroyuki; Ishikawa, Toshiaki; Sugihara, Kenichi; Mizushima, Hiroshi; Tanaka, Hiroshi

    2010-07-29

    Colorectal cancer (CRC) is one of the most frequently occurring cancers in Japan, and thus a wide range of methods have been deployed to study the molecular mechanisms of CRC. In this study, we performed a comprehensive analysis of CRC, incorporating copy number aberration (CRC) and gene expression data. For the last four years, we have been collecting data from CRC cases and organizing the information as an "omics" study by integrating many kinds of analysis into a single comprehensive investigation. In our previous studies, we had experienced difficulty in finding genes related to CRC, as we observed higher noise levels in the expression data than in the data for other cancers. Because chromosomal aberrations are often observed in CRC, here, we have performed a combination of CNA analysis and expression analysis in order to identify some new genes responsible for CRC. This study was performed as part of the Clinical Omics Database Project at Tokyo Medical and Dental University. The purpose of this study was to investigate the mechanism of genetic instability in CRC by this combination of expression analysis and CNA, and to establish a new method for the diagnosis and treatment of CRC. Comprehensive gene expression analysis was performed on 79 CRC cases using an Affymetrix Gene Chip, and comprehensive CNA analysis was performed using an Affymetrix DNA Sty array. To avoid the contamination of cancer tissue with normal cells, laser micro-dissection was performed before DNA/RNA extraction. Data analysis was performed using original software written in the R language. We observed a high percentage of CNA in colorectal cancer, including copy number gains at 7, 8q, 13 and 20q, and copy number losses at 8p, 17p and 18. Gene expression analysis provided many candidates for CRC-related genes, but their association with CRC did not reach the level of statistical significance. The combination of CNA and gene expression analysis, together with the clinical information, suggested UGT2B28, LOC440995, CXCL6, SULT1B1, RALBP1, TYMS, RAB12, RNMT, ARHGDIB, S1000A2, ABHD2, OIT3 and ABHD12 as genes that are possibly associated with CRC. Some of these genes have already been reported as being related to CRC. TYMS has been reported as being associated with resistance to the anti-cancer drug 5-fluorouracil, and we observed a copy number increase for this gene. RALBP1, ARHGDIB and S100A2 have been reported as oncogenes, and we observed copy number increases in each. ARHGDIB has been reported as a metastasis-related gene, and our data also showed copy number increases of this gene in cases with metastasis. The combination of CNA analysis and gene expression analysis was a more effective method for finding genes associated with the clinicopathological classification of CRC than either analysis alone. Using this combination of methods, we were able to detect genes that have already been associated with CRC. We also identified additional candidate genes that may be new markers or targets for this form of cancer.

  10. Air pollution and the fetal origin of disease: A systematic review of the molecular signatures of air pollution exposure in human placenta.

    PubMed

    Luyten, Leen J; Saenen, Nelly D; Janssen, Bram G; Vrijens, Karen; Plusquin, Michelle; Roels, Harry A; Debacq-Chainiaux, Florence; Nawrot, Tim S

    2018-06-13

    Fetal development is a crucial window of susceptibility in which exposure-related alterations can be induced on the molecular level, leading to potential changes in metabolism and development. The placenta serves as a gatekeeper between mother and fetus, and is in contact with environmental stressors throughout pregnancy. This makes the placenta as a temporary organ an informative non-invasive matrix suitable to investigate omics-related aberrations in association with in utero exposures such as ambient air pollution. To summarize and discuss the current evidence and define the gaps of knowledge concerning human placental -omics markers in association with prenatal exposure to ambient air pollution. Two investigators independently searched the PubMed, ScienceDirect, and Scopus databases to identify all studies published until January 2017 with an emphasis on epidemiological research on prenatal exposure to ambient air pollution and the effect on placental -omics signatures. From the initial 386 articles, 25 were retained following an a priori set inclusion and exclusion criteria. We identified eleven studies on the genome, two on the transcriptome, five on the epigenome, five on the proteome category, one study with both genomic and proteomic topics, and one study with both genomic and transcriptomic topics. Six studies discussed the triple relationship between exposure to air pollution during pregnancy, the associated placental -omics marker(s), and the potential effect on disease development later in life. So far, no metabolomic or exposomic data discussing associations between the placenta and prenatal exposure to air pollution have been published. Integration of placental biomarkers in an environmental epidemiological context enables researchers to address fundamental questions essential in unraveling the fetal origin of disease and helps to better define the pregnancy exposome of air pollution. Copyright © 2018 Elsevier Inc. All rights reserved.

  11. OMICS and 21st century brain surgery from education to practice: James Rutka of the University of Toronto interviewed by Joseph B. Martin (Boston) and Türker Kılıç (İstanbul).

    PubMed

    Rutka, James; Martin, Joseph; Kılıç, Türker

    2014-12-01

    The Science-in-Backstage interviews aim to share experiences by global medical and life sciences thought leaders on emergent technologies and novel scientific, medical, and educational practices, situating them in both a historical and contemporary science context so as to "look into the biotechnology and innovation futures" reflexively and intelligently. OMICS systems diagnostics and personalized medicine are greatly impacting brain surgery, not to forget the training of the next generation of neurosurgeons. What do the futures hold for the practice of, and education in 21(st) century brain surgery in the age of OMICS systems science, personalized medicine, and the use of simulation in surgeon training? James Rutka is a clinician scientist and a world leader in diagnosis and treatment of brain tumors. He is Professor and Chair of the Department of Surgery at the Faculty of Medicine, University of Toronto, a President Emeritus of the American Association of Neurological Surgeons, and Editor-in-Chief of the Journal of Neurosurgery. Professor Rutka was interviewed for the global medical, biotechnology, and life sciences readership of the OMICS: A Journal of Integrative Biology to speak on these pressing questions in his personal capacity as an independent senior scholar. The issues debated in the present interview are of broad relevance for 21(st) century surgery and postgenomics medicine. The interviewers were Professor Joseph B. Martin, Harvard Medical School Dean Emeritus in Boston and Joint Dean of Medicine at Bahçeşehir University in İstanbul, and the author of "Alfalfa to Ivy: Memoir of a Harvard Medical School Dean," and Professor Türker Kılıç, Dean of Medicine at Bahçeşehir University in İstanbul, and an elected member of the Turkish Academy of Sciences.

  12. Biomedical analysis of formalin-fixed, paraffin-embedded tissue samples: The Holy Grail for molecular diagnostics.

    PubMed

    Donczo, Boglarka; Guttman, Andras

    2018-06-05

    More than a century ago in 1893, a revolutionary idea about fixing biological tissue specimens was introduced by Ferdinand Blum, a German physician. Since then, a plethora of fixation methods have been investigated and used. Formalin fixation with paraffin embedment became the most widely used types of fixation and preservation method, due to its proper architectural conservation of tissue structures and cellular shape. The huge collection of formalin-fixed, paraffin-embedded (FFPE) sample archives worldwide holds a large amount of unearthed information about diseases that could be the Holy Grail in contemporary biomarker research utilizing analytical omics based molecular diagnostics. The aim of this review is to critically evaluate the omics options for FFPE tissue sample analysis in the molecular diagnostics field. Copyright © 2018. Published by Elsevier B.V.

  13. [Nutrigenetics and nutrigenomics - application of „omics” technologies in optimization of human nutrition].

    PubMed

    Panczyk, Mariusz

    2013-01-01

    Nowadays nutrigenetics and nutrigenomics are perceived as one of the most important research areas ensuring better understanding of an impact of nutrition on human health. Since such researches are interdisciplinary in type, there is a problem with their widespread acceptance and practical clinical application of obtained results. Understanding the new ideas and hypotheses published in researches on nutrigenetics/nutrigenomics requires some knowledge of genetics, biochemistry, molecular biology, and capabilities and limitations that are associated with the use of statistical and bioinformatic analysis, and above all „omics” research technologies (genomics, transcriptomics, proteomics, metabolomics). Highly efficient genome and proteome analysis techniques allow to obtain data necessary for profiling of an individual patient. The main problem is still our insufficient knowledge of cell physiology and biochemistry. The vast amount of information is obtained with the use of „omics” technologies what makes it difficult to interpret and infer. An unquestionable advantage of this type of research is the possibility to utilize system analysis (system biology) which is important in the context of a holistic interpretation of biological phenomena. This review is an attempt to present the main hypotheses and objectives which are carried out by researchers in nutrigenetics/nutrigenomics. This article describes the most important directions of research and anticipated results that are related to the practical use of nutritional genomics as well as the critical assessment of the possible impact of future developments on public health.

  14. Genome, transcriptome and methylome sequencing of a primitively eusocial wasp reveal a greatly reduced DNA methylation system in a social insect.

    PubMed

    Standage, Daniel S; Berens, Ali J; Glastad, Karl M; Severin, Andrew J; Brendel, Volker P; Toth, Amy L

    2016-04-01

    Comparative genomics of social insects has been intensely pursued in recent years with the goal of providing insights into the evolution of social behaviour and its underlying genomic and epigenomic basis. However, the comparative approach has been hampered by a paucity of data on some of the most informative social forms (e.g. incipiently and primitively social) and taxa (especially members of the wasp family Vespidae) for studying social evolution. Here, we provide a draft genome of the primitively eusocial model insect Polistes dominula, accompanied by analysis of caste-related transcriptome and methylome sequence data for adult queens and workers. Polistes dominula possesses a fairly typical hymenopteran genome, but shows very low genomewide GC content and some evidence of reduced genome size. We found numerous caste-related differences in gene expression, with evidence that both conserved and novel genes are related to caste differences. Most strikingly, these -omics data reveal a major reduction in one of the major epigenetic mechanisms that has been previously suggested to be important for caste differences in social insects: DNA methylation. Along with a conspicuous loss of a key gene associated with environmentally responsive DNA methylation (the de novo DNA methyltransferase Dnmt3), these wasps have greatly reduced genomewide methylation to almost zero. In addition to providing a valuable resource for comparative analysis of social insect evolution, our integrative -omics data for this important behavioural and evolutionary model system call into question the general importance of DNA methylation in caste differences and evolution in social insects. © 2016 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  15. [In silico, in vitro, in omic experimental models and drug safety evaluation].

    PubMed

    Claude, Nancy; Goldfain-Blanc, Françoise; Guillouzo, André

    2009-01-01

    Over the last few decades, toxicology has benefited from scientific, technical, and bioinformatic developments relating to patient safety assessment during clinical and drug marketing studies. Based on this knowledge, new in silico, in vitro, and "omic" experimental models are emerging. Although these models cannot currently replace classic safety evaluations performed on laboratory animals, they allow compounds with unacceptable toxicity to be rejected in the early stages of drug development, thereby reducing the number of laboratory animals needed. In addition, because these models are particularly adapted to mechanistic studies, they can help to improve the relevance of the data obtained, thus enabling better prevention and screening of the adverse effects that may occur in humans. Much progress remains to be done, especially in the field of validation. Nevertheless, current efforts by industrial, academic laboratories, and regulatory agencies should, in coming years, significantly improve preclinical drug safety evaluation thanks to the integration of these new methods into the drug research and development process.

  16. Big data: the next frontier for innovation in therapeutics and healthcare.

    PubMed

    Issa, Naiem T; Byers, Stephen W; Dakshanamurthy, Sivanesan

    2014-05-01

    Advancements in genomics and personalized medicine not only effect healthcare delivery from patient and provider standpoints, but also reshape biomedical discovery. We are in the era of the '-omics', wherein an individual's genome, transcriptome, proteome and metabolome can be scrutinized to the finest resolution to paint a personalized biochemical fingerprint that enables tailored treatments, prognoses, risk factors, etc. Digitization of this information parlays into 'big data' informatics-driven evidence-based medical practice. While individualized patient management is a key beneficiary of next-generation medical informatics, this data also harbors a wealth of novel therapeutic discoveries waiting to be uncovered. 'Big data' informatics allows for networks-driven systems pharmacodynamics whereby drug information can be coupled to cellular- and organ-level physiology for determining whole-body outcomes. Patient '-omics' data can be integrated for ontology-based data-mining for the discovery of new biological associations and drug targets. Here we highlight the potential of 'big data' informatics for clinical pharmacology.

  17. Application of proteomics in research on traditional Chinese medicine.

    PubMed

    Suo, Tongchuan; Wang, Haixia; Li, Zheng

    2016-09-01

    Traditional Chinese medicine (TCM) is a widely used complementary alternative medicine approach. Although many aspects of its effectiveness have been approved clinically, rigorous scientific techniques are highly required to translate the promises from TCM into powerful modern therapies. In this respect, proteomics is useful because of its ability to unveil the underlying target proteins and/or protein biomarkers. In this review, we summarize the recent interplay between proteomics and research on TCM, ranging from exploration of the medicinal materials to the biological basis of TCM concepts, and from pathological studies to pharmacological investigations. We show that proteomic analyses provide preliminary biological evidence of the promises in TCM, and the integration of proteomics with other omics and bioinformatics offers a comprehensive methodology to address the complications of TCM. Expert commentary: Currently, only limited information can be obtained regarding TCM issues and thus more work is required to resolve the ambiguity. As such, more collaborations between proteomics and other techniques (other omics, network pharmacology, etc.) are essential for deciphering the underlying biological basis in TCM topics.

  18. Active Interaction Mapping as a tool to elucidate hierarchical functions of biological processes.

    PubMed

    Farré, Jean-Claude; Kramer, Michael; Ideker, Trey; Subramani, Suresh

    2017-07-03

    Increasingly, various 'omics data are contributing significantly to our understanding of novel biological processes, but it has not been possible to iteratively elucidate hierarchical functions in complex phenomena. We describe a general systems biology approach called Active Interaction Mapping (AI-MAP), which elucidates the hierarchy of functions for any biological process. Existing and new 'omics data sets can be iteratively added to create and improve hierarchical models which enhance our understanding of particular biological processes. The best datatypes to further improve an AI-MAP model are predicted computationally. We applied this approach to our understanding of general and selective autophagy, which are conserved in most eukaryotes, setting the stage for the broader application to other cellular processes of interest. In the particular application to autophagy-related processes, we uncovered and validated new autophagy and autophagy-related processes, expanded known autophagy processes with new components, integrated known non-autophagic processes with autophagy and predict other unexplored connections.

  19. Metabolomics through the lens of precision cardiovascular medicine.

    PubMed

    Lam, Sin Man; Wang, Yuan; Li, Bowen; Du, Jie; Shui, Guanghou

    2017-03-20

    Metabolomics, which targets at the extensive characterization and quantitation of global metabolites from both endogenous and exogenous sources, has emerged as a novel technological avenue to advance the field of precision medicine principally driven by genomics-oriented approaches. In particular, metabolomics has revealed the cardinal roles that the environment exerts in driving the progression of major diseases threatening public health. Herein, the existent and potential applications of metabolomics in two key areas of precision cardiovascular medicine will be critically discussed: 1) the use of metabolomics in unveiling novel disease biomarkers and pathological pathways; 2) the contribution of metabolomics in cardiovascular drug development. Major issues concerning the statistical handling of big data generated by metabolomics, as well as its interpretation, will be briefly addressed. Finally, the need for integration of various omics branches and adopting a multi-omics approach to precision medicine will be discussed. Copyright © 2017 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.

  20. Network diffusion-based analysis of high-throughput data for the detection of differentially enriched modules

    PubMed Central

    Bersanelli, Matteo; Mosca, Ettore; Remondini, Daniel; Castellani, Gastone; Milanesi, Luciano

    2016-01-01

    A relation exists between network proximity of molecular entities in interaction networks, functional similarity and association with diseases. The identification of network regions associated with biological functions and pathologies is a major goal in systems biology. We describe a network diffusion-based pipeline for the interpretation of different types of omics in the context of molecular interaction networks. We introduce the network smoothing index, a network-based quantity that allows to jointly quantify the amount of omics information in genes and in their network neighbourhood, using network diffusion to define network proximity. The approach is applicable to both descriptive and inferential statistics calculated on omics data. We also show that network resampling, applied to gene lists ranked by quantities derived from the network smoothing index, indicates the presence of significantly connected genes. As a proof of principle, we identified gene modules enriched in somatic mutations and transcriptional variations observed in samples of prostate adenocarcinoma (PRAD). In line with the local hypothesis, network smoothing index and network resampling underlined the existence of a connected component of genes harbouring molecular alterations in PRAD. PMID:27731320

  1. cual-id: Globally Unique, Correctable, and Human-Friendly Sample Identifiers for Comparative Omics Studies.

    PubMed

    Chase, John H; Bolyen, Evan; Rideout, Jai Ram; Caporaso, J Gregory

    2016-01-01

    The number of samples in high-throughput comparative "omics" studies is increasing rapidly due to declining experimental costs. To keep sample data and metadata manageable and to ensure the integrity of scientific results as the scale of these projects continues to increase, it is essential that we transition to better-designed sample identifiers. Ideally, sample identifiers should be globally unique across projects, project teams, and institutions; short (to facilitate manual transcription); correctable with respect to common types of transcription errors; opaque, meaning that they do not contain information about the samples; and compatible with existing standards. We present cual-id, a lightweight command line tool that creates, or mints, sample identifiers that meet these criteria without reliance on centralized infrastructure. cual-id allows users to assign universally unique identifiers, or UUIDs, that are globally unique to their samples. UUIDs are too long to be conveniently written on sampling materials, such as swabs or microcentrifuge tubes, however, so cual-id additionally generates human-friendly 4- to 12-character identifiers that map to their UUIDs and are unique within a project. By convention, we use "cual-id" to refer to the software, "CualID" to refer to the short, human-friendly identifiers, and "UUID" to refer to the globally unique identifiers. CualIDs are used by humans when they manually write or enter identifiers, while the longer UUIDs are used by computers to unambiguously reference a sample. Finally, cual-id optionally generates printable label sticker sheets containing Code 128 bar codes and CualIDs for labeling of sample collection and processing materials. IMPORTANCE The adoption of identifiers that are globally unique, correctable, and easily handwritten or manually entered into a computer will be a major step forward for sample tracking in comparative omics studies. As the fields transition to more-centralized sample management, for example, across labs within an institution, across projects funded under a common program, or in systems designed to facilitate meta- and/or integrated analysis, sample identifiers generated with cual-id will not need to change; thus, costly and error-prone updating of data and metadata identifiers will be avoided. Further, using cual-id will ensure that transcription errors in sample identifiers do not require the discarding of otherwise-useful samples that may have been expensive to obtain. Finally, cual-id is simple to install and use and is free for all use. No centralized infrastructure is required to ensure global uniqueness, so it is feasible for any lab to get started using these identifiers within their existing infrastructure.

  2. Security controls in an integrated Biobank to protect privacy in data sharing: rationale and study design.

    PubMed

    Takai-Igarashi, Takako; Kinoshita, Kengo; Nagasaki, Masao; Ogishima, Soichi; Nakamura, Naoki; Nagase, Sachiko; Nagaie, Satoshi; Saito, Tomo; Nagami, Fuji; Minegishi, Naoko; Suzuki, Yoichi; Suzuki, Kichiya; Hashizume, Hiroaki; Kuriyama, Shinichi; Hozawa, Atsushi; Yaegashi, Nobuo; Kure, Shigeo; Tamiya, Gen; Kawaguchi, Yoshio; Tanaka, Hiroshi; Yamamoto, Masayuki

    2017-07-06

    With the goal of realizing genome-based personalized healthcare, we have developed a biobank that integrates personal health, genome, and omics data along with biospecimens donated by volunteers of 150,000. Such a large-scale of data integration involves obvious risks of privacy violation. The research use of personal genome and health information is a topic of global discussion with regard to the protection of privacy while promoting scientific advancement. The present paper reports on our plans, current attempts, and accomplishments in addressing security problems involved in data sharing to ensure donor privacy while promoting scientific advancement. Biospecimens and data have been collected in prospective cohort studies with the comprehensive agreement. The sample size of 150,000 participants was required for multiple researches including genome-wide screening of gene by environment interactions, haplotype phasing, and parametric linkage analysis. We established the T ohoku M edical M egabank (TMM) data sharing policy: a privacy protection rule that requires physical, personnel, and technological safeguards against privacy violation regarding the use and sharing of data. The proposed policy refers to that of NCBI and that of the Sanger Institute. The proposed policy classifies shared data according to the strength of re-identification risks. Local committees organized by TMM evaluate re-identification risk and assign a security category to a dataset. Every dataset is stored in an assigned segment of a supercomputer in accordance with its security category. A security manager should be designated to handle all security problems at individual data use locations. The proposed policy requires closed networks and IP-VPN remote connections. The mission of the biobank is to distribute biological resources most productively. This mission motivated us to collect biospecimens and health data and simultaneously analyze genome/omics data in-house. The biobank also has the mission of improving the quality and quantity of the contents of the biobank. This motivated us to request users to share the results of their research as feedback to the biobank. The TMM data sharing policy has tackled every security problem originating with the missions. We believe our current implementation to be the best way to protect privacy in data sharing.

  3. Integrative methods for analyzing big data in precision medicine.

    PubMed

    Gligorijević, Vladimir; Malod-Dognin, Noël; Pržulj, Nataša

    2016-03-01

    We provide an overview of recent developments in big data analyses in the context of precision medicine and health informatics. With the advance in technologies capturing molecular and medical data, we entered the area of "Big Data" in biology and medicine. These data offer many opportunities to advance precision medicine. We outline key challenges in precision medicine and present recent advances in data integration-based methods to uncover personalized information from big data produced by various omics studies. We survey recent integrative methods for disease subtyping, biomarkers discovery, and drug repurposing, and list the tools that are available to domain scientists. Given the ever-growing nature of these big data, we highlight key issues that big data integration methods will face. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Metabolomics for Biomarker Discovery in Gastroenterological Cancer

    PubMed Central

    Nishiumi, Shin; Suzuki, Makoto; Kobayashi, Takashi; Matsubara, Atsuki; Azuma, Takeshi; Yoshida, Masaru

    2014-01-01

    The study of the omics cascade, which involves comprehensive investigations based on genomics, transcriptomics, proteomics, metabolomics, etc., has developed rapidly and now plays an important role in life science research. Among such analyses, metabolome analysis, in which the concentrations of low molecular weight metabolites are comprehensively analyzed, has rapidly developed along with improvements in analytical technology, and hence, has been applied to a variety of research fields including the clinical, cell biology, and plant/food science fields. The metabolome represents the endpoint of the omics cascade and is also the closest point in the cascade to the phenotype. Moreover, it is affected by variations in not only the expression but also the enzymatic activity of several proteins. Therefore, metabolome analysis can be a useful approach for finding effective diagnostic markers and examining unknown pathological conditions. The number of studies involving metabolome analysis has recently been increasing year-on-year. Here, we describe the findings of studies that used metabolome analysis to attempt to discover biomarker candidates for gastroenterological cancer and discuss metabolome analysis-based disease diagnosis. PMID:25003943

  5. Systems view of adipogenesis via novel omics-driven and tissue-specific activity scoring of network functional modules

    NASA Astrophysics Data System (ADS)

    Nassiri, Isar; Lombardo, Rosario; Lauria, Mario; Morine, Melissa J.; Moyseos, Petros; Varma, Vijayalakshmi; Nolen, Greg T.; Knox, Bridgett; Sloper, Daniel; Kaput, Jim; Priami, Corrado

    2016-07-01

    The investigation of the complex processes involved in cellular differentiation must be based on unbiased, high throughput data processing methods to identify relevant biological pathways. A number of bioinformatics tools are available that can generate lists of pathways ranked by statistical significance (i.e. by p-value), while ideally it would be desirable to functionally score the pathways relative to each other or to other interacting parts of the system or process. We describe a new computational method (Network Activity Score Finder - NASFinder) to identify tissue-specific, omics-determined sub-networks and the connections with their upstream regulator receptors to obtain a systems view of the differentiation of human adipocytes. Adipogenesis of human SBGS pre-adipocyte cells in vitro was monitored with a transcriptomic data set comprising six time points (0, 6, 48, 96, 192, 384 hours). To elucidate the mechanisms of adipogenesis, NASFinder was used to perform time-point analysis by comparing each time point against the control (0 h) and time-lapse analysis by comparing each time point with the previous one. NASFinder identified the coordinated activity of seemingly unrelated processes between each comparison, providing the first systems view of adipogenesis in culture. NASFinder has been implemented into a web-based, freely available resource associated with novel, easy to read visualization of omics data sets and network modules.

  6. Cross-study and cross-omics comparisons of three nephrotoxic compounds reveal mechanistic insights and new candidate biomarkers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Matheis, Katja A., E-mail: katja.matheis@boehringer-ingelheim.com; Com, Emmanuelle; High-Throughput Proteomics Core Facility OUEST-genopole

    2011-04-15

    The European InnoMed-PredTox project was a collaborative effort between 15 pharmaceutical companies, 2 small and mid-sized enterprises, and 3 universities with the goal of delivering deeper insights into the molecular mechanisms of kidney and liver toxicity and to identify mechanism-linked diagnostic or prognostic safety biomarker candidates by combining conventional toxicological parameters with 'omics' data. Mechanistic toxicity studies with 16 different compounds, 2 dose levels, and 3 time points were performed in male Crl: WI(Han) rats. Three of the 16 investigated compounds, BI-3 (FP007SE), Gentamicin (FP009SF), and IMM125 (FP013NO), induced kidney proximal tubule damage (PTD). In addition to histopathology and clinicalmore » chemistry, transcriptomics microarray and proteomics 2D-DIGE analysis were performed. Data from the three PTD studies were combined for a cross-study and cross-omics meta-analysis of the target organ. The mechanistic interpretation of kidney PTD-associated deregulated transcripts revealed, in addition to previously described kidney damage transcript biomarkers such as KIM-1, CLU and TIMP-1, a number of additional deregulated pathways congruent with histopathology observations on a single animal basis, including a specific effect on the complement system. The identification of new, more specific biomarker candidates for PTD was most successful when transcriptomics data were used. Combining transcriptomics data with proteomics data added extra value.« less

  7. Advances in the integration of transcriptional regulatory information into genome-scale metabolic models.

    PubMed

    Vivek-Ananth, R P; Samal, Areejit

    2016-09-01

    A major goal of systems biology is to build predictive computational models of cellular metabolism. Availability of complete genome sequences and wealth of legacy biochemical information has led to the reconstruction of genome-scale metabolic networks in the last 15 years for several organisms across the three domains of life. Due to paucity of information on kinetic parameters associated with metabolic reactions, the constraint-based modelling approach, flux balance analysis (FBA), has proved to be a vital alternative to investigate the capabilities of reconstructed metabolic networks. In parallel, advent of high-throughput technologies has led to the generation of massive amounts of omics data on transcriptional regulation comprising mRNA transcript levels and genome-wide binding profile of transcriptional regulators. A frontier area in metabolic systems biology has been the development of methods to integrate the available transcriptional regulatory information into constraint-based models of reconstructed metabolic networks in order to increase the predictive capabilities of computational models and understand the regulation of cellular metabolism. Here, we review the existing methods to integrate transcriptional regulatory information into constraint-based models of metabolic networks. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  8. Integrated multi-omic analysis of host-microbiota interactions in acute oak decline.

    PubMed

    Broberg, Martin; Doonan, James; Mundt, Filip; Denman, Sandra; McDonald, James E

    2018-01-30

    Britain's native oak species are currently under threat from acute oak decline (AOD), a decline-disease where stem bleeds overlying necrotic lesions in the inner bark and larval galleries of the bark-boring beetle, Agrilus biguttatus, represent the primary symptoms. It is known that complex interactions between the plant host and its microbiome, i.e. the holobiont, significantly influence the health status of the plant. In AOD, necrotic lesions are caused by a microbiome shift to a pathobiome consisting predominantly of Brenneria goodwinii, Gibbsiella quercinecans, Rahnella victoriana and potentially other bacteria. However, the specific mechanistic processes of the microbiota causing tissue necrosis, and the host response, have not been established and represent a barrier to understanding and managing this decline. We profiled the metagenome, metatranscriptome and metaproteome of inner bark tissue from AOD symptomatic and non-symptomatic trees to characterise microbiota-host interactions. Active bacterial virulence factors such as plant cell wall-degrading enzymes, reactive oxygen species defence and flagella in AOD lesions, along with host defence responses including reactive oxygen species, cell wall modification and defence regulators were identified. B. goodwinii dominated the lesion microbiome, with significant expression of virulence factors such as the phytopathogen effector avrE. A smaller proportion of microbiome activity was attributed to G. quercinecans and R. victoriana. In addition, we describe for the first time the potential role of two previously uncharacterised Gram-positive bacteria predicted from metagenomic binning and identified as active in the AOD lesion metatranscriptome and metaproteome, implicating them in lesion formation. This multi-omic study provides novel functional insights into microbiota-host interactions in AOD, a complex arboreal decline disease where polymicrobial-host interactions result in lesion formation on tree stems. We present the first descriptions of holobiont function in oak health and disease, specifically, the relative lesion activity of B. goodwinii, G. quercinecans, Rahnella victoriana and other bacteria. Thus, the research presented here provides evidence of some of the mechanisms used by members of the lesion microbiome and a template for future multi-omic research into holobiont characterisation, plant polymicrobial diseases and pathogen defence in trees.

  9. A review on machine learning principles for multi-view biological data integration.

    PubMed

    Li, Yifeng; Wu, Fang-Xiang; Ngom, Alioune

    2018-03-01

    Driven by high-throughput sequencing techniques, modern genomic and clinical studies are in a strong need of integrative machine learning models for better use of vast volumes of heterogeneous information in the deep understanding of biological systems and the development of predictive models. How data from multiple sources (called multi-view data) are incorporated in a learning system is a key step for successful analysis. In this article, we provide a comprehensive review on omics and clinical data integration techniques, from a machine learning perspective, for various analyses such as prediction, clustering, dimension reduction and association. We shall show that Bayesian models are able to use prior information and model measurements with various distributions; tree-based methods can either build a tree with all features or collectively make a final decision based on trees learned from each view; kernel methods fuse the similarity matrices learned from individual views together for a final similarity matrix or learning model; network-based fusion methods are capable of inferring direct and indirect associations in a heterogeneous network; matrix factorization models have potential to learn interactions among features from different views; and a range of deep neural networks can be integrated in multi-modal learning for capturing the complex mechanism of biological systems.

  10. Qupe--a Rich Internet Application to take a step forward in the analysis of mass spectrometry-based quantitative proteomics experiments.

    PubMed

    Albaum, Stefan P; Neuweger, Heiko; Fränzel, Benjamin; Lange, Sita; Mertens, Dominik; Trötschel, Christian; Wolters, Dirk; Kalinowski, Jörn; Nattkemper, Tim W; Goesmann, Alexander

    2009-12-01

    The goal of present -omics sciences is to understand biological systems as a whole in terms of interactions of the individual cellular components. One of the main building blocks in this field of study is proteomics where tandem mass spectrometry (LC-MS/MS) in combination with isotopic labelling techniques provides a common way to obtain a direct insight into regulation at the protein level. Methods to identify and quantify the peptides contained in a sample are well established, and their output usually results in lists of identified proteins and calculated relative abundance values. The next step is to move ahead from these abstract lists and apply statistical inference methods to compare measurements, to identify genes that are significantly up- or down-regulated, or to detect clusters of proteins with similar expression profiles. We introduce the Rich Internet Application (RIA) Qupe providing comprehensive data management and analysis functions for LC-MS/MS experiments. Starting with the import of mass spectra data the system guides the experimenter through the process of protein identification by database search, the calculation of protein abundance ratios, and in particular, the statistical evaluation of the quantification results including multivariate analysis methods such as analysis of variance or hierarchical cluster analysis. While a data model to store these results has been developed, a well-defined programming interface facilitates the integration of novel approaches. A compute cluster is utilized to distribute computationally intensive calculations, and a web service allows to interchange information with other -omics software applications. To demonstrate that Qupe represents a step forward in quantitative proteomics analysis an application study on Corynebacterium glutamicum has been carried out. Qupe is implemented in Java utilizing Hibernate, Echo2, R and the Spring framework. We encourage the usage of the RIA in the sense of the 'software as a service' concept, maintained on our servers and accessible at the following location: http://qupe.cebitec.uni-bielefeld.de. Supplementary data are available at Bioinformatics online.

  11. A roadmap towards personalized immunology.

    PubMed

    Delhalle, Sylvie; Bode, Sebastian F N; Balling, Rudi; Ollert, Markus; He, Feng Q

    2018-01-01

    Big data generation and computational processing will enable medicine to evolve from a "one-size-fits-all" approach to precise patient stratification and treatment. Significant achievements using "Omics" data have been made especially in personalized oncology. However, immune cells relative to tumor cells show a much higher degree of complexity in heterogeneity, dynamics, memory-capability, plasticity and "social" interactions. There is still a long way ahead on translating our capability to identify potentially targetable personalized biomarkers into effective personalized therapy in immune-centralized diseases. Here, we discuss the recent advances and successful applications in "Omics" data utilization and network analysis on patients' samples of clinical trials and studies, as well as the major challenges and strategies towards personalized stratification and treatment for infectious or non-communicable inflammatory diseases such as autoimmune diseases or allergies. We provide a roadmap and highlight experimental, clinical, computational analysis, data management, ethical and regulatory issues to accelerate the implementation of personalized immunology.

  12. 'Omics' techniques for identifying flooding-response mechanisms in soybean.

    PubMed

    Komatsu, Setsuko; Shirasaka, Naoki; Sakata, Katsumi

    2013-11-20

    Plant growth and productivity are adversely influenced by various environmental stresses, which often lead to reduced seedling growth and decreased crop yields. Plants respond to stressful conditions through changes in 'omics' profiles, including transcriptomics, proteomics, and metabolomics. Linking plant phenotype to gene expression patterns, protein abundance, and metabolite accumulation is one of the main challenges for improving agricultural production. 'Omics' approaches may shed insight into the mechanisms that function in soybean in response to environmental stresses, particularly flooding by frequent rain, which occurs worldwide due to changes in global climate. Flooding causes significant reductions in the growth and yield of several crops, especially soybean. The application of 'omics' techniques may facilitate the development of flood-tolerant cultivars of soybean. In this review, the use of 'omics' techniques towards understanding the flooding-responsive mechanisms of soybeans is discussed, as the findings from these studies are expected to have applications in both breeding and agronomy. This article is part of a Special Issue entitled: Translational Plant Proteomics. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. Omicseq: a web-based search engine for exploring omics datasets

    PubMed Central

    Sun, Xiaobo; Pittard, William S.; Xu, Tianlei; Chen, Li; Zwick, Michael E.; Jiang, Xiaoqian; Wang, Fusheng

    2017-01-01

    Abstract The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve ‘findability’ of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. PMID:28402462

  14. Assessment of berberine as a multi-target antimicrobial: a multi-omics study for drug discovery and repositioning.

    PubMed

    Karaosmanoglu, Kubra; Sayar, Nihat Alpagu; Kurnaz, Isil Aksan; Akbulut, Berna Sariyar

    2014-01-01

    Postgenomics drug development is undergoing major transformation in the age of multi-omics studies and drug repositioning. Rather than applications solely in personalized medicine, omics science thus additionally offers a better understanding of a broader range of drug targets and drug repositioning. Berberine is an isoquinoline alkaloid found in many medicinal plants. We report here a whole genome microarray study in tandem with proteomics techniques for mining the plethora of targets that are putatively involved in the antimicrobial activity of berberine against Escherichia coli. We found DNA replication/repair and transcription to be triggered by berberine, indicating that nucleic acids, in general, are among its targets. Our combined transcriptomics and proteomics multi-omics findings underscore that, in the presence of berberine, cell wall or cell membrane transport and motility-related functions are also specifically regulated. We further report a general decline in metabolism, as seen by repression of genes in carbohydrate and amino acid metabolism, energy production, and conversion. An involvement of multidrug efflux pumps, as well as reduced membrane permeability for developing resistance against berberine in E. coli was noted. Collectively, these findings offer original and significant leads for omics-guided drug discovery and future repositioning approaches in the postgenomics era, using berberine as a multi-omics case study.

  15. Renal injury in neonates: use of "omics" for developing precision medicine in neonatology.

    PubMed

    Joshi, Mandar S; Montgomery, Kelsey A; Giannone, Peter J; Bauer, John A; Hanna, Mina H

    2017-01-01

    Preterm birth is associated with increased risks of morbidity and mortality along with increased healthcare costs. Advances in medicine have enhanced survival for preterm infants but the overall incidence of major morbidities has changed very little. Abnormal renal development is an important consequence of premature birth. Acute kidney injury (AKI) in the neonatal period is multifactorial and may increase lifetime risk of chronic kidney disease.Traditional biomarkers in newborns suffer from considerable confounders, limiting their use for early identification of AKI. There is a need to develop novel biomarkers that can identify, in real time, the evolution of renal dysfunction in an early diagnostic, monitoring and prognostic fashion. Use of "omics", particularly metabolomics, may provide valuable information regarding functional pathways underlying AKI and prediction of clinical outcomes.The emerging knowledge generated by the application of "omics" (genomics, proteomics, metabolomics) in neonatology provides new insights that can help to identify markers of early diagnosis, disease progression, and identify new therapeutic targets. Additionally, omics will have major implications in the field of personalized healthcare in the future. Here, we will review the current knowledge of different omics technologies in neonatal-perinatal medicine including biomarker discovery, defining as yet unrecognized biologic therapeutic targets, and linking of omics to relevant standard indices and long-term outcomes.

  16. Understanding butanol tolerance and assimilation in Pseudomonas putida BIRD-1: an integrated omics approach.

    PubMed

    Cuenca, María del Sol; Roca, Amalia; Molina-Santiago, Carlos; Duque, Estrella; Armengaud, Jean; Gómez-Garcia, María R; Ramos, Juan L

    2016-01-01

    Pseudomonas putida BIRD-1 has the potential to be used for the industrial production of butanol due to its solvent tolerance and ability to metabolize low-cost compounds. However, the strain has two major limitations: it assimilates butanol as sole carbon source and butanol concentrations above 1% (v/v) are toxic. With the aim of facilitating BIRD-1 strain design for industrial use, a genome-wide mini-Tn5 transposon mutant library was screened for clones exhibiting increased butanol sensitivity or deficiency in butanol assimilation. Twenty-one mutants were selected that were affected in one or both of the processes. These mutants exhibited insertions in various genes, including those involved in the TCA cycle, fatty acid metabolism, transcription, cofactor synthesis and membrane integrity. An omics-based analysis revealed key genes involved in the butanol response. Transcriptomic and proteomic studies were carried out to compare short and long-term tolerance and assimilation traits. Pseudomonas putida initiates various butanol assimilation pathways via alcohol and aldehyde dehydrogenases that channel the compound to central metabolism through the glyoxylate shunt pathway. Accordingly, isocitrate lyase - a key enzyme of the pathway - was the most abundant protein when butanol was used as the sole carbon source. Upregulation of two genes encoding proteins PPUBIRD1_2240 and PPUBIRD1_2241 (acyl-CoA dehydrogenase and acyl-CoA synthetase respectively) linked butanol assimilation with acyl-CoA metabolism. Butanol tolerance was found to be primarily linked to classic solvent defense mechanisms, such as efflux pumps, membrane modifications and control of redox state. Our results also highlight the intensive energy requirements for butanol production and tolerance; thus, enhancing TCA cycle operation may represent a promising strategy for enhanced butanol production. © 2015 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  17. Integrated omics analyses reveal the details of metabolic adaptation of Clostridium thermocellum to lignocellulose-derived growth inhibitors released during the deconstruction of switchgrass.

    PubMed

    Poudel, Suresh; Giannone, Richard J; Rodriguez, Miguel; Raman, Babu; Martin, Madhavi Z; Engle, Nancy L; Mielenz, Jonathan R; Nookaew, Intawat; Brown, Steven D; Tschaplinski, Timothy J; Ussery, David; Hettich, Robert L

    2017-01-01

    Clostridium thermocellum is capable of solubilizing and converting lignocellulosic biomass into ethanol. Although much of the work-to-date has centered on characterizing this microbe's growth on model cellulosic substrates, such as cellobiose, Avicel, or filter paper, it is vitally important to understand its metabolism on more complex, lignocellulosic substrates to identify relevant industrial bottlenecks that could undermine efficient biofuel production. To this end, we have examined a time course progression of C. thermocellum grown on switchgrass to assess the metabolic and protein changes that occur during the conversion of plant biomass to ethanol. The most striking feature of the metabolome was the observed accumulation of long-chain, branched fatty acids over time, implying an adaptive restructuring of C. thermocellum's cellular membrane as the culture progresses. This is undoubtedly a response to the gradual accumulation of lignocellulose-derived inhibitory compounds as the organism deconstructs the switchgrass to access the embedded cellulose. Corroborating the metabolomics data, proteomic analysis revealed a corresponding time-dependent increase in various enzymes, including those involved in the interconversion of branched amino acids valine, leucine, and isoleucine to iso- and anteiso-fatty acid precursors. Additionally, the metabolic accumulation of hemicellulose-derived sugars and sugar alcohols concomitant with increased abundance of enzymes involved in C5 sugar metabolism/pentose phosphate pathway indicates that C. thermocellum shifts glycolytic intermediates to alternate pathways to modulate overall carbon flux in response to C5 sugar metabolites that increase during lignocellulose deconstruction. Integrated omic platforms provided complementary systems biological information that highlight C. thermocellum 's specific response to cytotoxic inhibitors released during the deconstruction and utilization of switchgrass. These additional viewpoints allowed us to fully realize the level to which the organism adapts to an increasingly challenging culture environment-information that will prove critical to C. thermocellum 's industrial efficacy.

  18. Chianina beef tenderness investigated through integrated Omics.

    PubMed

    D'Alessandro, Angelo; Marrocco, Cristina; Rinalducci, Sara; Mirasole, Cristiana; Failla, Sebastiana; Zolla, Lello

    2012-07-19

    In the present study we performed an integrated proteomics, interactomics and metabolomics analysis of Longissimus dorsi tender and tough meat samples from Chianina beef cattle. Results were statistically handled as to obtain Pearson's correlation coefficients of the results from Omics investigation in relation to canonical tenderness-related parameters, including Warner Bratzler shear force, myofibrillar degradation (at 48 h and 10 days after slaughter), sarcomere length and total collagen content. As a result, we could observe that the tender meat group was characterized by higher levels of glycolytic enzymes, which were over-phosphorylated and produced accumulation of glycolytic intermediates. Oxidative stress promoted meat tenderness and elicited heat shock protein responses, which in turn triggered apoptosis-like cascades along with PARP fragmentation. Phosphorylation was found to be a key process in post mortem muscle conversion to meat, as it was shown not only to modulate glycolytic enzyme activities, but also mediate the stability of structural proteins at the Z-disk. On the other hand, phosphorylation of HSPs has been supposed to alter their functions through changing their affinity for target interactors. Analogies and breed-specific differences are highlighted throughout the text via a direct comparison of the present results against the ones obtained in a parallel study on Maremmana Longissimus dorsi. It emerges that, while the main cornerstones and the final outcome are maintained, post mortem metabolism in tender and tough meat yielding individuals is subtly modulated via specific higher levels of enzymes and amino acidic residue phosphorylation in a breed-specific fashion, and whether calcium homeostasis dysregulation was a key factor in Maremmana, higher early post mortem phosphocreatine levels in the Chianina tender group could favor a slower and prolonged glycolytic rate, prolonging the extent of the minimum hanging period necessary to obtain tender meat from this breed by a few days. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. The High-Throughput Analyses Era: Are We Ready for the Data Struggle?

    PubMed

    D'Argenio, Valeria

    2018-03-02

    Recent and rapid technological advances in molecular sciences have dramatically increased the ability to carry out high-throughput studies characterized by big data production. This, in turn, led to the consequent negative effect of highlighting the presence of a gap between data yield and their analysis. Indeed, big data management is becoming an increasingly important aspect of many fields of molecular research including the study of human diseases. Now, the challenge is to identify, within the huge amount of data obtained, that which is of clinical relevance. In this context, issues related to data interpretation, sharing and storage need to be assessed and standardized. Once this is achieved, the integration of data from different -omic approaches will improve the diagnosis, monitoring and therapy of diseases by allowing the identification of novel, potentially actionably biomarkers in view of personalized medicine.

  20. BioContainers: an open-source and community-driven framework for software standardization.

    PubMed

    da Veiga Leprevost, Felipe; Grüning, Björn A; Alves Aflitos, Saulo; Röst, Hannes L; Uszkoreit, Julian; Barsnes, Harald; Vaudel, Marc; Moreno, Pablo; Gatto, Laurent; Weber, Jonas; Bai, Mingze; Jimenez, Rafael C; Sachsenberg, Timo; Pfeuffer, Julianus; Vera Alvarez, Roberto; Griss, Johannes; Nesvizhskii, Alexey I; Perez-Riverol, Yasset

    2017-08-15

    BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). The software is freely available at github.com/BioContainers/. yperez@ebi.ac.uk. © The Author(s) 2017. Published by Oxford University Press.

  1. BioContainers: an open-source and community-driven framework for software standardization

    PubMed Central

    da Veiga Leprevost, Felipe; Grüning, Björn A.; Alves Aflitos, Saulo; Röst, Hannes L.; Uszkoreit, Julian; Barsnes, Harald; Vaudel, Marc; Moreno, Pablo; Gatto, Laurent; Weber, Jonas; Bai, Mingze; Jimenez, Rafael C.; Sachsenberg, Timo; Pfeuffer, Julianus; Vera Alvarez, Roberto; Griss, Johannes; Nesvizhskii, Alexey I.; Perez-Riverol, Yasset

    2017-01-01

    Abstract Motivation BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). Availability and Implementation The software is freely available at github.com/BioContainers/. Contact yperez@ebi.ac.uk PMID:28379341

  2. Challenges of molecular nutrition research 6: the nutritional phenotype database to store, share and evaluate nutritional systems biology studies

    PubMed Central

    Bouwman, Jildau; Dragsted, Lars O.; Drevon, Christian A.; Elliott, Ruan; de Groot, Philip; Kaput, Jim; Mathers, John C.; Müller, Michael; Pepping, Fre; Saito, Jahn; Scalbert, Augustin; Radonjic, Marijana; Rocca-Serra, Philippe; Travis, Anthony; Wopereis, Suzan; Evelo, Chris T.

    2010-01-01

    The challenge of modern nutrition and health research is to identify food-based strategies promoting life-long optimal health and well-being. This research is complex because it exploits a multitude of bioactive compounds acting on an extensive network of interacting processes. Whereas nutrition research can profit enormously from the revolution in ‘omics’ technologies, it has discipline-specific requirements for analytical and bioinformatic procedures. In addition to measurements of the parameters of interest (measures of health), extensive description of the subjects of study and foods or diets consumed is central for describing the nutritional phenotype. We propose and pursue an infrastructural activity of constructing the “Nutritional Phenotype database” (dbNP). When fully developed, dbNP will be a research and collaboration tool and a publicly available data and knowledge repository. Creation and implementation of the dbNP will maximize benefits to the research community by enabling integration and interrogation of data from multiple studies, from different research groups, different countries and different—omics levels. The dbNP is designed to facilitate storage of biologically relevant, pre-processed—omics data, as well as study descriptive and study participant phenotype data. It is also important to enable the combination of this information at different levels (e.g. to facilitate linkage of data describing participant phenotype, genotype and food intake with information on study design and—omics measurements, and to combine all of this with existing knowledge). The biological information stored in the database (i.e. genetics, transcriptomics, proteomics, biomarkers, metabolomics, functional assays, food intake and food composition) is tailored to nutrition research and embedded in an environment of standard procedures and protocols, annotations, modular data-basing, networking and integrated bioinformatics. The dbNP is an evolving enterprise, which is only sustainable if it is accepted and adopted by the wider nutrition and health research community as an open source, pre-competitive and publicly available resource where many partners both can contribute and profit from its developments. We introduce the Nutrigenomics Organisation (NuGO, http://www.nugo.org) as a membership association responsible for establishing and curating the dbNP. Within NuGO, all efforts related to dbNP (i.e. usage, coordination, integration, facilitation and maintenance) will be directed towards a sustainable and federated infrastructure. PMID:21052526

  3. Benchmarking Water Quality from Wastewater to Drinking Waters Using Reduced Transcriptome of Human Cells.

    PubMed

    Xia, Pu; Zhang, Xiaowei; Zhang, Hanxin; Wang, Pingping; Tian, Mingming; Yu, Hongxia

    2017-08-15

    One of the major challenges in environmental science is monitoring and assessing the risk of complex environmental mixtures. In vitro bioassays with limited key toxicological end points have been shown to be suitable to evaluate mixtures of organic pollutants in wastewater and recycled water. Omics approaches such as transcriptomics can monitor biological effects at the genome scale. However, few studies have applied omics approach in the assessment of mixtures of organic micropollutants. Here, an omics approach was developed for profiling bioactivity of 10 water samples ranging from wastewater to drinking water in human cells by a reduced human transcriptome (RHT) approach and dose-response modeling. Transcriptional expression of 1200 selected genes were measured by an Ampliseq technology in two cell lines, HepG2 and MCF7, that were exposed to eight serial dilutions of each sample. Concentration-effect models were used to identify differentially expressed genes (DEGs) and to calculate effect concentrations (ECs) of DEGs, which could be ranked to investigate low dose response. Furthermore, molecular pathways disrupted by different samples were evaluated by Gene Ontology (GO) enrichment analysis. The ability of RHT for representing bioactivity utilizing both HepG2 and MCF7 was shown to be comparable to the results of previous in vitro bioassays. Finally, the relative potencies of the mixtures indicated by RHT analysis were consistent with the chemical profiles of the samples. RHT analysis with human cells provides an efficient and cost-effective approach to benchmarking mixture of micropollutants and may offer novel insight into the assessment of mixture toxicity in water.

  4. Biodiversity Meets Neuroscience: From the Sequencing Ship (Ship-Seq) to Deciphering Parallel Evolution of Neural Systems in Omic's Era.

    PubMed

    Moroz, Leonid L

    2015-12-01

    The origins of neural systems and centralized brains are one of the major transitions in evolution. These events might occur more than once over 570-600 million years. The convergent evolution of neural circuits is evident from a diversity of unique adaptive strategies implemented by ctenophores, cnidarians, acoels, molluscs, and basal deuterostomes. But, further integration of biodiversity research and neuroscience is required to decipher critical events leading to development of complex integrative and cognitive functions. Here, we outline reference species and interdisciplinary approaches in reconstructing the evolution of nervous systems. In the "omic" era, it is now possible to establish fully functional genomics laboratories aboard of oceanic ships and perform sequencing and real-time analyses of data at any oceanic location (named here as Ship-Seq). In doing so, fragile, rare, cryptic, and planktonic organisms, or even entire marine ecosystems, are becoming accessible directly to experimental and physiological analyses by modern analytical tools. Thus, we are now in a position to take full advantages from countless "experiments" Nature performed for us in the course of 3.5 billion years of biological evolution. Together with progress in computational and comparative genomics, evolutionary neuroscience, proteomic and developmental biology, a new surprising picture is emerging that reveals many ways of how nervous systems evolved. As a result, this symposium provides a unique opportunity to revisit old questions about the origins of biological complexity. © The Author 2015. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.

  5. A semantic web framework to integrate cancer omics data with biological knowledge.

    PubMed

    Holford, Matthew E; McCusker, James P; Cheung, Kei-Hoi; Krauthammer, Michael

    2012-01-25

    The RDF triple provides a simple linguistic means of describing limitless types of information. Triples can be flexibly combined into a unified data source we call a semantic model. Semantic models open new possibilities for the integration of variegated biological data. We use Semantic Web technology to explicate high throughput clinical data in the context of fundamental biological knowledge. We have extended Corvus, a data warehouse which provides a uniform interface to various forms of Omics data, by providing a SPARQL endpoint. With the querying and reasoning tools made possible by the Semantic Web, we were able to explore quantitative semantic models retrieved from Corvus in the light of systematic biological knowledge. For this paper, we merged semantic models containing genomic, transcriptomic and epigenomic data from melanoma samples with two semantic models of functional data - one containing Gene Ontology (GO) data, the other, regulatory networks constructed from transcription factor binding information. These two semantic models were created in an ad hoc manner but support a common interface for integration with the quantitative semantic models. Such combined semantic models allow us to pose significant translational medicine questions. Here, we study the interplay between a cell's molecular state and its response to anti-cancer therapy by exploring the resistance of cancer cells to Decitabine, a demethylating agent. We were able to generate a testable hypothesis to explain how Decitabine fights cancer - namely, that it targets apoptosis-related gene promoters predominantly in Decitabine-sensitive cell lines, thus conveying its cytotoxic effect by activating the apoptosis pathway. Our research provides a framework whereby similar hypotheses can be developed easily.

  6. Meat science: From proteomics to integrated omics towards system biology.

    PubMed

    D'Alessandro, Angelo; Zolla, Lello

    2013-01-14

    Since the main ultimate goal of farm animal raising is the production of proteins for human consumption, research tools to investigate proteins play a major role in farm animal and meat science. Indeed, proteomics has been applied to the field of farm animal science to monitor in vivo performances of livestock animals (growth performances, fertility, milk quality etc.), but also to further our understanding of the molecular processes at the basis of meat quality, which are largely dependent on the post mortem biochemistry of the muscle, often in a species-specific way. Post mortem alterations to the muscle proteome reflect the biological complexity of the process of "muscle to meat conversion," a process that, despite decades of advancements, is all but fully understood. This is mainly due to the enormous amounts of variables affecting meat tenderness per se, including biological factors, such as animal species, breed specific-characteristic, muscle under investigation. However, it is rapidly emerging that the tender meat phenotype is not only tied to genetics (livestock breeding selection), but also to extrinsic factors, such as the rearing environment, feeding conditions, physical activity, administration of hormonal growth promotants, pre-slaughter handling and stress, post mortem handling. From this intricate scenario, biochemical approaches and systems-wide integrated investigations (metabolomics, transcriptomics, interactomics, phosphoproteomics, mathematical modeling), which have emerged as complementary tools to proteomics, have helped establishing a few milestones in our understanding of the events leading from muscle to meat conversion. The growing integration of omics disciplines in the field of systems biology will soon contribute to take further steps forward. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Microbial community structure and functioning in marine sediments associated with diffuse hydrothermal venting assessed by integrated meta-omics.

    PubMed

    Urich, Tim; Lanzén, Anders; Stokke, Runar; Pedersen, Rolf B; Bayer, Christoph; Thorseth, Ingunn H; Schleper, Christa; Steen, Ida H; Ovreas, Lise

    2014-09-01

    Deep-sea hydrothermal vents are unique environments on Earth, as they host chemosynthetic ecosystems fuelled by geochemical energy with chemolithoautotrophic microorganisms at the basis of the food webs. Whereas discrete high-temperature venting systems have been studied extensively, the microbiotas associated with low-temperature diffuse venting are not well understood. We analysed the structure and functioning of microbial communities in two diffuse venting sediments from the Jan Mayen vent fields in the Norwegian-Greenland Sea, applying an integrated 'omics' approach combining metatranscriptomics, metaproteomics and metagenomics. Polymerase chain reaction-independent three-domain community profiling showed that the two sediments hosted highly similar communities dominated by Epsilonproteobacteria, Deltaproteobacteria and Gammaproteobacteria, besides ciliates, nematodes and various archaeal taxa. Active metabolic pathways were identified through transcripts and peptides, with genes of sulphur and methane oxidation, and carbon fixation pathways highly expressed, in addition to genes of aerobic and anaerobic (nitrate and sulphate) respiratory chains. High expression of chemotaxis and flagella genes reflected a lifestyle in a dynamic habitat rich in physico-chemical gradients. The major metabolic pathways could be assigned to distinct taxonomic groups, thus enabling hypotheses about the function of the different prokaryotic and eukaryotic taxa. This study advances our understanding of the functioning of microbial communities in diffuse hydrothermal venting sediments. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.

  8. From big data analysis to personalized medicine for all: challenges and opportunities.

    PubMed

    Alyass, Akram; Turcotte, Michelle; Meyre, David

    2015-06-27

    Recent advances in high-throughput technologies have led to the emergence of systems biology as a holistic science to achieve more precise modeling of complex diseases. Many predict the emergence of personalized medicine in the near future. We are, however, moving from two-tiered health systems to a two-tiered personalized medicine. Omics facilities are restricted to affluent regions, and personalized medicine is likely to widen the growing gap in health systems between high and low-income countries. This is mirrored by an increasing lag between our ability to generate and analyze big data. Several bottlenecks slow-down the transition from conventional to personalized medicine: generation of cost-effective high-throughput data; hybrid education and multidisciplinary teams; data storage and processing; data integration and interpretation; and individual and global economic relevance. This review provides an update of important developments in the analysis of big data and forward strategies to accelerate the global transition to personalized medicine.

  9. Experimental Design in Clinical 'Omics Biomarker Discovery.

    PubMed

    Forshed, Jenny

    2017-11-03

    This tutorial highlights some issues in the experimental design of clinical 'omics biomarker discovery, how to avoid bias and get as true quantities as possible from biochemical analyses, and how to select samples to improve the chance of answering the clinical question at issue. This includes the importance of defining clinical aim and end point, knowing the variability in the results, randomization of samples, sample size, statistical power, and how to avoid confounding factors by including clinical data in the sample selection, that is, how to avoid unpleasant surprises at the point of statistical analysis. The aim of this Tutorial is to help translational clinical and preclinical biomarker candidate research and to improve the validity and potential of future biomarker candidate findings.

  10. Single-cell sequencing in stem cell biology.

    PubMed

    Wen, Lu; Tang, Fuchou

    2016-04-15

    Cell-to-cell variation and heterogeneity are fundamental and intrinsic characteristics of stem cell populations, but these differences are masked when bulk cells are used for omic analysis. Single-cell sequencing technologies serve as powerful tools to dissect cellular heterogeneity comprehensively and to identify distinct phenotypic cell types, even within a 'homogeneous' stem cell population. These technologies, including single-cell genome, epigenome, and transcriptome sequencing technologies, have been developing rapidly in recent years. The application of these methods to different types of stem cells, including pluripotent stem cells and tissue-specific stem cells, has led to exciting new findings in the stem cell field. In this review, we discuss the recent progress as well as future perspectives in the methodologies and applications of single-cell omic sequencing technologies.

  11. Multiscale Integration of -Omic, Imaging, and Clinical Data in Biomedical Informatics

    PubMed Central

    Phan, John H.; Quo, Chang F.; Cheng, Chihwen; Wang, May Dongmei

    2016-01-01

    This paper reviews challenges and opportunities in multiscale data integration for biomedical informatics. Biomedical data can come from different biological origins, data acquisition technologies, and clinical applications. Integrating such data across multiple scales (e.g., molecular, cellular/tissue, and patient) can lead to more informed decisions for personalized, predictive, and preventive medicine. However, data heterogeneity, community standards in data acquisition, and computational complexity are big challenges for such decision making. This review describes genomic and proteomic (i.e., molecular), histopathological imaging (i.e., cellular/tissue), and clinical (i.e., patient) data; it includes case studies for single-scale (e.g., combining genomic or histopathological image data), multiscale (e.g., combining histopathological image and clinical data), and multiscale and multiplatform (e.g., the Human Protein Atlas and The Cancer Genome Atlas) data integration. Numerous opportunities exist in biomedical informatics research focusing on integration of multiscale and multiplatform data. PMID:23231990

  12. Multiscale integration of -omic, imaging, and clinical data in biomedical informatics.

    PubMed

    Phan, John H; Quo, Chang F; Cheng, Chihwen; Wang, May Dongmei

    2012-01-01

    This paper reviews challenges and opportunities in multiscale data integration for biomedical informatics. Biomedical data can come from different biological origins, data acquisition technologies, and clinical applications. Integrating such data across multiple scales (e.g., molecular, cellular/tissue, and patient) can lead to more informed decisions for personalized, predictive, and preventive medicine. However, data heterogeneity, community standards in data acquisition, and computational complexity are big challenges for such decision making. This review describes genomic and proteomic (i.e., molecular), histopathological imaging (i.e., cellular/tissue), and clinical (i.e., patient) data; it includes case studies for single-scale (e.g., combining genomic or histopathological image data), multiscale (e.g., combining histopathological image and clinical data), and multiscale and multiplatform (e.g., the Human Protein Atlas and The Cancer Genome Atlas) data integration. Numerous opportunities exist in biomedical informatics research focusing on integration of multiscale and multiplatform data.

  13. Raising orphans from a metadata morass: A researcher's guide to re-use of public 'omics data.

    PubMed

    Bhandary, Priyanka; Seetharam, Arun S; Arendsee, Zebulun W; Hur, Manhoi; Wurtele, Eve Syrkin

    2018-02-01

    More than 15 petabases of raw RNAseq data is now accessible through public repositories. Acquisition of other 'omics data types is expanding, though most lack a centralized archival repository. Data-reuse provides tremendous opportunity to extract new knowledge from existing experiments, and offers a unique opportunity for robust, multi-'omics analyses by merging metadata (information about experimental design, biological samples, protocols) and data from multiple experiments. We illustrate how predictive research can be accelerated by meta-analysis with a study of orphan (species-specific) genes. Computational predictions are critical to infer orphan function because their coding sequences provide very few clues. The metadata in public databases is often confusing; a test case with Zea mays mRNA seq data reveals a high proportion of missing, misleading or incomplete metadata. This metadata morass significantly diminishes the insight that can be extracted from these data. We provide tips for data submitters and users, including specific recommendations to improve metadata quality by more use of controlled vocabulary and by metadata reviews. Finally, we advocate for a unified, straightforward metadata submission and retrieval system. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. [Exposome: from an intuition to a mandatory research field in occupational and enviromental medicine.

    PubMed

    Paganelli, Matteo; De Palma, Giuseppe; Apostoli, Pietro

    2017-11-01

    As Genomics aims at the collective characterization and quantification of genes, exposomics refers to the totality of lifetime environmental exposures, consisting in a novel approach to studying the role of the environment in human disease. The aim is to assess all human environmental and occupational exposures in order to better understand their contribution to human diseases. The "omics" revolution infact mostly regards the underlying method: scientific knowledge is expected to come from the analysis of increasingly extensive databases. The primary focus is on air pollution and water contaminants, but all the determinants of human exposure are conceptually part of the idea of exposome, including physical and psychological factors. Using 'omic' techniques the collected exposure data can be linked to biochemical and molecular changes in our body. Since the first formulation of the idea itself of Exposome many efforts have been made to translate the concept into research, in particular two important studies have been started in Europe. We herein suggest that Occupational Medicine could be a precious contributor to the growth of exposure science also in its omic side thanks to the methods and to the knowledges part of our background. Copyright© by Aracne Editrice, Roma, Italy.

  15. An integrated omic analysis of hepatic alteration in medaka fish chronically exposed to cyanotoxins with possible mechanisms of reproductive toxicity.

    PubMed

    Qiao, Qin; Le Manach, Séverine; Huet, Hélène; Duvernois-Berthet, Evelyne; Chaouch, Soraya; Duval, Charlotte; Sotton, Benoit; Ponger, Loïc; Marie, Arul; Mathéron, Lucrèce; Lennon, Sarah; Bolbach, Gérard; Djediat, Chakib; Bernard, Cécile; Edery, Marc; Marie, Benjamin

    2016-12-01

    Cyanobacterial blooms threaten human health as well as the population of other living organisms in the aquatic environment, particularly due to the production of natural toxic components, the cyanotoxin. So far, the most studied cyanotoxins are microcystins (MCs). In this study, the hepatic alterations at histological, proteome and transcriptome levels were evaluated in female and male medaka fish chronically exposed to 1 and 5 μg L -1 microcystin-LR (MC-LR) and to the extract of MC-producing Microcystis aeruginosa PCC 7820 (5 μg L -1 of equivalent MC-LR) by balneation for 28 days, aiming at enhancing our understanding of the potential reproductive toxicity of cyanotoxins in aquatic vertebrate models. Indeed, both MC and Microcystis extract adversely affect reproductive parameters including fecundity and egg hatchability. The liver of toxin treated female fish present glycogen storage loss and cellular damages. The quantitative proteomics analysis revealed that the quantities of 225 hepatic proteins are dysregulated. In particular, a notable decrease in protein quantities of vitellogenin and choriogenin was observed, which could explain the decrease in reproductive output. Liver transcriptome analysis through Illumina RNA-seq reveals that over 100-400 genes are differentially expressed under 5 μg L -1  MC-LR and Microcystis extract treatments, respectively. Ingenuity pathway analysis of the omic data attests that various metabolic pathways, such as energy production, protein biosynthesis and lipid metabolism, are disturbed by both MC-LR and the Microcystis extract, which could provoke the observed reproductive impairment. The transcriptomics analysis also constitutes the first report of the impairment of circadian rhythm-related gene induced by MCs. This study contributes to a better understanding of the potential consequences of chronic exposure of fish to environmental concentrations of cyanotoxins, suggesting that Microcystis extract could impact a wider range of biological pathways, compared with pure MC-LR, and even 1 μg L -1  MC-LR potentially induces a health risk for aquatic organisms. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Analyzing and interpreting genome data at the network level with ConsensusPathDB.

    PubMed

    Herwig, Ralf; Hardt, Christopher; Lienhard, Matthias; Kamburov, Atanas

    2016-10-01

    ConsensusPathDB consists of a comprehensive collection of human (as well as mouse and yeast) molecular interaction data integrated from 32 different public repositories and a web interface featuring a set of computational methods and visualization tools to explore these data. This protocol describes the use of ConsensusPathDB (http://consensuspathdb.org) with respect to the functional and network-based characterization of biomolecules (genes, proteins and metabolites) that are submitted to the system either as a priority list or together with associated experimental data such as RNA-seq. The tool reports interaction network modules, biochemical pathways and functional information that are significantly enriched by the user's input, applying computational methods for statistical over-representation, enrichment and graph analysis. The results of this protocol can be observed within a few minutes, even with genome-wide data. The resulting network associations can be used to interpret high-throughput data mechanistically, to characterize and prioritize biomarkers, to integrate different omics levels, to design follow-up functional assay experiments and to generate topology for kinetic models at different scales.

  17. Target-Pathogen: a structural bioinformatic approach to prioritize drug targets in pathogens

    PubMed Central

    Sosa, Ezequiel J; Burguener, Germán; Lanzarotti, Esteban; Radusky, Leandro; Pardo, Agustín M; Marti, Marcelo

    2018-01-01

    Abstract Available genomic data for pathogens has created new opportunities for drug discovery and development to fight them, including new resistant and multiresistant strains. In particular structural data must be integrated with both, gene information and experimental results. In this sense, there is a lack of an online resource that allows genome wide-based data consolidation from diverse sources together with thorough bioinformatic analysis that allows easy filtering and scoring for fast target selection for drug discovery. Here, we present Target-Pathogen database (http://target.sbg.qb.fcen.uba.ar/patho), designed and developed as an online resource that allows the integration and weighting of protein information such as: function, metabolic role, off-targeting, structural properties including druggability, essentiality and omic experiments, to facilitate the identification and prioritization of candidate drug targets in pathogens. We include in the database 10 genomes of some of the most relevant microorganisms for human health (Mycobacterium tuberculosis, Mycobacterium leprae, Klebsiella pneumoniae, Plasmodium vivax, Toxoplasma gondii, Leishmania major, Wolbachia bancrofti, Trypanosoma brucei, Shigella dysenteriae and Schistosoma Smanosoni) and show its applicability. New genomes can be uploaded upon request. PMID:29106651

  18. Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology.

    PubMed

    Vivar, Juan C; Pemu, Priscilla; McPherson, Ruth; Ghosh, Sujoy

    2013-08-01

    Abstract Unparalleled technological advances have fueled an explosive growth in the scope and scale of biological data and have propelled life sciences into the realm of "Big Data" that cannot be managed or analyzed by conventional approaches. Big Data in the life sciences are driven primarily via a diverse collection of 'omics'-based technologies, including genomics, proteomics, metabolomics, transcriptomics, metagenomics, and lipidomics. Gene-set enrichment analysis is a powerful approach for interrogating large 'omics' datasets, leading to the identification of biological mechanisms associated with observed outcomes. While several factors influence the results from such analysis, the impact from the contents of pathway databases is often under-appreciated. Pathway databases often contain variously named pathways that overlap with one another to varying degrees. Ignoring such redundancies during pathway analysis can lead to the designation of several pathways as being significant due to high content-similarity, rather than truly independent biological mechanisms. Statistically, such dependencies also result in correlated p values and overdispersion, leading to biased results. We investigated the level of redundancies in multiple pathway databases and observed large discrepancies in the nature and extent of pathway overlap. This prompted us to develop the application, ReCiPa (Redundancy Control in Pathway Databases), to control redundancies in pathway databases based on user-defined thresholds. Analysis of genomic and genetic datasets, using ReCiPa-generated overlap-controlled versions of KEGG and Reactome pathways, led to a reduction in redundancy among the top-scoring gene-sets and allowed for the inclusion of additional gene-sets representing possibly novel biological mechanisms. Using obesity as an example, bioinformatic analysis further demonstrated that gene-sets identified from overlap-controlled pathway databases show stronger evidence of prior association to obesity compared to pathways identified from the original databases.

  19. Omicseq: a web-based search engine for exploring omics datasets.

    PubMed

    Sun, Xiaobo; Pittard, William S; Xu, Tianlei; Chen, Li; Zwick, Michael E; Jiang, Xiaoqian; Wang, Fusheng; Qin, Zhaohui S

    2017-07-03

    The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve 'findability' of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. 77 FR 16246 - National Heart, Lung, and Blood Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-03-20

    ... unwarranted invasion of personal privacy. Name of Committee: National Heart, Lung, and Blood Institute Special Emphasis Panel; Omics and Genetic Analysis. Date: April 6, 2012. Time: 8:30 a.m. to 12 p.m. Agenda: To...

Top